NVIDIA GRID GPUs perfect for keeping up with the Raspberry Pi and the next generation of end points


Citrix have been making a fair bit of noise about their end-client (Receiver) being available and supported in-conjunction with partner ThinLinx on the Raspberry Pi, which with peripherals is proving a sub-$100 thin-client, capable of handling demanding graphics and frame rates (fps) of 30fps or more (YouTube is usually 30fps).

The Raspberry Pi and other low-cost end-points such as the Intel NUC are capable because they support hardware decode of protocols such as H.264 and JPEG used by HDX/ICA, they have SoC (system on a chip) hardware designed to handle graphics really very well.

There has been a lot of excitement in the industry and community, with traditional thin-clients typically costing $300-$600 the Pi offers potential cost savings but also the opportunity to use VDI/Application remoting (XenDesktop or XenApp) in scenarios where it wouldn’t have made financial sense.

There have been some stunning videos demonstrating the potential of this new class of low-cost endpoint such as:

And what do all these videos have in common – THEY WERE ALL RECORDED USING SERVERS BACKED BY NVIDIA GRID virtualized GPUs (vGPU or GPU-sharing via XenApp and GPU pass-through)! Because:

  • If you have effective hardware decode on the client you need your server to be able to keep up and pump out high frame rates and visual quality
  • With low cost clients virtualized (shared vGPU or XenApp GPU-sharing via passthrough) offers a cost effective way to boost the graphical power of the server without the need for a dedicated GPU per user

The Citrix Pi project highlights the power of using hardware encode and decode on GPUs. With the GPU able to take the brunt of the workload leading to:

  • Battery and power savings
  • The opportunity to offload server CPU for the protocol encode on the server boosting scalability, whilst Citrix only currently have hardware encode for their Linux VDA. With VMware Blast Extreme and NICE already offering it. The opportunity for NVIDIA GRID customers to see future value, as Citrix catches up, are there. The Pi project and VMware / NICE developments are vindication that this is where the industry is going – there are simply many tasks associated with virtualizing graphics that GPUs are best suited to.
  • Tests with blast extreme and NVENC shows up to 51ms lower latency on screen updates and lower CPU usage http://blogs.vmware.com/euc/2016/02/vmware-horizon-blast-extreme-acceleration-with-nvidia-grid.html so the potential for Citrix to emulate is clear.

Savings on the end-client costs is likely to change the balance of VDI deployment costs, allowing customers to invest more in the server data centre and freeing up budget for GPUs to access the user-experience improvements, benefits of consolidation and power savings. It really doesn’t make sense to have high-end workstations or PCs dotted around remote locations in use for a few hours a day for many customers anymore.

Once you get GPUs into a data centre, many sys admins report a reduction in costs associated with sluggish performance and  helpdesk calls. It’s not just high-end graphics benefit from GPU-acceleration but regular office applications, browsers and unified communications (Cisco Jabber, Skype, etc). You can read more here: https://www.virtualexperience.no/2015/11/05/mythbusting-browser-gpu-usage-on-xenapp/ and also here: https://www.citrix.com/blogs/2015/12/03/gpu-for-the-masses-with-xenappxendesktop/.

Exciting times! I somehow suspect we’ll see Citrix make (quite rightly) quite a bit of fuss over the Pi at their upcoming Synergy event! A performant end-client though needs to be fed by a performant server and NVIDIA GRID is a great match.

More Info:

4 thoughts on “NVIDIA GRID GPUs perfect for keeping up with the Raspberry Pi and the next generation of end points”

  1. With a GRID back end and a Windows 8.1 XenDesktop VM, I get about 60-80 msec latency from afar and no sign of delay. The RPI + GRID combination works exceptionally well and already the following was possible back in November 2015 with a RPI2 and Citrix Receiver supported under ThinLinX TLXOS and with Framehawk kicking in: http://www2.nau.edu/tjk/RPI-ThinLinX-2015Nov24.mp4

    Exciting times are definitely ahead and the RPI3 offers even more (around 50-60%) performance gains over the RPI2.

    Liked by 1 person

  2. I have had the pleasure of doing a larger vGPU project with thin clients as endpoints (No vendors mentioned means none forgotten) and enabling HW decode for the H.264 stream meant the difference between an “ok”-experience and “wow, that’s amazing”! The customer may have spent a little more on the endpoints than they could have, as opposed to a budget-model, but seeing how much a difference the HW Decoding introduced, the price-subject was never mentioned again.


  3. Oh, and definitely looking forward to Citrix’ response to “BlastExtreme” for the Windows-VDA’s! I guess we will know more after Synergy…


Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s