VMWare ESXi announce High Availability (HA) for NVIDIA GRID vGPU VMs with vSphere 6.5

I was very pleased yesterday to see Pat Lee from VMware’s PM team tweet about this yesterday…

patleetweet

It’s something we knew VMware had added to vSphere 2016, vSphere 2016 supported in the GRID 4.1 (Nov 2016) release. As a VMware implemented feature this was something we at NVIDIA had to wait for them to announce. I think there have been a few problems with the documentation update staging which is why this has been a rather quiet feature release. I’ll update this blog with links to the documentation when it becomes available which should be soon!

But since Pat has let the cat out of the bag…. Probably best to answer a few basic questions straing away.

What is High Availability (HA)?

Basic HA is a feature to ensure VMs are up and running as soon as possible in the event of host failure. The VM will automatically restart as soon as possible on another host if one is available with sufficient resources. So for vGPU enabled VMs that means on a host with an appropriate GPU etc. Although the user will experience some down-time where possible this is minimized without the need for manual intervention by a system administrator.

Guaranteed High Availability…

This can be provided by HA features by allowing resources to be resourced such as RAM/CPU on hosts e.g. maybe 15% of a hosts capacity, which allows a guarantee that resource will be available to restart VMs upto a certain number of host failures. I believe that VMware’s configuration does not extend to configuring GPU resource reservation and so the support announced today will not offer guaranteed HA. It is a feature VMware could add in the future though if they saw sufficient demand, it is not a feature engineered by NVIDIA.

Can HA provide continual up-time?

No, not alone. Many hypervisors though offer Fault Tolerance (FT) which can provide such support, this is a very expensive feature to use as it relies on running essentially a duplicate VM on mirrored hardware which is phase-locked to the original (i.e. milliseconds behind), in the event of failure the user is switched to the duplicate with only a momentary glitch in user experience. It’s a feature essentially only used in a few safety / mission critical use cases as it’s so costly to implement.

So is Fault Tolerance (FT) supported for vGPU?

No not today, the technology to continually essentially snapshot a live GPU is not available. This is also a pre-requisite for live migration/motion e.g. vMotion and also regular snapshots.

The Future

NVIDIA and all the partners such as Citrix and VMware appreciate that live motion and snapshotting are key enterprise datacenter needs so we continue to work towards making such technology happen (it’s very technically hard I’m told!). We all know what you want and what you want our priorities to be!!!

NVIDIA GRID is architected with a software model which gives us the ability to add additional support for new OSs for customers existing hardware allowing them to pick up new features.

NVIDIA GRID: Linux Guest OS support for Linux distributions on Citrix and VMware

I was recently involved in a support inquiry where a user wanted to know if NVIDIA GRID vGPU was available on Linux VDAs with the Linux guest OS, OpenSUSE LEAP (the answer at the time of writing is that it’s NOT!). Finding the answer was a lot harder than I expected as both VMware and Citrix documentation took a bit of hunting around.

Much of the marketing around Linux VDA’s mentions support for “SUSE”, “CentOS” or other genres of Linux, such as this blog. It is important that customers check both their hypervisor and VDI solutions official support matrix as both Citrix and VMware only certify, QA and support specific versions of Linux Guest OSs (usually only enterprise supported versions). Customers may find themselves unsupported by the virtualization vendors if they fail to check that the OS and specific version is supported by both their hypervisor and VDI solution (especially if mixing vendors such as Citrix XenDesktop on VMware ESXi).

Both vendors are evolving their Linux support rapidly and customers must check the documentation associated with the relevant versions of VMware/Citrix products they intend to use.

NVIDIA cannot provide support for guest OSs unsupported by the relevant virtualization vendor and as such customers are recommended to contact VMware/Citrix if they wish to use alternative versions/distributions. It is very likely many other varieties of Linux will “work” but customers should be aware that they will be unable to obtain hypervisor or VDI support in the event of an issue.

At the time of writing Horizon 7 on ESXi supports:

  • Ubuntu 12.04 and 14.04
  • Red Hat Enterprise Linux (RHEL) 6.6 and 7.1
  • CentOS 6.6
  • NeoKylin 6 Update 1 (Chinese)
  • SUSE Linux Enterprise Desktop 11 SP3

 

At the time of writing Citrix XenDesktop 7.9 on XenServer supports:

  • SUSE Linux Enterprise:
    • Desktop 11 Service Pack 4
    • Desktop 12 Service Pack 1
    • Server 11 Service Pack 4
    • Server 12 Service Pack 1
  • Red Hat Enterprise Linux
    • Workstation 6.7
    • Workstation 7.2
    • Server 6.7
    • Server 7.2
  • CentOS Linux
    • CentOS 6.7
    • CentOS 7.2

Ongoing if you want to check the OSs available for a Linux VDA you should follow the advice below.

Citrix

XenServer Support for Linux Guest OSs

This is documented in the “Citrix XenServer® Virtual Machine User’s Guide” for the relevant version of XenServer e.g. for 7.0, here: http://docs.citrix.com/content/dam/docs/en-us/xenserver/xenserver-7-0/downloads/xenserver-7-0-vm-users-guide.pdf

XenDesktop Guest OSs Supported by the Linux VDA

This can be found in the Linux VDA product documentation for the relevant version of XenDesktop under the section “System Requirements” e.g. for XenDesktop 7.9 Please see http://docs.citrix.com/en-us/xenapp-and-xendesktop/7-9/install-configure/suse-linux-vda.html (This is where I had to hunt around as bizarrely Citrix detail the genres and versions of Linux supported under each supported OS rather than in a master list, so the SUSE documentation is where you can find RHEL and other supported versions listed)

VMware

ESXi/vSphere Support for Linux Guest OSs

Supported Linux OSs are listed in the “VMware Compatibility Guide”: https://www.vmware.com/resources/compatibility/search.php?deviceCategory=software

Horizon Support for Linux Guest OSs

The versions and distributions supported by Horizon are listed in the FAQ for the appropriate release e.g. for Horizon 7, here: http://www.vmware.com/content/dam/digitalmarketing/vmware/en/pdf/products/horizon/vmware-horizon-for-linux-faq.pdf

New Cisco Validated Design featuring UCS B200 M4 with NVIDIA GRID M6 vGPU – available now!

It’s great to see a new validated design released by Cisco in recent weeks. Particularly as this features the NVIDIA GRID M6 options for blade servers to enable virtualized GPU-accelerations (vGPU). This reference architecture joins other available for UCS but in particular features a reference blueprint for Citrix XenDesktop/XenApp 7.7 and VMware vSphere 6.0 for 5000 Seats. Key features include

  • Citrix XenDesktop/XenApp 7.7.
  • Built on Cisco UCS (including Cisco B200 M4 Blade Server) and Cisco Nexus 9000 Series
  • with NetApp AFF 8080EX
  • VMware vSphere ESXi 6.0 Update 1 Hypervisor Platform

Cisco have done a great job providing a comprehensive guide and reference for a full VDI/XenApp deployment that includes networking, storage and graphics acceleration considerations.

 

Cisco-NVIDIA Relationship

There are plenty of case studies, whitepapers and webinar recording covering Cisco long-investment in NVIDIA GRID and vGPU too:

VMWorld 2016 – VMware let users set the agenda! Go VMware!

VMware Democracy at VMworld – it seems you can vote for what sessions you’d like to see. I think this is a super idea as it allows the community, partners and customers to actually pre-screen the balance of talks and speakers.

This kind of openness where VMworld lets their community see what has been submitted (and subsequently rejected/accepted) is great. It allows others to be aware of potential speakers who whilst might not be suitable for VMworld may fit other events/platforms better. It’s also a very strong message that this conference is for the users. Go VMware!

I wish more conferences did this.

How-to-vote

  • From now till May 24th you will be able to cast your vote on the 1500+ submissions that came in through the Call for Papers. To VOTE, go to http://www.vmworld.com/en/call-for-papers.html log into your account – just click the “like” button and you’ve cast your vote.
  • Public Session Voting is the opportunity for the VMware community of experts, customers, partners, bloggers and enthusiasts to cast their votes and help shape the agenda for VMworld 2016.

Session Voting is open to everyone and anyone. You will be required to login to your vmworld.com account. If you do not have a vmworld.com account, you can set one up for free.

 

NVIDIA GRID at VMworld

If you are interested in seeing sessions on NVIDIA GRID and vGPU, just search on key words like “NVIDIA” and “vGPU”. Some of the submissions I know of include:

  • Title: Why does Siemens use High-Performance Desktops with VMware Horizon and NVIDIA GRID – Submission from Soeren Reinersen, Siemens Wind VMware ,  Sarah Mannion NVIDIA, ID: 7888
  • Title: Sizing your NVIDIA GRID with VMware Horizon 7 – Submission from: Erik Bohnhorst, NVIDIA, ID: 8211
  • Title: Accelerating VMware Horizon Blast Extreme with NVIDIA GRID – Submission from: Erik Bohnhorst NVIDIA, ID: 7517
  • Title: Selecting the right NVIDIA GRID edition with VMware Horizon – Submission from: Manvender Rawat, NVIDIA ID: 8232
  • Title: A Technical Deep Dive on Performance, Scalability and Deployment Best Practices of GPU-accelerated workloads with VMware Horizon View and Nvidia GRID vGPU – Submission from: Lan Vu VMware and Manvender Rawat NVIDIA, ID: 8447
  • Title: Scientific Methodology to Determine User Experience for VDI – Submission from: Deepti Jain, NVIDIA, ID: 9202
  • Title: Customer Success at TSP: NVIDIA vGPU on Horizon View – Submission from: Jeff Weiss + NVIDIA Customer, ID: 8254
  • Title: NVIDIA vGPU on Horizon from Pilot to Production Deployment – Submission from: Jeff Weiss Submission ID: 8178
  • Title: Real world NVIDIA GRID vGPU sizing for optimized user experience with VMware vRealize – Submission from: Milan Diebel NVIDIA, ID: 8464
  • Title: How Architectural Design Firms Leverage Virtual GPU Technology for Global Collaboration – Submission from: Randall Siggers NVIDIA, ID: 9045

 

NVIDIA-GRID-vGPU-VMware-vSphereMany of these speakers have spoken at NVIDIA’s GTC events and recordings are available so you can get an idea of the expertise of the speakers and technical depth. See GTC-on-demand: HERE. Perhaps you have seen them and can comment below on sessions you are looking forward to at VMworld?

VMworld will be held on August 28 – Sept 1 2016 in Las Vegas. There is still time to sign-up.  If you’d like to find out more about our partnership with VMware – why not visit our community site; full of forums, FAQs, webinars and product overviews:

Confessions of a VMware Secret Agent – NVIDIA GRID and Blast Extreme – answers!

I have a new secret double life! I’ve recently been involved in doing the live chat and Q&A from NVIDIA GRID webinars. If you never attended our webinars but you are interested in NVIDIA GRID technologies you should consider trying it. They usually take the format of a 1+ hour Webinar hosted by internal technology specialists like Support, Readiness or Product Management. We will show live demos, hints and best practices. And also we have regular Guest Speakers or partners involved.

Our next webinar is on Thursday 12th May 2016 (8am PST/11AM EST/4PM UK): “See How Virtual GPU Technology Can Increase User Productivity and Reduce IT Cost.”

Continue reading Confessions of a VMware Secret Agent – NVIDIA GRID and Blast Extreme – answers!