Delivering 3D Graphics from the Private or Public Cloud with ...
Transcript of Delivering 3D Graphics from the Private or Public Cloud with ...
Q3 2013
Delivering 3D Graphics from
the Private or Public Cloud
with XenDesktop and GRID
Derek Thorslund, Director of Product Management
Citrix Systems
© 2013 Citrix
Rich Apps as a
Service
© 2013 Citrix
Citrix milestones in 3D graphics remoting
2006
Project K2 delivers CATIA
to Boeing Dreamliner designers
2009
GA of XenDesktop HDX 3D Pro with Deep
Compression
2011
XenServer 6.0 hypervisor introduces
GPU Passthrough
2012
Higher fps via NVIDIA GRID™
API plus improved
compression
2013
XenDesktop 7 GPU Sharing
with high density GRID™
K1/K2 cards
Business Drivers for virtualizing 3D graphics apps & workstations
Global talent base
Secure IP
Work-from-home
Disaster recovery
Mobile device access
Improve time-to-market
Cost efficiency
© 2013 Citrix
Leverage worldwide talent pool
© 2013 Citrix
Intellectual property: Do you recognize this car?
© 2013 Citrix
Centralize and secure design IP
Engineering drawings
Bills of Materials
Cost info
Supplier info
Customer info
Lifecycle data
Product design decisions
© 2013 Citrix
Work-from-home & Disaster recovery
© 2013 Citrix
Leverage mobile devices
© 2013 Citrix
Improve efficiency & agility
Reduce operational costs
© 2013 Citrix
Global Product Development Teams – Real Example
Korea
Germany
Brazil
Australia
United States
India
China
© 2013 Citrix
Global Development Effort – Real Example
• 30,000 CAD files or 70 GB of data to be synchronized every day
• Across 26 design centers (30,000+ users)
• Across 16 countries
• It took 2 weekends to sync all code updates!
• More challenging for 4,000+ suppliers and partners
© 2013 Citrix
Enhanced IP control, collaboration and global agility
R & D
Manufacturing & Logistics
Sales & Marketing
Supplier
Support
QA
R & D
Data stays in data center Access via LAN or WAN
Case studies
© 2013 Citrix
Case study and customer reference Manuel Killer, Project Manager CAx Technologies ABB Switzerland Ltd – Power Electronics & MV Drives
Global CAD accesss with HDX 3D Pro October 2011
© 2013 Citrix
Requirements from Business to IS
Turgi
Global Document Mgmt
Global Software Development and Engineering Tool Landscape
Global Product Release Process
Global Change Mgmt Process
Global Product Development Process
Global Development
Global Engineering
Global Production
Engineers in India need to be able to work as if they were sitting in Switzerland
Extended Engineering Workbench in India
© 2013 Citrix © ABB Group July 17, 2013 | Slide 18
Challenges Of course there were more than one…
3D CAD data is large
Transferring our largest assemblies took 2.5 hours!
ABB’s corporate network
Latency
Bandwidth
Like one
team
© 2013 Citrix © ABB Group July 17, 2013 | Slide 20
Learnings
50%
75% 80%
90%
0%10%20%30%40%50%60%70%80%90%100%
0
50
100
150
200
250
300
350
Latency effect (subjective scores)
Latency [ms] System Quality [%]
Service quality is a
subjective matter
Using Dassault SolidWorks, 5-6 hours per day;
Designers can work from India as if in Switzerland!
(results without CloudBridge)
© 2013 Citrix
© 2013 Citrix
Wind turbine manufacturer HDX 3D Pro case study
Citrix Confidential - Do Not Distribute
• Delivering PTC Pro/E and Dassault SolidWorks from Europe to other continents since 2008 (2,000 remote users)
• HDX 3D Pro protects Vestas’ intellectual property, supports workforce globalization, eliminates inconsistencies in engineering design versioning and overcomes regulatory challenges
• Reduced cost per running hour by 30% from €416 (traditional CAD workstations) to €291 (data center blade workstations) via follow-the-sun utilization (Denmark, UK, US, India, China)
© 2013 Citrix
Major European heavy vehicle manufacturer HDX 3D Pro case study
• Access from Germany, Mexico and Brazil to Dassault CATIA apps hosted in Sweden
• At 220ms roundtrip latency, good performance working on models with 1500+ parts; bandwidth usage rarely reaches 2.5 Mbps
• Using 3D Space Mouse
© 2013 Citrix
Daimler Digital Factory HDX 3D Pro case study
• The Daimler Digitale Fabrik (Digital Factory) team can simulate an entire manufacturing plant in software
• HDX 3D Pro serves users across various Daimler plants and offices
• Siemens NX applications
• HP ws460c data center blade workstations
• NVIDIA Quadro Fermi Q2000 cards
Product Overview XenDesktop HDX 3D Pro
© 2013 Citrix
Segmenting the user population
Tier 1: Professional users (e.g. design engineers, radiologists)
• Top rendering performance (dedicated GPU)
• Deep compression on WAN links
• 3D mouse
Tier 2: Power users (users who need to view/edit large 3D models)
• GPU sharing
Tier 3: Knowledge workers
• Highly shared GPU
1
2
3
© 2013 Citrix
XenDesktop HDX 3D Pro
• XenDesktop feature for high-end and
second-tier 3D professional graphics
• GPU acceleration for hardware
rendering of large 3D models
• Multiple compression options including
deep compression codec for access
over narrow WAN links
• VDI and Hosted Shared workloads
© 2013 Citrix
XenDesktop: Powerful and flexible infrastructure
Universal client Enterprise app store High-Definition User Experience
Flexible Desktop and App delivery
Citrix Receiver
NetScaler Gateway
XenDesktop
© 2013 Citrix
VDI versus RDS (hosted shared) XenDesktop workloads
Tier 1: HDX 3D Pro on VDI Tier 2: HDX 3D Pro on RDS
• GPU acceleration of Direct3D, OpenGL, CUDA, OpenCL
• H.264-based Deep Compression
• Full desktop or seamless apps
• DirectX/OpenGL GPU sharing planned via GRID™ vGPU
• 3D mouse support
TOP PERFORMANCE
• GPU acceleration of Direct3D, OpenGL, CUDA*, OpenCL*
• H.264-based Deep Compression
• Full desktop or seamless apps
• DirectX/OpenGL GPU sharing
• Lower cost Microsoft licensing
• Apps must be RDS compatible
MOST COST-EFFECTIVE * Experimental pending field validation
© 2013 Citrix
What’s new with HDX 3D Pro in XenDesktop 7?
• Self-tuning codec technology ᵒ Adaptive Display automatically detects transient and/or video images
ᵒ Image quality dynamically adapts to network bandwidth
ᵒ Fine Drawing codec eliminated; improved H.264 codec performs much
better
• HDX 3D Pro now available for Windows Server RDS workloads ᵒ Adaptive H.264-based Deep Compression
ᵒ GPU acceleration and sharing for OpenGL and DirectX (including WPF),
plus experimental support for CUDA and OpenCL
ᵒ Faster frame rate at higher resolutions compared to XA6.5 GPU Sharing
© 2013 Citrix
What’s new with HDX 3D Pro? (cont’d)
• Auto screen resolution detection ᵒ No longer necessary to disconnect /reconnect when changing resolution
• 5 versions of Receiver now include decoding of H.264-based Deep
Compression: Windows, Linux, iOS, Mac, Android
• HDX Monitor now reports on HDX 3D Pro ᵒ Details on fps, codec, performance
ᵒ Replaces previous HDX 3D Pro Health Check Tool
• Quad monitor support ᵒ Not a hard limit but we tested with up to 4 monitors with good performance
© 2013 Citrix
RDS-compatible professional graphics apps Some examples from autodeskandcitrix.com, Citrix Ready site, etc.
• Lots of Autodesk apps, including: ᵒ AutoCAD
ᵒ Inventor
ᵒ Revit
ᵒ Navisworks
• Bentley MicroStation
• ESRI ArcGIS
• Intergraph SmartPlant 3D
• Adobe PhotoShop
© 2013 Citrix
• Feature of XenDesktop Enterprise and Platinum editions
• Broad app compatibility • OpenGL, DirectX (incl. WPF), CUDA, OpenCL
• No API hooking
• Blade/rack workstations are ideal, but any form factor can be used for the host • Multiple users per host using GPU passthrough
XenDesktop HDX 3D Pro
© 2013 Citrix
Desktop Virtualization for High-end Graphics Users HDX 3D Pro client options
© 2013 Citrix
HDX 3D Pro on thin clients • HDX Ready Premium thin
clients support Deep
Compression decoding
• Min. 1.6 GHz CPU required
• More to come, including
lower cost HDX SoC devices
Photos not to scale
© 2013 Citrix
Host requirements
• XenServer 6.x or vSphere 5.1 or physical machine
• Quad core CPU at 2.3 GHz or higher, or four vCPUs
• 4 GB of RAM minimum
• GPU card supported by ISV
© 2013 Citrix
Seamless Application Delivery
ICA
XenDesktop VDA
HDX 3D Pro
Host End Point
Citrix Receiver
© 2013 Citrix
Compression Options
Users can switch between codecs if desired:
• Deep Compression codec (default)
• CPU-based, min. 1.5 Mbps bandwidth
(see next slide for real world examples)
• 100% lossless compression (e.g. medical imaging)
© 2013 Citrix
Customer-reported bandwidth utilization on long-haul connections Deep Compression codec technology
• First user requires 1.5 to 2 Mbps minimum
• Heavy equipment manufacturer: Branch with 12 concurrent users requires 700-800 Kbps per user
• Control valves manufacturer: 20 Mbps WAN link serves branch with 17 users, i.e. 1.2 Mbps/user
• Bandwidth requirement does not scale linearly
© 2013 Citrix
Pixel-Perfect Lossless for Medical Images Lossless Compression (pixel-perfect)
Lossless Systray icon
Text displayed on hovering the mouse over the icon
© 2013 Citrix
Support for up to 4 monitors
•Citrix Receiver for Windows or Linux
•Efficient use of bandwidth
© 2013 Citrix
3D mouse support available on VDI
USB redirection for 3D mice and
similar devices
Virtual Channel can be prioritized
to maximize responsiveness
© 2013 Citrix
Ideal for low bandwidth and high latency connections Citrix CloudBridge for WAN optimization
Citrix Confidential - Do Not Distribute
• Improves responsiveness of apps delivered via HDX 3D Pro over high latency connections
• Reduces bandwidth consumption, enabling more users to share a given size of pipe (e.g. ABB reports 3:1 improvement at just 5 users)
© 2013 Citrix
GPU sharing for RDS workloads
• Usually one VM per GPU (and one GPU per VM) ᵒ On bare metal with OpenGL apps, multiple GPUs can serve one VM, but in general we
recommend one GPU per VM using a hypervisor that supports GPU passthrough
• Each VM is a multi-user Windows Server RDS workload
• XenServer GPU Passthrough now supports up to 12 GPUs per server ᵒ But typical high-end configuration is 3x NVIDIA GRID™ K2 for a total of 6 GPUs
• Direct access to graphics driver and hardware, unlike software-based vGPU
• User density depends on the apps, GPU processing power, video RAM, etc. ᵒ No fixed limit; one customer reports 32 users on a Q6000 with Dassault 3D Via player
• RDS limitation: One user could impact performance of other users ᵒ Recommend capping the number of users per VM
Available
now!
© 2013 Citrix
Reduced cost per user
GPU Passthrough (single-user & multi-user VMs)
Citrix Confidential - Do Not Distribute
• Introduced in XenServer 6 (October 2011)
• Now also in vSphere/ESX with “vDGA”
• Multiple GPUs per host Lower cost per user
• Servers with up to 12 GPUs currently on XenServer HCL
• One graphics-accelerated VM (single-user or multi-user) per GPU
• Depending on CPU power, same host may also support regular office workers
Hypervisor
VM VM VM VM
NVIDIA Confidential
NVIDIA GRID K2
GPU 4 Kepler GPUs 2 High End Kepler GPUs
CUDA cores 768 (192 per GPU) 3072 (1,536 per GPU)
Memory Size 16GB DDR3 (4GB per GPU) 8GB GDDR5 (4GB per GPU)
Max Power 130 W 225 W
Cooling solution Passive Passive
OpenGL 4.3 4.3
DirectX 11 11
GRID vGPU support Yes Yes
1 Number of users depends on software solution, workload, and screen resolution
NVIDIA GRID K1
© 2013 Citrix
Hardware platform
GPU Passthrough
GP
U
XenServer hypervisor
XenDesktop Windows VMs
GP
U
GP
U
GP
U
3D Pro VM
3D Pro VM
3D Pro VM
non-3D VM
3D Pro VM
non-3D VM
non-3D VM
non-3D VM
non-3D VM
non-3D VM
non-3D VM
non-3D VM
. . .
© 2013 Citrix
Hardware platform
GPU Passthrough with RDS workloads
GP
U
XenServer hypervisor
XenApp Windows Server VMs
GP
U
GP
U
GP
U
XenApp VM
XenApp VM
XenApp VM
Session 1
XenApp VM
Session 2
Session 3
Session 4
Session 5
Session N-1
Session N . . .
© 2013 Citrix
Multiple concurrent users per GPU
XenDesktop GPU Sharing on hosted-shared
• Ideal for second tier users of 3D professional graphics
• Supports all versions of DirectX and OpenGL
• GPU Sharing for DirectX has been available since XenApp 6.0
• XenApp 6.5 OpenGL GPU Sharing feature add-on was introduced in March at GTC
• Included in XenDesktop 7 for Hosted Shared workloads
• Works with Fermi-generation NVIDIA Quadro cards and with the latest Kepler-architecture GRID™ K2 (higher user density)
• Directly leverages the GPU video driver (unlike API Intercept vGPU)
• Includes experimental support for CUDA and OpenCL
© 2013 Citrix
GPU sharing scalability
• With two NVIDIA Quadro 4000 cards we ran 9 users per GPU using a test app
that works with ESRI ArcGIS, and we still had space for more
• Running Dassault SolidWorks, Ansys Workbench and Fluent, scalability was 6
to 10 users per Quadro 4000
• The Quadro 6000 was able to support 30 users running Dassault 3DVIA
Composer Player with only minor slowdown; and this test was harder on the
graphics card than the real world is!
• We are getting 30 users of SAP Right Hemisphere 3D on a physical XenApp
6.5 server with a Quadro 2000 card
New NVIDIA GRID K2 introduces even higher user densities!
© 2013 Citrix
GPU sharing for VDI workloads (coming soon)
• GPU sharing for single-user Windows desktop VDI workloads requires GPU
virtualization (vGPU)
• Earlier vGPU technologies (Microsoft Hyper-V RemoteFX, VMware
vSphere/ESX vSGA) are software-based (API Intercept approach)
ᵒ Designed for less demanding knowledge worker use cases
ᵒ Limited to smaller 3D models due to data transfer from user session to session 0
ᵒ Limited to older versions of DirectX/OpenGL
• XenServer/NVIDIA GRID™ vGPU is hardware-based
ᵒ High performance, even with large models
ᵒ Supports the latest versions of Direct and OpenGL
• Public Tech Preview September 2013
© 2013 Citrix
Hardware platform
GPU Virtualization
GP
U
XenServer hypervisor
XenDesktop Windows VMs 3D Pro
VM 3D Pro
VM 3D Pro
VM 3D Pro
VM
vGP
U
3D Pro VM
. . .
vGP
U
vGP
U
vGP
U
vGP
U
. . .
1 2 3 4 N
. . .
NVIDIA GRID Enabled OEM Platforms
IBM iDataPlex dx360 M4
2 GRID K1 or 2 GRID K2
HP ProLiant WS460c Gen8 1 GRID K1 or 1 GRID K2
Dell PowerEdge R720 2 GRID K1 or 2 GRID K2
SuperMicro SYS-2027-TRF 2 GRID K1 or 3 GRID K2
SuperMicro SYS-1027-TRF 2 GRID K1 or 3 GRID K2
Cisco UCS C240 M3 2 GRID K1 or 2 GRID K2
HP ProLiant SL250 Gen8 2 GRID K2
Available
H1 2
013
Available
H2 2
013
HP ProLiant SL270 4+ GRID K2
Asus ESC 4000 G2 4 GRID K2
http://hcl.vmd.citrix.com/GPUPass-throughDeviceList.aspx
© 2013 Citrix
Summary: Citrix solution for 3D graphics
•Proven solution for high-end 3D graphics delivery
•Best WAN performance on the market
•Lowest cost per user
•Access from any device
•First vendor to adopt NVIDIA GRID™ API and offer
high performance GPU sharing
© 2013 Citrix
Additional Information
• Resources: www.citrix.com/xendesktop/HDX3D/
• Blog: www.blogs.citrix.com/product/xendesktop/
• Twitter: @xendesktop
Check out GRID talks @ SIGGRAPH Next Week:
“State-of the-Art of Virtualized Graphics”
Tuesday, July 23rd – 10:40 AM PDT
Wednesday, July 24th – 1:20 PM PDT
Thursday, July 25th – 11:20 AM PDT
“Bunkspeed: Bringing NVIDIA iray® and
Ease of Use to Designers”
Thursday, July 25th –1:20 PM PDT
Join us online – all talks will be streamed live!
www.nvidia.com/siggraph2013
GTC 2014 Call for Submissions
Looking for submissions in the fields of
Science and research
Professional graphics
Mobile computing
Automotive applications
Game development
Cloud computing
Submit at www.gputechconf.com
© 2013 Citrix
Work better. Live better.