Printing - GPU Technology Conferenceon-demand.gputechconf.com/gtc/2015/posters/GTC_2015_Life... ·...

1
1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 Similarity to native Pseudo energy Near native poses E Decoy poses GeauxDock: an ultra- fast molecular docking package for computeraided drug discovery Yun Ding, Ye Fang, W. Feinstein, D.M. Koppelman, J. Moreno, J. Ramanujam, M.Brylinski, M. Jarrell Louisiana State University, Louisiana Alliance for SimulaDonGuided Materials ApplicaDons BACKGROUND ComputaDonal modeling of binding drug to proteins has become an integral component of modern drug discovery pipelines. A typical applicaDon is structurebased virtual screening, which involves a large scale modeling of pharmacological relevant associaDons between small molecules and their macromolecular targets (Fig. 1 and Fig. 2). The desire to improve stateoftheart moDvated us to develop an ultrafast ligand docking approach that uses Monte Carlo as the sampling method and features computaDons on modern supercomputers. Combined with an effecDve scoring funcDon, this new method will provide accurate predicDons at a high performance/cost raDo, which is a criDcal factor for largescale virtual screening applicaDons. SCORING FUNCTION Figure 1. Computeraided drug development holds a significant promise to speed up the discovery of novel pharmaceuDcals at reduced costs. The naDvelike recogniDon capacity of the scoring funcDon was opDmized by maximizing the Zscore calculated across the training dataset. The Zscore is defined as: CASE STUDY One example of successful pose idenDficaDon is shown to demonstrate the accuracy of our approach; GeauxDock was used to find the near naDve pose of the inhibitor in Cathepsin K P43235. BENCHMARKS IMPLEMENTATIONS CONCLUSIONS Figure 6. Performance comparison on different pla^orms. The benchmarks are carried on four pla^orms, a 6 core Xeon E52620 CPU, a Xeon Phi 3210A, two Tesla M2090, and a GeForce GTX 980. We use intel compiler 15.0.0 and CUDA 6.5 with opDmizaDon flag – O3. The results demonstrate a considerable speedup using heterogeneous accelerators. Our early tuning of a Maxwell GeForce GTX 980 GPU provides a twofold speed up over a Fermi Tesla M2090 GPU. Figure 2. Docking simulaDons predict the naDve pose of the ligand by searching for the global minimum in the energy space. Z score = E L E H σ L 2 + σ H 2 Figure 3. The naDvelike recogniDon capability of our force field is opDmized by maximizing the Z score. Figure 4. Receiver operaDng characterisDc (ROC) analysis for the recogniDon of naDvelike conformaDons in docking ensembles by GeauxDock compared to other scoring funcDons. TPR – true posiDve rate, FPR – false posiDve rate. IniDalize Monte Carlo Perturb compute Replica Exchange Monte Carlo IteraDons ~=10 Monte Carlo Perturb compute Monte Carlo Perturb compute The docking code is implemented with a highly efficient Replica Exchange Monte Carlo (REMC) sampling to take advantage of massively parallel capabiliDes provided by GPUs. In a nutshell, docking calculaDons exploits two levels of parallelism: coarse and finegrained. The former considers a simultaneous processing of many ligand configuraDons, which will be mapped to hardware thread blocks. The finegrained level parallelizes the calculaDon of interacDons for each individual configuraDon using mulDple threads within each block. We implement and maintain fairly similar codes for three pla^orms: CPUOpenMP, Phi OpenMP, GPUCUDA. The applicaDon front end loads data from input files, then apply strength reducDon opDmizaDon and construct Struct of Array (SoA) data for GPU, or Array of Structure (AoS) data for CPU/Phi. A set of common wrappers for offload compuDng are developed to abstract the work flow on three different pla^orms: (1) data allocaDon, (2) copyin, (3) kernel computaDon, and (4) copyback, thus that CPU/ Phi/GPU code can share the same high level infrastructure. While the low level kernel code for different architecture are forked from a highly opDmized sequenDal code, and specifically tuned for the parallel performance. GeauxDock provides sufficient accuracy for its applicaDon in largescale virtual screening projects to support the reconstrucDon of proteinligand interacDon networks. GeauxDock adapts mulDple commodity backends, among which, GPU provides considerable speedup. GeauxDock is freely available from our website: hmp://www.insDtute.loni.org/lasigma/package/dock/ This work is supported by the NSF EPSCoR LASiGMA project under award #EPS1003897 and the Louisiana Board of Regents through the Board of Regents Support Fund [contract LEQSF(201215)RDA05]. Figure 5. Docking simulaDon for Cathepsin K P43235. From up let to down right. (A) Decreasing pseudo energy drives system to higher CMS regions during the Monte Carlo simulaDon. (B) Scamer plot of pseudo energy vs CMS. Three typical data points were selected from the simulaDon trajectory and their corresponding configuraDons (same color) are plomed in (C). (D) Closeups. The crystallographic configuraDon is light blue. Decoy Intermediate native-like A D C B Figure 8. The mapping from computaDonal resources to domain science objects. A proteinligand replica is mapped to a GPU thread block; atom coefficients are stored in consecuDve memory addresses, and mapped to GPU threads. Figure 7. The program’s flow chart. Many replicas are updated simultaneously using replica exchange Monte Carlo (REMC) algorithm. CONTACT NAME Yun Ding: [email protected] POSTER P5191 CATEGORY: LIFE & MATERIAL SCIENCE - LS04

Transcript of Printing - GPU Technology Conferenceon-demand.gputechconf.com/gtc/2015/posters/GTC_2015_Life... ·...

Page 1: Printing - GPU Technology Conferenceon-demand.gputechconf.com/gtc/2015/posters/GTC_2015_Life... · Printing: This poster is 48 wide by 36 high. It s designed to be printed on a large-format

Printing: This poster is 48” wide by 36” high. It’s designed to be printed on a large-format printer.

Customizing the Content: The placeholders in this poster are formatted for you. Type in the placeholders to add text, or click an icon to add a table, chart, SmartArt graphic, picture or multimedia file. To add or remove bullet points from text, just click the Bullets button on the Home tab. If you need more placeholders for titles, content or body text, just make a copy of what you need and drag it into place. PowerPoint’s Smart Guides will help you align it with everything else. Want to use your own pictures instead of ours? No problem! Just right-click a picture and choose Change Picture. Maintain the proportion of pictures as you resize by dragging a corner.

10 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

Similarity to native

Pseu

do

ener

gy

Near native poses

∆E

Decoy poses

GeauxDock:  an  ultra-fast  molecular  docking  package    for  computer-­‐aided  drug  discovery  Yun  Ding,  Ye  Fang,  W.  Feinstein,  D.M.  Koppelman,  J.  Moreno,  J.  Ramanujam,  M.Brylinski,  M.  Jarrell  Louisiana  State  University,  Louisiana  Alliance  for  SimulaDon-­‐Guided  Materials  ApplicaDons  

BACKGROUND  ComputaDonal   modeling   of   binding   drug   to   proteins   has   become   an  integral   component   of   modern   drug   discovery   pipelines.   A   typical  applicaDon   is   structure-­‐based   virtual   screening,   which   involves   a   large-­‐scale   modeling   of   pharmacological   relevant   associaDons   between   small  molecules  and  their  macromolecular  targets  (Fig.  1  and  Fig.  2).  The  desire    to     improve  state-­‐of-­‐the-­‐art  moDvated  us  to  develop  an  ultra-­‐fast   ligand  docking   approach   that   uses   Monte   Carlo   as   the   sampling   method   and  features   computaDons   on   modern   supercomputers.   Combined   with   an  effecDve   scoring   funcDon,   this   new   method   will   provide   accurate  predicDons  at  a  high  performance/cost  raDo,  which  is  a  criDcal  factor  for  large-­‐scale  virtual  screening  applicaDons.  

 

SCORING  FUNCTION  

Figure   1.   Computer-­‐aided   drug  development   holds   a   significant  promise  to  speed  up  the  discovery  of  novel   pharmaceuDcals   at   reduced  costs.  

The   naDve-­‐like   recogniDon   capacity   of   the   scoring   funcDon   was   opDmized   by  maximizing  the  Z-­‐score  calculated  across  the  training  dataset.  The  Z-­‐score  is  defined  as:  

 

CASE  STUDY  

One   example   of   successful   pose   idenDficaDon   is   shown   to   demonstrate  the   accuracy   of   our   approach;   GeauxDock   was   used   to   find   the   near-­‐naDve  pose  of  the  inhibitor  in  Cathepsin  K  -­‐  P43235.    

BENCHMARKS  

IMPLEMENTATIONS  

CONCLUSIONS  

Figure  6.  Performance  comparison  on  different  pla^orms.  The  benchmarks  are  carried  on  four  pla^orms,  a  6  core  Xeon  E5-­‐2620  CPU,  a  Xeon  Phi  3210A,  two  Tesla  M2090,  and  a  GeForce  GTX  980.  We  use  intel  compiler  15.0.0  and  CUDA  6.5  with  opDmizaDon  flag  –O3.  The  results  demonstrate  a  considerable  speedup  using  heterogeneous  accelerators.  Our  early  tuning  of  a  Maxwell  GeForce  GTX  980  GPU  provides  a  two-­‐fold  speed  up  over  a  Fermi  Tesla  M2090  GPU.  

Figure  2.  Docking  simulaDons  predict  the  naDve  pose  of  the  ligand  by  searching  for  the  global  minimum  in  the  energy  space.  

Z − score = EL −EH

σ L2 +σ H

2

Figure  3.  The  naDve-­‐like  recogniDon  capability   of   our   force   field   is  opDmized   by   maximizing   the   Z-­‐score.  

F i g u r e   4 .   R e c e i v e r   o p e r a D n g  characterisDc   (ROC)   analysis   for   the  recogniDon   of   naDve-­‐like   conformaDons  in   docking   ensembles   by   GeauxDock  compared   to   other   scoring   funcDons.  TPR   –   true   posiDve   rate,   FPR   –   false  posiDve  rate.  

IniDalize  

Monte  Carlo  

Perturb  

compute  

Replica  Exchange  

Monte  Carlo  IteraDons  ~=10    

Monte  Carlo  

Perturb  

compute  

Monte  Carlo  

Perturb  

compute  

The   docking   code   is   implemented  with   a   highly   efficient   Replica  Exchange   Monte   Carlo   (REMC)  sampling   to   take   advantage   of  massively   parallel   capabiliDes  provided   by   GPUs.   In   a   nutshell,  docking   calculaDons   exploits   two  levels   of   parallelism:   coarse-­‐   and  fine-­‐grained.   The   former   considers  a  simultaneous  processing  of  many  ligand  configuraDons,  which  will  be  mapped  to  hardware  thread  blocks.  The   fine-­‐grained   level   parallelizes  the   calculaDon   of   interacDons   for  each   individual   configuraDon   using  mulDple  threads  within  each  block.  

 

We  implement  and  maintain  fairly  similar  codes  for  three  pla^orms:  CPU-­‐OpenMP,  Phi-­‐OpenMP,  GPU-­‐CUDA.  The  applicaDon  front  end  loads  data  from  input  files,  then  apply  strength   reducDon   opDmizaDon   and   construct   Struct   of   Array   (SoA)   data   for  GPU,   or  Array   of   Structure   (AoS)   data   for   CPU/Phi.   A   set   of   common   wrappers   for   offload  compuDng   are   developed   to   abstract   the  work   flow  on   three   different   pla^orms:   (1)  data  allocaDon,  (2)  copy-­‐in,  (3)  kernel  computaDon,  and  (4)  copy-­‐back,  thus  that  CPU/Phi/GPU  code  can  share  the  same  high  level   infrastructure.  While  the  low  level  kernel  code  for  different  architecture  are  forked  from  a  highly  opDmized  sequenDal  code,  and  specifically  tuned  for  the  parallel  performance.  

GeauxDock   provides   sufficient   accuracy   for   its   applicaDon   in   large-­‐scale   virtual  screening   projects   to   support   the   reconstrucDon   of   protein-­‐ligand   interacDon  networks.   GeauxDock   adapts   mulDple   commodity   back-­‐ends,   among   which,   GPU  provides   considerable   speedup.   GeauxDock   is   freely   available   from   our   website:  hmp://www.insDtute.loni.org/lasigma/package/dock/    This   work   is   supported   by   the   NSF   EPSCoR   LA-­‐SiGMA   project   under   award  #EPS-­‐1003897   and   the   Louisiana   Board   of   Regents   through   the   Board   of   Regents  Support  Fund  [contract  LEQSF(2012-­‐15)-­‐RD-­‐A-­‐05].      

Figure  5.  Docking  simulaDon  for  Cathepsin  K  -­‐  P43235.  From  up  let  to  down  right.  (A)   Decreasing   pseudo   energy   drives   system   to   higher   CMS   regions   during   the  Monte  Carlo  simulaDon.   (B)  Scamer  plot  of  pseudo  energy  vs  CMS.  Three  typical  data  points  were  selected  from  the  simulaDon  trajectory  and  their  corresponding  configuraDons  (same  color)  are  plomed  in  (C).  (D)  Close-­‐ups.  The  crystallographic  configuraDon  is  light  blue.    

Decoy

Intermediate

native-like

A

DC

B

Figure   8.  The  mapping   from   computaDonal   resources   to   domain   science   objects.  A  protein-­‐ligand  replica  is  mapped  to  a  GPU  thread  block;  atom  coefficients  are  stored  in  consecuDve  memory  addresses,  and  mapped  to  GPU  threads.  

Figure   7.   The   program’s   flow   chart.   Many  replicas   are   updated   simultaneously   using  replica   exchange   Monte   Carlo   (REMC)  algorithm.  

contact name

Yun Ding: [email protected]

P5191

category: Life & MateriaL Science - LS04