P. E Oak Ridge, TN 37831-6224 of TN

8
CLASSICAL MOLECULAR SIMULATIONS OF COMPLEX, II"DUSTRIALLY-IMPORTANT SYSTEMS ON THE INTEL PARAGON H. D. cochran and P. E Lncascio Chemical Technology Division Oak Ridge National Laboratory* Oak Ridge, TN 37831-6224 P. T. Cummings Department of Chemical Engineering Knoxville, TN 37996-2200 University of Tennessee . Presented and published: Intel Supercomputer User Group (ISUG '96), Khoxville> Tennessee June 19-22> I996 Submitted to: J. Comp. &Math June, 2996 The submitted manuscript has been authored by a contractor of the US. Government under contract No. DE-ACO5-96OR22464. Accordingly, the US. Government retains a nonexclusive, royalty-pee license to publish or reproduce the published form of this contribution, or allow others to do so, for U.S. Governmentpwposes. *Managed by Lockheed Martin Energy Research Corp. for the U.S. Department of Energy under contract number DE-AC05-%OR22464. MASTER

Transcript of P. E Oak Ridge, TN 37831-6224 of TN

Page 1: P. E Oak Ridge, TN 37831-6224 of TN

CLASSICAL MOLECULAR SIMULATIONS OF COMPLEX, II"DUSTRIALLY-IMPORTANT SYSTEMS ON THE INTEL PARAGON

H. D. cochran and P. E Lncascio Chemical Technology Division

Oak Ridge National Laboratory* Oak Ridge, TN 37831-6224

P. T. Cummings Department of Chemical Engineering

Knoxville, TN 37996-2200 University of Tennessee .

Presented and published: Intel Supercomputer User Group (ISUG '96), Khoxville> Tennessee

June 19-22> I996

Submitted to:

J. Comp. &Math June, 2996

The submitted manuscript has been authored by a contractor of the US. Government

under contract No. DE-ACO5-96OR22464. Accordingly, the US. Government retains a

nonexclusive, royalty-pee license to publish or reproduce the published form of this contribution, or allow others to do so, for U.S. Government pwposes.

*Managed by Lockheed Martin Energy Research Corp. for the U.S. Department of Energy under contract number DE-AC05-%OR22464.

MASTER

Page 2: P. E Oak Ridge, TN 37831-6224 of TN

CLASSICAL MOLECULAR SIMULATIONS OF COMPLEX, INDUSTRIALLY-IMPORTANT SYSTEMS ON THE INTEL PARAGON

H. D. Cochran and P. F. LoCasciot Chemical Technology Division Oak Ridge National Laboratory Oak Ridge, TN 37831-6224

P. T. Cummings Chemical Engineering Department University of Tennessee Knoxville, TN 37996-2200

Advances in parallel supercomputing now make possible molecular-based engineering and science calculations that will soon revolutionize many technologies, such as those involving polymers and and those involving aqueous electrolytes. We have developed a suite of message-passing codes for classical molecular simulation of such complex fluids and amorphous materials and have completed a number of demonstration calculations of problems of scientific and technological impr- tance with each. In this overview paper we will outline the techniques for for classical molecular simulation of these industrially-important systems on the Intel Paragon and we will summarize some of the important scientific and technical results of the varied applications, including the following: 1) Parallel codes for quaternion dynamics using techniques for handling long-range Coulombic forces allow study of ion pairing in supercritical aqueous electrolyte solutions.'-3 Ion pair- ing lies at the heart of technological problems with corrosion and solids deposition in industrial processes utilizing high temperature water. 2) Non-equilibrium, multiple time step molecular dynamics lets us investigate the rheology of molecular fluids.&' Such calculations enable the molecular-based design of new synthetic lubricants of importance in the automotive engines of the future. 3) Cbain molecule Monte Carlo simulations in the Gibbs now permit calculation of phase equilibrium of long-chain molecular systems. With complementary equilibrium molecular dynamics (with multiple time steps) we have been able to gain fundamental insight into the technologically-important problem of liquid-liquid phase separation in polymer blends."

Computational Techniques

Figure 1 is a cartoon showing a trajectory through the phase space. The abscissa represents the 3N-dimensional position coordinates of the N particles (say, atoms) in a system. The ordinate represents the 3N-dimensional momentum coordinates Of the particles. The trajectory, then, traces the evolution with time of the positions and momenta under the influence of the total potential energy of the system which consists of the potential energy of each particle due u) its position in any extern2 fields plus the potential energy of interaction of the particles with one another, including both bonded and non-bonded interactions. Classical molecular dynamics simulation solves Newton's equations of motion for such a system for a small portion of the trajectory. Application of the laws of statistical mechanics, then, allows calculation of the bulk properties of the substance from the positions and momenta of the particles. An alternative approach to classical molecular simulation samples randomly (rather than deterministically) from the points in the configuration space of ?he system according to their probability distribution; this technique is called Monte Carlo simulation. The general tecbniques of classical molecular shulation on sequential computers are described by Allen and Tildesley.'2 Usually, to make the computation tractable, a number of approximations are employed: interactions among particles are treated pairwise, the range of such interactions is truncated, and small systems 0(1@ - IO3 particles) are surrounded on all sides by periodic images of themselves to elim- inate surface effects. The great bulk of the computational work is in calculating the painvise interactions. The use of neigh- bor lists and/or link cells can reduce this to an 0 0 problem. Even with these simplifications, classical molecular simula- tions are challenging computations, and until the advent of massively parallel supercomputers such as the Intel Paragons, calculations such as those described here, on many industrially-important systems, were beyond the horizon of practicality.

Figure 2 shows two approaches to parallel molecular simulation calculations (for clarity of presentation a 2-dimensional system is shown). Figure 2a shows a domain decomposition approach in which the area (volume in 3-D) containing the sample of particles constituting the simulated system is divided into subareas (subvolumes). In a molecular simulation per- formed using domain decomposition, the particles within each subdomain (and their positions and momenta) are assigned to a separate processor which calculates the interactions among particles within its subdomain. Then, movement of parti- cles between subdomains and interactions which extend into neighboring subdomains are communicated each timestep. For bomogeneous systems (single phase in the absence of external fields), this domain decomposition is intrinsically load- balanced. Pinches et a ~ , ' ~ presented a domain decomposition code for classical molecular dynamics of simple atomic sys- tems that showed good scaling. As might be expected, domain decomposition approaches permit the simulation of very large systems, spreading the memory required for the positions and momenta of the many particles over the local memory of a number of processors. However, the computation of the pairwise interactions of the particles in the subdomain of each p r ~ o r is performed only at the speed of a desktop workstation; so, large but not long long simulations are possible using domain decomposition.

t currently Center f a Computational Sciences, OWL.

Page 3: P. E Oak Ridge, TN 37831-6224 of TN

- 2 -

figure 2b shows a replicated data approach to molecular simulation. In this approach identical representations of the entire syskm are replicated in the local memory of a number of processors. Then, the computational work of evaluating the pair- wise inkractions is divided among thc processors and Ihc local results are summed. 7he interaction is evaluated within a double loop b a t compares uniquely every particle with every other pamcle. The computational work is divided up among processors by wsigning a portion of the range of the outer loop to each processor. These calculations can be done in paral- lel, and the simplicity of the division permits some straightforward load balancing to take place. Improved load balancing divides the work for a set of P processors, as if we had 2P available nodes. One then assign iterations from oppositc ends of the outer loop, in effect, giving each processor a large and a small portion to work on. Clearly, this replicated data approach allows small systems (so that all data fit in local memory) to be simulated for long times. The latency of global communica- tion (in summing the local contributions to the total interaction) limits the ullimate scaling of the replicated data approach. Current implementations on the Paragon Limit the scaling to the range lox to 1OOx.

Figure 3 illustrates the complementary advantages and disadvantages of the two approaches. Unfortunately, for real physi- cal systems, larger system size requires simulation for longer times. For example, one part of a system can communicate its influence to other parts of the system no faster than tbe speed of sound so large systems must be simulated for long enough time for sound to propagate from one extremity to another, else the parts could be simulated independently. In fact, most large systems require much longer simulations than this to encompass the range of timescales intrinsic to the problem. For example, in a large protein molecule the time constant of the interatomic interactions (which ultimately determine all of the dynamics) is of order femtoseconds while the time constant for folding of the entire protein molecule (which determines its biological function) may be of order seconds or longer. Substantial improvements m both parallel supercomputer perfor- mance and molecular simulation programs will be required before the dynamics of protein folding can be simulated atom- i s t i d y .

Because calculations of the painvise interaction constitute the bulk of the computational cost in al l cases, we have invested m making that part of the calculation as efficient as possible. The first step was to rewrite €he source code so that it vector- Ues to the greatest extent possible and so that the cache is utilized as effectively as possible. Then, assembler routines for the kernel calculations were written. These steps have achieved @-up ranging from 4x to lox on our various calcula- tions. We have managed to achieve 60 MFLOPS on each node (30 MFLOPS/processar) in some of these kernel calcula- tion~.'~ Other massively parallel supercomputers utilize microprocessors for which far better compilers have been written than are available for the i860, so, such specialized coding might not be needed and might not yield such significant perfor- mance improvements.

Industrially-Important Applications

We describe briefly some of the significant scientific and technological results from classical molecular simulations of industrially-important systems on the Intel Paragons. These applications are described more fully elsewhe~e.'~'~ Addi- tional computational techniques have been required to meet the demands of these specific applications; these more special- ized techniques are described briefly with the appropriate application.

Zon Pairing-Ion pairing lies at the heart of technological problems with corrosion and solids deposition in industrial processes utilizing high temperature water; because of severe corrosion problems, experimental study has not been possible for many important cases. In our molecular simulations, water has been modeled as a small rigid rotator with imbedded charges. We have employed quaternion coordinates to avoid the difficulty of trigonometric functions (which go to zero at some angle) in the denominator of the usual equations of Newtonian dynamics. Tbe long range of Coulombic forces requires special treatment. We have employed two approaches. The more efficient technique involves truncating the forces at some distance and assuming a uniform dielectric beyond the cutoff (the reaction field technique). A more rigorous but more costly technique uses Ewald summation for the forces beyond the cutoff. W e have developed both replicated data and domain decomposition codes for such simulations. m e domain decomposition approach has allowed study of the very large systems which are required with dilute electrolytes.

Figure 4 shows the probability of finding an oxygen atom at a given distance from anotha oxygen atom normalized to the bulk average probability (the intermolecular oxygen-oxygen pair correlation function) from a simulation of 42,592 water molecules at a supercritical state. Also shown are intermolecular hydrogen-hydrogen and oxygen-hydrogen pair correlation functions. Tbe very good statistics fiom such a large simulation yidd smooth curves that can be precisely compared with the results of neutron scattering experiments. Both our simulations and the experiments of a European group have been improved through efforts to quantify the degree of hydrogen-bonding in supercritical water which can be inferred from such results. Figure 5 shows the pair correlation function between sodium ions and chloride ions in a dilute, high- temperature aqueous solution. Because the solution is dilute, the results show considerable statistical uncertainty even in the largest aqueous electrolyte simulation ever performed. Tbe sodium chloride dissociation constant, calculated from these results, is in good agreement with exuapolations fonn experiments performed at less severe conditions.

Page 4: P. E Oak Ridge, TN 37831-6224 of TN

- 3 -

Rheologv ofAlkanes-Thc variation of the viscosity of lubricants with temperature and strain rate (rheology) is a techno- logically important problcm for which existing scientific understanding cannot give needed quantitative predictions. To study rhcology of lubricant molecules (alkanes) by molecular simulation we have used the techniques of non-equilibrium molecular dynamics with both replicated data and domain decomposition codes. Figure 6 shows the two approaches (deforming-cell and sliding-brick) we have employed to handle the periodic boundary conditions when a velocity profile is imposcd. The deforming-cell approach facilitates communications in doinain decomposition codes, and the sliding-brick approach is simplest in replicated data codes. Chain molecules like the n-alkanes have internal degrees of freedom (bond vibration, bond-angle bending, and torsion about the dihedral angle) which occur on a faster timescale than the overall molecular motions. A Trotter factorization of the Liouvillian of the motion permits the use of small time steps in the integration of the fast motion and larger time steps in the integration of the slower motion. The combination of all of these techniques ha% enabled calculation of the rheology of lubricants on the Intel Paragons.

Figure 7 shows the viscosity of the normal alkanes with 1, 10, 16, and 24 carbon atoms versus shear strain rate, The observed changes in the transition from Newtonian to nowNewtonian behavior can be understood in terms of the modes of intramolecular and intermolecular motion which can be visuahzed from the simulation results. The smallest molecules show behavior in agreement with mode-mode coupling theory, and the longer chains show behavior consistent with Rouse dynamics. Studies in progress of branched alkanes will enable lubricant manufacturers understand how molecular structure effects properties and to design improved synthetic lubricants for the high-speed, high-temperature automobile engines of the future.

Phase Separarion in Pofymr Blends-By mixing togethex two polymers it is possible to make new materials (polymer blends) with new properties but without requiring new synthesis. However, such systems have a tendency to phase separate. Theoretical and computational approaches to studying this problem are actively sought. The most direct molecu- lar simulation method to studying phase equilibrium employs Monte Carlo simulation in the Gibbs ensemble.'* Experimen- tal studies often employ small angle neutron scattering to determine the x parameter, which is a measure of the interaction between chain segments. Behavior not predicted by theory, composition dependence of the x parameter, which has been observed experimentally can be studied by molecular dynamics employing the same technique to detennme x as is used in analyzing experimental data, and the unexpected result can be understood. Both techniques involve simulations of long- chain molecules; Figure 8 shows how we can handle such molecules in domain decomposition codes. m e part of the molecule within the subdomain of each processor is treated as an independent molecule, and its bonded interaction with its other part in the subdomain of another processor is handled just like the interaction between any pair of particles, one in each subdomain. We have implemented several approaches to parallel Monte Carlo simulations. A domain decomposition approach employs the Time Warp algorithm for parallel discrete event ~imulation'~ in which each processor calculates independently until it receives a message denoting interaction from another processor. Events must be kept in correct sequence; so, if the receiving processor has progressed in sequence beyond the point of the received interacting event, it rolls back its sequence, inserts the interacting event, and begins calculations anew fiom that point. A replicated data approach2o attempts P ambitious Monte Carlo trial moves on P processors in parallel. Tbe most favorable move is tested for acceptance, the bias being corrected for in the acceptance criterion. The maximum speed-up in the replicated data approach is limited by the global communication latentcy just as in the molecular dynamics case.

Figure 9 shows the results of vapor-liquid phase equilibrium calculations for model chains of 32, 100, and 500 segments, results which are well beyond those previously attainable. Figure 10 shows the x parameter as a function of composition for two model polymer blends. From the simulations we can determine the relative contributions of effects of 1) isothermal compressibility, 2) excess volume of mixing, and 3) composition fluctuations to the obsenred composition variation of whicb is not predicted by Flory theory.

These calculations have been performed on Intel Paragon supercomputers at Oak Ridge National Laboratory. Based upon these demonstrated capabilities, we have established an ORNLRTT Joint Institute for Molecular-Based Engineering and Science to make these new computational capabilities available for academic, government, and industrial research and development. Further information is available on the Web at http://flory.engr.utkedundrd/projectfldtd-ann~95.h~ and http://flOry .engr.utk.edu/ldrd/w .html.

Acknowledgments I W e are pleased to acknowledge the the many contributions of A. A. Chialvo, S. T. Cui, R K. Bhupathiraju, S. A. Gupta, F. A. Escobedo, M. J. Greger, and J. J. de Pablo, upon whos outstanding work this paper is based.

The work of HDC has been supported by the Division of Chemical Sciences of the U. S. Department of Energy (DOE) at Oak Ridge National Laboratory (ORNL). This work was sponsored by the Laboratory Directed Resear& and Development Rogram of ORNL. 'Ihe authors acknowledge the use of the Intel Paragon supercomputers in the Center for Computational Sciences at ORNL, funded by the DOE'S Mathematical, Information, and Computational Sciences Division. ORNL is managed by Lockheed Martin Energy Research COT. for the DOE under Contract No. DE-ACO5-96OR22464.

Page 5: P. E Oak Ridge, TN 37831-6224 of TN

- 4 -

References 1. A. A. Chialvo, P. T. Cummings, and H. D. Cochran, "Solvation Structure, Hydrogen Bonding, and Ion-Pairing in Dilute Supercritical Aqueous NaCI Mixtures," Int. J. of Thermphys., 17, 147-156 (1996). 2. A. A. Chialvo, P. T. Cummings, H. D. Cochran, J. M. Simonson, and R. E. Mesmer, "Na+/CI- Ion Pair Association in Supcrcritical Water," 3. Chem. Phys. 103,9379-9387 (1995). 3. A. A. Chialvo, P. T. Cummings, H. D. Cochran , J. M. Simonson, and R.E. Mesmer, "Solvation, Hydrogen Bonding, and Ion Pairing in Supercritical Aqueous Solutions. Simulation and Experiments", Innovations in Supercrificul Fluids: Science and Technology, K. Hutchenson and N. Foster eds., ACS Symp. Ser. 608,676 (1995) 4. S. T. Cui, P. T. Cummings, and H. D. Cochran, "The Calculation of the Viscosity from the Auto-Correlation Function Using Molecular and Atomic Stress Tensors, Mol. Phys., accepted (1996). 5. S. T. Cui, P. T. Cummings, and H. D. Cochran, "Multiple Time Step Nonequilibrium Molecular Dynamics Simulation of the Rheological Properties of Liquid n-Decane," J. Chem. Phys., 104,255-262 (1996). 6. R. K. Bhupathiraju, P. T. Cummings, and H. D. Cochran, "An Efficient Parallel Algorithm for Nonequilibrium Molecu- lar Dynamics Simulations of Very Large Systems in Planar Couette Flow," Mol. Phys., accepted (1996). 7. S. T. Cui, S. A. Gupta, P. T. Cummings, and H. D. Cochran, "Molecular Dynamics Simulations of the Rheology of Nor- mal Decane, Hexadecane, and Tetracosane," J. Chem Phys., accepted (1996). 8. F. A. Escobedo and J. J. de Pablo, "A New Method for Generating Volume Changes in Isobaric-Isothermal Monte Carlo Simulations of Flexible Molecules," Macromol. Theory Simul., 4,691-707 (1995). 9. F. A. Escobedo and J. J. de Pablo, "Simulation of Chain Molecules for Prediction of Thermodynamics Properties," Fluid Phase Equilib., in press (1995). 10. F. A. Escobedo and J. J. de Pablo, "Simulation and Prediction of Vapor-Liquid Equilibrium of Chain Molecules," Molecular Phys., in press (1995). 11. S. T. Cui, P. T. Cummings, H. D. Cochran, and S. K. Kumar,"Structural and Mixing Properties of Polymer Blends Studied by Molecular Dynamics Simulation," manuscript in preparation. 12. M. P. Allen and D. J. Tildesley, Molecular Simulation of Liquids, Clarendon Pres, Oxford, UK (1987). 13. M. R. S. Pinches, D. J. Tildesley, and W. Smith, "Large Scale Molecular Dynamics on Pardel Computers using the Link-cell Algorithm," Mol. Sim, 6.51-87 (1991). 14. P. F. Locascio, "High Performance Molecular Dynamics Calculations for the Intel Paragon," presented at the Intel Supercomputer User Group Conference, Knoxville, TN, June 19-22 (1996). 15. S. A. Gupta, S. T. Cui, P. T. Cummings and H.D. Cochran "A Nonequilibrium Molecular Dynamics Study of the Rheology of Alkanes," presented at the Intel Supercomputer User Group Conference, Knoxville, TN, June 19-22 (1996). 16. S. T. Cui, P. T. Cummings, and H. D. Cochran, "Molecular Dynamics Simulations of Polymer Blends on the Paragon," presented at the Intel Supercomputer User Group Conference, Knoxville, TN, June 19-22 (1996). 17. R. K. Bhupathiraju, P. T. Cummings, and H. D. Cochran, "Nonequilibrium molecular dynamic simulations on mas- sively parallel supercomputers," presented at the Intel Supercomputer User Group Conference, Knoxville, TN, June 19-22 (1996). 18. A. 2. Panagiotopoulos, "Direct determination of phase coexistence properties of fluids by Monte Carlo simulation m a new ensemble," Mof. Phys., 61,813 (1987). 19. R. M. Fujimoto, "Pardel discrete event simulation." C o r n ACM, 33,30-53 (1990). 20. K. Esselink, L. D. J. C. Loyens, and B. Smit, "Parallel Monte Carlo simulations," Phys. Rev. E, 51,1560-1568 (1995).

Page 6: P. E Oak Ridge, TN 37831-6224 of TN

a IQ

r!

I

position, F"

Figure 1. Trajectory through phase space.

1

3 3 1 E z E F Y

System S i

Figure 3. Advantages and disadvantages. IO N a ' C and 10956 SPC water molccala

T' = 1.05 p' = 1.m 15.0 I I .

5.0 r

0 5 10 15 20

Figure 5. Ion pairing of NaCI. r/a

0 .;*

* * :.

P l

Figure 2. a) Domain decomposition. b) Replicated datr

42592 SPC water molecules

T* = 1.05 p* = 1.00 3.000 &-, . . . . I . . . . , . . . , . . . - j

2.500 g(r)

2.000

1 so0

1 .ooo 0.500

0.000

;. $1 t

1 . 1 : . . . . I . . . . I . . . . I . . . - 0 1 2

Figure 4. Supercritical water.

3 4

ria

5

Figure 6. Defonnin_p-cell and sliding-brick BCs.

Page 7: P. E Oak Ridge, TN 37831-6224 of TN

I e **-3e9

1.0 L - 4 4

o.oO01 0.001 0.01 0.1 1 10' Dimensionless Srtain Rate

Figure 7. Rheology of some a-alkanes.

+ E + Q

0

+a

x

Figure 8. Chain spanning three subdomams.

O . l O [ . . . * , . . . ' 1 - - . . , . . . @ 0.00

-0.10 -

-0.40 y 9 1 -0.50 a

0 ao5 ai 0.1s 0 2 015 os 0 0.2s 0.5 0.75 1 0 P.aks--

Figure 9. Vapor-liquid phase equilibrium of cham molecules.

Figure 10. Composition dependence of x parameter.

DISCLAIMER

This report was prepared as an account of work sponsored by an agency of the United States Government. Neither the United States Government nor any agency thereof, nor any of their employees, makes any warranty, express or implied, or assumes any legal liability or responsi- bility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. Refer- ence herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise does not necessarily constitute or imply its endorsement, recom-

' mendation, or favoring by the United States Government or any agency thereof. The views and opinions of authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof.

Page 8: P. E Oak Ridge, TN 37831-6224 of TN

DISCLAIMER

Portions of this document may be illegible in electronic image products. Images are produced from the best available original document.