Empirical energy function Summarizing some points about typical MM force field In principle, for a...

18
Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need to be derived from scratch because once a functional group changes, it can change many things. But this will be inconvenient as you can image how many molecules are there? To get around this difficulty, a way to do is to use ‘unit molecular fragments’ to form the molecule we want to study. This way, we only need to build force field for these unit molecular fragments. (certainly, this is an approximation.) This approach has been widely used for polymer molecules like proteins and/or DNAs, etc. Q: How do we know the force fields are accurate? A: The force field parameters are chosen/optimized such that the calculated results will agree with experimental results well. Better agreement with experimental results and higher prediction ability of a force field, a better one. Note that, they are many type of carbon atoms, even in the ‘unit molecular fragments’ you choose. For example, carbons, sometimes, is singly bonded or doubly bonded etc. This will affect vdw parameters, and many other parameters. Usually, all the charges of atoms are FIXED. ( this is an approximation, considering the cis- and trans n-butane case) Then, given a geometry of a studied molecule, the energy of that molecule at that structure can be calculated. N N N H N N N N H N H 2 C

Transcript of Empirical energy function Summarizing some points about typical MM force field In principle, for a...

Page 1: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Empirical energy functionSummarizing some points about typical MM force field

• In principle, for a given new molecule, all force field parameters need to be derived from scratch because once a functional group changes, it can change many things.

But this will be inconvenient as you can image how many molecules are there?

To get around this difficulty, a way to do is to use ‘unit molecular fragments’ to form the molecule we want to study. This way, we only need to build force field for these unit molecular fragments. (certainly, this is an approximation.) This approach has been widely used for polymer molecules like proteins and/or DNAs, etc.

Q: How do we know the force fields are accurate?

A: The force field parameters are chosen/optimized such that the calculated results will agree with experimental results well. Better agreement with experimental results and higher prediction ability of a force field, a better one.

• Note that, they are many type of carbon atoms, even in the ‘unit molecular fragments’ you choose. For example, carbons, sometimes, is singly bonded or doubly bonded etc. This will affect vdw parameters, and many other parameters.

• Usually, all the charges of atoms are FIXED. ( this is an approximation, considering the cis- and trans n-butane case)

• • Then, given a geometry of a studied molecule, the energy of that molecule at that structure can be calculated.•  • Besides the typical MM force field, there are empirical force fields which can deal with bond breaking and

formation cases.•  •  

N N

NHN

N N

NHNH2

C

Page 2: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Molecular dynamics simulation

(Wikipedia)

Page 3: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

• For transition metal molecular systems, the empirical force field functional forms can be more complex due to complexity of d-orbitals. The bonding between metal ions and ligands can be bonded or non-bonded.

•  • Well-known force field for biomolecules: AMBER, Charmm, OPLS, etc.. Note that the force field

is still under development. The transferibility of force field at different conformations(e.g., cis and trans C-C-C-C, and ff of amino acids are derived from chosen conformations) and molecules (forming different molecules with unit molecular fragments) are not 100 % yet. Some people even argue that the force field need to be changed during MD simulation to achieve better simulation. In addition, polarizable force field is under development as well.

• In addition to the description of molecular interactions using empirical functional forms without including electrons explicitly, quantum chemistry calculation can be used to calculate molecular energy as well. This method includes electron orbitals in calculation explicitly. The calculation procedures in this "quantum chemistry calculation" can be categorized into two groups. One is semi-empirical method, the other is ab initio (this means "starting from the very beginning or the first principle") calculation method.

•  • Between the empirical MM force field and quantum chemistry calculation, there is a

combined /compromised method, named QM(quantum mechanical) /MM method. This method allows to examine local electronic structure (orbitals) in the large molecular systems.

Page 4: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Ways to alleviate problem of sampling in MD and MC simulation

• Simulated annealing

• Parallel tempering, also known as replica exchange

Page 5: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Simulation step

Replicate Exchange MD simulation

exchange with probability P = min{1, exp[-(E2-E1)(1/kT1 – 1/kT2)]}

• Enhance sampling. (alleviate the problem of being-trapped at a local minimum.)

• For a helical peptide of ~20 amino acids, this speeds up about 10 times, compared with conventional MD.

(PCCP)

• Remove net translational and rotational motion due to truncated error at chosen time interval.

Page 6: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Random Number Generators/Monte Carlo Simulation

• An example of a uniform (quasi)-random number generator:

• (Fortran 77 version) • double precision function usran(ir)• c• c this subroutine generates random values between 0.0 and 1.0 using• c an integer seed• c it is based on the imsl routine ggubs.• c• c double precision version• c• implicit double precision (a-h,o-z)• parameter(da=16807.d0,db=2147483647.d0,dc=2147483648.d0)• ir=abs(mod(da*ir,db)+0.5d0)• usran=dfloat(ir)/dc• return• end

Page 7: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Random Number Generators/Monte Carlo Simulation

• An example of a uniform (quasi)-random number generator:

• (Fortran 77 version, don’t mind the rigorousness of coding here.) • double precision function RANF(DUMMY)• c• c this subroutine generates random values between 0.0 and 1.0 using• c an integer seed• c it is based on the imsl routine ggubs.• c• c double precision version• c• Parameter ( L= 1029, C=221591, M=1048576)• DATA SEED /0/

• SEED= MOD(SEED*L+C,M)• RANF= REAL(SEED)/M• RETURN• END

Page 8: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Protein Structure Basics

References :

1. Branden & Tooze’s “Introduction to Protein Structure (second edition)”, Garland Publishing, 1999, ISBN: 0815323050. ( 藝軒有賣 !)

2. Creighton's "Proteins: Structures and Molecular Properties (second edition)", Freeman and Company, 1993.

Page 9: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

There are several levels of protein structure

Primary structure: the sequence of amino acid residuesSecondary Structure: the polypeptide backbone conformationTertiary Structure: the three-dimensional structure of a proteinQuaternary Structure: the arrangement of one subunit relative to another in space

Page 10: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Proteins are polypeptide chains

The basic repeating unit along the main chain is NH-CH-C’=O, which is the residue of the common parts ofamino acids after peptide bonds have been formed

Page 11: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Amino Acids can be classified by their R groups (1)

Page 12: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

The right form is usually predominant in modelpeptides. However, which form is predominantdepends on the precise conditions in the localenvironment. Both forms (the left form & the right form) are found in proteins.

The left form The right form

His

Page 13: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Cysteines can form disulfide bridges

Page 14: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

The conformation of the main-chain atoms is therefore determined bythe values of phi (), & psi () angles of each aminoacid.

Page 15: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Ramachandran Plot

(a): a result from calculations of sterically allowed regions

Page 16: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

(b) Observed values for allResidue types except glycine

(c) Observed values for glycine: note that the values include combinations of and that arenot allowed for other amino acids.

Glycine residues can adopt many different conformations

Page 17: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.
Page 18: Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.

Rotamers: Most side chains have one or a few conformations thatoccur most frequently than theother possible staggered conformations. These are called rotamers. Today, collections ofthese favored conformations,or rotamer libraries are a standard tool in computer programs used for modelingprotein structures