Home
        DLPROTEIN 1.2 User Manual - KIST computational science
         Contents
1.       N    1 1 4 Adding entries to the PS file    The first line of this file indicates the force field type  charmm or gromos   More   over  a line is skipped if starts with the character         while each new entry is  indicated by the special string BLOCK  The order of appearence of the BLOCKs  must be the same as the PS file included in the distribution  Atomic labels are    consistent with those specified in the fourth column of the MBBT file     11    The database contains the following data     Number of records in the atom list   Atom names  a4   masses  f12 0  and repetitions  i5   Number of records in the bonds list   Prefactor for scaling the Van der Waals data between pairs    Bond pair names  2a4   force constants  f12 0  and equilibrium dis     tances  f12 0   Number of records in the angle bends list  Prefactor for scaling the Van der Waals data between atomic triplets    Angle triplet names  3a4   force constants  f12 0  and equilibrium an     gles  f12 0   Number of records in the general proper dihedrals  GPD  list    Prefactor for scaling the Van der Waals data between atomic quadru     plets of the GPD list    Potential type  cos   scaling factor for electrostatics and Van der Waals    interactions    GPD quadruplet names  4a4   force constants  f12 0   phase  f12 0   and multeplicities  f12 0      Number of records in the specific proper dihedrals  SPD  list    Prefactor for scaling the Van der Waals data between atomic quadru     plets of the SPD list  
2.     according to the Berendsen scaling factor is prop_ber     19    The Evans constraint on velocities  that allows to sample the isokinetic    ensemble  is achieved in the routine evans_scale     2 1 3 Constraints treatment    In the VV scheme treatment of the holonomic constraints is handled in a similar  way to the LF scheme  The main difference relies in imposing simultaneously the    condition of satisfied constraints    on  r      0 k 1 M  2 17     as handled by the routines rdshake_1 and sca_shake  SHAKE procedure  and    condition that momenta are orthogonal to the constraints surfaces  on   ri  Pit   0  2 18     as handled by the routines quench and guench1  known as the RATTLE proce   dure   We have modified the preexisting rdshake_1 routine into the new sca_shake  routine  The change has been made in order to solve the iterative sequence of con   straints by cyclic update of atomic positions  3   At the moment this procedure is  possible only on scalar architectures  Viceversa the previous rdshake 1 is applied  on the full sequence of constraints in a different way and it has been motivated by  parallelisation reason  1   The sca_shake  althought running on scalar platforms   converges in about half the iterations than the parallel version  In an analogous  way the new sca_quench routines uses a cyclic update of atomic velocities    For the NVE and NVT ensembles the sequence of propagators and con     straints impositions is  h h     A A  exp is Lan R exp izLe S ex
3.   12    Potential type  cos   scaling factor for electrostatics and Van der Waals    interactions    SPD quadruplet names  4a4   force constants  f12 0   phases  f12 0   and multeplicities  f12 0      Number of records in the general improper dihedrals  GID  list    Prefactor for scaling the Van der Waals data between atomic quadru     plets of the GID list    Potential type  harm   scaling factor for electrostatics and Van der    Waals interactions    GID quadruplet names  4a4   force constants  f12 0   and equilibrium  angles  f12 0      Number of records in the specific proper dihedrals  SID  list    Prefactor for scaling the Van der Waals data between atomic quadru     plets of the SID list    Potential type  harm   scaling factor for electrostatics and Van der    Waals interactions    SID quadruplet names  4a4   force constants  f12 0   and equilibrium  angles  f12 0      Number of records in the general Van der Waals parameters list  General Van der Waals parameters  Number of records in the specific Van der Waals parameters list    Specific Van der Waals parameters    13    e Number of records in the general 1  4 Van der Waals parameters list  e General 1  4 Van der Waals parameters  e Number of records in the specific 1  4 Van der Waals parameters list    e Specific 1  4 Van der Waals parameters    c    Here    general dihedrals     both proper or improper  specifies the quadruplets hav   ing specified a reduced number of atoms defining the dihedral  The wild card  sy
4.   z     2 5   Each operator is applied sequentially and gives rise to a shift  o  expla  f  2    f x  a   2 6     In DLPROTEIN the single time step molecular dynamics is hadled by the routine    sloop  and the action of the propagators is performed by the routines    A  respa_v_verlet   gt  exp izLe    respa_r_verlet  gt  exp  ihL      applied according to the Trotter splitting  The same routines are called when  performing NVT and NPT simulations with slight modifications for the NPT case    as explained in the following     2 1 1 NVT ensemble    DLPROTEIN implements the canonical ensemble and the isobaric isothermal en   semble with the Nos   Hoover style equations of motion 5   Other kinds of statis     tical behaviours  such as the the Evans and Berendsen schemes can still be used     17    The Nos Hoover NVT equations of motion read       p  tS eS   m  p   F xp   1 T  aes   2 7  X a          Accordingly the integration scheme is given by the splitting    exp  ihL    exp  rxn  exp  ihL Ham  exp Gag  2 8   where  o EE     ius oee enl 2 9  ve Pap    Th G Ox 20    For the NVT and NPT dynamics we make use of the general formula when applying    the effect of the thermostat and piston on the momenta and positions    exp GC  b  2    exp a x   bexp a 2   2 10     The sinh zx  x function is computed by expanding numerator and denominator to  a 8th order Taylor expansion    The Nos   Hoover propagator is further symmetrically splitted in order to  deal with intrinsic momenta depen
5.  and answer a few questions  Infos and warnings    will be issued as the molecular construction proceeds     The dlgen utility supports the    United Atom     UA  scheme  i e  CH  CH2  and CH  group are treated as united groups  when using the GROMOS87 force  field  and the    All Hydrogens     AH  scheme  polar and non polar hydrogens are  always explicitely treated  when using the CHARMm22 force field    dlgen uses an internal database to store the parameter set  PS  and the  molecular building block topology  MBBT   e g  the structure of the elementary  units  such as the amino acids  The PS file stores      atomic names  masses and connection tables      bond reference list  spring constants and eguilibrium distances      angle reference list  spring constants and eguilibrium angles      dihedral reference list  multiplicity  force constants and phases      Van der Waals parameters    The MBBT files stores the group names  internal connectivity and atomic    charges  In table 1 1 we report the database file names     PS file name MBBT file name  GROMOS87    PS gromos37c ff MBBT gromos37c ff  CHARMM22 PS charmm22 ff     MBBT  charmm22 ff    Table 1 1  Database file names for the GROMOS87 and CHARMM22 force  fields  PS   parameter set  MBBT   molecular bulding block topology    1 1 dlgen    The dlgen utility must be run with a pdb file that specifies a single molecule  i e   each atom of the molecule must be globally connected by chemical bonds     dlgen performs in order t
6.  for the grid density must be taken  So   it is recommended to determine the a and n parameters manually  as described in  the following  and as almost identically described in the DLPOLY manual     Preselect the value for rew  choose a value for a of about 3 2 reut and a    large value for n  say 50 50 50 or more   Then do a series of ten or so single    27    step simulations with your initial configuration and with a ranging over the value  chosen plus and minus 20   Plot the Coulombic energy  and the coulombic virial  with opposite sign     W  versus a  If the SPME is correctly converged you will  see a plateau in the plot  Divergence from the plateau at small a is due to non   convergence in the real space sum  Divergence from the plateau at large a is due to  non convergence of the SPME term  Redo the series of calculations using smaller  n values and by choosing the n parameters such that they can be only divided by  2 3 or 5  e g  50  48  45        The optimum values for n are the smallest values  that reproduce at the value of a to be used in the simulation the correct Coulombic  energy  the plateau value  and virial  Keep in mind that the virial depends on  forces and it converges more slowly than the energy   i e  one usually requires a    higher precision in the energy  say 1076  than in the virial     28    Chapter 3    Other differences between    DLPOLY and DLPROTEIN    3 1 Non bonded interactions    In DLPROTEIN the actual number of available non bonding poten
7.  polar hydrogens and acceptors to detect Hbond formation in run   time  The format of the FIELD file regards the atomic species description  and  appears as  a8 2f12 0 3i5 a3   see table 3 13  The keyword hprop is equal to             d      for a donor    h     for a polar hydrogen attached to a donor       a    for an  acceptor     d a    for an atom with both acceptor and donor propensity    The mandatory rule to use when specifying polar hydrogens is that all  polar hydrogens attached to a donor must follow in order the donor specification     Absence of a proper triplet will be set to           by default     36    3 14 Directives in the Makefile    In this section fwe describe the Makefile directives added or changed with respect    to the original DLPOLY2 0 package     1  VECTLIST  This macro compiles the code to run with a 1 d array for storing the  neighbors table  It is usable only with scalar architectures  It allows  an optimal save in memory and CPU time  The arguments are   e VECTLIST   1 d array  default   e NOVECTLIST   2 d array    2  STORE  This macro compiles the code to run with a 2 bytes declaration of  neighbor arrays  It allows a factor 2 in savins memory  Allowed only  for number of atoms less then 32767  The arguments are   e STORE2   INTEGER 2 declaration default   e STORE4   INTEGER 4 declaration    3  FFT  This macro uses a public domain or library versions of the Fast Fourier  Transforms to apply the SPME method  The arguments are   e PUBFFT   for
8.  serial public domain FFT  default   e SGIFFT   for serial Silicon Graphics complib library  e CRAYFFT   for serial Cray libraries    e PCCFFT   for parallel library with the pccfft3d distributed routine       available     4  SHAKE    This macro forces the use of the standard shake algorithm to treat    37    constraints on parallel architectures  constraints are not distributed  and all processors perform the same task   The argument are     e SERIALSHAKE   to force serial shake with parallel runs   e NOSERIALSHAKE  to use distributed shake algorithm  default        INTERPOLATE   Switch of the calculation of potentials and forces between 3 point in   terpolation and cubic spline  Arguments     e SPLINE   Spline potential   e NOSPLINE   3 point interpolation  default        REFOLD   Controls molecules refolding in the primary box when using periodic  boundary conditions  Arguments     e REFOLD   Molecules are refolded  i e  broken  in the primary cell  e NOREFOLD   Molecules are not refolded  default      38    3 15 Examples    A few examples are provided together with DLPROTEIN  The examples are kept    in the    examples    directory and they are      1  System composed of 500 argon atoms simulated for 1000 steps   Ensemble   NVE  Van der Waals interaction   shifted   smoothed  Direct evaluation  No electrostatics    Scanning   all pairs basis    2  System composed of 256 SPC water molecules  100 steps   Ensemble   NVT  Nos   Hoover   Van der Waals interaction   shifted   
9.  unshifted potential will be exactly the same apart  from a fluctuating term in the total energy  Consistently with the shifted potential  model  the long range corrections to energy and pressure are set to zero  In this  case the dynamics of the system is affected when performing NPT dynamics since  in the unshifted system the long range correction to pressure is part of the pressure    tensor and its act as a pseudo force for the volume rearrangement     3 6 Smoothing the Van der Waals interactions    at the cut off    A switching function can be used in order to eliminate discontinuities in the forces  at the cut off for the Van der Waals potential  Switching is obtained by a 3   rd order polinomial function that switches off the potential energy in the range   recut     dswch   lt  r  lt  rcut  where dswch is the healing length as specified in the    CONTROL file by the directive ljswitch     31    3 7 Hydrogen bond detection    In DLPROTEIN it is possible to analyse the presence of hydrogen bonds in the  system during the run  The detection of hydrogen bonds is stored in the output  file HBOND dat in formatted  2015  style  as the list of atomic indices of triplets  of donor hydrogen acceptor    Hbond formation is controlled by the directive hbond n dist angle in the  CONTROL file  where n is the frequency for dumping the detected Hbonds  dist  is the distance between donor and acceptor and angle is the angle formed between  the vector connecting hydrogen donor and the v
10. 0 0  rvdw 10 0    mult 232  2 25     is a possible choice for a multiple time step simulations  The multiple time step    algorithm is handled by the routine mloop     23    2 3 Scanning neighbors    DLPROTEIN retains the previous capabilities of DLPOLY to scan neighbors on a     all pairs     AP  basis by using a Brode Alrich algorithm to order the interacting  pairs and eventually split them on several processors    A direct scanning method based on the Link Cell  LC  method  2  is also  been preserved  but DLPOLY supported only cubic or orthorombic PBC for the  LC method  The LC method allows a drastic raise in CPU efficiency for large  systems since CPU time grows linearly with the system size  AP is quadratic      Those scanning methods have been modified in order to    e implement AP and LC on scalar platforms and optimize CPU and    memory use     e extend LC to different PBC   s  truncated octahedron and dodecahe     dron     Actually in the CONTROL file it is possible to chose whether to use AP or LC  method  The LC method is applied with different methods if the run is scalar or  parallel  In the case of a parallel run LC is activated only if it produces effective    CPU optimization and this condition is true if  cutoff  lt  min Lz  Ly  Lz  3  2 26     With a scalar run the condition is less strict  If LC is not usable  eventually due  to a cell shrinking as time proceeds with an isobaric run  automatic switching to    AP scanning is performed and the user is infor
11. 00 27 260 13 390  ATOM 1190 OT1 LYH 128 26 420 28 360 13 350  ATOM 1191 OT2 LYH 128 26 460 26 250 12 940    You need to change it into      ATOM 1 NTERALAZZ 1 15 850 25 550 23 340  ATOM 2 HT1 ALA 1 15 520 25 140 24 200  ATOM 3 HT2 ALA 1 15 700 26 540 23 360  ATOM 4 HT3 ALA 1 15 360 25 150 22 570  ATOM 5 CA ALA 1 17 270 25 280 23 200    ATOM  ATOM  ATOM  ATOM    1188  1189  1190  1191    HZ3 LYH  CTERLYH  OT1 LYH  OT2 LYH    128  128  128  128    31 830  27  000  26 420  26 460    21 930  27 260  28 360  26 250    16 050  13 390  13 350  12 940    1 1 3 Adding entries to the MBBT file    In the MBBT database the first lines concern the gromos vs charmm signal  title  and number of subsequent block entries   For each entry the format is specified as in the following example for the    alanine residue      residue   ala  atoms   6  1 N N  0 28  1H H  28  1CA CH1  0    2  3  4 3CB CH3  0  5 3C C  38  6    50 0    38    The first column  i4  is the atom indexing  second column  i4  is the index of  the preceeding atom linked to the current atom  The general rule is that each  attachment specifies only links with previously defined atoms  If the second col   umn index is omitted  since it concerns interblock connectivity  or if the complete  connectivity cannot be specified  as is the case for aromatic rings  dlgen builds the  connectivity by using geometric rules  as coordinates are extracted from the pdb  file    The third column  a4  regards the pdb atomic names  while the fourt
12. DLPROTEIN 1 2 User Manual    Contents    Overview    1    2    Building the molecular topology    el sieti  digi ge eck ae Re n KEN a  1 1 1  lt digen input files 0 4 5 ec en Kv  1 1 2 Editing the pdbcfiles  lt  lt  sa a      x    1 1 3 Adding entries to the MBBT file    1 1 4 Adding entries to the PS file       1 2 merge field  lt 2 Mending amo aran Weng    The molecular dynamics code    2 1 Integration Algorithms             2 1 1 NVT ensemble             2 1 2 NPT ensemble             2 1 3 Constraints treatment          2 2 RESPA implementation              2 3 Scanning neighbors 1 40 RAA A    10  11  14    2 3 1 Memory optimization on scalar workstation            2 3 2 CPU optimization on scalar workstation              2 4 The Smooth Particle Mesh Ewald method                 2 4 1 Choosing the SPME variables   n     3 Other differences between DLPOLY and DLPROTEIN  3 1 Non bonded interactions       s ke  3 2 Interpolation  spline or direct evaluation of the Van der Waals in   GOPACHIONS   10 4  as ea te oe Se a mej   Bee ae oe ee 5009 48 e  3 3 Refolding of molecules ssa N ea  3 4  OONSIKATMIS treatment 4 20 RO EE ee oksys ee a  3 5 Shifted Van der Waals interactions and long range corrections  3 6 Smoothing the Van der Waals interactions at the cut off         3 7 Hydrogen bond detection sf mm gc ee Rok 46008 BE BS  3 8 Short and long  OUTPUT files 1  0tma tanio eee be  3 9 I O format of trajectory data       iia wie 108 ew ee Ja  3 10 neludeileg 8225  eee ange Nee S
13. PA    Verbose mode in OUTPUT file  default     Silent mode in OUTPUT file    Three dumping files for trajectory  velocities and forces  Gromos sty  Hydrogen bond detection every n steps  Hbond is detected if   the donor acceptor distance is less than dist  A    and the hydrogen donor acceptor angle is less than ang degrees   One history file for dumping the MD run  DLPOLY original style    Activates the link cell method whenever the ratio between the cell c  Shift the Van der Waals potential at the cut off  Not shifted by def   Smoothed Van der Waals potential at the cut off in the   region rcut     f  lt r  lt  rcut  Unsmoothed by default    Uses the r interpolated or splined evaluation of the   non bonding potentials  Direct evaluation of VdW by default   Global refresh of velocities from a Maxwellian   every n steps  preferentially during equilibration     select SPME method for elecrostatics  with    a   Ewald convergence parameter          n   spline order    e 4 Et ee AI SEI MAAT ERE 10004 JN E    sitnam a8 atomic site name  unchanged   weight  12 0 atomic site mass  unchanged     chge f12 0 atomic site charge  unchanged     nrept id repeat counter  unchanged   iftz i5    frozen    atom  unchanged   igrp 15 neutral charge group number  unchanged     hprop a3 Hbond propensity  added     Table 3 2  Atomic sites specification in the FIELD file  3 13 Directives in the FIELD file    The only optional change in the FIELD file for DLPROTEIN is the specification  of the donors 
14. boxes    SPME uses a public domain FFT routine or  alternatively  calls to libraries  optimized on selected architectures  as Silicon Graphics and Cray machines  while  the user can extend the calls to other platforms  The use of public domain or  library calls to apply FFT is chosen in the compilation environment  as discussed  in the section on the compilation directives  The parallel implementation of SPME  is achieved by using the distributed routine    pccfft3d    to perform 3d FFT  present    on T3E and Silicon Graphics parallel architectures     2 4 1 Choosing the SPME variables    The precedure for choosing the SPME control variables resembles the procedure  usually applied in the standard Ewald sum method  e g  see the DLPOLY man   ual   The only difference relies in substituing the number of k vectors of the stan   dard Ewald sum with the number of grid points of the SPME real space method   Furthermore in the SPME another factor can be varied so to achieve optimal con   vergence  this is the order of the splines used in the method  It is advised to use  a well established and safe value of the spline order  say 6 or 8  and to focus on  a good choice of the SPME variables  since the spline order does not affect the  results if the spline order is large enough    In the original Ewald method the    ewald precision    directive was available  to find an optimal value of a and kmar  The same directive can be used in order to  find a good guess for a  but still a choice
15. c  2 27     where U ec is the reciprocal space term of the Ewald sum    I   amp  expl k  4a   Z     lt     ones ae   al     gt  gj exp    ik   r      2 28   2Vo   0  70 k j    as implemented in the ewald1 routine of the original DLPOLY code and it is  usually the main CPU eater in biosimulations  moreover scaling as N    with the  system size    Alternatively DLPROTEIN can use the smooth particle mesh Ewald  SPME   method  7  to compute the reciprocal part of the Ewald sum  The SPME method  does not compute Upec as a sum over k shells as for the original Ewald method   but it uses a three dimensional grid to apply 3d Fast Fourier Transforms  FFT   and obtain splined observables that reconstruct the requested energy term and  corresponding forces  The SPME is called by the routine ewaldlpme  a driver  for the original SPME routines    The SPME method requires as input parameters the order of the spline  and the number of grid points in the x   y  and z directions  These parameters  are specified in the CONTROL file as described in the following sections  Memory  requirements must be satisfied by changing the MAXN  MAXORDER and MAXT  parameters in the ewaldlpme routine  If memory is necessary  i e  for very dense  grids or high order splines  the program will write the memory requirements and    the code must be recompiled     26    The SPME method is currently available only for general triclinic boxes so  that it is forbidden for truncated octahedral and rhombic dodecahedral 
16. d third non bonding    forces are defined by introducing three interaction ranges    0 lt  Tij  lt  rcuti  rcutl  lt  ri   lt rcut2    rcut2 lt  rj  lt  cutoff  2 21     where for Van der Waals interaction cutoff is substituted by the variable rvdw    in the CONTROL file  The shells are handled via proper switching functions in    21    order to ensure continuity of forces at the shell borders  The switching functions  are polynomials of order three governed by the    healing length    parameter switch     The splitted propagator is of the form   exp  i  Ls   exp  iF 22   exp i3L1    exp ihL       exp iL1     exp if2 L2       exp if8L3     2 22     where A is the input time step and no  n   and no are chosen in order to have the    following time steps of increasing lengths    hy   noh  ho   nih    ninoh  hg   noh    non h     non Noh     2 23     When using this splitting within the VV scheme and with the previously reported  NVT and NPT dynamics  changes need to be applied with respect to the previous  LF scheme  The reader is referred to the cited literature for a detailed explanation  of technical points    Switching between single and multiple time step molecular dynamics is  achieved by changing the input parameters in the CONTROL file  see the mult  directive in the CONTROL file   For example setting    rcut1 0 0  rcut2 0 0  cutoff 10 0  rvdw 10 0    mult 111  2 24     22    is a typical setting for a single time step simulation whilst    rcut1 7 0  rcut2 9 0  cutoff 1
17. dence of the thermostat driving force  The    splitting is of the type    ae 2 E 1    1  J       2 472 Toni OX  h oO h 1 T o  S   a ee 2 11  exp   5057   E Tn  7      A        The effect of the Nos   Hoover propagator is given by the routine  A  prop_NH   exp io Lau  2 12     The same routine is applied for the NPT case with minor modifications     18    2 1 2 NPT ensemble    The NPT Nos   Hoover equations of motion are    t   P alr     Bo   m  D   F  x n p  1 T  E N E G  X TA       7     V  P     Peat   1   3NkpTonte s  V   3nV  2 13     where R  is the total center of mass of the system  The Trotter splitting is chosen    as  h h  exp  ihL    exp Gz exp  thLigam  XP  152   2 14   where         iL   4L am     10  3 ae  Ham 10H   n r Ro     Via  A     inn    X   Pa     1 T    i     if ed as  PoPa DE    7  7      Dx   3NkeToar   Dag 9    The NPT Nos   Hoover propagator is further symmetrically splitted in order to  deal with intrinsic momenta and coordinates dependence of the thermostat and  piston driving forces  The splitting is analogous to equation 2 11  The effect of    the NPT Nos   Hoover propagator is again given by the routine  eg  prop_NH   exp isbn  2 16     The NST ensemble eguations of motion  i e  the dynamics associated with  anisotropic volume fluctuations  are not yet implemented in the current release    For the Berendsen thermostat  the evolution propagator has been splitted  in an analogous way to the Nos   Hoover way  The routine that scales the velocities
18. e Rk eo BE ee eS Ge eae oy  3 11 T3E communication routine    lt   lt  kN 62 ee eee eee  3 12 Directives in the CONTROL file      lt  ss   3 13 Directives in the FIELD file     2588 K PEI TRE Kyra  3 14 Directives in the Makefile    aoaaa e    pole EXAMPlES    wed Sergel leretan e eat bon eRe K a     References    Overview    DLPROTEIN is a molecular dynamics package written by Simone Melchionna  and Stefano Cozzini in the framework of the Italian Institute for the Physics of  Matter  INFM   Network on    MD simulation of biosystems     DLPROTEIN is a  development of the original general purpose Molecular Dynamics code DLPOLY  written at Daresbury Lab  UK  by William Smith and Tim R Forester    The motivation underlying the development of DLPROTEIN has been to  specialize the original MD code to deal more specifically with biological molecules   with particular focus on proteins  and with high efficiency and low memory re   quirements for scalar architectures    DLPROTEIN has been realized with the contributions of several people  who have partecipated in the project in different percentages  Amonst others we  wish to thank Antonella Luise as a coauthor of dlgen and for designing the link  cell algorithm for generic periodic boundary conditions  Maddalena Venturoli as a  coauthor of dlgen and for many useful insights in the code  Marco Pierro  as the  author of the merge field utility  from the Physics Department of the University     La Sapienza    of Rome  A special thanks 
19. e refolding in the primary cell of periodic boundary condition setting  Re   folding means that molecules will be cut appear as cut in the primary cell  By  avoiding refolding routines dealing with constraints will strongly ameliorate in per   formance  Refolding of molecules  as used originally in DLPOLY  is activated at  the compilation level by the macro REFOLD  while by default the code will run  without any refolding    If refolding is absent the code will sew the molecules at step zero of the MD  run in the routine utaylor  When a molecular species is constituted by blocks not    connected by constraints or springs  refolding of molecules is mandatory     3 4 Constraints treatment    In order to have exactly the same trajectories on scalar and parallel platforms  i e   avoiding the different ways the shake algorithms is implemented in the sca_shake  and rdshake 1 routines  it is possible to force the code to use sca_shake also with    parallel runs  where all processors will process all the constraints in the system     30    This is achieved by specifing the macro SERIALSHAKE at the compilation level     3 5 Shifted Van der Waals interactions and long  range corrections    Better energy conservation is achieved when the Van der Waals potential is shifted  at the chosen cut off  The shift is activated through the ljshift potential directive  in the CONTROL file    The shift does not affect the forces but only the energy  Consequently the  NVE trajectory of a shifted and
20. ector connecting acceptor to donor    The specification of donors  polar hydrogens and acceptors in the system is    specified in the FIELD file  as explained in the following     3 8 Short and long OUTPUT files    When running biological molecules  i e  when using very large FIELD files  the  user may require that the OUTPUT file is shorter than the default value  A short  or long version of the OUTPUT file can be obtained by using the directives output  short and output long in the CONTROL file     3 9 I O format of trajectory data    A compact and formatted version of the dumped output trajectory is often useful  with biomolecules  A formatted GROMOS like version of the output data can  be generated by using the directive gromos dump in the CONTROL file  In  this case three different files  DUMPPOS dat  DUMPVEL dat  DUMPFOR dat   are used to dump positions  velocities and forces separately  This is the default  behaviour  When using the history dump directive the old style HISTORY file    is recovered     32    3 10 Include files    The main include file of DLPROTEIN is dlprotein inc that should be modified  for any change  in an analogous way to the dl_params inc file for DLPOLY  In  the utility directory the code parset dlprotein f has been added  analogous to  the parset utility for DLPOLY    A new include files has been added in DLPROTEIN that is used for declar   ing the arrays dimensions used in the SPME method  This file is called darden inc     where its analogous 
21. es  VDW params     and  SYSTEM    can be used by the user to check if the merging operation was    correctly achieved     15    Chapter 2    The molecular dynamics code    2 1 Integration Algorithms    In DLPROTEIN the algorithm that integrates in time the equations of motion is  of the form of the Velocity Verlet  VV  scheme whereas in DLPOLY the original  scheme was the Leap Frog  LF  scheme  In addition the VV scheme is applied  in the form proposed in ref   4  so to have  even in presence of non hamiltonian  dynamics  as it is customary for generating canonical or isobaric ensembles   time  reversible and simplectic integrators     Given the Hamiltonian equations of motion    r     I S     p    2 1     where velocities are related to momenta via the relation p   mv  the LF scheme    integrates the trajectory in time with the following updating scheme    r t h    r t   hv t h 2     v t   h 2    v t     h 2       F    2 2     together with the auxiliary equation to obtain velocities simultaneously to posi     16    tions       v t       v t     h 2    v t   h 2    2 3   The VV scheme  that for hamiltonian system produces exactly the same trajectory    than the LF one  reads    r t h    r t  ho t    E ro    v t h    v t    HIFO   F t h    2 4     The VV scheme can be casted in operatorial form by using the Trotter    factorization of the evolution operator  4     h h  exp  ihL ram    exp    r  exp  ihL   exp    r     hF    o hF     EN hF O Pees hF O 9  exp  i 2  exp  m 4 exp
22. for the T3E compilation is darden_t3e inc      3 11 T3E communication routine    A new version of the routines gdsum and gisum have been introduced to strongly    reduce communication times on the Cray T3E parallel architecture     33    3 12 Directives in the CONTROL file    In table 3 1 we report the directives in the CONTROL file that are changed or are  not standard in the original DLPOLY2 0 package    None of the added directives is mandatory  If not specified rcut1  rcut2   rvdw and the switching length are set to zero and the single time step scheme is    applied     34    directive     mult no n   n      rcut1 f    rcut2 f    cutoff f    rvdw f    switch f    output long  output short  gromos dump    hbond n dist ang    history dump   link cell   ljshift   ljswitch f  interpolate potentials    maxwell n    spme an Nz Ny Nz    meaning     multiple time steps cycles    Given the timestep h  the RESPA timesteps are given by   hy   noh  he   nihi  h3   noho   First cutoff  A  for C VdW interaction  region I of RESPA   Second cutoff  A  for C VdW interaction  region II of RESPA   Last cutoff  A  for C VdW interactions  region III of RESPA or  global interaction region with the single time step scheme   Maximum cutoff  A  for VdW interactions    VdW are globally computed within   the  0  rudw   swlen  sphere if rvdw  lt  cutof f   and within the regions I  II  III  otherwise    Healing length  A  for switching off the   non bonding interactions at the region borders for RES
23. h  column  a4  specifies the atom type name  coherently with the atomic names as  specified in the PS file   The fifth column    8 0  specifies the atomic charge  The  last column  f8 0   if present  specifies the repetition of the current atom data     The    nte    and    cte    block names as fictitious names and they need not to    10    be considered as new residue names  as they are needed only to specify data of the  terminal groups    Some specific patched can be added to the MBBT file  These patches can be  necessary if dlgen fails to build the full topology of complex molecules  The patches  are used to add or delete specific constraints  angles bendings  harmonic type   di   hedral torsions  cosine type   improper torsions  harmonic type  and bonds  har   monic types  potential terms from the final topology  The general format of each  patch should be in the form    add keyword num  list 1     n  par 1 m   in free format   or  del keyword num  list 1     n  par 1 m   in free format   where keyword can be constraint  angle  dihedral  improper  bond  num is  the number of patches specified subsequently  list 1     n  is the list of atomic  indices involved in the patch indexed as the atoms appearance in the block itself   par 1     m  is the list of parameters that need to be fixed  appearing in the same  order of the FIELD file  in the same units as speficied previously  An example  of patch can be found in the MBBT block specifying the HEME group  keyword       hem
24. he following actions    1  Read in the Brookhaven pdb file and the MBBT files     2  Generate the seguence of each block by merging the informations in    the pdb and MBBT files  edtgen routine      3  If hydrogen atoms are missing  add them with the proper chemical  geometry and connectivity  as specified in the MBBT database  hgen    routine    4  Read in the PS and FF dat files     5  Complete connectivity by adding the missing links  e g  link aminoacid    to aminoacid  complete the aromatic rings  and so on      6  Generate bond  angle and dihedral list  together with the Van der Waals    interactions  generic and I IV types      7  Dumps out the FIELD file     1 1 1 dlgen input files    dlgen uses the following input data     e Brookhaven pdb file  containing atomic and molecular names and atomic  configuration  This file must be edited and modified  as explained later on    e FF dat file  containing the topology title  constraints vs  harmonic springs    treatment and energy units  The constraints directives are      directive  meaning    constrain string where string can be equal to  all   all bonds are constrained  h   only bonds concerning hydrogens are constrained  none   all bonds are harmonic   specialbond n   n  ng     list of residues to treat harmonically     units string Output units as expressed in the FIELD file  Admitted    values for string are    eV        kJ        kcal    and     internal    for using dlpoly internal units     Moreover the user must in
25. is for Georges Destree  of the Universit    Libre de Bruxelles  who has managed several parallel fixes and new algorithms in  the code    During the development of DLPROTEIN collaboration and support from  Giovanni Ciccotti  University of Rome    La Sapienza      Mattia Falconi  Alessandro    Desideri  University of Rome    Tor Vergata     and Mauro Ferrario  University of    Modena  are kindly ackowledged  We thank Tom Darden for providing us with  the SPME original routines    The following manual is intended as an addendum to the DLPOLY 2 0  Manual  1   The reader is referred to the DLPOLY manual for general explanations  about the code philosophy and technical details whereas the present documentation    reports only modifications and addenda to the original DLPOLY package     Chapter 1    Building the molecular topology    Building the molecular topology of molecules is done by the utility build all   The  program build all takes care of building up in sequence a system composed of  one or more molecules  build all is a fortran program that calls the two different    utilities called dlgen and merge field         The three programs build all   dlgen   and merge field are provided with  a Makefile in the build topology directory  Once they are compiled  by simply  executing the    nake    command   these utilities can be run alltogether  by running  the driver build all   or singularly    dlgen converts a Protein Data Bank file  pdb file  into the files CONFIG  and FIELD 
26. mbol         indicates all possible atom types included in the quadruplet  On the  other hand    specific dihedrals    means the quadruplets explicitily indicated   Similarly    general    and    specific    Van der Waals interactions  for 1  4 inter   actions or not  refer to the parameter set used for all kind of atomic pairs and for    selected pairs respectively  The 1  4 parameters specify the interaction parameters    between pairs of atoms that are third neighbours in the molecular connectivity     1 2 merge field    This utility allows to merge different FIELD files into a single FIELD file  called  FIELD out  where each starting FIELD file must contain one molecular specie   Moreover each FIELD file must specify the Van der Waals parameters in    12 6     style    When running merge field   the user is requested to provide as standard  input the force field scheme  charmm or gromos   the name of each FIELD file   together with the relative MBBT file and the number of repetitions of each molec   ular specie  If the name of some atom in one FIELD file coincides with the name  in asecond FIELD files and those atoms have different interaction parameters  the  user is requested to change the name of atoms in the second FIELD file  This can    be done by the utility itself or by editing one of the FIELD files  The utility can    14    merge as many molecular species as needed by changing the mxmoltyps parame   ter in the merge field inc file and recompiling  The output fil
27. med     2 3 1 Memory optimization on scalar workstation    DLPROTEIN has routines that optimize memory handling by substituting a 2d     array array for storing interacting pairs into a 1d array  The method is suited    24    to run on scalar machines and a good speed up is achieved with respect to other  methods due to a better cache memory use    An extra factor 2 in memory is gained by using the  INTEGER 2    definition  for the interacting list  This is actually suited for number of atoms less than  32767  if this is not the case an error message is printed out and the code must be    recompiled     2 3 2 CPU optimization on scalar workstation    The actual version of DLPROTEIN handles the Link Cell method  2  for all pos    sible boundary conditions implemented in the code  A different algorithm than   in the original DLPOLY routine has been implemented  The routines that perform   LC scanning on scalar workstation are scalink_vlst_cube and scalink_respa_vlst_cube  for cubic and octahedral boxes  scalink vlst ortho and scalink respa vlst ortho    for general triclinic boxes     25    2 4 The Smooth Particle Mesh Ewald method    DLPOLY used the Ewald method to compute the electrostatic interactions when  using periodic boundary conditions  1   The Ewald method splits the Coulombic  interactions into two separate potentials in the form    1 X    On 1 aa Q er  Arm  Ue   er felarns        ys S aan  mse   TG      1 0   ATE  n lt j Trj ATE  molecules l lt m vT Tim          Ure
28. p  ihL   exp izl exp is Lou  2 19   where R and S are the operators that handle constraints on velocities and positions    respectively     20    In the NPT implementation the overall procedure is modified in order to  deal with intrinsic dependence of the pressure tensor on the constraining forces   as discussed in  4   In this case an iteration loop need to be applied in order to  converge the pressure estimate and piston update so to achieve proper integration  in time  Convergence of the piston updating is controlled by the routine checkexp    For the Berendsen isothermal   isobaric dynamics  the splitting of the prop   agator is rather crude  but it seems to produce a statistical behaviour rather similar    to the Berendsen implementation of the original DLPOLY package     2 2 RESPA implementation    A multiple time step implementation  generally known as the RESPA method  has  been applied in the framework of the VV scheme  4  and within a splitting of the    Hamiltonian well suited for biological systems  6   The propagator is in the form  exp ihL    exp ih L    L    Lo   L3    2 20     where L  takes into account positions shifts and updating of momenta according  to bond stretching and angle bending forces  L   shifts momenta due to dihedral  torsions and first shell non bonding interactions  L   shifts momenta due to second   shell non bonding interactions and reciprocal Ewald term  L3 shifts momenta due  to third shell non bonding interactions  6   First  second an
29. sert data as requested by the program in run time    at standard input     1 1 2 Editing the pdb file    Before running the dlgen program the pdb file must be edited and a few changes  must be applied  dlgen uses each building block specified in the MBBT database       as an independent unit so that in principle there is no difference when treating  a sequence of amino acids or a sequence of generic building blocks  In fact the    intrablock topology is specificied in the MBBT database  while dlgen uses the       geometry specified in the pdb file to construct the interblock topology     1  Terminal groups labels  Usually the peptidic sequence begins  or ends  with a NH3 group  The  pdb atomic label relative to the starting nitrogen atom must be changed    from the original pdb label into the new NTER label and eventually if    any hydrogen is specified  i e  the user does not require generation of  positions hydrogens from scratch  the hydrogen atoms must have the  labels HTn  with n 1  2  3  Hydrogen atoms attached to NTER must fol   low in order  In case the N terminal aminoacid is a Proline  the name  of the starting nitrogen should be left as N and the aminoacid name  should be changed from PRO to PRT    When the peptidic sequence has COO as a terminal group  the label  of the carbon atom must be changed into the CTER new label and the  oxygens must have the new labels OTn  with n 1  2  Oxygens attached  to CTER must follow in order    When having terminal PRO residues  
30. smoothed  Splined potential  Electrostatics   SPME    Scanning   all pairs basis    3  System composed of 256 SPC water molecules  200 steps   Ensemble   NPT  Nos   Hoover   Van der Waals interaction   shifted   smoothed  Direct evaluation  Electrostatics   Ewald sum    Scanning   all pairs basis    4  System composed of Myoglobin   1513 tip3p water molecules   9 Cl  counterions  20 steps   Ensemble   NPT  Nos   Hoover   Van der Waals interaction   shifted   smoothed  Direct evaluation  Electrostatics   SPME    Scanning   link cell method    39    In the    build topology    directory an example of use of the build_all utility  is provided  The example builds the topology of Myoglobin as used in the previous  test case  The utility build_all can be run by answering a few questions or by using  the prebuilt file with the command  build all  lt  input build all  gt  out build all  A number of files are produced as execution proceeds  The output file FIELD out  is the global topology of Myoblogin   solvent   counterions  The topology of  tip3p water and counterions are present as prebuilt FIELD files  The output files  FIELD 1 and CONFIG 1 are the partial topology and configuration for Myoglobin   The CONFIG 1 file can be used in conjunction with the DLPOLY utility water   add to fill the MD box with water     40    Bibliography     1           Smith  W   Forester  T R       The DIPoly 2 0 User Manual     T R Forester  and W Smith  CCLRC  Daresbury Laboratory  Daresbury  Warring
31. that contain respectively the topology and configuration of a single  molecule  Subsequently merge field merges each FIELD file produced by dlgen    eventually with other prebuilt FIELD files  into a global FIELD file  Theese utili   ties have been implemented thinking in terms of aminoacidic systems  e g proteins   as the main goal  and it is able to treat force fields of the type CHARMm22 and  GROMOS87  37c4 parameter set   Due to the general combination rules of  the CHARMm22 force field  dlgen works for proteins  proteins metallic groups   e g  heme group   phospholipids  sugars  and so on  i e  any kind of entry can be    safely added to the molecular database  Viceversa due to the more complex rules    adopted by the GROMOSST force field  e g  the way the dihedrals are defined for  sugars   the actual version of dlgen works only for proteins  eventually in presence  of metallic groups  The reader may ask the authors to obtain a non standard  version of dlgen to treat different systems within the GROMOS87 force field     An easy way to run the build all utility is to make the following steps      1  Link  or Copy  in your working directory the three executables pro   duced by the compilation  together with the database files  MBBT and  PS files  as explained in the following  needed by dlgen to run  In the  working directory you need to have also the PDB files to be converted    or any previously produced FIELD file the needs to be merged     2  Run the build all utility
32. the user must define from scratch  the complete terminal residue  PRO terminal group  in the MBBT file    as it is specified in the section about adding entries to the MBBT file       Non aminoacidic specification   In the standard pdb format  non aminoacidic atoms have as special  molecular label HETATM instead of ATOM  The user must respect this  labeling when specifying groups as prosthetic groups  such as heme  molecule or single metals that are specified as separate entities in the    MBBT file       Atom and block indexing  The pdb atomic indexing is ignored by dlgen   Viceversa the building  block  e g  residue  indexing is used to signal the change in the molec     ular building block  althought the order of appearance is unimportant     4  Protonation state and disulfide bridges  Residues that in the pdb file have the same names but intrinsic differ   ences  e g  different protonation states due to different pK s or disulfide  brigdes  must be named consistently with the database names in the  MBBT file so that they adhere with the corresponding database defi     nitions     In the following example  it is indicated how to change the head and tail specifi     cation of a peptide  Given the original pdb file     ATOM 1 N ALA 1 15 850 25 550 23 340  ATOM 2 HT1 ALA 1 15 520 25 140 24 200  ATOM 3 HT2 ALA 1 15 700 26 540 23 360  ATOM 4 HT3 ALA 1 15 360 25 150 22 570  ATOM 5 CA ALA 1 17 270 25 280 23 200  ATOM 1188 HZ3 LYH 128 31 830 21 930 16 050  ATOM 1189 C LYH 128 27 0
33. tials has been  restricted in order to deal only with potentials usually employed in biological sim   ulations  e g  the GROMOS and CHARMM22 force fields   This is the Coulomb  interaction  in the form bare 1 r form  shifted  reaction field or Ewald sum  and  the Van der Waals interaction  both in the 12 6 and lj form for the input FIELD  file     3 2 Interpolation  spline or direct evaluation of    the Van der Waals interactions    In order to increase performances DLPROTEIN can avoid the use of tabulated  potential when computing the non bonding forces  On the other hand splines can  be used as a third alternative  Higher order interpolation schemes  such as the 4     points and r sguare interpolation  as used in the original DLPOLY program  have    29    been eliminated  Interpolation  or spline  versus direct evaluation is managed in  the CONTROL file  The directive    interpolate potentials    is used to indicate the  use of an interpolation or a splining scheme  At the compilation level  when the  compilation macro    SPLINE    is present the splined potential will be used instead  of the 3 point interpolation  In the include file  specification of the grid points for  the spline is determined by the parameter mxspln  The ratio between the grid  points in a 3 point interpolation and spline potential can be up to 10 depending    on the desired accuracy     3 3 Refolding of molecules    An efficient way of reducing CPU time when dealing with molecules is to avoid  molecul
34. ton  WA4 4AD  England  1995      M P Allen e D J Tildesley     Computer simulation of liquids      Clarendon  Press  Oxford  1987     Ciccotti  G   and Ryckaert  J P   Comp Phys  Rep 4 1986   345    Martyna  J G   Tuckerman  M E   Tobias  D J   Klein  M L    Mol  Phys 87  1996   1117    Melchionna  S   Ciccotti  G   Holian  B L   Mol  Phys 78 1993   533  Procacci  P   Marchi  M   J  Chem  Phys 104 1996   3003    Essmann  U   Perera  L   Berkowitz  M L   Darden  T   Lee  H   Peder   sen  L G  J  Chem  Phys 103 1995   8577    41    
    
Download Pdf Manuals
 
 
    
Related Search
    
Related Contents
SH-R300 SH-R220 SH-WR80 SH-R160    manual de instruções do multímetro analógico ma-60  MIGGY 171 MIGGY 171 - Stel Welding Division  SPL31 User Manual 7-2014 V8  Important Information Guide WARNING CAUTION - Psn  Denon DCD-500AE CD Player User Manual  B - GE Healthcare Life Sciences  Manual de usuario - Domotica y Seguridad  平成15年横審第48号 引船あすか丸機関損傷事件 言渡年月日 平成16    Copyright © All rights reserved. 
   Failed to retrieve file