Home

User Manual - Duke University

image

Contents

1. resFormat Format of resonance assignments It can be CYANA format BMRB format or KEASY format resNameScheme Atom naming scheme It can be BMRB OLD BMRB NEW PDB NEW or PDB OLD resonance File name of resonance assignment list NOESY Format Format of NOESY cross peak list It can be XEASY format or NMRVIEW format 3D N15 NOESY File name of 3D 15N labeled NOESY cross peak list 3D C13 NOESY File name of 3D 13C labeled NOESY cross peak list isUnique Whether only those unique NOE assignments are outputted or all ambiguous NOE assignments are outputted isOutORFormat Whether multiple NOE assignments assigned to the same NOESY cross peak are outputted in OR format outNoeName File name of the output NOE table which is in the standard Xplor format The default atom name scheme used in the output NOE assignment table is PDB NEW CompareTwoNOETables CompareTwoNOETables input This command compares two NOE table and outputs the summary A typical input parameter file includes the following parameters Sequence File name of protein primary sequence NOEFormat Format of source NOE table It can be XPLOR format or CYANA format NOE Table File name of source NOE table for comparison RefNOEFormat Format of reference NOE table It can be XPLOR format or CYANA format Ref NOE Table File name of reference NOE table for comparison
2. IsOne20ne whether it is one to one packing or one to all packing namely packing one structure to all structure in an ensemble It is set to be 1 for one to one packing 0 for one to all packing SSES PDB names of two SSE fragments which are separated by outPackedStructures File name of the output packed structures only the common name of all structures in the ensemble NOE Table File name of NOE restraint table used for packing SSEs NOEFormat Format of NOE restraint table It can be XPLOR or CYANA format isSkipSymCheck Whether the symmetry checking step is inputted It can be 1 or 0 SymID Assigned symmetry ID i e 0 1 2 3 if previous isSkipSymCheck is set to be 1 isSkipOutEnsemble Whether the step of outputting the whole ensemble of packed is skipped If this step is skipped only the mean structure of all packed structures is outputted InputFileLocation Location of the input ensemble of SSE fragments in the all to one packing case addSideChains addSideChains input This commands places the side chains onto backbone using the NOE pattern matching technique A typical input parameter file includes the following parameters Sequence File name of protein primary sequence haErr hlErr cl3Err hnErr nErr Error windows in each dimension ppm resFormat Format of resonance assignments It can be CYANA format BMRB format or
3. BMRB OLD BMRB NEW PDB NEW or PDB OLD isWholeStructure Whether the whole structure i e including sidechains and backbone is used for pruning ambiguous NOEs It is set to be 1 if the original backbone PDB file is used for pruning ambiguous NOEs resFormat Format of resonance assignments It can be CYANA format BMRB format or KXEASY format resNameScheme Atom naming scheme It can be BMRB OLD BMRB NEW PDB NEW or PDB OLD resonance File name of resonance assignment list NOESY Format Format of NOESY cross peak list It can be XEASY format or NMRVIEW format 3D N15 NOESY File name of 3D 15N labeled NOESY cross peak list 3D C13 NOESY File name of 3D 13C labeled NOESY cross peak list noeLimit Distance upper limit used for pruning ambiguous NOE assignment The original distance bound calibrated if the value of noeLimit is negative isOutORFormat Whether multiple NOE assignments assigned to the same NOESY cross peak are outputted in OR format outNoeName File name of the output NOE table which is in the standard Xplor format The default atom name scheme used in the output NOE assignment table is PDB NEW IsOriginalUp In the output NOE table whether the original NOE upper bounds i e calibrated from NOESY peak intensity are displayed The calibrated distance is used if the parameter is set to be 1 If the parameter is 0 the distances from the st
4. See the GNU Lesser General Public License for more details Full licensing details including citation requirements for the various different modules of the software are found in the document license pdf enclosed with this package distribution RDC PANDA is specifically designed for automated NMR NOE assignment and protein structure determination package It provides a novel approach for computing the initial structure template for NOE assignment by exactly solving backbones from RDCs and systematically choosing rotamers based on NOE pattern matching RDC PANDA mainly consists of following three algorithmic modules 1 RDC ANALYTIC which computes orientations and conformations of SSE backbones Current version of RDC ANALYTIC is designed to compute protein global fold backbone using NH and CH RDCs in one alignment medium It determines the conformations and orientations of secondary structure elements SSEs by solving the RDC equations in closed form It applies a systematic search approach to compute the global optimal solution of each SSE fragment that best fits the RDC data The RDC ANALYTIC module has been implemented in a standalone package More details about configurations and input file formats of RDC ANALYTIC can be found in its user manual released with this package distribution 2 PACKER which packs SSE backbones using spare NOE restraints PACKER first extracts a set of sparse unambiguous NOE assignments using only chemical shift
5. KEASY format resNameScheme Atom naming scheme It can be BMRB OLD BMRB NEW PDB NEW or PDB OLD resonance File name of resonance assignment list NOESY Format Format of NOESY cross peak list It can be XEASY format or NMRVIEW format 3D N15 NOESY File name of 3D 15N labeled NOESY cross peak list 3D C13 NOESY File name of 3D 13C labeled NOESY cross peak list LocationBBEnsemble Location of backbone ensemble outStructureName Name of the output structure name including both backbones and side chains addRandomLoops addRandomLoops input This command simply merges the SSE PDB and loop PDB into a PDB that contains the whole structure It prepares the input PDB files to Xplor in the local minimization approach A typical input parameter file includes the following parameters LocationSSEEnsemble Location of the input SSE ensemble LoopPDBName File name of loop PDB outCompleteStructures File name of output structures HANA NOE Asg HANA NOE Asg input This command executes the HANA NOE assignment algorithm given the backbone solved from RDC EXACT A typical input parameter file includes the following parameters Sequence File name of protein primary sequence haErr hlErr cl3Err hnErr nErr Error windows in each dimension ppm Backbone PDB file name of the backbone PdbNameScheme Atom naming scheme of backbone PDB It can be
6. TE FE HE TE FE FE FE FE FE HE TE FE E TE FE HE EE EE ER ER EE E FE E EH AE E E E E E E E E E EE EE H Note For short loops we can directly obtain NOE assignments from the mean structures of the ensemble cd cd cp av xplor local min average all pdb xplor final cal extract the unique NOE assignments for long loops in residues 33 41 using only chemical shift information The result will be stored in xplor final cal ubiquitin noe loopsl O0 tbl java cp bin RdcPanda NOEAsgFromCS asg noe csl input LongLoopsNOEAsgl out extract the unique NOE assignments for long loops in residues 50 65 using only chemical shift information The result will be stored in xplor final cal ubiquitin noe loops2 O0 tbl java cp bin RdcPanda NOEAsgFromCS asg noe cs2 input LongLoopsNOEAsg2 0ut extract SSE fragments including short loops that is excluding two long loops java cp bin RdcPanda ReadPdbFragments readPdbFragments input readPdbFragments out java cp bin RdcPanda ReadPdbFragments readPdbFragments2 input readPdbFragments2 out fUse HANA to compute NOE assignments in SSE and short loop regions fThe result will be stored in xplor final cal ubiquitin noe sses tbl java cp bin RdcPanda HANA NOE Asg HANA NOE Asg SSE input gt DANA NOE ASG SSE out FEE AE E FE AE FE EFE AERE HE EH EH E iteration 1 for computing long loops in residues 33 41 and residues 50 65 cd SPWD xplor fin
7. ANDA 2 License Information The source header below must be included in any modification or extension of the source code of RDC PANDA Source Header RDC PANDA NOE Assignment and Protein Structure Determination Version 1 0 Copyright C 2001 2009 Bruce Donald Lab Duke University RDC PANDA is free software you can redistribute it and or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation either version 3 of the License or at your option any later version RDC PANDA is distributed in the hope that it will be useful but WITHOUT ANY WARRANTY without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE See the GNU Lesser General Public License for more details You should have received a copy of the GNU Lesser General Public License along with this library if not see http www gnu org licenses There are additional restrictions imposed on the use and distribution of this open source code including A this header must be included in any modification or extension of the code B you are required to cite our papers in any publications that use this code The citation for the various different modules of our software together with a complete list of requirements and restrictions are found in the document license pdf enclosed with this distribution Co
8. E ERE RE E E E E EE EH E E E EEE ERE 3 Use Packer to pack SSEs FE HE TE FE E TE FE FE HE FE FE HE FE FE HE TE FE HE TE FE E TE FE FE FE FE FE E HE EE EE EE EE HE HH HE EE E E E E E E E E E EE H After computing comformations and orientations of SSE bacbone fragments save the pdb of each SSE backbone fragment in the directory inputFiles Then run the following command for packing helix and beta sheet java cp bin RdcPanda SSEPacking packingSSEs input gt packingSSE out fThe ensemble of all packed structures will be stored in directory inputFiles packing HEHEHE TE FE FE HE EE EH EH EH HH HE TE FE HE HE EE HE EE EE EE EE HE HH E TE FE EE EE E E E E EE E H 4 Use HANA to place side chains HERE EEE EH EH EH EH EH HH FE E TE FE FE TE EE EERE ER ER E E EE EE EE E E HEE EE HE Run the following command line to place side chains onto all packed SSE backbone i e core structures java cp bin RdcPanda addSideChains addSideChains input gt addSideChains out The ensemble of outputted packed SSE backbones with computed side chains will be stored in directory inputFiles bbwsc HERE RHE HEHE RHEE EEE RHE HE EE FE FE HERE HE FE AE HEHE E FE E FE E FE HE HERE EEE H 5 Compute the loops FE HE AE FEAE FEAE FEAE FEAE FEAE FEE HE FE HEFE FE FE RHE HERE FE FE E HERE AE FE AE FE AE FE AE FE E FE AE HEHE HERE EEE H First random loops are added into the core structures i e packed SSE backbones with side ch
9. Location Location of the input ensemble directory outPackedStructures Name of the output packed structure only the common name of all structures in the enemble For exameple the name H2H1New will generate an ensemble of structures with file names H2H1Newl pdb H2H1New3 pdb H2H1New3 pdb etc resolCluster Resolution used in the clustering step 20 7 Examples Two examples including tests on ubiquitin and FF2 structures come with the distribution of RDC PANDA version 1 0 The input data parameter files and shell scripts for running these two examples can be found in sub directory experiments The details on running RDC ANALYTIC on these two proteins can be found in the RDC ANALYTIC manual Section 6 page 9 To run these two examples namely testing RDC PANDA on proteins ubiquitin and FF2 first clear the following directory and make them empty inputFiles Jxplor local min Jxplor final cal and then type chmod 700 test NAME Jtest NAME in which NAME can be ff2 ubq and eta for each testing case For example to test RdcPanda on ubq protein type chmod 700 test ubq nohup test_ubq The locations of intermediate results can be found in remark sections of the shell scripts It is recommended that different testing cases should run in different RDC PANDA working directories The following gives an example of the shell scripts for running the ubiquitin test Shell Script File test_ubq
10. RDC PANDA User Manual Version 1 0 Jianyang Michael Zeng and Bruce R Donald Copyright O 2001 2010 Bruce Donald Lab Duke University Contents I ug te MEM HT 3 PA License ruris TIRE T E a er ene 6 a Citation E OT E 7 EMI CHE ire EON TTL 7 5 System Cofi s rati Sese secorre rE rE EE EE EE E EEE E EEEE TAS EErEE TESS 9 6 RDC PANDA Commands RP ERER 11 ps cung p EER 2 B RDC PANDA CI2388 Summary iiie esencia p eo api tial hcec i emi a es Do pi ca eee DU IE ON Co n ad eub Uie 26 1 Introduction RDC PANDA RDC based SSE PAcking with NOEs for Structure Determination and NOE Assignment is a suite of programs for nuclear Overhauser effect NOE assignment and high resolution structure determination starting with a global fold calculated from exact solutions to the residual dipolar coupling RDC equations RDC PANDA is developed in the lab of Prof Bruce Donald at Duke University This user manual is for version 1 0 of the software RDC PANDA is free software and can be redistributed and or modified under the terms of the GNU Lesser General Public License as published by the Free Software Foundation either version 3 of the License or optionally any later version RDC PANDA is distributed in the hope that it will be useful but WITHOUT ANY WARRANTY without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE
11. ReadPDBEnergy ReadNoeEnergy input This command reads the energy values from Xplor PDB files and output the top structures with lowest energies A typical input parameter file includes the following parameters LocationSSEEnsemble Location of the input Xplor PDB ensemble IsCutOff Whehter a cutoff is used for selecting top ensemble of structure Cutoff The cutoff of energy if IsCutOff is 1 OutputNumber If the cutoff is not used the top number of structures with lowest energies rowPos colPos Position of energy terms in the REMARK section of Xplor PDB files starting from 0 isOutput Whether the same PDB with the remark is outputted OutName File name of output PDBs CheckNoeByEnsemble CheckNoeByEnsemble input This command uses the voting scheme to obtain the consensus NOE assignments which are consistent with a majority of the structure in the ensemble A typical input parameter file includes the following parameters Sequence File name of protein primary sequence InputPdbLocation Location of the input ensemble of PDBs used for pruning NOE assignments PdbNameScheme Atom naming scheme of backbone PDB It can be BMRB OLD BMRB NEW PDB NEW or PDB OLD NOEFormat Format of source NOE table It can be XPLOR format or CYANA format NoeAtomNamingScheme Atom naming scheme in the NOE table InputNoeTable File name of the input NOE table isMultiAssignment Whether th
12. ains using the following commands java cp bin RdcPanda addRandomLoops addRandomLoops input gt addRandomLoops out The ensemble of complete structures with random loops will be ouputted in directory xplor local min structure which will be used as initial templates for the local minimization approach in Xplor Then go to the directory xplor local min and run the following three Xplor scripts for the local minimization cd SPWD xplor local min xplor min inp min out 22 Sxplor dgsa inp gt dgsa out xplor refine inp gt refine out Ensemble of final complete structures both SSEs and loops after the local fminimization approach will be stored in xplor local min final 5 1 Use following Xplor NIH script to evaluate the well packed structures WPSs Sxplor py calEnergy py gt computeWPS out compute average structures for ensemble of packed structures computed by PACKER the average structure will be store in xplor local_min average_all pdb Sxplor lt average_all inp gt average_all out compute average structures for ensemble of well packed structures the average structure will be store in xplor local_min average_wps pdb Sxplor lt average_wps inp gt average_wps out FE HE TE FE HE HEH EE EH EH HE FE HERE FE HE TE FE E TE RE TE ER ER ER HE HE EE E E E E E E EE BE EE HE 6 Compute the long loop in residues 50 65 FE HE TE FE FE HE FE FE HE TE FE E TE FE HE
13. al_cal 23 cp av ubiquitin noe loopsl1 0 tbl ubiquitin noe loopsl tbl cp av ubiquitin noe loops2 0 tbl ubiquitin noe loops2 tbl fempty intermediate directories rm rf pdb rm rf accept rm rf top Sxplor lt ubq loops fix inp gt compute long loops 1 out cp av pdb top TES4SASASATEAEATSTET iteration 2 for computing long loops cd cd refine those unique NOE assignments in long loop regions other NOEs are unchanged java cp bin RdcPanda CheckNoeByEnsemb CheckNoeByEnsemble loopsl input gt checkLoopNoeByEnsemble 1l out java cp bin RdcPanda CheckNoeByEnsemb CheckNoeByEnsemble loops2 input checkLoopNoeByEnsemble 2 out cd SPWD xplor final cal fempty intermediate directories rm rf pdb rm rf accept rm rf top Sxplor ubq loops fix inp gt compute long loops 2 out move all 10 structures to xplor final cal top cp av pdb top compute the inital structure template for final structure calculation Sxplor average top fix inp gt average top fix2 out Go to RdcPanda working directory ed cd HEHEHE HH HH HH HEHE HE HEH iteration 3 for computing long loops java cp bin RdcPanda HANA NOE Asg HANA NOE Asg fix input gt HANA NOE ASG fix out cd SPWD xplor final cal cp av ubiquitin noe tbl ubiquitin noe 0 tbl empty intermediate direct
14. b Bruce Donald EE Eu X O OXP Duke University XX kK qp RDCs Department of Computer Science EK 1 7 Levine Science Research Center LSRC ZS RK pa Durham SS Ce ail el erg o NC 27708 0129 USA K CAM LC CI ud kk kk k k kk E J Zeng J Boyles C Tripathy L Wang A Yan P Zhou and B R Donald ui RON High Resolution Protein Structure Determination Starting with a Global AX RU Fold Calculated from Exact Solutions to the RDC Equations J Biom NMR ae SH 45 3 265 281 2009 FE Ck Ck ck ck kk ck Ck ck Ck kk Ck Sk ck Ck Sk ck Ck ck KK KK KKK KKK KKK ck kk KK KKK ck Ck ck ck kk KKK KKK ck kk Ck ck ck kk ck kk ck kk ko kk Ck kk ko kk kk ck ko ko KK KKK KK RDC PANDA Once the program is started the desired functions can be executed through the command line The details of commands and corresponding parameter files will be described next The templates of command lines can be found in doc command template txt The structure determination through RDC Panda is generally divided into steps 1 RDC ANALYTIC is used to compute the SSE backbones from RDCs 2 PACKER is invoked to pack SSEs 3 HANA is used to compute the rotamers based on NOE pattern matching technique The loops are computed through the local minimization approach The NOE assignments are computed by HANA 4 Xplor is called to calculate the structures based on the NOE assignment table computed by HANA How to compile the progra
15. bin sh HHEHHHHEEHEEEESESE EHH TESTSTAXHETAXHSESETXSHEEOAHSHERAHAHUHES EHE 1 initialization AE AE aE aE aT aE aE aE FE aE aE HE aE aE aE aE aE aE HE EE FE FE AE AE AE AE AE aE AE AE AE aE FE aE E AE aE aE aE aE HE aE aE aE 1 1 set the XPLOR NIH path xplor usr project dlab Users Software xplor NIH 64 xplor nih 2 16 0 bin xplor 1 2 Empty the inputfile directory dto be safe it would be better to empty these directories manually rm rf SPWD inputFiles rm rf SPWD xplor 1 3 Copy the input files from experiments directory 21 cp av PWD experiments ubq inputFiles SPWD inputFiles 1 4 Copy the Xplor scripts to the Xplor working directory cp av SPWD experiments ubq xplor SPWD xplor FE FEE HE E REE E FEAE EE RHEE FE FE RHEE EEE RHEE EEE EERE EEE HERE EEE HERE EEF 2 Run RDC Analytic program to compute the SSE backbones from RDCs HEHE HEHEHE HE FE HE HEHE FE FE FE FE FE EEE FE FE EE EEA E FE E FE E TE HEE EEE HEE EEF go to RDC ANALYTIC directory and compute SSE backbones cd RdcAnalytic java analytic RDCAnalytic RDC ANALYTIC out copy the PDB files from RDC ANALYTIC to the RDC PANDA input file directory cp av helixl pdb RdcPanda inputFiles cp av sheetl pdb RdcPanda inputFiles go to RDC PANDA directory cd RdcPanda FE HE TE FE E TE FE FE HE FE FE EH EH EH HH FE AE EEE EE E
16. bond vectors 5 the TALOS table of dihedral angle ranges from the chemical shift analysis optional 6 the rotamer library RDC PANDA has been used to solve the new structures of the FF Domain 2 of human transcription elongation factor CA150 RNA polymerase II C terminal domain interacting protein FF2 The new NMR structures have been deposited into Protein Data Bank PDB ID 2KIQ Stage 1 CH amp NH RDCs RDC ANALYTIC C C amp NC RDCs optional Orientations amp conformations of SSE backbones Packing algorithm PACKER Resonance assignments JP jal NOESY peak list Packed SSE backbones core structure HANA b e NOESY peak list Rotamer library 39 Side chain placement L BONS assignments Initial NOE assignment Lu Core structure with side chains Filtration of ambiguous NOE assignments i Unambiguous NOE restraints i Local minimization approach fixing core structure as a rigid body Low resolution structure Filtration of ambiguous NOE assignments Ambiguous NOE assignments NOESY peak list Resonance assignments NOE assignments XPLOR XPLOR NIH Calculated structures Prune violated NOE assignments Refined NOE assignments XPLOR XPLOR NIH High resolution structures Figure 1 Flow chart of RDC P
17. e following parameters Sequence File name of protein primary sequence chFile File name of CH RDCs nhFile File name of NH RDCs cocaFile File name of CaC RDCs This parameter is set to be null if CaC RDCs are not available conFile File name of NC RDCs This parameter is set to be null if NC RDCs are not available WtCoCA wtCoN Weighting factor between CH NH RDCs and CaC and NC RDCs in the scoring function used in systematic search for computing the dihedral angle solutions Syy Szz Diagonal elements of Saupe order matrix of alignment tensor SseBoundaries Residue boundaries for all beta strands The start and end residues should be separated by 5 PrePdbName The PDB file name of previous backbone fragment This is used for computing a long helix when the helix is divided into two parts This parameter is set to be null for computing a single fragment hBonds File name of hydrogen bonds used for packing bete strands OutPdbName File name of output PDB file name of computed backbone TALOS File name of the TALOS angle table This command is not longer under the maintenance This updated version of this function can be found in the latest version of RDC ANALYTIC SSEPacking packingSSEs input This command line packs SSEs using the PACKER algorithm A typical input parameter file includes the following parameters Sequence File name of protein primary sequence
18. e input checkNoeByEnsemble_2 out cd SPWD xplor final_cal fempty intermediate directories ri EE pdb rm rf accept tm rf top sxplor lt ubgq l inp gt final call ont Go to RdcPanda workding directory cd cd move all 10 structures to xplor final cal top java cp bin RdcPanda ReadPDBEnergy ReadNoeEnergy input ReadEnergy final 2 out cd SPWD xplor final cal Sxplor average top 2 inp gt average top 2 out Go to RdcPanda workding directory cd ca 8 RDC PANDA Class Summary This section provides a summary of the RDC PANDA classes which will be useful for users to modify and extend the RDC PANDA source code More details about the definitions of the RDC PANDA classes and methods in the java source code can be found in sub directory doc javadoc in html format e Assign This class provides functions that involve NOE assignment such as initial NOE assignment using only chemical shift information and prune ambiguous NOE assignments using given the structural information etc e BackNoe This class provides functions that involve back computed NOEs e Cartesian This class defines Cartesian coordinates for individual atom in protein structure e Const This class defines various global constants for the entire program 26 Dipolar This class provides data structures processing experimental RDC data EigenvalueDecomposition This class is extended from t
19. e input NOE table is in the multiple assignment format noeLimit Upper limit for pruning ambiguous NOEs When it is negative the original upper bound calibrated from peak intensity is used NoeCutOff Percentage threshold of the structures in the ensemble for checking the consistency of NOE assignments Suppose the NoeCutOff is 0 6 If more than 60 of structures in the ensemble are consistent with an NOE assignment this NOE assignment is outputted Otherwise it is pruned OutNOETable File name of new NOE table after deleting those violated NOEs isOutVioNOEs Whether those violated NOEs are outputted as the remarks isOutMultiAsgFormat Whether new NOE assignments are outputted in the multiple NOE Assignment format NoeStatistics NoeStatistics input This command gives the statistical summary of an NOE assignment table A typical input parameter file includes the following parameters Sequence File name of protein primary sequence NOEFormat Format of source NOE table It can be XPLOR format or CYANA format NoeAtomNamingScheme Atom naming scheme in the NOE table InputNoeTable File name of the input NOE table isMultiAssignment Whether the input NOE table is in the multiple assignment format mergeNClusterAll mergeNClusterAll input This command performs the clustering step over all packed structures in a single directory A typical input parameter file includes the following parameters InputFile
20. factor between CH NH RDCs and CaC and NC RDCs in the scoring function used in systematic search for computing the dihedral angle solutions Syy Szz Diagonal elements of Saupe order matrix of alignment tensor ResBounds Residue boundaries of the helix The start and end residues should be separated by PrePdbName The PDB file name of previous backbone fragment This is used for computing a long helix when the helix is divided into two parts This parameter is set to be null for computing a single fragment OutPdbName File name of output PDB file name of computed backbone TALOS File name of the TALOS angle table This command is not longer under the maintenance This updated version of this function can be found in the latest version of RDC ANALYTIC RdcExactHelixWOAT RdcExactHelixWOAT input This command line computes the helix backbone without requiring the alignment tensor information The input parameter file is similar to RdcExactHelix input except the inputs of diagonal elements of Saupe matrix are not required This command is not longer under the maintenance This updated version of this function can be found in the latest version of RDC ANALYTIC RdcExactSheet RdcExactSheet input This command is used to compute the beta sheet backbone from RDCs using the RDC EXACT algorithm CH and NH RDCs are required as input files while CaC and NC RDCs are optional A typical input parameter file includes th
21. he Jama package which provides the methods for the eigenvalue decomposition of a matrix HICS This class provides data structures that involve chemical shifts Hbond This class defines data structures related to h bonds Hdist This class provides data structures related to back computed NOE distance restraints IDof2aryStructure This class provides functions for processing files with secondary structure element information Maths This class is modified from the Jama numeric package which provides some math operations that are used in the program Matrix This class is modified from the Jama numeric package which provides some matrix operations that are used in the program Model This class provides functions that compute backbone from RDC EXACT algorithm ModelRdc This class provides methods for computing all the backbone dihedral phi psi angles of an alpha helix or a beta strand based on exact solution and a systematic search Noe This class provides data structures and functions for processing NOE assignment Noesy This class provides data structures and functions for processing NOESY spectra Pdb This class provides data structures and functions for processing and generating PDB files and extracting backbone angles etc PdbRdc This class provides various SVD methods for best fitting RDCs and a structural fragment and grid search method for computing the three Euler angles PhiPsi This class provides various method
22. information It then applies a 3D grid search to find all discrete translations that satisfy the sparse NOE distance restraints When packing each pair of SSE backbones all four fold symmetries of SSE orientations are also taken into account 3 HANA HAusdorff based NOE Assignment which uses the SSE backbones to place side chains and assign NOEs HANA first computes a set of initial NOE assignments by considering all pairs of protons that are possibly assigned to an NOE cross peak if the resonances of corresponding atoms fall within a tolerance window Then a Hausdorff based pattern matching technique is employed to deduce similarity between experimental and back computed NOE spectra for each rotamer from a statistically diverse library and drive the selection of optimal position specific rotamers for filtering ambiguous NOE assignments In addition to above three modules a local minimization approach is used to compute loops and refine side chain conformations by fixing the core structure as a rigid body while allowing movement of loops and side chains Figure 1 shows the flow chart of RDC PANDA The input data to RDC PANDA include 1 the primary sequence of the protein 2 the 3D NOE peak list from both 15N and 13C edited spectra 3 the resonance assignment list including both backbone and side chain resonance assignments 4 the RDC data including CH and NH RDCs and the RDCs of other bond vectors optional such as CA C and NC
23. le of an input parameter file ceading protein sequence sequence UBQ seq format of resonance assignments CYANA for the CYANA format BMRB for BMRB format XEASY for XEASY format resFormat CYANA atome naming scheme in the resonance list resNameScheme BMRB NEW eg HB1 HB2 for GLU etc name of resonance assignment file resonance ubc cyana prot convert txt 10 reading NOESY peak lists 3D N15 NOESY ubc n15 noe peaks 3D C13 NOESY ubc_c13_noe peaks All input parameter files are stored in directory inputFiles System Files All system files of RDC PANDA are stored in sub directory system This sub directory includes the rotamer library system parameters e g location of the input files and BMRB statistical information 6 RDC PANDA Commands RdcExactHelix RdcExactHelix input This command computes the helix backbone from RDCs using the RDC EXACT algorithm CH and NH RDC Ss are required as input files while CaC and NC RDCs are optional A typical input parameter file includes the following parameters Sequence File name of protein primary sequence chFile File name of CH RDCs nhFile File name of NH RDCs cocaFile File name of CaC RDCs This parameter is set to be null if CaC RDCs are not available conFile File name of NC RDCs This parameter is set to be null if NC RDCs are not available wtCoCA wtCoN Weighting
24. m optional 1 Install Java Development Kit JDK The newest version of JRE can be downloaded from http java sun com javase downloads 2 Modify the Makefile under the RDC PANDA working directory as needed In particular specify your working directory and the paths to your Java compiler and resources 3 Under the RDC PANDA working directory type make 5 System Configurations File Organization The directory structure of RDC PANDA is organized as follows Jexperiments This directory contains data and input parameter files used for the experiments with proteins FF2 and ubiquitin Jbin This directory contains the java executable files src This directory contains the java source codes inputFiles This directory contains input data and parameter files JoutputFiles This directory is used as the location where output files are written to Jdoc This directory contains the documentations of the program system This directory contains the system configuration files The directory path of input data and parameter files can be set in system system parameters txt Jxplor This directory is used for running Xplor scripts and storing corresponding results Required NMR Data and Formats RDCs CH and NH RDCs are required Other RDCs such as CaC and NC RDCs are optional Format residue index RDC value unscaled experimental error 0 0 In RDC ANALYTIC RDCs are in standard Xplor format Resona
25. nce assignments The resonance assignment list should contain both backbone and side chain resonance assignments Format XEASY format CYANA format or BMRB deposition format Sequence The primary sequence of the protein Format residue name residue index NOESY cross peaks 3D or 4D DN or PC edited NOESY peak list Format XEASY format or NMRVIEW format TALOS dihedral angle restraints Optional The TALOS table provides interval restraints of dihedral angles estimated from chemical chemical shift analysis Format residue index residue name Phi or Psi average value error bar Atom Naming Schemes In RDC Panda the following four atom naming schemes in the input files are supported PDB NEW eg HA1 HA2 Gly PDB OLD eg 1HA 2HA Gly BMRB NEW eg HA2 HA3 Gly BMRB OLD eg 2HA 3HA Gly In the program all atom names are first converted to the PDB NEW naming scheme Input Parameter File Format This section describes the different commands and the corresponding required parameters Each command line consists of command and the file name of input parameter file The command and the parameter file name must be separated by space In the input parameter file lines starting with are parsed as comments Parameter names are single words parameter values follow the corresponding parameter name on the same line and are separated by Each parameter line ends with the symbol The following gives an examp
26. ngs In Proceedings of the 2004 IEEE Computational Systems Bioinformatics Conference CSB 04 Stanford CA pages 319 330 2004 4 Installation System Requirements To use this software Java Runtime Environment from Sun Microsystems JRE must be installed The newest version of JRE can be downloaded from http java sun com javase downloads How to install the program 1 Install Java Runtime Environment from Sun Microsystems JRE The newest version of JRE can be downloaded from http java sun com javase downloads 2 Create a working directory Use gunzip and tar commands to decompress Rdc Analytic_Panda_V1 0_Jan_2010 tgz and extract all files in the working directory 3 Install Xplor Xplor NIH The newest version of Xplor Xplor NIH can be download from http nmr cit nih gov xplor nih How to run the program To run the RDC Panda software go to the RDC PANDA working directory which contains the bin sub directory and use the following command java cp bin RdcPanda Or alternatively use the following commands chmod 755 RdcPanda RdcPanda If the program runs successfully the following txt interface should be given Ck ck ck ck ck ck Ck ck ck kk Ck Sk ck Ck ck Ck ck Ck KKK KKK KKK ck Ck ck ck ck KKK KKK KKK KK KKK KKK KKK ck kk ck kk Ck kk kk ck kk Sk kk ck kk Sk kc k ko kc kc kckockokok ax rMr rMr RDC PANDA Version 1 0 EK XE CND 2ND Contact Info ae EK P xMx yM
27. ntact Info Bruce R Donald Duke University Department of Computer Science Levine Science Research Center LSRC Durham NC 27708 0129 USA e mail www cs duke edu brd signature of Bruce Donald 01 December 2009 Bruce R Donald Professor of Computer Science and Biochemistry 3 Citation Requirements You are required to cite our papers in any publications that use this code The primary citation corresponding to this software is 1 The papers that can be cited based on or related to this software are listed below 1 Jianyang Zeng Jeffrey Boyles Chittaranjan Tripathy Lincong Wang Anthony Yan Pei Zhou and Bruce Randall Donald High resolution protein structure determination starting with a global fold calculated from exact solutions to the RDC equations Journal of Biomolecular NMR 45 3 265 281 2009 2 Bruce R Donald and Jeffrey Martin Automated NMR Assignment and Protein Structure Determination using Sparse Dipolar Coupling Constraints Progress in Nuclear Magnetic Resonance Spectroscopy 55 2 101 127 2009 3 Lincong Wang Ramgopal R Mettu and Bruce R Donald A Polynomial Time Algorithm for De Novo Protein Backbone Structure Determination from NMR Data Journal of Computational Biology 13 7 1276 1288 2006 4 Lincong Wang and Bruce Randall Donald Analysis of a Systematic Search Based Algorithm for Determining Protein Backbone Structure from a Minimal Number of Residual Dipolar Coupli
28. ories rm rf pdb 24 rm rf accept rm rf top Sxplor lt ubg loops fix2 inp gt compute long loops 3 out move all 10 structures to xplor final cal top cp av pdb top compute the inital structure template for final structure calculation xplor average top fix inp average top fix3 out Go to RdcPanda workding directory cd cd TRE UEERERHHEUEREREREREREREREREREREUEERERRUEHERETSSSERdHESESESAHSTETIHTETSSA3HEAqdHHSHSH 7 Final structure calculation FEAE AE aE AE AE aE aE E FE aE AE AE AE E aE aE AE HE a HE aE HE aa aa E E aaa aE aaa HEE aaa FE FE H HEHEHE HEH EH EH ERE ERE ERE E E Round 1 Use HANA to compute the NOE assignments java cp bin RdcPanda DANA NOE Asg HANA NOE Asg fix input gt DANA NOE ASG 1l out cd SPWD xplor final cal cp av ubiquitin noe tbl ubiquitin noe 0 tbl fempty intermediate directories rm rf pdb rm rf accept rm rf top 9xplor lt ubq l inp gt final cal 1 0 Ut Go to RdcPanda workding directory cd ca move all 10 structures to xplor final cal top java cp bin RdcPanda ReadPDBEnergy ReadNoeEnergy input gt ReadEnergy final 1 out cd SPWD xplor final cal Sxplor lt average top l inp gt average top 1l out Go to RdcPanda workding directory cd ca TEXAS UEHERE RE RE HERE ERE RE ERE EHF Round 2 25 java cp bin RdcPanda CheckNoeByEnsemble CheckNoeByEnsembl
29. ructure after combining side chains and backbone are used as the NOE upper bounds IsCheck Whether checking the NOE assignment results by comparing them with the reference structure refPdb File name of the reference structure PDB when ischeck is 1 refNameScheme Atom naming scheme for the reference structure It can be BMRB OLD BMRB NEW PDB NEW or PDB OLD CalAlignmentTensor calAlignmentTensor input This command computes the alignment tensor given CH and NH RDCs and PDB Although CaC and NC RDCs can be inputted into the program only CH and NH RDCs are used for computing alignment tensor However the RMSD values between experimental and back computed RDCs for CaC NC CH and NH RDCs are all reported A typical input parameter file includes the following parameters 16 chFile File name of CH RDCs nhFile File name of NH RDCs cocaFile File name of CaC RDCs This parameter is set to be null if CaC RDCs are not available conFile File name of NC RDCs This parameter is set to be null if NC RDCs are not available InputPdbName File name of input PDB NOEAsgFromCS asg noe cs input This command assigns NOEs based on only chemical shift information A typical input parameter file includes the following parameters Sequence File name of protein primary sequence haErr Klee Gl3Err hnErr AETI Error windows in each dimension ppm
30. s for exact solution and systematic search including the computation of rotation to the first peptide plane the DFS search and solver of the quartic equation and the computation phi and psi angles from two vectors in consecutive planes etc Rotamer This class provides data structures and functions related to rotamers RotaPattern This class implements data structures and functions that deal with the rotamer properties such as NOE pattern for each rotamer 27 SingularValueDecomposition This class provides functions for SVD operations from Jama package SSEPacking This class provides functions for packing SSE fragments vdw This class provides methods for computing the vdw energy term 28

Download Pdf Manuals

image

Related Search

Related Contents

  Hitachi CP-WUX645N data projector  Xerox ColorQube 8880ADN, A4, Printer  600 SERIES USER MANUAL  XR10 SPOT user manual rel 1.0 eng  CD690/695 Portuguese user manual  「やる気と能力を引き出す」NLP研修  ATTENTION! Kit Type: HUGHES  Manual de Instrucciones PC95  

Copyright © All rights reserved.
Failed to retrieve file