Home
CzeekD Manual
Contents
1. Figure 10 b The number of bonding points as well as the number of R groups must also be defined Setting the R groups can be done using the R1 to R4 buttons in Marvin Sketch Structures that are created using other tools can also imported from files and by copying and pasting from other applications Please refer to the following link for details regarding the use of Marvin Sketch http www chemaxon com marvin help sketch sketch index html In our example the unit at the middle will be set to the structure of Quinazoline as shown in the Figure below File Edit View Insert Atom Bond Structuri Calculation Tools Help 0 o 8 b Figure 10 Drawing the core structure 3 3 2 Selection of fragment library There are two ways to select the desired fragment library 1 by right clicking on a unit then clicking on Select library item on the context menu and 2 by selecting a unit by clicking on it once then clicking on the Select library button This will open a window that will allow the selection of one or more desired fragment library from a list Several fragment libraries are available depending on the structure and intended usage Please refer to Table 2 for details The number of fragments contained within the selected fragment library can still be narrowed down by using one or more methods shown below e According to suppliers this is possible only with those libraries containing supplier information e According to qu
2. the distribution map is presented as the plot of molecular weight at the X axis Weight against the score at the Y axis Changing the parameters plotted in any of the axes can be done by clicking on any of the axis labels then selecting the desired parameters in a popup window Figure 19 a In this example CGBVS scores of GAK X axis and EGFR Y axis are plotted CzeekD GEG cos ee I I e n I DO gt eee lt HH Figure 18 Results analysis window Figure 19 a Axis display parameters and b Distribution map Kyoto Constella Technologies 13 CzeekD Manual Clicking and dragging on an area within the distribution map will select the compound present in that area Clicking the List view button will show the compound structures with its corresponding scores as well as other parameters Figure 20 Selecting a range by specifying values or selecting all the plotted compounds can be performed by clicking on the Range or Select all buttons respectively Canceling the selected range can be done by double clicking within the distribution map The maximum number of compounds that can be selected is 50 000 Czeek N a cartins are E ian te nce lane crm nmin dv tener o A dwn 1 Click and drag to select an area t_ Double clicking within the map will reset selection Nn Click List view button Click on the Numeric button to do numeric search Hebe Figure 20 Selecting a
3. of fragment display in the distribution map the X axis is initially set to freq that indicates the frequency in which the fragment is utilized in the generated compounds The Y axis is set to mean that indicates the average score of compound groups possessing that fragment In addition fragments that have available supplier data also show the name and ID of the supplier That is true in the case of building block library being offered by Namiki Shoji Namiki BB Library Kyoto Constella Technologies 17 CzeekD Manual Czeek D Ce oo rece coi rm try opto du r tved cea tee 7 eat os D n Eu 4 04 m 03 a i t TE FF E Ti E ARTIF TEEST EEr M X l hei f gt Wium Le rt o a Saen anni e sz mR ti i4 na Cee soot ness ua QUEST I WA ete RI LI owe le Co Pena a L 1 PEM sana di 21212424 lt lt lt T lt P E i i it Figure 26 Displaying structures of selected fragments 4 5 Refining compound search Normally CzeekD generates several hundred thousands of compounds during calculation While it depends on the selected target and synthesis frame settings the number of high scoring compounds can reach up to tens of thousands In our example calculation using EGFR and GAK alone there are 21256 compounds scoring 0 8 or greater and checking that list for compounds worthy of further examination would be very difficult if not im
4. setting window In this example the structure of Gefitinib is drawn while EGFR and GAK were set as the target and off target proteins respectively Figure 34 Kyoto Constella Technologies ae CzeekD Manual Czeek D SS jetta o Peoficdm Lint Search 2 Ure Camo Catan Carp Ara range Tama striitare Dese ve PIT e Gt bore ne 3 Draw structure ti sam asce 4 erat nre lt IM Inuteihe g taf A H tnt dii n y srein Me Figure 34 Calculation based on structure entered using the Marvin Sketch plugin The result of the above calculation is shown in Figure 35 una 44624404 COP COORD ddt 447 2 dit OEP det ont aay l Puen lone gee SS ieee Figure 35 Scoring result for Gefitinib 5 2 2 Uploading structure files If calculation of a set of compounds is desired the compound structures must be put in a file SD SMILES or mol format and then uploaded into the system Up to 100 compound structures can be uploaded The first line of the descriptors for all compounds must start with the compound name and unexpected results will be obtained if that information is missing A unit without any bonding point must be placed first in the synthesis frame area The unit is then selected before clicking on the Select library button A window similar to that in Figure 36 will open Click on the Browse button to select the file for uploading The number of compound structures
5. showing the information for the current user Figure 32 b Information can be modified by clicking the Edit account details button This will open a dialog box that allows the user to change the user name e mail address and password Figure 33 Click on OK to close the window and effect the changes or Cancel to cancel current operation tam informa bor it Welcome to CzeekD di User informatica Gau Click to check user information a Click to edit user information b Figure 32 Displaying user information Edit user information x Usemame E mail taylo k ct jp Password Re enter password Cancel Figure 33 Editing user information 5 2 Compound scoring by CGBVS With CzeekD scoring of a predefined structure against one or more protein targets can be performed and the structure can be specified in 2 ways 1 drawing or uploading the desired structure using the Marvin Sketch plugin or 2 by uploading SD or SMILES file within the fragment library selection window 5 2 1 Using the Marvin Sketch plugin First remove the default frame by clicking on the Clear frame button From the selection of units select the one without any bonding points then click on the frame area to place the unit Select the unit by clicking on it then click on the Draw structure button to activate the Marvin Sketch plugin Draw the desired structure then click OK to return to frame
6. Kyoto Constella Technologies Co Ltd CzeekD Manual December 2014 CzeekD Manual TABLE OF CONTENTS L PAREN ONC NO ON PEPPE CREO RR RO RR ORE 1 Lil IR it MM SC a E EE EE TE E suacocsusancdmacseass l L2 Logn andoperaional Pollyanna rain l 2 Creation of Compounds in CzeekD iscsiccsiticioiaricinicnaneaniana ia lcisivadinonreiniadbisasibciilanibviatiaialasegibciniazzbinri ani 2 21 COMPOUNAEFANOMSCHEIO iridata arreca ode 2 2 2 Retrosynthetic Combinatorial Analysis Procedure RECAP 2 2 3 Particle Swarm Optimization PSO Algorithm ceccsssceccccceeceseseeeeeccceeeecaaeseecesseeeeceesessaaeeeeees 3 3 Operation Specifying Calculation Settings and Actual Calculation vcrescrrrrrecereeeeneeecececeseeeeeeeere 4 dl BOI 5 gt IIS dae 7 AI RR A 7 E Bi nia TN 9 3 5 Regarding Combinations Chemical Space and Estimated Time i 11 3 6 Checking the status of a calculation JOD 12 A Calculation Results Analysis ccccscssscesteessssvscessavetassacseveseisenusseasecensnesvenvevesstaseantesseuensessenveseaveasioanesessassonens 13 Al Digione 13 42 ompound HL ope 110 9 innata TES 14 43 Nu mericandpariialstr tur SEARCH E a N 15 4 4 Compound display and fragment display I7 460 Liliana 21 S I I RCA RR RE RR RESA 22 5 1 Modifying user information changing passWord 22 52 COMPOLAd scornet by COBY S serseri
7. antitative physical properties number of heavy atoms molecular weight and ClogP e According to partial structure can be done by fragment structure and structure similarity search Kyoto Constella Technologies 8 CzeekD Manual After the conditions above have been set clicking on the Search button will start the search for fragments fitting the set conditions Clicking on the Show Fragment List button will reveal the selected fragments on the right side of the window Figure 11 a Figure 11 Fragment library selection window The example in this section sets the left unit to RECAP non aro ring 1 and the right unit to RECAP aro ring 1 fragment libraries RECAP non aro ring ge3 absent Table 2 Fragment library list RECAP chain ge6 Has 6 or more rotatable bonds but no ring structures 3 4 Bond settings In CzeekD the RECAP rule is applied to bonding between fragment units and the fragment that conform to these rule are automatically selected However if a particular structure is set for a unit the RECAP rule that must govern the binding must be explicitly defined The procedure is explained below As shown in Figure 12 a the bond to be set is selected by clicking on it once after which the Set bond rules button 1s clicked This will reveal a pull down menu from which the appropriate R group is selected A window similar to that in Figure 12 b and 13 will then be shown At the upper pa
8. atus is usually shown in the following order Queued Pre Processing Running shown as percentage Post Processing Normal end Results of complete jobs can be displayed and further analyzed A currently running job can be aborted by clicking on the STOP button which is displayed in place of the trash bin icon in case of a currently running job Figure 17 Job list window The trash bin icon appears in completed jobs as well as in aborted jobs Clicking on the trash bin icon will delete the job Deleting a job cannot be undone so it is advisable to download your results before doing anything else so as not to accidentally lose important data Kyoto Constella Technologies 12 CzeekD Manual 4 Calculation Results Analysis In this section display of results and analysis feature will be described We have used EGFR and GAK as example targets since section and we will continue to use the same in this section Double clicking on a job or clicking on the red icon at the rightmost column of the job list window will display the results of a calculation job 4 1 Distribution map A sample of the results analysis window is shown in Figure 18 below The left hand side shows the data manipulation area while the right hand side displays the selected compound structures with corresponding scores and parameters Compounds shown in the distribution map range in colors from blue to nearly read indicating the density number of compounds Initially
9. based fragment search refinement In compound display mode select an area by clicking and dragging within the distribution map Click on the List view button to show the compound structures then select one or more compounds by clicking on the checkboxes Figure 29 On the Frame panel click on the left or right fragment units The distribution map will change to show only the fragments present in the selected compounds Figure 30 Kyoto Constella Technologies 19 CzeekD Manual CzeekD GER cs Wweicome to czasy be etnea Peek ieee Oe Soe ipo Seed eworciws Bhs wget ee Dapy ptor Hee emrt beas Magnaat d rect were Camera COR PAPMAN GAK IRAAN Ac regno Wage 9 a 44 d Osy Ubi 1 4 TPA 377 2 440 374 000 FIT 73328294 Wo 0 Oe 01039 Magna te Wm Sewa Oa a Linee teint at Setociot 15467 Figure 29 Selecting compounds Before search refinement After search refinement 104 91370 9 4927 80836 04870 8964 77 48 1312 1147696406 154 919320 9 492 80030 84870 8984 71148 1352 14 761640 8 fr g treg J seoce TP numere Jo ne teamed Ri aR St TI Cl Aste be ua n di f 0 Figure 30 Change in distribution map after compound based search refinement For both cases above the results of the refined search can still be narrowed down further by setting different conditions and by clicking the checkboxes to select fragments or compounds Clicking the Save button located below the distributio
10. contained in the file will be shown at the lower left hand area of the window Clicking the Show Fragment List button to show the structures at the right Kyoto Constella Technologies 23 CzeekD Manual side of the window Click the OK button to set the compounds for calculation is Uptoed Me moU SOSMES r t List Select file EGFRI smy write ID HA Ma oy sopies Sabi homn Eaa y 377063 ji da i Library List z ded a von 1 Click to select file to upload N 2377104 HO nr 417387 O OS Supplier List ven Lul Soori Fragmants 3122 ALA g Ci Pa 412200 i ri UL r Refine Search E w 2 Click to check structures Fy uw lt eavy Morn lt hi F p x Ostructuce timer Y lag O 1 Ox Cancel Sa pou as x ami fy Figure 36 Uploading structure via the fragment library selection window Results for the above calculation is shown in Figure 37 below CzeekD ZEMmDE ca racerne te Conway dine eteri GEER H tz 423 U tr deed vespri eee es Oo Suor avanzo ae eee x Py mora gt n v artn er n O del Rod DOC 4 Bag 3 ETTI AIR dll ARA TARSIA dr al r cay nr me ae 2 Lt e na o Titas Seat o ee TI Te bc 5 3 Assigning s ry a 4 s i i us fa Figure 37 Calculation results for compound structures uploaded from file everal core structures Several core structures can be assigned using the Marvin Sketch drawing tool As shown in Figure 38 it can be done by
11. culation is currently being shared with everybody within the same group Clicking on the same icon will un share the data Only the owner of the calculation results can share and un share the data Unchecking the Show shared data checkbox will reset the job list to display only the currently logged in user s data Checking the box again will show currently shared data by other group members Kyoto Constella Technologies ZO CzeekD Manual Aga mew jod update Search Share Lise lob name Gian Chart Display options Vi Show shared data icon desonpbon ne Eno bme Running time Expected compton bme One ar on p test kinase egfr TETY 2014 8 21 11 30 2014 8 21 19 15 07 44 57 2014 5 21 18 53 ww ti teat kinase egfr i mary 407 4 6 22 10 42 eVl4i igs 14 54 04 12 32 2014 G 22 14 27 E G Taviofdmund EGFR GAK Testi Normas end 2034 32 16 10 25 2014 12 16 18 13 07 38 13 2014 12 16 18 09 G amp G g Go T Todo Edmund Geftinb EGFR GAK Test Norma end 2014 12 18 4 12 15 1 VOl o G T Normaond 2014 12 22 16 58 2014 12 22 10 39 00 91 07 G O Normans 2034 32 22 17134 2014 12 22 17 35 00 00 57 O gG O 2 Changes to red color indicating shared status 1 Click here to share data for this job Figure 39 Sharing calculation data 5 5 Expanding the range of compounds that can be generated The synthesis frame can be designed in such way that the number of virtual compounds the chemical space that can be generated will increase As shown in Figu
12. d by the following equation X Gia Fy In the above equation x is a vector that represents the properties of the fragment while u represents the unit Since a virtual compound is created by combining all the fragment units in the synthesis frame the position vector X can be represented by all the fragment vectors x connected in series Although the basic equation for PSO indicates that the position vectors are continuously changing with regards to de novo design the structure of the compound must be obtained from the value of the position vector X The approach that is used to realize that is to select all the fragments with vectors whose properties are very much alike with those of x that constitutes X and subsequently bind them to create the virtual compounds If several proteins are selected an overall score will be calculated Score calculation is represented by the following equation s f P positive i 1 P negative In the equation above n represents the number of proteins S represents the score of the protein and if negative that is off target protein is selected S is calculated using 1 P 3 Operation Specifying Calculation Settings and Actual Calculation In this section de novo design settings will be explained using Kinase as an example target The target proteins are EGFR and GAK Compounds that are active against EGFR while at the same time inactive against GAK will be Kyoto Const
13. drawin g several structures within Marvin Sketch Figure 38 a Structures not connected together by Kyoto Constella Technologies 24 CzeekD Manual bonds are recognized as independent structures Figure 38 b There are two important points to remember when using this method number of R groups should be the same for all the structures 2 R group symbols should be the same for all the structures For the second condition above if 2 R groups are present the R1 and R2 symbols must be used for all the structures and not R1 R2 for one then R3 R4 or R5 R6 and so on for the succeeding structures Failure to follow the conditions above will lead to unexpected results Please note the system does not give a warning if you commit such errors and will accept the invalid settings and continue on to calculation aon Marvin Sketch G hites denovo czeek jp creekd Ma File Edit View Insert Atom Bond Structure Calculation Tools Help vr 2 a cy t A b Figure 38 Drawing more than one structure within the Marvin Sketch plugin 5 4 Sharing calculation results Users belonging to the same group defined during creation of accounts are allowed to share their calculation results with each other This can be easily done by clicking on the purple icon containing white callout symbols Figure 39 This will make the gray button located at the leftmost column to turn to red color indicating that the results for that particularly cal
14. ed not to drastically increase the number of iterations If a much broader search range is required performing the same calculation several times is a better option You can however make changes to the settings when the need arises 2point Figure 42 Optimization algorithm settings Kyoto Constella Technologies 28
15. elected and these proteins will be shown on the lower right hand panel Figure 7 In the case of Figure 7 EGFR will be the target and GAK will be the off target Kyoto Constella Technologies 6 CzeekD Manual 3 2 Synthesis frame settings Setting the required conditions for the creation of virtual compounds is performed in the synthesis frame panel Operation for the required settings shown below introduced in the section 2 will be explained in this section set number and configuration of units topology 2 specify structures and or fragment libraries to be used 3 select rules that would govern binding between units Immediately after logging in to CzeekD the settings window will be shown Figure 6 wherein a synthesis frame with default topology 3 units arranged horizontally is initially set In our example this synthesis frame can be used without changing the topology But if the topology has to be modified the default synthesis frame must be deleted first by clicking on the Clear frame button Creating a new synthesis frame can be performed by first clicking on any of the desired unit located on the right side area of the Add units label upper left hand area of the panel Figure 8 Clicking on the frame area will create the selected unit After placing the desired units connect the units by clicking and dragging between the connection points of each unit There are 5 types of units that can be used in CzeekD each
16. ella Technologies 4 CzeekD Manual calculated searched for this particular example As shown in Figure 5 compounds will be designed having the Quinazoline part of Gefitinib as the core structure with both ends serving as fragment attachment sites Attachment site 2 Attachment site 1 Gefitinib core structure Figure 5 Compound design scheme 3 1 Target selection There are 3 general settings that must be performed by the user a select target protein calculation model b specify synthesis frame settings and c set calculation parameters The last is optional and default settings will be assumed if not set by the user The window below will be shown immediately right after logging in to CzeekD On the left side of the window is a panel that allows the user to select the target protein family and the actual target proteins itself The biggest panel located at the right side of the window allows the setting of the synthesis frame Below is where calculation conditions can be set and is also where the Start job button is located CzeekD GE mie o caan rr ES Men TE we U O w wa ape of ime toe rar Ln Seta fe Lia all t det te Figure 6 Settings window In selecting the target protein the CGBVS calculation model should be selected first Figure 7 Models can be selected after clicking on the dropdown list There are 6 standard models to choose from Table 1 Once a model is selected by clicking
17. est solution is the unique feature of PSO The general equation for the PSO algorithm is shown below Each particle has its own position X and speed V vectors and based on the equation searches the chemical space for the best solution Xa X V V1 WV tne Care rcz Came X In the equation above Monee refers to the best solution found by each particle personal best X gbest refers to the best solution found by the entire swarm global best w refers to the inertia constant r r indicates random digits from 0 to 1 while c4 c gt refers to the size of particles attracted to the best solution The position and speed vectors are constantly recalculated and changes in their values lead the swarm nearer to the best solution There have been other equations that are proposed to represent PSO but in our case we are adopting the equation above If you would apply PSO to de novo design a particle represents a compound On the other hand a particle s position vector X is a value that represents the properties of the compound represented by the particle The fitness function score that should be optimized is one that is independent of X and in the case of CzeekD the CGBVS Chemical Genomics Based Virtual Screening score is adopted This score is also the same score calculated when using our other product CzeekS when several proteins are specified the average score is presented The position vector X is represente
18. having from 0 to 4 bonding points Figure 9 If a unit having no bonding point is selected a simple scoring screening calculation will be performed refer to p 22 for details There is a limit of 9 units that can be used to compose a synthesis frame Furthermore one or more units that have one or more unattached binding points are not allowed CzeekD GG eee uz Select unit M tom dere Place unit ona Connect units Figure 8 Selecting arranging and connecting units SE Figure 9 5 types of units according to the number of bonding points 3 3 Unit settings Unit properties can be set in two ways 1 drawing a structure through the use of a plugin integrated within CzeekD and 2 selecting a fragment library 3 3 1 Drawing a structure With this method a structure fragment is set by using a drawing tool integrated within CzeekD As shown in Kyoto Constella Technologies CzeekD Manual Figure 10 a it can be done by right clicking on one of the units then clicking on the Draw structure item on the context menu The Marvin Sketch Figure 10 c plugin will appear in a popup window that would allow the user to draw the desired structure Another way to activate the drawing tool is to select a unit by clicking on it once then clicking on the Draw structure button Clicking on the OK button after drawing the structure will close the plugin and return to the CzeekD window already showing the structure drawn
19. ments referred to in CzeekD are basically structures created after cutting compounds using the RECAP method There are actually 11 basic rules that are followed in the RECAP method and these are indicated in Figure Kyoto Constella Technologies 2 CzeekD Manual 3 below 6 Ce NNI S O Za ii A Nd pm h Pm O Di Pr Am we 7 Th e i opt we omy k a a Al N 8 Ul 1 amide 2 ester 3 amine 4 urea Y ar b Ly 4 eu Eu a illa Np N Np HEX buy n 4 oe Gd H Mal Bk pP I Sig af N Sa a i lea Dal 3 La Pu Pu ty VA N 5 ether 6 olefin 7 quaternary nirogen 8 aromatic N carbon K K Vi O l e ai I P U i i Il H s i n Fy Yb N x Lu ur a Fd gt l r I 0 O S 0 5 9 lactam N carbon 10 aromatic carbon aromatic carbon 11 sulphonamide Figure 3 RECAP rules In CzeekD we employ heavy metal symbols such as rare earth elements as markers for cutting and binding sites These metals are not normally used in the creation of drugs therefore are used only as markers Markers differ based on the cutting rule and the type of bond created after cutting We also refer to these heavy metal symbols as attachments For example if an amide bond is cut La is attached to the acyl side and Ce is attached to the amine side please refer to Figure 3 for details In CzeekD fragment libraries refer to the collection of structure
20. n area within the distribution map and displaying the structures 4 2 Compound list operations Putting the mouse over a structure shown on the compound list will magnify the structure and clicking on the structure itself will open a structure viewer displaying the clicked structure Expanding the width of the structure column will also result to the magnification of the displayed structures Unchecking the Show structures checkbox will hide the compound structures while clicking the Show fragments checkbox will show the fragment structures Clicking on the column name of any parameter will reveal a dialog box displaying sorting and filtering options Clicking and dragging on the column name of any parameter will change the order of the columns Figure 21 Compounds can be selected by clicking on the checkbox to the left of the compound structures Selected compounds can then be exported in SDF or CSV format by clicking on the File export button Kyoto Constella Technologies 14 CzeekD Manual a b Figure 21 Compound list panel 4 3 Numeric and partial structure search Display filtering has been described in the previous section but actually in the first place only 50 000 compounds can be displayed However compounds to be displayed can be narrowed down by filling in search conditions for the distribution map There are two ways that this can be done first by using numerical parameters and second by partial st
21. n map will save the conditions and results of the current search Up to 3 searches can be saved and you can switch between them by clicking on the buttons x1 x2 or x3 buttons located above the upper right hand corner of the distribution map Saving more than 3 searches will Kyoto Constella Technologies 20 CzeekD Manual erase the least recent search Clicking on the Reset map button will reset the distribution map to display the default and will erase all the saved searches 4 6 File export Data of the selected compounds can be downloaded by clicking on the File export button and selecting the appropriate file format within the dialog box that appears Figure 31 Data can be saved in either SD or CSV file format Structures saved in CSV files are in SMILES format Clicking on the OK button allow the user to specify a filename before saving the file File Format Please select export file format Export SD file Export CSV file OK Cancel Figure 31 Selecting file format before saving data file Kyoto Constella Technologies 21 CzeekD Manual 5 Other Features This section describes the other functions of the CzeekD system not described in the previous sections 5 1 Modifying user information changing password Registered name e mail address and passwords can be changed by first clicking on the User information button Figure 32 a The display will switch to a new window
22. n scheme Virtual compounds are created according to the conditions that are defined in a synthesis frame Figure 1 The synthesis frame is generally composed of units and bonds Units represent areas where fragments are placed whereas bonds are parts where binding rules between fragments can be defined Appropriate fragments are placed into all the units and results to creation of a virtual compound after binding of the fragments involved Structures that do not conform to the defined conditions of the synthesis are not created The following should be defined in the synthesis frame number and configuration of units topology 2 structures and or fragment libraries to be used rules that would govern binding between units Lb QAL se O Figure 1 Example of a synthesis frame There are 5 types of units Figure 2 that are available in CzeekD each differing only in the number of bonds possible In creating the synthesis frame units are selected and arranged then are bound according to the number of bonds available for each unit Units with dangling bonds are not allowed Up to 9 units can be set in a synthesis frame Figure 2 5 types of units 2 2 Retrosynthetic Combinatorial Analysis Procedure RECAP Fragments libraries provided in CzeekD are created based on the RECAP rules Using the RECAP method a particular compound is fragmented cut based on the rules by which it was originally chemically synthesized Frag
23. on it available target proteins will then be shown in a tabulated list Figure 7 The Ligands Kyoto Constella Technologies D CzeekD Manual column shows the number of compounds used in machine learning for each target protein and the Compound Clusters column indicates the number of structural variations of those compounds Please keep in mind that in CGBVS higher number of structural variations between compound used in machine learning equates to higher prediction accuracy Standard model for Nuclear receptors Standard model for Proteases Standard model for Transporters Table 1 List of standard models Optimizations PoC Ligands 255158 Coren eens Figure 7 Protein selection panel Figure 7 shows an example in which the Kinase standard was selected A protein can be selected by clicking on the checkbox located on the left side of the protein ID Clicking on the box once will show A and a second click will show W A third click will return it to an empty checkbox A indicates that active compounds against the selected protein will be designed while V indicates that the compounds to be designed should not be active against the selected protein In short A and V indicate the target and off target respectively The protein IDs are hyperlinks to the UniProt database and clicking on them will open the web browser and show the page containing information about the clicked protein Up to 10 proteins can be s
24. orm de novo design calculation mati TE Optimization PSO Advanced Car oll Al EGFR E Estimated time 43 00 Ligands 755128 srt Combinations L S410 Figure 15 Summary of calculation settings Combinations refers to the total number of compounds that will be created based on the synthesis frame settings It is basically shown as a high value exponential notation Ligands refer to the default number of compounds actually included in the calculation This value depends on the number of particles and number of iterations set within the PSO Advanced settings Estimated time indicates the approximate time it will take for the calculation to finish and is meant only as a guide l1 Kyoto Constella Technologies CzeekD Manual After carefully checking the settings the calculation can be commenced by clicking on the Start job button A dialog box just like in Figure 16 will be shown and a job name for the calculation can be entered As much as possible use an easy to comprehend job name Job Name Input Job name Figure 16 Job name input dialog box 3 6 Checking the calculation job Status of calculation jobs can be checked by clicking on the Job List Window button Figure 17 Check the Status column to see whether your job is currently running or have already completed The status of a currently running job is usually shown as the percentage of the completed calculation Job st
25. possible In cases like these refining compound search is advisable and is a feature included in CzeekD 4 5 1 Refining fragment based compound search This is done by first switching to fragment display mode As shown in Figure 27 select an area in the distribution map and display the list at the right panel Select one or more fragments from the list by clicking on the checkbox to the left of each structure All the fragments in the list can be selected by clicking on the checkbox located at the left of structure column heading Next go to the Frame panel at the lower left hand side of the window then click on the Select entire frame button The distribution will then change into something similar to that shown in Figure 28 In this example the fragments having the highest average score mean gt 0 7 are selected so the center of the distribution map would be composed of those fragments Kyoto Constella Technologies 18 CzeekD Manual CzeekD Gaga Co ca Vulcan tp Cot ioe vee GER Figure 27 Selecting fragments Before search refinement After search refinement PIG 377 P4484 624 SOOCR ETT 763 2 820 4006 8O0 B O88 TRO 372244064524 66006 GT 7532 629 4 965 6 93181338 Meg Wepre 1E 1 beso i t mos Oe ty Nurmenc Sruct Cen be savod 2 Namor sn gt can te save Manat map DI Range See si CI Range Selected Figure 28 Change in distribution map after fragment based search refinement 4 5 2 Compound
26. re 40 adding a unit to the synthesis frame will expand the calculation at the R1 side In this case the chemical space increased from 1 83 x 10 to 1 96 x 10 and so is the variation of the chemical structures However increasing the number of units will also increase the time required to perform the calculation LD RECAP_ROM aro_nN Vi i ie RECAP_aro_nng_1 Total 53058 ko PP Tota 194590 Add a unit to the left side of the core structure I RECAP chain les ie tlie Tie RECAP aro nng 1 Fotal 15985 i f a Tota 194580 RECAP nori aro_mn Total 53088 Figure 40 Extending the synthesis frame 5 6 Using custom made fragments As described in section 5 2 structures can be uploaded by using the SD MILES mol file upload feature The same method can be used to upload user designed custom fragments However this type of usage requires understanding of the RECAP rules Kyoto Constella Technologies 26 CzeekD Manual As discussed in section 2 fragments provided in CzeekD are labeled with heavy metal element symbols that serve as markers for proper binding based on RECAP rules These markers are generally removed during the binding process The custom fragments must therefore contain these same markers before being uploaded into the CzeekD system Additionally the first line of information for every compound present in the SD file should contain the fragment name written with a maximum of 10 letters This i
27. re the search PIG 377 7 Atha 624 SHOR G77 7683 8220 4 OF ba rie 1 humen gt Struct Sane ani P ai N e VEO TI kange Select ai Selected 0 ate j o ty 72 2 446 4624 60006 877 7532 8254 06 6 981 B10S8 Wi ceghe P one ce Snxt di Sove E cas be sane Roat mag Lance Sasa 3 Selected Search After the search Figure 23 Transformation of the distribution map after partial structure search Kyoto Constella Technologies 16 CzeekD Manual Figure 24 Resulting compound list after partial structure search 4 4 Compound display and fragment display The Frame panel located at the lower left hand area of the results analysis window displays the synthesis frame Figure 25 At this area clicking on any of the fragments then on the Select entire frame button toggles between compound list display and fragment list display in the Compound List panel Click on any unit to display fragments Click to display compounds Figure 25 The Frame panel Figure 26 displays the fragment structures with the corresponding parameters after selecting the rightmost fragment unit within the synthesis fame then selecting an area within the distribution map In CzeekD the binding point in a fragment structure is designated by an atom of a heavy element not generally used in the creation of drugs Parameters displayed are values from which the effects of the heavy atoms were removed In the case
28. rr E S 22 5 3 Assigning several core Structures tira 24 54A Spanair 25 5 5 Expanding the range of compounds that can be generated 26 5 0 Using c stom made TAC rali 26 5 7 AdvanCed CaleulaviOn Parameters srana a a a a k a aa 2 Kyoto Constella Technologies 1 CzeekD Manual Trademarks All the company and product names appearing in this manual are trademarks or registered trademarks of the respective companies Furthermore trademarks are not appended to all the software and product names described in this manual 2012 Kyoto Constella Technologies Co Ltd Reproduction in whole or in part of this document is strictly prohibited Kyoto Constella Technologies 11 CzeekD Manual 1 Introduction 1 1 Regarding use of CzeekD This manual explains how a computational chemistry personnel can utilize CzeekD with particular emphasis to medicinal chemists Starting from the introduction of the calculation method involved in CzeekD description of operations from calculation up to analysis of results will be described while at the same time showing relevant CzeekD windows It is assumed that CzeekD will be used in a Windows environment and explanations in this manual are based on that assumption CzeekD is a Client server type of web application offered as a service that designs and analyzes new compound structures based on CGBVS Chemical Genomics Based Virtual Screening prediction score Computational servers are main
29. rt of the window the previously drawn structure will be shown with the selected R group as shaded 11 types of bonding rules are shown in Figure 14 and Kyoto Constella Technologies 9 CzeekD Manual with the exception of olefins and aromatic carbons variations are shown in pairs One or more bond types can be selected asa unite O 2 Click on Set bond rules button ton Setect Sbrary Draw stricture Delete unt Edit bomds Set bond rules St bond for IRL Set bond for R2 ar mm RECAP _non aro_rin Li a n RECAP_aro_nng_1 Total 53058 k Pi Total 194560 r n T Select bond a Figure 12 Bond rule settings window Select bonding rule D Gord nse description Nond types Bora cer ka AN e Aio amne z SN A i gin dita cune e ANY de OK Gancqui Figure 13 Setting bond rules for R2 Kyoto Constella Technologies 10 CzeekD Manual m sa amico i ii al Figure 14 Possible settings based on RECAP rules gn 2 lactam N carbon suiphonamide 3 5 Regarding Combinations Chemical Space and Estimated Time The settings required to start a de novo design Job is almost complete at this point Clicking on the accept changes button will make the settings take effect and if there are no errors encountered will give values for the Combinations equivalent to size of chemical space and Estimated Time approximate time required to perf
30. ructure search Clicking on the Numeric button will open a dialog box that allows entry of numerical values that be will be used for the numeric search Figure 22 In this example scores of 0 5 and above and molecular weights of 500 or less were used as numeric search parameters Executing the search changes the display of the distribution map depending on the conditions set in the numeric search parameters Compound search based on partial structures can be performed by clicking on the Struct button then clicking within a dialog box that appears afterwards This opens the Marvin Sketch plugin that allows entry of the desired structure Clicking OK after drawing the structure in Marvin Sketch returns you to the dialog box that also allows you to enter an optional lower limit for similarity with the structure previously entered The example in Figure 23 searches for compounds containing the Thiophene group Similarly as in quantitative search the distribution map changes depending on the partial structure search parameters After partial structure search compounds can be selected within the distribution map and the structures displayed in the Compound List panel Figure 24 Kyoto Constella Technologies 15 Before the search CzeekD Manual After the search S347 7377 SE SOR 4 84 498864602 479 6 500 fiat porto aa a de saved Selected D Figure 22 Transformation of the distribution map after numeric search Befo
31. s required to prevent unexpected results As an example benzoic acid is set as the fragment as shown in Figure 41 Since it is a carboxylic acid based on RECAP rules the bond used should be the amide bond However in CzeekD both the amide bond and the ester bond can be used so they are presented separately In Figure 41 in case of amide bond the OH group is replaced with LA On the other hand if the ester bond is to be used the OH group is replaced with Pr If both bonds are to be used during the calculation both fragments with the appropriate labels should be included in the upload file c is fac Neon uf Table 3 Heavy metal marker pairs that allow binding O If using amide bond La O on O O If using ester bond Pr Figure 41 Selecting the heavy metal markers 5 7 Advanced calculation parameters Clicking on the Advanced button in the settings windows will open a popup window similar to that shown in Kyoto Constella Technologies Al CzeekD Manual Figure 42 The upper left hand area of the window allows the selection of the optimization algorithm PSO or GA to be used in the calculation CzeekD uses PSO by default so in this case just keep the PSO radio button checked The default number of iterations is set at 2000 but can be set higher if a more refined chemical space search is required However increasing the number of iterations will also increase the amount of time required for calculation It is recommend
32. s with attachments and are classified according to the number of attachments Fragment binding also follow the RECAP rules unless otherwise specified That is binding based only on the above combinations of heavy metal markers is allowed For example amide bonding is only allowed with fragments having La and Ce attachments In Figure 4 below combinations that are not allowed in RECAP like Lu and Am below do not result to binding and therefore to the formation of new compound structures Using the RECAP rule to govern the binding of fragments results to creation of virtual compounds with high probability of synthetic accessibility A UL SIIT SC Ay Figure 4 Example of a fragment combination not conforming to the RECAP 2 3 Particle Swarm Optimization PSO algorithm This section describes the process by which appropriate fragments are arranged in each unit The use of PSO algorithm in CzeekD allows selection of fragments for binding such that the capabilities of the scoring function is maximized Kyoto Constella Technologies CzeekD Manual PSO is an optimization technique that mimics the movement of insect swarms and fish schools that are seeking food or safety It has been found to have better performance than genetic algorithm GA Exchange of information between each particle in a swarm regarding the best solution each has found on its own leading to the determination of the overall b
33. tained and managed at Kyoto Constella Technologies and therefore it does not require any installation on the part of the user Users can access the service using the web browser installed in their PCs Internet Explorer Firefox Google Chrome Below are the system requirements to use CzeekD e OS Windows 7 8 8 1 e Screen Resolution HD 1366x768 and above Full HD 1920x1080 is recommended Browser Firefox Google Chrome Internet Explorer e Java version 6 or 7 e Flash Player Adobe Flash Player 11 and above e Network connection Broadband 1 2 Login and operational policy Users can reach the CzeekD server by accessing the URL below https denovo czeek jp czeekd Once you reach the login page fill in your login name and password then click on the Login button Please change your password particularly during your first login Refer to section 5 for details Below is our server operation policy e Our servers are structured such that our application server is our frontend and the calculation server is our backend e We dedicate one CzeekD server for every customer e Data backups are performed automatically every Sunday midnight AM e In principle we retain customer data only for a period of one year Kyoto Constella Technologies Il CzeekD Manual 2 Creation of Compounds in CzeekD The principle behind the creation of virtual compounds using CzeekD will be explained in this section 2 1 Compound creatio
Download Pdf Manuals
Related Search
Related Contents
Marmitek LW12 MODO DE EMPLEO BCV43, BCV63 BA DRAFT V1 LA GRÂCE DÊTRE PARENTS DSP Radio Receiver GST Professional EMACO SFR® IDC mem - Orion France 戸別有料収集(大型ごみなど)(PDF:1.8MB) Copyright © All rights reserved.
Failed to retrieve file