Home
        Client User`s Manual
         Contents
1.         46    RECOG Client User   s Manual       Cluster dist The cluster distance and sub cluster distance  are displayed     Phylogenetic Pattern Coefficient In a similar phylogenetic pattern search  the    coefficient of correlation with a specified    pattern is displayed     11 3  Display Nondisplay of the homology cluster header    1  Click the right mouse button on the cluster header  click the menu  and click Show Hide    homology cluster label     11 4  Species color setting    The species color is set  The color set here is reflected in the background color of the PPM    species header and the locus tag label of the phylogenetic tree in the multiple alignment    analysis     1  Select the species on the Taxonomy Tree at the upper part of the Selected tab  click the    right mouse button  and click Color organism     Choose    in the pop up menu     The Color palette screen is displayed     2  Set the color on the Color palette screen and click the OK button to set the species color     11 5  Changing the PPM cell boundary color    1  Select Color     Base Color on the control panel  click the right mouse button and click    Edit to display the PPM tab on the Option screen           Taxonomy Tree  PPM Function Category Color Neighboring Cluster Multiple Alignment Tree General     V  Background color for cells containing inparalogs   gt     2   gene   ID   V  Display gene names or the number of genes      Z  cole tor border ines       Background color for ingroup cets
2.     result     39    RECOG Client User   s Manual       9 4  Display of the DomClust analysis result properties    For the DomClust analysis result  the parameters for the DomClust execution  the applied PPM  sorting conditions  etc  are displayed   Also  the RECOG server can be edited for default access by the currently displayed DomClust    analysis result     9  Click Option     Properties    to display the Properties screen     Ed Properties          ject   le name  Cluster Tab ID  Nurriber of Select Genomes  hgroup organisms  Outgroup organisms  Muriber of Phylogenetic Patterns  Murnber of Clusters    Organism Filter   Fitter by minimum number of organisms  DormeClust Options   Cutoff BLAST E value   Cutoff DP score   Alignment coverage for domain spitting  Score cutoff for domain splitting  Similarity measure for orfhology   Best hit criterion       10  To edit the RECOG server  select the server from among those listed by double clicking    value in the Server drop down menu  and click the Apply button     40    RECOG Client User   s Manual       9 5  Storage of the DomClust analysis results    The DomClust analysis result is automatically saved to the project directory or one of its  lower level directories when the analysis is conducted  To refer to the analysis result using  another tool  save the analysis result in DomClust format   dclst  or tab delimited format    Although the tab delimited format file is useful for loading it into and displaying it on Excel   et
3.    1  The ortholog analysis is performed and the results are displayed as described in    9 2    Execution of DomClust    and    9 3 Display of the DomClust analysis results        2  Click 89 gt   Core genome alignment  CoreAligner   in the Toolbox  The Core Genome  Alignment screen is displayed   3  Specify the conditions for the analysis carried out by the CoreAligner program on the Core    Genome Alignment screen  There are two ways of specifying the analysis conditions   Simple Mode and Expert Mode     Simple Mode Expert Mode   Assignable    Reference genome   Reference genome   item   Cutoff ratio of conserved orthologs   Cutoff ratio of conserved orthologs  and neighborhood relations   Cutoff ratio of conserved    Window size neighborhood relations    Use unuse species group   Window size      Use unuse species group    Display Nondisplay of species      Display order of species      Simple Mode    Gore Genome Alienment    Reference Genome     Window Size     Use sp group       RECOG Client User   s Manual         Expert Mode    Gore Genome Alienment          B orvyiotquetaciens FZB42  B aenyloliquefaciens F ZB42  Bsenyioiquetacions FZB42  B eevyloliquelaciens F ZB42  B aeryloiquelaciens FZB42  B cevyioiquetaciens FZB42    MO00000       C folt ratio of conserved genes  WY       Citolt ralio of conserved cormections  6        Required coverage of each genome for each chister          Citolt of deletion score for removing local regions       Other Options     F  Use sp g
4.    H Desulfurococcales 4   J oy Desulfurococcales 4   i  H Desulfurococcaceae 3    gt     Desulfurococcaceae 3   ameropyrum pernix K1 LeAeropyrum pernix K1    Lionicoccus hospitalis KINGA  L staphylothermus marinus F1    Lionicoccus hospitalis KINGA    La staphylothermus marinus F1  2 Aee eD   LEHyperthermus butylicus DSM 5456       LH perthermus butylicus DSM 5456  mo nana ana i eas  n     Supplement  Rank item modifications    To modify the Rank items in the drop down menu     1  Click Ve  Option  in the Toolbox to display the Option screen     2  On the Option screen  click the Taxonomy Tree tab     3  In the Rank Item form on the Taxonomy Tree tab  check the taxonomic rank to be       34    RECOG Client User   s Manual       displayed     4  Click the Apply button on the Option screen        35    RECOG Client User   s Manual       9  Ortholog Clustering  DomClust     Ortholog clustering is performed upon specifying a species  The result is displayed on the  PPM  This is the first analysis conducted in RECOG  and is the basis for all comparative    analyses     9 1  New analysis    If any DomClust analysis result is displayed  clear the result     1  Click New     New Analysis to clear the currently displayed DomClust analysis result     9 2  Execution of DomClust     Note  DomClust analysis is possible only in an environment where Internet connection is    available     2  Specify the ingroup outgroup by means of the method shown in 8 3 Spec ification    of the ingroup o
5.    group can be executed     Cluster ID The cluster ID is displayed    Gene Name The representative gene name of the cluster  sub cluster  is  displayed    Description The representative description of the cluster  sub cluster  is    displayed  As the background color  the color corresponding to the    typical function category of the cluster  sub cluster  is used     Gene information table  center table    The information on the genes belonging to the cluster selected on the PPM or the cluster  information table is displayed  Upon double clicking  a web browser is activated and the  details of the gene information are displayed  If multiple genes on the table are selected  and right clicked  various functions  including Multiple Alignment and the display of a    Regional Genome Map  for the selected gene group can be executed     Organism The species code is displayed   Locus Tag The locus tag of a gene  domain  is displayed  For the domain  the    domain number is shown at the word   s end     Detailed gene information table  lower table   Detailed information on the gene selected on the PPM or the cluster information table is    displayed  The registered gene properties are also displayed     Organism The species code is displayed   Locus Tag Locus Tag   Gene Name Gene name   Accession  P  Accession number   Position Gene region   Direction Direction of gene   Feature Key Feature key   GI number GI number   Gene ID Gene ID   Description Description       22    RECOG Client
6.    to select the relevant species on the Taxonomy Tree on the All tab Selected tab        52    RECOG Client User   s Manual       12  Color Display by Properties    Each cell on the PPM can be colored and displayed using the gene properties registered as    described in    29 1 Registration of gene properties     the correlation coefficient determined    based on the similar phylogenetic pattern search function  etc     ILL    Color display setting by properties    Click       Color genes by properties  in the Toolbox to display the Color genes by    properties screen     Specify the coloring conditions on the Color genes by properties screen     Organism  Specify a species     Property  Specify properties     Value type or enumeration type properties can be colored and displayed     Color  Set the color on the PPM     If the properties are of the value type      lt  gt     Threshold setting   The color as per a specified property value is set  labels 1 and 2 in the above figure    Click the Add color button to set up to four values and colors  The color s   between them is determined by the linear interpolation method  Click the Remove    color button to remove the intermediate values        Color by     Specify whether to set the color based on value  Value  or ranking  Rank         The method for assigning rank     If the color is set as per ranking  specify whether to rank all the species specified     All species  or each species  Every species         Automatic update 
7.   10    RECOG Client User   s Manual       3  Installation Uninstallation    3 1  Installation of RECOG for Windows    1  Set    JAVA HOME     an environmental variable  as follows   Variable  JAVA HOME  Value  the directory in which the JAVA JRE 1s installed     2  Double click recog client  lt version gt  exe to start the installation   Install RECOG by following the instructions on the screen  Upon completion of the    installation process  the RECOG menu is added to the start menu     3 2  Installation of RECOG for Mac    1  Double click recog client pkg tgz to create recog client pkg     2  Double click recog client pkg to start the installation   Install RECOG by following the instructions on the screen     During the installation  the administrator   s username and password are requested     3 3  Installation of RECOG for Linux    1  Set    JAVA HOME     an environmental variable  with the following command     bash  export JAVA _HOME  lt JAVA JRE home directory gt     csh  setenv JAVA_HOME  lt JAVA JRE home directory gt        2  Decompress recog client  lt version gt  tgz to create a recog directory     Create the recog directory in an appropriate place     11    RECOG Client User   s Manual       3 4  Uninstallation of RECOG for Windows    1  Click Uninstall RECOG in the start menu to remove the installation directory   2  Ifthe installation directory is not removed  remove it manually   3  The data directory C   Documents and Settings   lt user account gt   RECOG i
8.   294  Removing    OV CN ceca as aa a a acts stan EE A aaia 120  Registration and Management of Gene Cluster Sets o cccccccccecesessssccessesssessssesesesesseeevavavessesevavavenseseees 121  30 1  Registration of a POTS cluster SSL papalecaineserasantivciaraienwncnes entra eabierieiaearsiuisenavsininidiesienadonmmemnenniebande 121  30 2  Outputting a gene cluster Set tO a FIC    ee cccesesesesssessesesescscecevsceeescevevsvevesevaveveneetseevavaveveneeeeeetans 124  30 3  Editing a gene cluster set  removing ZENES       cccsecsscesecsesescsesecessesecersesececsessvsesecaeeesavecsesesaceeseceeseees 124  30 4  Registering additional genes clusters to a gene cluster Sti    eecececececeseseecesesesseseeseteeeseeeaees 125  30 3  Removing a gene  SIE oe aca sraraictsatcroannateressencnnrenasaniatnad nericanionnmmnatenaidatqaiaaenanieinemolatinnen  125  30 6  Referencing the list of gene ClUSter SOS    eeecesssscsesessecscecesesesesvaveveveceeeetevaveneseetevavaveveneeeetanans 125  PNM E  acacia cts vac AEE E vo epee piesa A ev eves cea acess esa sae esses R E AA A vars A 127  31 1  Registering a combined Se  t        osoeeoseseeeeeeeesneneserererunerurerurnrururururururisrsisknrsnsnsnnnnknrnnnnnnnrannrununnrunun nenene stusads 127  3 t2 Editing a Combed SEE raii aE 130  silo  Remove a Combined n      Cope one SP Pcs ee re ee 130  31 4  Specifying a combined set as a filter condition woe ccc cccee ces ceseeeeeceeceseesesseeeesessensaveeeessenenees 130  31 5  Specifying a combined set
9.   E  Gackyround color tor outgroup cess  P   C  Background color for outgroup cells considered to be ingroup    Background color for selected ine cetis 1    E  Background color tor setected ine cets 2   JE    Background color for search resut cells          RECOG Client User   s Manual       2  On the PPM tab  the background color of the cells  the display nondisplay and the color of    the boundary are set     Color for border line The color of the boundary between cells is set   Untick  and the boundary is not drawn    Background color for ingroup cells The background color of the ingroup cells is set    Background color for outgroup cells The background color of the outgroup cells is set    Background color for outgroup cells If the horizontal transfer option is specified  the   considered to be ingroup background color of the outgroup cells that are  considered to derive from the ingroup is specified    Background color for selected line cell  1  The background color of the cells of the selected  cluster is specified    Background color for selected line cell  2  The background color of the cells of the selected  species is specified    Background color for search result cells The background color of the cells of the searched    cluster is specified   3  Click the Apply button     4  To enable disable the color settings of the PPM cells  double click Color     Base Color on    the control panel     11 6  Color change according to the gene count within a    cell    The bac
10.   IK Bacillus cereus ES    In Basmus cereus NYH 391 98   IE Beciius ihuringensis serovar koniiuiiaon 97 27  In Basmus thuringiensis Al Hakam   IR Bactius clousd KSM K16    vee Mesie Uatethmene Om ear                   Bacillus amyhiquetaciens FZB42     Socilus cereus ATCC 10997       95    RECOG Client User   s Manual       25 3  Editing species group names    1  Select Sp Group on the Selected tab     2  Select the species group to be renamed in the lower view  click the right mouse button  and    click Rename  Rename screen is displayed     3  To rename the species group  edit the name on the Rename screen and click the Apply    button     25 4  Removing species groups and removing species    from a species group    1  Select Sp Group on the Selected tab     2  Select the species groups or species to be removed in the lower view  click the right mouse    button  and click Delete     96    RECOG Client User   s Manual       26  Genome Core Structure Alignment     CoreAligner     Genome core structure analysis consists of the extraction of genomic structures that are well  conserved among related genomes  That is  a pair of orthologous groups are extracted whose  genomic neighborhood relationship is conserved above a certain level  and the orthologous  groups are realigned based on this neighborhood relationsships  The CoreAligner program for    such analysis is run on the RECOG server  and the extracted core structures are displayed     26 1  Running the CoreAligner program 
11.   Switching the RECOG OEE rr oss scenes ererncieesteteaorstee sled niccede nnocen 29  7 3  Reference to the registered RECOG servers ooocicieccececccccescescesceseesescessesssseeseseeseuseseessesensessevensenseeees 29  TA  Registering the REGOG SQV 60 wicsiiciscesiicesiciacciscsceieisuiienstaccsadsacsiystanssiacustdecaneabdguddnedsbensiacdstantaatibdjecinadstacsindes 30  7 5  Editing the RECOG Servebuuui iccccecccccccscescecescsscessecessesseeeeseuseesueeeseasavseseasevseseuseuseveeseeseusesseseusesevessensenees 30  10  Rom  val or the RECOU Gyor enine AE aaa 31  Display and Manipulation of the Taxonomy BrowSer  aeeseseseeseeseeeeeseruorreereusrrisiensnrisnersnsrsnsrererererererernns 32  8 1  Expansion Collapse of the Taxonomy Tree wue e ciccceccsesssssssesssessscssesesesescsvsvevssesesssvavesesteeevavavensetseetaneven 32  8 2  Specification of a set of classification ranks to be displayed on the Taxonomy Tree           32  8 3  Specification of the INGrOUD OUTZLOUD      cececesescssscececesesescececeveceecsesvavaveveceecetevavsueseesevavaveveeeetevavaveveneeeatens 33  8 4  Automatic IZFOUD  OUTPFOUD specification smsisisctssuniniicsssshcisiitsvavcanntsnnchasecsiscaniiavndsainenadavntcasiativadiaraekeauanes 34  Ortholog Clustering  DOMCIUSE  uu    cseessscsssssscesscsscessccssessssesessescssesesesssseseseasseeseccsnesssesteassnsteassesteaesteseststeataee 36  SR Ce Nae  4 12  colo ae ree EE ee nee A eran eee aS ee eT 36  Ta ECCU OF Bog OLD Col emer terre eee errr a tremor ee t
12.   and specify the    ClustalW conditions on the Multiple Alignment tab on the Option screen                Taxonomy Tree PPM Function Category Color Neighboring Clusters    Mutiple Alignenert  Tree General     Path   C  projectm eclipse recog_clent clustatw exe      Options    TYPE Use defout v      OUTPUT OUTORDER MATRIX SEGNO RANGE    lchastal v   fasaned v  olosum v   FF v  START            GAPOPEN GAPEXT GAPDIST  100 02       GUICKTREE ON     OFF             KTUPLE VANDOW SCORE  1j 5   percert v          PYVONAMATRIX PYVGAPOPEN PYYGAPEXT  lolosurn v    100 04              Path    Specify the path of ClustalW  The default setting is the ClustalW attached to the    installer     83    RECOG Client User   s Manual          TYPE    Use the default   Multiple alignment is executed with the default parameter of ClustalW     Protein     Multiple alignment is executed as per the setting specified on the screen     2  Inthe Disaggregate Mode  select the cluster in the cluster information display table on the  Info tab  click the right mouse button  and click Multiple Alignment on the pop up menu  to display the progress screen and execute the multiple alignment  When the Confirm    screen is displayed  click the OK button        Info   Histogram    ossible aspartate transaminase  aminotran   Gene Name Description       Cluster ID    Regional Genome Map Comparison  Multiple Alignment       3  After the completion of the multiple alignment process  the Multiple Alignment screen is  
13.  3  Saving the list of genes    1  To output the list of gene property values in tab delimiting format  click the Export     button on the Gene List screen to display the Save gene list screen  enter the output file  name and click the OK button     76    RECOG Client User   s Manual       20  Display and Operation of the Circular  Genome Map  CGM     The Circular Genome Map  CGM  draws a circular linear genome map of the selected species     t Genome Map       20 1  Displaying the CGM    1  To display the CGM  select the species on the Taxonomy Tree on the Selected tab  click  the right mouse button and click Genome Map   Chromosome name     IR Bocifus cereus E33L  IE Bacilus cereus HYH 391 93                Ik alaes A  in E   Taxonomy  gt   IRE Sp Group  gt     EC        chromosome 1  IKE Gene List plasmid p819727  e   Gene order on bik   Color b       Ik       71    RECOG Client User   s Manual       20 2  Changing the selected region    1  Enter the region to be selected in From and To in the upper part of the CGM screen  and  click the Show button  The entetred region is highlighted and the displayed region of the  genome map in the lower part of the CGM screen changes     The selected region can also be changed by dragging the mouse on the circular genome     2  To move the selected region in the clockwise counterclockwise direction  click the cl    Previous  Next  button in the lower part of the CGM screen  respectively  It is    also possible to change the moving dist
14.  Clustering  PhyloPatClust      Execute      Local     Server  Target      Sub Cluster    Cluster     Use displayed clusters only     gt  gt  Parameters       Execute  If the clustering is carried out locally  specify Local  If the clustering is       carried out on the RECOG server  specify Server      Note  Server is available only in an environment with an Internet connection        Target  If the clustering is carried out based on the phylogenetic pattern of a cluster   specify Cluster  If the clustering is carried out based on the phylogenetic pattern of a    sub cluster  specify Sub Cluster       Use displayed clusters only  If this is checked  phylogenetic pattern clustering is  carried out only for the clusters currently displayed on the PPM  This reduces the    processing time when there are many clusters     3  Click the Parameters button to set the parameters on the parameter setting screen        60    RECOG Client User   s Manual       4  Click the Apply button on the Confirm screen to execute PhloPatClust  Upon completion  of the process  a dendrogram  clustering tree  1s displayed on the Clustering tab based on    the clustering result  and the PPM is sorted based on its arrangement     2  Phylogenetic Pattern Clustering  PhyloPatClust      Execute     Local     Server  Target     Sub Cluster     Cluster  C  Use displayed clusters only     lt  lt  Parameters     PhyloPatClust Parmeters          Minimum number of organisms   3  4          Miss Dist Ratio    EJ  
15.  Each line is color coded in accordance with the conditions     Color Details   Black There is no insertion between genes    Green There is an insertion between genes    Red The relative direction of the genes is reversed  inversion        Gene Direction  The triangular arrow in the upper part of ortholog group indicates the direction   The background is displayed in the color corresponding to the typical function category    of the ortholog group        99    RECOG Client User   s Manual       26 4     ZO     26 6     vs ona    Changing the display position    To scroll the screen in the direction of the clicked button  click    Move Lett  gt      Move Right  in the Toolbox on the Core Genome Alignment screen     The display position can be changed by dragging the mouse on the Overview window     Selecting an ortholog group    To highlight an ortholog group  click the ortholog group in the magnified view on the Core  Genome Alignment screen  Also  in the Genome Comparison Viewer  the corresponding  ortholog group is highlighted     Locating an ortholog group at center    Double click the relevant ortholog group on the Core Genome Alignment screen to display  it at center  Also  on the Genome Comparison Viewer screen  the corresponding ortholog  group is mainly displayed     Setting a reference genome    To display the Core Genome Alignment display modification screen  click View     View  Change     on the Core Genome Alignment screen     In the    Target    column  check th
16.  If this column is checked  the Core genes are displayed   Non Core genes Genes that were not extracted by the CoreAligner analysis are   called Non Core genes    If this column is checked  the Non Core genes are displayed     RNA If this column is checked  the RNA genes are displayed     109    RECOG Client User   s Manual       Switching options for ortholog line display   Options Details   Universal Only Ortholog groups that contain the genes of all species are called  universal ortholog groups   If this column is selected  only the universal ortholog groups are  displayed    1 vs  1 Only Ortholog groups that contain only a gene for each species are  called 1 vs  1 groups     If this column is checked  only the 1 vs 1 ortholog groups are    displayed   Unique Only If this column is checked  only unique genes that form no    Non Core genes only ortholog groups are displayed     3  Click the Apply button     27 11  Changing the display style of species names    1  Click View   View Change    on the Genome Comparison Viewer screen     Click the View tab on the screen for changing the display style of the Genome Comparison    Viewer screen     2  Select the display stype of species names in the Organism column on the View tab       Normal  The official species names are displayed     Abbreviation  The abbreviated species names are displayed       Show Strain  If this is checked  the strains are displayed     110    RECOG Client User   s Manual       27 12  Display Nondisplay o
17.  Nondisplay of the Locus Tae aiinnwsiunwncicnisendinvstieierrindivennedinieiveincintiveneniinnn 111       RECOG Client User   s Manual       28     29     30     31     32     PLN Ba ME GT e E a gee eee eee ele Pe Pe Om Em Pee ROE Rie Oe me A PA A em Pee EAE teen Pe 111  21 14  Automatic correction of the gene Orientation    cece cece ces cesceeessessesessesseeeesesseeseveesesseneeeens 112  21 15  Changing the display style of the scale MarkS woe cece ces cee ceeeeeeeceeceseesesseveesesseeseseeenseneneens 113  AR   Pl se all 1   gn net en eT ete ene ROE Em Une TOE Reena Se meee eR RC TR NOR Metter meee enn Meer oer ewe 114  Updating the Gene Information asic cis tara ned aches saccid assccdgawn tes sa nnaccidiabwaicndeavinisaceasienieassvedaneeeiaenbes  115  28 1  Updating the Taxonomy Tree based on the Update notice ou    ceeeeecessessessessesssssessesseseseenees 115  28 2  Updating gene information through Update Data wo    cece cece ceeceseeseeseseesessevenseeeenseneneens 116  Registration and Management of Gene Cluster Properties       ccecccecscsesesesssssessesesesssseseseeesseeseeeenean 117  29 1  Registration of gene DEO CTS segs sec cacc po bacs tes ac ceavetaccanneheoinasnetaracdencecectanictebdsbanenezedanniacesnansceeninants 117  29 2  Referencing a list of gene cluster Proper tie oo    eecceseseseccssecssesesessesssessseseesssevevesesseveveneseneeeetanes 118  p20 We EAE 01 6  6  i af   05 pee oO tor cr Ra ren reer net EERO VE REUSE CRE RE RON RU eos ee maT Ree 120
18.  REAM_OOOD10 REAALOOOOED FSIALOOOOE  REAM_OOO100 RBAAL_OOGDISO FETAL_OOCeS  FEREM_O0OS 0 REWLOO03D  REAU 0903S   OOD IC_ICoIL__ Secs sist Ls G SS     0 A i    o Omen  FAFELOOD  10 REAM_OD0170 REALIZZO  PEAM_000190  REAM 000200  FEPAL_OSS1  REAM O09210    REAM 033160  Bacillus anthracis Serne  5222084 45011    FAMR 225 1  tasiga TAAA  BAS0WDI  BASWDIS  BASIQ BASO0OS TRAA ILE   BASODIT BASODIS TRA SER  1  BASIOZI TERA ALA A  BASODOS BASIOUG REHA  IGS 1  REHA SS 1  BASDOIA BASDOZUBASOOZZ TFNA ILE 4  BASIOZI BASODSS GAS  OCCA SE a CO i ieee  Coot    Cj man  0  Byen0NS BaOO17 pasos  eistis  besO019    Bacillus cereus ATCC 10987  5218705 45012  ECE_5 7S3 BCE OMA ELE 0023 ECE 0034  BLE O02 BCE _Seat BCE 001A BLE_OO21 BCE SELI BLE OMS CEMS  BCE 00I BLE 5738 BCE 0010 BLE_0013 ECE_ODIS BIE 0023 SCEO  Bie 0022  BLE_O002  BCEL00OS  SCELSSA BLE_OOOSBLE_OOTI ECE SSA  BIE DOOBLE MOS SCE Bte cozi ete   BE00Ol ECE BLE _COO7 BCE S740 BCE O12 BLE _0019 BLE 0022 BLE _S643 BCE S743 ECE_0OS  BCE _O0GEE  Bee LILI  eS oo  CD een ee  Go  Oj   n  BtE_OOKS BCE OCS       21 1  Displaying the RGM    1  To display the Regional Genome Map  RGM  screen  select a cluster on the Info tab  click    the right mouse button and click Regional Genome Map on the pop up menu     Info   Histogram    lutamine amidotransferase pdxT  SNO family  Cluster ID Gene Name Description  09441 yaa    Regional Genome Map Comparison    Multiple Alignment       RECOG Client User   s Manual       ACOs    Zi     21 4     Zooming in o
19.  Server screen  enter the URL of the RECOG server and click the Apply    button     4  On the Server List screen  click the Apply button     7 5  Editing the RECOG server    1  Click File     Change Server     Server List    to display the Server List screen     2  On the Server List screen  select the RECOG server to be edited and click the Edit     button  The Edit Server screen is displayed     3  On the Edit Server screen  enter the URL of the RECOG server and click the Apply    button     4  On the Server List screen  click the Apply button     30    RECOG Client User   s Manual       7 6  Removal of the RECOG server    1  Click File     Change Server     Server List    to display the Server List screen     2  On the Server List screen  select the RECOG server to be removed and click the Delete    button  A warning message appears  Click the OK button     Q  Delete server  http   test server ac jp         3  On the Server List screen  click the Apply button     31    RECOG Client User   s Manual       8  Display and Manipulation of the Taxonomy    Browser    The Taxonomy Browser displays the taxonomy tree of organisms  On the tree  manipulations  can be performed  including the selection of the ingroup outgroup to be subjected to DomClust  analysis    The Taxonomy Browser comprises the All tab for specifying the species group to be subjected  to analysis from among all the available species  and the Selected tab for specifying various    manipulations of the species grou
20.  Set Cluster Set     Combined Set Name on the control panel     131    RECOG Client User   s Manual       32  Species Set    Multiple species can be specified as a set  The species set can be utilized for the phylogenetic    pattern filtering setting  the species color setting  and profile editing and operation     32 1  Registering a species set    1  To display the Organism Set screen  select a species on the Taxonomy Tree in the upper  part of the Selected tab  click the right mouse button  and click Organism set   New    organism set        Organism Set       Name   Organism Set 1       Target       New O Overwrite O Add       2  Enter the name on the Organism Set screen  In the Target column  specify one of the  following  New  new registration   Overwrite  overwriting of an existing species set      Add  additional registration      3  To display the species set in Organism Set on the set control panel  specify the conditions    and click the Apply button         Control     Sp Group     Set    Gene Set   Gene Set Conbination   Cluster Set   Cluster Set Conbination   Organism Set      C Organism Set 1      Bacillus cereus 4H187      Bacillus cereus 4H820      Bacillus cereus ATCC 10987    Organism Set 2       132    RECOG Client User   s Manual       32 2  Editing a species set name    To display the Organism Set screen  select a species set in Organism Set on the set control  panel  click the right mouse button  and click Edit     1  Modify the species set name and cli
21.  Tree screen  the following operations are possible       Reading the dnd files           Click       Open DND file   select the dnd file and click the OK button      Outputting the phylogenetic tree to a PDF file   Click Dr  PDF   specify destination and file name  and click the OK button       Switching between the rooted phylogenetic tree and the unrooted phylogenetic tree     To display the rooted phylogenetic tree  click    Rooted Horizontal   and to    display the unrooted phylogenetic tree  click  gt   Unrooted      Displaying the distance     Click ah  Show Length ON OFF         86    RECOG Client User   s Manual         Various settings        j  Option Settings       lt  Directory tab    Set the default directory for loading DND files and the defaut directory for outputting  PDF files      lt  Style  Specify the node style      lt   Color  Specify the Locus Tag colors     Species   Each species is displayed by the species color  the color of the species header in  the PPM table  specified in the Color Organisms menu on the Texonomy  Tree in the upper part of the Selected tab      OUT   IN Group  Each species is displayed by the ingroup outgroup color specified in Font Color  on the Taxonomy Tree tab on the Option screen  according to the current    ingroup outgroup specification     87    RECOG Client User   s Manual       23  Function Category Frequency Graph   Numerical Data Graph    On the Histogram tab  the frequency of the function category in the same phylogen
22.  User   s Manual       5 9  Histogram tab    On the Histogram tab  cluster properties are displayed in graphs of various types  In normal    disaggregate mode  annotation information is displayed           Gan    mode    Aggregate mode  Disaggregate    Bar graph for phylogenetic pattern frequency   Description   Function Category  The description is displayed  For the background color  the color of  the sub cluster   s typical function category is displayed     Value    1  The species    numerical data are displayed in a bar graph line    graph     Difference    2  The differences in the species    numerical data are displayed in a    bar graph line graph     nig Hayen         ED i    731  633  PL  335  m    ni I             97 _AD04E          s37 A420 ee    3            Pyridoxine blossr  bosis eroteln       BIE cerS   Seryi thath eyrhhetane    ese  thi Clycerate kirase Cid            64_AN250     64_AN042    164 A410    20 _ 4205   BCE 425s       Graph display of the property value       23    RECOG Client User   s Manual       5 10  Status bar    The Status bar displays the amount of used memory  PPM information and application update    information     136M6  1065ME6 Number of Selected Genomes  15 Number of Phylogenetic Patlerns  16910    1  Display of the amount of used memory  left side   The amount of memory used by the current application is displayed     Left side  the amount of memory used by the application    Right side  the amount of memory allocated to the appli
23.  as a Color CONITION Li  eee cscs ces ceeeeeessesceeessesseveesessevenseesenseneneens 130  31 6  Enabling  Disabling a filter SO CEI  csan ai 131  31 7  Enabling Disabling a color SEW ccczixtcestasncichsvinnibosesaddacinianncnedbsrcacheasasadnstaschuauvsabancsaidsiuinioseeasieaniaceicusnds 131  e E     aE E E EE AE 132  32 1  Registering a species Sts ursrisnidssioniied dianid niania niaaa endi diania 132  32 2  Editing a species set NAME ouw ieeccececceccescsses ces ceseseeseesesseseesessessesessesseesesesseusesesseusesesseuseseesenseusevensesseneneens 133  323  Removing a Species St anusia nee en ne STS CT Pe mura TDS ee ee Tee 133  32 4  Setting colors using a species SO escees cca acta recctem eg chateau aed nneteanonatabiticeutaaleuenttieeeeintilecs 133  32 9  Taxonomy filtering Using a Species SO ris cacaccscssccssoccasatisnteencsnanctnsscseseraerstanrsacesacncaasetasastanusigeeaneipeniae 133       RECOG Client User   s Manual       33  Similar Phylogenetic Pattern Search aana A aA E Eai 134  33 1  Profile registration TVONICS i gs eco sc ccs scnencustasucccupascdacnsenaccxancdsedecaesseccksniceactnauedansetsaaceeiacs 134  Se Me AIM et OE ccat sect caer eiscee sce Acacias deie cg cate EA EEAO NAATA 135  a REMOVE a WOE accent TE 135  33 4  Similar phylogenetic pattern searCh       eeseseeeseeseseeseeerserisrersnersnsrsusrsnsrsnsrsnnrnrunurunnrneurunnrneuruenrnenrnennnn 136  33 5  Uses of the results of the phylogenetic pattern similarity Search    eeeeeesesseeeseeeeeeeee
24.  click File   Update Data        2  Select one of the following on the Update data screen  and click the Apply button     Update data    Update data         Updated data only      Updated data only  Force       All data          Taxonomy data only  Only the Taxonomy Tree data are updated  Other data are retrieved as the need arises     Usually  this mode is the most efficient        Updated data only  Regarding the Taxonomy Tree data and the gene information currently held by the  RECOG Client  only the data updated by the server are updated  This function is used    to update necessary information collectively        Updated data only  Force   The Taxonomy Tree data and all the gene information held by the RECOG Client are  updated  This function is used to force an update in cases where the RECOG Client   s    information is defective        Alldata  All the gene data are downloaded      Note  If All data is specified  it may take about several tens of minutes to download the data     depending also on the network transfer rate     116    RECOG Client User   s Manual       29  Registration and Management of    Gene Cluster Properties    It is possible to incorporate any type of gene information by importing gene property files   Cluster properties are properties defined for each ortholog cluster  which typically calculated  using gene properties  Gene properties and cluster properties can be utilized for various    analyses such as sorting and PPM coloring     29 1  Registr
25.  is displayed in the color specified by the user in Color on the control panel     Genome Map Options     Function Category  Function Cetegory     on Category ee tr  1 Amino acid biosynthesis    1 7 Histicine family  2 Purines  pyridines  nucleosides  and nucleotides                                              ener       Positive Negative  Positive Negative ME       Use the coke of the control tres       20 5  Displaying gene information in a browser    i    Double click a gene on the genome map in the lower part of the CGM screen to display  the information on the default external resource specified in   35  External Resource URL  Management    in the browser    Click the external resource URL displayed upon right clicking  and the information on the    external resource is displayed in the browser        19    RECOG Client User   s Manual       21  Display and Operation of the Regional  Genome Map  RGM     To allow users to compare the chromosome map around the genes belonging to a specified  ortholog cluster  the Regional Genome Map  RGM  displays the genome map  where these    genes are arranged at the center     thd Alinment List    Pa    j  a    Bacillus anyloliquefaciens FZB42  33913340 45341  REAY_ OWDSD REAL CONTO FRPAOWSO FOAL  FEAH OCOOAO REALU 030140 REALL OI02GO FRPH_OOOS4  REALLUDIQADE  REALL OCO0SO RBAL 090159 REA OOOI ROAU COUSO RALL 090359 REWAL OO0SS0  REALL OCOOCO  RSALLCOOITO REAL COUJSIRSRA 00120 REALL OIO1ICO FBA 00a  FEPALOOOSIOREAM_OOOSSO 86 REAULOO0 
26.  meee te ene ee eee OUT Peer To ene Ent EO PeE on nena oe ere eee eee ere ees 91  24  Clustering Neighborhood a          lt  1 gt  aoa ene ee ee eon se Pe PR Pe 92  24 1  Execution of the clustering of neighborhood genes uo  cesceceesessesceseesessesesessessesssessessessnses 92  Display  Nondisplay of the clustering results ca  ccscccccscccsccccscssssecescsesaseceeasseacenscssensesvacevssatceaesvenesssentsitsstoasseieas 93  24 2  Changing the color of a group of neighborhood GENES    eee ees ces cseeseecesseseesesseseesssesseeesenses 94       RECOG Client User   s Manual       25     26     ei     Pe  Ook ol aC   gc  a en ae Cena ee A EAT PT PE AT E A EEE ee Oe EP AE CT eae ne eee A 95  23  l  Displaying Se secs casa a aa AES AA TATANA aT E 95  29 2  Registration of species Bi Sasi tos cis accsicssttnccat san eaaa aaia A a a Aa AN AEE ARAARA NAARS 95  25 3  Editing species group NamMesS anes eet eee een eee er aR On SS ee RO EEE OOS ene Sn RAN mre evn CN nme eae 96  25 4  Removing species groups and removing species from a species group ou    eeeeeeseeseeseeeeeeeeees 96  Genome Core Structure Alignment  CoreAliginer     c scccscsssscsssesessesecessssesessesesseseseceesesacsesecessesevereesesieeeraesee 97  26 1  Running the CoreAligner program ciel scczccscisaitiiacesscisobdpectensiesssaidsicensecssecsutesnidesaneiddadiniebdocctebisitectieddaacucd  s 97  26 2  Displaying the CoreAligner analysis results wo    cece ccc ces cesceseesescesceseesesseesesesseusesseveeseuseveesensen
27.  on the set control panel  click the right mouse button  and click Delete set    combination  When the warning message is displayed  click the OK button     31 4     SEs    Specifying a combined set as a filter condition    To set a combined set as a filter condition in Filter   Gene Set Filter Cluster Set Filter  on the control panel  select the combined set in Gene Set Combination Cluster Set  Combination on the set control panel  click the right mouse button  and click Register    filter     Specifying a combined set as a color condition    To set a combined set as a color condition in Filter   Gene Set Cluster Set on the control  panel  select the combined set in Gene Set Combination Cluster Set Combination on    the set control panel  click the right mouse button  and click Register color     130    RECOG Client User   s Manual       31 6  Enabling Disabling a filter setting    1  Select Filter   Gene Set Filter Cluster Set Filter on the control panel  click the right    mouse button  and click Enable Disable     2  To enable disable each individual condition of a combined set  double click the condition    in Filter   Gene Set Filter Cluster Set Filter   Combined Set Name on the control panel     31 7  Enabling Disabling a color setting    1  Select Color   Gene Set Cluster Set on the control panel  click the right mouse button     and click Enable Disable     2  To enable disable each individual condition of a combined set  double click the condition    in Color   Gene
28.  presence or absence of genes  phylogenetic pattern  is  displayed as a heat map  Basically  species are shown in the order of appearance on the  taxonomy tree  but if outgroup species are specified  ingroup species are displayed on the left  side and outgroup species are displayed on the right side  However  the order of display can  be changed via the operation panel  In each cell  the  locus_tag of the gene  which belongs to  the cell  or the number of genes is displayed  When a cell is clicked  the relevant cluster and  gene data are displayed on the Info tab at the right end    If DomClust analysis is carried out upon specifying outgroup species  the cluster table is  displayed as a nested table  That is  genes in the outgroup species form an outgroup cluster  which corresponds to multiple swb clusters that consists of genes in the ingroup species  All  of these are included in the  upper hierarchical  cluster  see the figure below   Furthermore   as its upper hierarchical cluster  a homologous cluster is defined as an accumulation of  homologous ortholog groups  In normal disaggregate mode  the function categories and gene  names  or cluster ID   s  of the sub clusters are displayed in the columns on the left side  the  function categories and gene names of the clusters are displayed in the columns on the right  side  and homologous clusters are displayed on the right side thereof  These columns are  called cluster headers  On the other hand  in the aggregate mode  clus
29.  results are saved to a file  they can be used even if the DomClust result    file is reloaded       Profile column  Specify the profile type    lt  Phylogenetic pattern    Use as a profile the binary vector representing the occurrence pattern of species     expressing the presence or absence of species as 0 or 1  respectively         136    RECOG Client User   s Manual        lt  Gene property    Use as a profile the numerical vector representing a property value of each gene    based on the specified gene property     Type of distance    Specify the index for calculation to be used as the value of dissimilarity  in which 0 is    the nearest and 1 is the farthest     lt  Normalized hamming distance   lt   Correlation coefficient    lt  Correlation coefficient  absolute     lt  Mutual information      If   Gene property    is specified in the Profile column  only the    Correlation coefficient       can be specified     Sort by the calculated result  If this is checked  the PPM table is sorted using the calculated distance values   3  To set the following conditions  click the  gt  gt Option button        Treatment of missing values  In cases where there is no gene corresponding to a species in a cluster  specify the  calculation method to cope with any missing values  This setting becomes effective    when Gene property is specified in Profile    lt  Ignore  Calculate in disregard of the species    lt  Average  Calculate by applying the mean of the values of the other genes 
30.  seeseeeeeeeeenenenereseresurerurururururururururuninininnntnnukannknknknknnn nunana nanunananunan aankan nnnn nanne nnana 41  11 5  Changing the PPM cell boundary Color         s sessenonsenennsenenenenensenenesnenrennsneenenrernnrsnenrnnrnrennsnrnronrnrenrerreronne 41  11 6  Color change according to the gene count within a Cell    ccc cee ce see sesceseeseeseseeeeeeenes 48  11 7  Color display corresponding to each functional Cate QOry     eceececcee cece cee cesceseesesceseesesseseeesees 49  11 8  Aggregated display of the PPM  ccc cc scecesceseseeceusesesseeseuseseeseusesseseeseueveesouseaseveeeenseneess 50       RECOG Client User   s Manual       11 9  Limited display of a selected region on the PPM Wu cece cece cesceeeeeeeseeseeeeseeseveeveeeensenens 51  11 10  Highlighting a species by selecting it on the Taxonomy Tree aneen 52  11 11  Selection of a species  phylogenetic pattern  in a cluster    e ecececesssessesesesestesesesessesesseseseceneeees 52  12  Color Display by Propertie oo    icc ccccccceccsceseececeeeeseuseeeeseesevsessesssseuseveeseuseseeseuseveessesesseaseveuseusesesseuseveesenses 03  12 1  Color display setting by DLO CICS asus cas riccedancccantseacsweniccdesreiaciseteeccieievdiaedauetneawdns 53  12 2  Enable Disable property color setting avisciessarcicanitvasvonsstsarveiaiavacinatsursroenuveivet en itavduitvndtounmvdeiounmanieriia 55  To Fe O a AE T A A A AOA rare 56  13 1  PPM sort m disaggregate Mode ee ne aaa ne nr 56  13 2  PPM sort in the aggre
31.  set screen  and click the Apply button     If    Overwrite the gene cluster set of the same name    is checked  the registration is made    upon overwriting the gene cluster set of the same name if such a set exists     Registered gene cluster sets are displayed in Gene Set Cluster Set on the set control       122    RECOG Client User   s Manual       panel     30 1 3     1  Display the Keyword Search Result screen     ti Keyword Search Result   _Neene   a 02956 1 5 5  adaA     SCE_s772 ada A  O SCE4 254_Ass4adas     6C3740     SCAHIS7 _A37iedsA     SCAKS20_S7SiadaA      0292910315    09178104      0373510415    Z02415       02039115    Z 02049 1415    5  02056 1415    Z 020871015    5  02201108    29023121045    3  02707108        028221008      Z  Display only matched genes   M Highight matched genes       a  K   m   a  m   m   K   m    2   m   m   m   a  a   2   m   a  a    roy  story    ry protein Te    Registration from the keyword search results    aai       _  ile    oreintory erotein        rehire rere datory erctein  tional regulatory protein            2  To display the Register _gene cluster set screen  check the column to the right of the    gene cluster to be registered  and click the Register gene set Register cluster set button     3  To register the gene cluster set  specify the set name and the destination for registration on    the Register gene cluster set screen  and click the Apply button     If    Overwrite the gene cluster set of the same name    is ch
32.  the    Any    conditions in Filter     Taxonomy Filter     Any on the control panel   Also  the species names meeting the    Any    conditions are displayed on the tree in light    blue     A name can be attached to each of the    Any    conditions in order to distinguish between the    conditions        All  Selected   Clustering    9 root 6      S Bacteria 6        Bacillales 6     S Bacillaceae 6    IN Bacillus cereus AH187  IN Bacillus cereus AH820  IN Bacillus cereus ATC  IN Bacillus cereus ATCC 14579  IN Bacillus cereus B4264  OUT Oceanobacillus iheyensis HTES31    i  7    Li    o        Control    Sp Group    Set   9 Fitter    Ga    S Aur      Bacillus cereus 4H187     Bacillus cereus AH820  B    y Any Bacillaceae   20 0   2      Bacillus cereus ATCC 10987     Bacillus cereus B4264       To specify the    Any    conditions more easily  select more than one species name  click the    right mouse button and click Taxonomy     Any New   For all any conditions     Any          64    RECOG Client User   s Manual       condition shown below to display in the lower view the species names registered for the       Any    conditions     To add species to the    Any    conditions  select the species name or names  click the right  mouse button and click Taxonomy     Add     Any Add      Additional    Any    condition    The relevant species are added to the    Any    conditions in the lower view and the relevant    species names are displayed on the tree in light blue     All  S
33.  the Keyword Search Result screen is displayed     The conditions corresponding to the search results are displayed in Filter   Color on the       70    RECOG Client User   s Manual       control panel  On the PPM  only the searched clusters are highlighted     Ed Keyword Search Result         Control     Sp Group ada  Fitter ee    E pra pa    Taxonomy Filter     gt  Gene Organism Count Filter  4 Keyword Search     Keyword Search Cluster dna   Gene Set Filter  Cluster Set Fitter  Organism Column  Sort  Color     Keyword search Cluster dna     L703 L Jars O       y   rth  y    aR  y   D   D   i    5       5  Check    Display only matched clusters    on the Keyword Search Result screen to display on  the PPM only the searched clusters  Upon unchecking  all the clusters are displayed     Check    Highlight matched clusters    to highlight the searched clusters  Upon unchecking   and the highlight is cancelled  Also  by clicking the color setting column  the color of the  highlight can be set     71    RECOG Client User   s Manual       17 2  Search of genes       p  1  Click      p  Keyword Search  in the Toolbox to display the Keyword Search screen  On    the Keyword Search screen  click the Gene tab     2  Specify an item and a keyword  and click the Search button     To specify multiple conditions  click the Add Condition button to add conditions     To clear a condition  click the Clear Condition button     Upon specifying the condition s   click the Search button     bea Ke
34.  the Option button on the Core Genome Alignment Preview screen to display the  Core Genome Alignment Print Option screen  and specify the option     Core Genome Alienment Print Option    Row count par one page 10     Show legend    Show organisa name every row    C  Show page number    Page setting       Option Details   Row count per page Specify the number of rows displayed on a page    One page Print so that the Core Genome Alignment image fits into a page   Show legend If this is checked  the legend is displayed     Show organism name on If this is checked  the species names are displayed in all the rows    every row If unchecked  the species names are displayed only in the first  row of each page    Show page number If this is checked  the page number is displayed     Page setting Specify the paper size and orientation     3  Click the Print button on the Core Genome Alignment Preview screen  and the printer  selection screen is displayed  Specify the printing conditions and click the OK button     26 16  Saving the CoreAligner results    The CoreAligner analysis results are saved automatically in the Project directory or its  sub directory when an analysis is conducted  Save the analysis results in the CoreAligner    format   coaln      1     Click te  Save Core Genome File  in the Toolbox  specify the destination for saving    and the file name  and click the OK button        104    RECOG Client User   s Manual       27  Genome Comparison Viewer    The Genome Compariso
35.  the completion of the DomClust analysis  the DomClust analysis result is displayed  on the PPM  When the completion message is displayed  click the OK button     If the analysis is executed in the background  the    Load DomClust file     message is  displayed  Click the OK button     Confirm    Q  Load DomClust file        38    RECOG Client User   s Manual       9 3  Display of the DomClust analysis results    The previous DomClust analysis results are displayed           Size La Update Cluster 10 1237315012_08302  1 679 KB 2009 02 24 225923  Cluster Count  9334  cluster 20020213 deist 3 284 KB 2009 02 20 11 0 amp  57  Server   cluster _20090224_gene_colordcist 4 294 KB 2009 02 24 9 16 53 http    mbsd  genowe  ad  jp       cluster 20080327 deist 2 156 KB 200900327 14 57 32  OPtions        eval   0 0010  luster sample deist 3 034 KBI2008 06 25 21346    store i 60 0      coverage    0 6      cutoff    80 0     Sim_measure   score    besthit   none     clusiwede   orthology    phylocut   0 5     taxonlevel   none            adiovlo   f 8       A             2  On the Open files screen  select the file filter DomClust File   dclst   cluster   and then    select a project and a DomClust analysis result file     Upon selecting the DomClust analysis result file  the relevant information  species set and    parameters  on the analysis result is displayed on the right side of the screen     3  Click the Apply button on the Open files screen to display the selected DomClust analysis
36.  to register    it on the New Server screen     New Server       Server  http   recog server  ac jp          5  When the Apply button on the New Project screen is clicked  the project is    registered and opened      Note  To register a project  ensure that you can connect to the RECOG server     25    RECOG Client User   s Manual       6 2  Open a project    1  Click File     Open Project  and click the relevant project  The clicked project opens        View Search Run Option    New    Change Server  gt     Gd Open file     Open Project 4   Test Project        Project List     fas a    6 3  Reference to a list of registered projects    1  Click File     Open Project     Project List    to display the Project List screen   The registered project names and the RECOG server URL are displayed on the Project List  screen  Also  the Select field of the project in use is displayed and checked     bd Project List    http  Wrecog server  ac jp             26    RECOG Client User   s Manual       6 4  Project registration    l     Click File     Open Project     Project List    to display the Project List screen   On the Project List screen  click the Add    button to display the New Project screen     On the New Project screen  enter the project name and set the URL of the RECOG server    that will conduct the analysis     To register the URL for a new server  click the New Server    button and register it on the    New Server screen   On the New Project screen  click the Apply butto
37. 20    Probability Cutoff    4 0 v                 Type of Distance Normalized hamming distance             14 2  Operation of the clustering tree    On the clustering tree  the display of distance can be switched on off  and by clicking a branch    point  clusters belonging to points not higher than the branch point can be selected   To switch on the display of distance  do the following   1  Click the right mouse button on the Clustering tab and click and check Show Distance to  display the distance on the clustering tree     To select a cluster not higher than a given branch point     1  On the Clustering tab  click a point near the desired branch point of the clustering tree to  select the clusters not higher than the clicked branch point     AE Selected   Clustering        bay bat be be ber bey b  k Bt  bel bho bli BIS b       RECOG Client User   s Manual       15  Taxonomy Filtering    15     Specify the filtering conditions on the Taxonomy Tree on the Selected tab to display on the    PPM only the clusters with phylogenetic patterns that meet the conditions  In setting the    filtering conditions  specify a set of species and conditions therefor at the same time  One of    three sets of species listed in the table below can be specified  Of the three sets     All    and       None    are special sets with fixed conditions  whereas for    Any     the conditions can be set    freely  If simply specifying a condition of existence nonexistence in a genome  use       All  No
38. 21 2 Deoxyrbonucieotida metabolism   2 2 Nucleotide and nucleoside intercorrrersions  2 3 Purine ribormciectide biosyrithesis   2 4 Pyrimidine ribonucleotide biosyrthesis          3  To change the functional category color on the lower list  click the Color column  specify    the color on the displayed Color palette screen  and click the OK button        49    RECOG Client User   s Manual    4  Click the Apply button     11 8  Aggregated display of the PPM    In aggregate PPM mode  clusters of the same phylogenetic pattern are aggregated into a single    line     1  Click ap  Aggregate Mode  in the Toolbox to display the PPM in aggregate mode     2  The PPM can also be displayed in aggregate mode by clicking View     Aggregate Mode   or by clicking Aggregate Mode after right clicking the PPM     tev bru gta       To exit from the aggregate mode  do the following     1  liek D  Disaggregate Mode  in the Toolbox to exit from the aggregate mode     3  One can also exit from the aggregate mode by clicking View     Disaggregate Mode  or by  clicking Disaggregate Mode after right clicking the PPM     BERR    E BE  PRET EE       RECOG Client User   s Manual       11 9  Limited display of a selected region on the PPM    The limited display of a selected region is possible by dragging the mouse on the PPM to  specify the region        To exit from the limited display mode  click the right mouse button and click Limitation    Release on the menu     accB      Select Oreanism       gee 
39. 29 4  Removing a property    To display the Gene Property   Cluster Property screen  click Option   Gene Property    Cluster Property List        On the Gene Property   Cluster Property screen  select the property to be removed  and  click the Delete button  When the Confirm screen is displayed  click the OK button     To remove the property  click the Apply button on the Gene Property   Cluster Property    screen        120    RECOG Client User   s Manual       30  Registration and Management of    Gene Cluster Sets    Multiple genes clusters can be registered as a set  The registered gene cluster sets can be    utilized for sorting  color setting and filter setting     30 1  Registration of a gene cluster set    A gene cluster set can be registered by the following three methods      Registration from a file     Registration from a cluster selected on the PPM    Registration from the keyword search results    30 1 1  Registration from a file  1  Create a gene cluster set file     The format for the gene cluster set file should be one of the following three     a  dclust format  The dclust format should comprise the following    lt Species code gt   lt LocusTag gt         t  lt Species code gt   lt LocusTag gt         lt Species code gt   lt LocusTag gt         Example     ban BA0001  ban BA0002    bca BCE_0009 bce BC0013 0inOB0010       b  clusttab format  The file format should be one of the following    lt  The clusttab format output by clicking File   Export   lt  The fi
40. 8 082131 ack  ERRELE CE 3635 BCSB IS GK 194  GTNG_1I206 IB16S ia    BAS2536 BCE 2504 BC2779 GRO  10 GTNG_061  acoA       The display nondisplay of the color presentation can be switched on off by ticking unticking    Show color genes by properties on the pop up menu on the PPM     occ anoa   pean    Sort b  Select Organism    wv  FH Neighboring Clusters     w    Bm      Color genes by properties       ee    Histogram       Ageregate Mode                       Show color genes by properties    12 2  Enable Disable property color setting    1  To enable or disable the property color setting  double click Color     Gene property on    the control panel        99    RECOG Client User   s Manual       13  PPM Sort    The PPM can be sorted and displayed according to various conditions based on the    phylogenetic pattern     13 1  PPM sort in disaggregate mode    In the disaggregate mode  the rows are sorted for each cluster or sub cluster                       ban bat bea tee sho gta oih  dm A uAN     BAsoo01 BCE 000 ediad GTNG_O00  C000  daak  dral E Ceo ere nare  Select Organism Gene order or  ban  358 Neighboring Clusters  Phylogenetic pattern lexical order   ee Phylogenetic pattern similarity based on the chaster 1071  BES Cole cones by properties  Phylogenetic pattern clustering  PhyioPatChist    rd Gene properties     Helam   gt   r      fgereesic Mode    1  Ifthe DomClust result contains an outgroup  click the Cluster Mode Sub Cluster Mode         button in the Toolbox to s
41. Add  gt            __ Genome Map  gt      Control Gene List   Cancel       15 7  Enable Disable conditions     an  Selected   Clustering         root 6     I Bacteria 6     H Bacillales 6      9 Bacillaceae 6   IN Bacillus cereus 4H187  IN Bacillus cereus AH820    IN Bacillus cereus ATCC 10987  IN Bacillus cereus ATCC 14579  IN Bacillus cereus B4264   OUT        Sp Group       Fitter     O Taxonomy Filter   a S anc      Bacillus cereus 4H187     Bacillus cereus AH820     9 None 1      Oceanobacillus iheyensis HTE831   E  amp  Any Bacillaceae   20 0    2      Bacillus cereus ATCC 10987     Bacillus cereus B4264    1  In Filter     Taxonomy Filter on the control panel  select the conditions  click the right    mouse button and click Enable Disable to enable disable the selected conditions     15 8  Removal of conditions    1  Select the conditions in Filter     Taxonomy Filter on the control panel  click the right    mouse button and click Delete to remove the selected conditions     68    RECOG Client User   s Manual       RECOG Client User   s Manual       16  Filtering by Gene Count Species Count in the  Phylogenetic Patterns    Set a threshold value for the gene count species count in the phylogenetic patterns and filter the    data based thereon  The results are displayed on the PPM     16 1  Setting the conditions    1  Select Filter     Gene Organism Count Filter on the control panel  click the right mouse  button and click Gene Organism Count Filter to display the Gene 
42. COG  Research Environment for Comparative Genomics  is a workbench software  program which is used to conduct comparative genome analyses on a massive scale  The  main feature of RECOG is the function of ortholog analysis between genomes of numerous  species based on the display of an ortholog table  rows  ortholog groups  columns  species    RECOG can also conduct various comparative analyses for detailed examination  based on  this table    RECOG was developed based on the MBGD  Microbial Genome Database for Comparative  Analysis   RECOG is a dedicated client software program that is available immediately after  connecting to the MBGD server  Meanwhile  the analysis of newly determined microbial  genomes  eukaryotic genomes  efc  that are not included in the MBGD can also be  conducted in a local environment  by installing the RECOG server locally    The latest version of RECOG is available from http   mbgd genome ad jp RECOG      This manual explains how to use the RECOG Client software program         Organisms    MBGD  Web    a  MBG            RECOG Client User   s Manual       2  Operating Environment of RECOG    2 1  Operating system      Mac OS 10 3 and upward    Windows XP  Vista    Linux    2 2  Compatible Java version      Java 1 4 and upward  Note  Make sure that Java JRE 1 4 and upward is installed before installing the RECOG  Client software program  If it is not installed  install Java JRE 1 4 and upward before    installing the RECOG Client software program   
43. Gene properties file  and select the destination for registering the    gene property             a He Dv       CI Open file    Horizortal Zoom        Open Project      ba  H Save Cluster file    Select      OomClust file       Gene propety file  gt  Current Project    mat                       15  Export  Global  Pace setup  ee     indata Nata       Select the destination from among the following        Current Project  project name    Register the gene property so that it becomes available only in the current project or in    the specified projects        Global    Register the gene property so that it becomes available in all projects     3  The Import gene property file screen opens  To register the gene property  specify the gene  property file created  and click the Open button     29 2  Referencing a list of gene cluster properties    On the Gene property   Cluster property screen  the registered properties can be checked     1  To display the Gene property Cluster property screen  click Option   Gene  Property Cluster Property List        2  To display a gene property  click the Gene tab  and to display a cluster property  click the    Cluster tab  The details of each screen that can be displayed are as follows       Name  property name       118    RECOG Client User   s Manual         Type  type       Multi  multiple attribute value flag  If this is checked  the gene has multiple attribute    values     Location  location for storage    Gene properties only     lt  G
44. Neighboring Clusters      a  age Color genes by properties     nocd CE    SE Histogram   Pgcrecate Mode  acet A1132 B Limitation Release          5     RECOG Client User   s Manual       11 10  Highlighting a species by selecting it on the    Taxonomy Tree    Select and click a species on the Taxonomy Tree above the Selected tab to highlight the  relevant species on the PPM     Also  to exit from the species selection mode on the Taxonomy Tree  do the following     Windows Linux  Ctrl   left click on the selected species      Mac  Apple key   left click on the selected species    Al Selected   mie    O fe O Sp Group    Ska    IN Bacius cereus ATCC 10567  IN Bacus cereus ATOC 14579  IN Bacus cereus ESIL  IN Bac  us cereus NVH 351 99  S 99 foa turngensis 2   IN Bac  us thuringiensis serovar ko  ik Bac  us thuringiensis Al Hakam  3 OY Btu chuiii   IN Bocdus clausi KSM KIG  B S Estus haiodurans 1   1    Botma  habdrans C 125  S OY Bacius kchenitormi  IW  IU Gocis bcherilomis OSM 13  ATOC  S OY Boctus p ft       al ae       wee oS    L 5       11 11  Selection of a species  phylogenetic pattern  in a    cluster    The species in a cluster specified on the PPM is selected and displayed on the Taxonomy Tree  on the All tab Selected tab  This function is useful when searching a phylogenetic pattern    similar to that of a specified ortholog group  etc   1  Click to select a cluster on the PPM     2  Click the right mouse button on the PPM and click Select Organism on the pop up menu 
45. Organism Count Filter    screen     Oreanism Gene Count Filter           Sp Group       Fitter    Taxonomy Filter     or more    Gene Organism Count Fitter cs  eels Gene count Cluster count Filter  Gene  4   orless   0    a  lt  Enable  a  lt  Disable                 v    or more   2       2  Specify the conditions on the Gene Organism Count Filter screen  and click the Apply       button to display the conditions in Filter     Gene Organism Count Filter on the control    panel     16 2  Enable Disable conditions    1  Select Filter     Gene Organism Count Filter on the control panel  click the right mouse    button and click Enable Disable     69    RECOG Client User   s Manual       17  Keyword Search    Regarding the cluster results  both clusters and genes can be searched  The gene search is    conducted in concert with the RECOG server     Search of clusters Search of genes  Search   Representative gene name   Gene properties provided by the  target   Representative description of each RECOG server  description  gene name   cluster etc      Gene cluster properties registered  through the use of the import function     etc     17 1  Search of clusters       1  Click  7  Keyword Search  in the Toolbox to display the Keyword Search screen     2  On the Keyword Search screen  click the Cluster tab     3  Enter the keyword s  in the Keyword s  column and click the Search button to begin the    search     Ba Keyword Search          4  Upon the completion of the search process 
46. RECOG Client User   s Manual       Client User s Manual    Ver 1 0 7    Octover 15  2009    iF    RECOG Client User   s Manual       Table of Contents    aTa Se gcse cece ET cco cece E AE E E EE 9  1 1  What is RECOG asinine aaiae asaan iaai eiaa aaae urate aneeciec Easa aaE aaa aoaaa Eaki 9  Operating Environment of RECOG css octcsesivcsctncascaietorsedcbeanscpecavdndesisdnlacgecclosesscesicdadadscensaisiooniaseneaetedeeniecs 10  BM csc kc S a cece NAANA 10  2 2  Compatible Java VersiON eee mene een mn e OO Ue ae cere re eee ne ere meee ee neerT 10  inetalkanorny Unin alato tcc cceacn ceca a aa aaiae aai 11  J1  etalatonof REGOG for WIndows sisareen n er oe te ee eet 11  32 IMAM Nid or RECO G Tor MaG cance cece cece acco scarcesevesearequcecaaocoscceconesceesuceayaesosovessucanseteteeeesaysueeeiesnees 11  3 3  Installation of RECOG for LIMUX             cccssssscseesssscssescesssssssssssenseecsnscessesesessssseeseaesessenssesseseesensseraearseseesseses 11  Of  Unmstallation of REG OG fof WINdOWSssinssmsrenniamncesnasaniianns enced 12  3 5  Uninstallation of RECOG for Ma0 eee eceaciczsacessan te searecpceceaianesicastavavirastadonntoneemeviasieseattesuane anana ees 12  3 6  Uninstallation of RECOG for Linux sescess cccncececetdectecocescnpescasoeseitcetd en deodhitviedesnatetieeilinemceiecsaeeeee eatin 12  Startme and Terminating RECOG oinaan a Aa a a AER REE 13  Ae a E   C SAA E A EAA E AP EAT AE AATE E N E A 13  A Terminate RECOU zeuria aE a E TAANE ANADA NARREN en NOTE nS 
47. able on  the same genome belong to a neighborhood gene cluster  Note that the same color may be  used for different clusters  cells in different genomes or cells far away from each other are    not related even if they have the same color in the table     Display Nondisplay of the clustering results    1  To switch between display nondisplay  double click Color   Neighboring cluster on the    control panel            Sp Group    Fitter  Organism Column  Sort  Color     Neighboring cluster Cluster 5 Range 5000      Gene Set       Cluster Set     Gene count    2      Base Color  Phylogenetic Pattern       93    RECOG Client User   s Manual       24 2  Changing the color of a group of neighborhood    genes       1  To display the Option screen  click ae  Option  in the Toolbox  On the Option screen     click Neighboring Cluster     Ei Option       Taxonomy Tree PPM   Function Category Color   Neighboring Chester    suttiple Alignment   Tree   General  Coler pattern for clustered cells             2  Set the color pattern        Color pattern for clustered cells    Set the color pattern for coloring genes clustered by neighborhood gene clustering   The color is assignmed to the neighborhood clusters in accordance with the order    specified on the list  The assignment is repeated from the top of the list after reaching  the end of the list     Add button   Add a color pattern    Delete button   Delete the color pattern selected from the list   Up button Down button    Shift the s
48. am The Histogram tab is used to create graphical representations    based on the gene property numerical data     Taxonomy Tree Pane Whether or not to display the taxonomy tree pane is specified   Function Category Pane Whether or not to display the function category pane is  specified     5 3 3  Tool menu    Tool   Run Option      Keyword Search       Organism Gene Count Filter       Keyword Search    A keyword search is conducted regarding the gene cluster properties   Organism Gene Count Filter The filtering conditions are specified based on the gene count species in  a cluster     5 3 4  Run menu      Run   Option       9 Ortholog Clustering  DomClust       es Phylogenetic pattern clustering  PhyloPatGlust      9   Core Genome Alignment CoreAligner        w  H  Neighboring Clusters     Ww    Ortholog Clustering  DomClust     DomClust analysis is conducted   Phylogenetic pattern clustering  PhyloPatClust     Phylogenetic pattern clustering is carried out   Core Genome Alignment  CoreAligner     CoreAligner analysis is conducted   Neighboring Clusters    Neighboring gene clustering is carried out     17    RECOG Client User   s Manual       5 3 5  Option menu    Option     URL       Gene Property        Ve Option         Gene Propety Cluster Propety List       Gene Set Cluster Set List       Properties     The option screen is displayed     The URL for displaying genetic information from an external resource is set     The list of registered gene properties cluster properti
49. an external resource  Name   the URL  and    the gene property  Key  to be used as the key for the URL   To embed a gene property value in the URL  enter      lt number gt     in the URL  click the    Add button  and specify the gene property in the Key column      Example  MBGD gene data URL  http   mbgd genome ad jp htbin RECOG RECOG _gene_info_frame pl     spec  1  amp name  2       142    RECOG Client User   s Manual       2 25     1  Gene property Organism   2  Gene property Locus Tag    To display the registered external resource URL on the URL screen  click the Apply  button on the Edit URL screen     Click the Apply button on the URL screen     Editing an external resource URL    Click Option   URL    to display the URL screen     To display the Edit URL screen  specify the external resource to be edited on the URL  screen and click the Edit button     On the Edit URL screen  edit the external resource information   On the Edit URL screen  click the Apply button     On the URL screen  click the Apply button       Removing an external resource URL    To display the URL screen  click Option   URL        On the URL screen  specify the external resource to be removed  and click the Delete    button  When the Confirm screen is displayed  click the OK button     On the URL screen  click the Apply button     143    RECOG Client User   s Manual       36  Appendix    36 1  DomClust parameters    Parameter    Cutoff BLAST E value    Cutoff DP score    Cutoff PAM distance    Cut
50. ance on the pop up menu upon right clicking the    button     3  To zoom in zoom out on the selected region  click the    Zoom in  A  Zoom    out  button in the lower part of the CGM screen  respectively  By right clicking on the    button  the zoom ratio can be changed on the pop up menu     20 3  Linkage between the PPM and CGM    1  Click a gene on the genome map in the lower part of the CGM screen to select on the PPM  the cluster to which the clicked gene belongs     2  Click a cell on the PPM to highlight the location of the clicked gene on the circular  genome on the CGM  Also  the displayed region of the lower genome map changes and    the clicked gene is displayed     3  Click the header in the upper part of the PPM to change the genome displayed on the  CGM  You can compre the locations on the chromosome of the genes in a given cluster  between genomes by selecting a cluster and switching the displayed genome one after    another     20 4  Changing the color of genes       LCI  Option  in the Toolbox on the CGM screen to display the Genome Map    Options screen        78    RECOG Client User   s Manual       2     Specify the gene color           Function Category     The gene is displayed in the color corresponding to the representative function category    of the cluster to which the gene belongs           Positive Negative       The gene is displayed in the color corresponding to the direction of the gene           Use the color of the control tree       The gene
51. ation of gene properties    1  Create a gene property file     The format of a gene property file is as shown below       The first row describes the header    lt  In the first column   sp   a three letter code for a species  is given  and in the  second column   locustag  is given    lt  In the third column  the arbitrary gene property name and type are given    Example  If the gene property name is Expression and the type is numerical  the  description should be Expression  Num      The following four types of gene properties can be specified     Type Code Example   String type Char BC2639   Numerical type Num  10 3   Enumeration Enum  elementl  element2      Yes  No   type   element   possible value  that is  only   Enum Yes No   that is  Yes    the element specified here is assignable and No are the specified  as a value  elements     Hierarchical Hierarchy EA  type      In the second row  gene property values are described    lt  In the first column  a three letter code for a species is given  and in the second  column  Locus Tag is given    lt  In the third column  arbitrary gene property values are given   If a gene has two values  the type should be specified as  Multi   and the two    man    values should be delimited by      semicolon         117    RECOG Client User   s Manual        Example     sp  locustag GO Char  Expression Num  Pathway Char Multi     ban BA0001 Cellular component 2000 Glycolysis   Gluconeogenesis Citrate cycle       2  Click File     Import   
52. axonomy Tree    The Taxonomy Tree shown on the Selected tab    z       147    RECOG Client User   s Manual       Function category    Functional classification attendant on and characterizing genes and clusters    G  Gene Property  Property values associated with each gene  which can be given by the user   Genome Comparison Viewer  A genome map in which the gene core structure is colored based on the results of    DomClust analysis and CoreAligner analysis    Ingroup  The set of target species in phylogenetic analysis  In the DomClust analysis  ingroup    species is specified as the set of related species on which the attention of the user is focused    M  MBGD  http   mbed genome ad jp   Microbial Genome Database for Comparative Analysis  which is the standard genome    database in RECOG     N  Neighborhood gene clustering  A method for identifying genes that are located near both in the phylogenetic pattern map  and in the genomic sequence   O    Ortholog group  cluster    A group of homologous genes made by clustering based on the orthologous relationships  between genes  In RECOG  ortholog groups are constructed by the DomClust program and an  ortholog table created based on the ortholog groups is used as a basis for any comparative  anaylsis  In the genomic core structure analysis  a core structure alignment is generated by  reordering ortholog groups based on the conserved gene order along each chromosome    Ortholog table  A table showing orthologous relationships amon
53. c   the information on the domain boundary and other types of information are lost  To reload    the analysis result into RECOG  save it in DomClust format     9 5 1  Storing a file in DomClust format    1  Click H  Save Cluster file  in the Toolbox to display the Save Cluster file screen     2  On the Save Cluster file screen  specify the file name and the destination folder  and click    the OK button     9 5 2  Storing a file in tab delimited format    1     Click aE  Export  in the Toolbox to display the Export screen     2  On the Export screen  specify the file name and the destination folder  and click the OK    button     41    RECOG Client User   s Manual       10  Control Panel and Set Management Panel    On the control panel  the filtering settings  alignment sequence settings and display nondisplay    settings for species  sort settings  color settings and phylogenic patterns can be specified     Also  on the set management panel  the gene sets cluster sets  combined sets and species sets    can be managed         Sp Group    Fitter      Taxonomy Filter     Gene Organism Count Filter    Keyword Search         Gene Set Filter         Cluster Set Filter  Organism Column   IN Bacillus cereus 4H18    IN Bacillus cereus 4H620   IN Bacillus cereus ATCC 10987   IN Bacillus cereus ATCC 14579   IN Bacillus cereus B4264   OUT Oceanobacillus iheyensis HTES31  Sort    Category gene name  Color        Cluster Set    Gene Set      Keyword search Gene Descritpion aminogly      G
54. cation    2  Display of PPM size information  center     Left side  the number of species shown on the PPM    Right side  the number of clusters  or total number of phylogenetic patterns in the    aggregation mode  shown on the PPM    3  Update information  right side   The update notice icon is displayed on the right side of the status bar when any application    or public data is updated  The update information is accessed by clicking this icon     Infor mation    i  RECOG Client Version  0 9 148  Released   You can get from the download site       Inthe future  this message should not be displayed    TAR       24    RECOG Client User   s Manual       6  Project Creation and Editing    A Project is a saved collection of related analysis results  In default mode  the default project  is selected  and all analysis results are stored therein  It is advisable to create a dedicated    project before proceeding with any analysis     6 1  Creation of a new Project    3  Click File     New     New Project      The New Project screen appears     New Project       Name Test Project           Server URL http   imbgd genome ad jp v          4  On the New Project screen  enter a project name and the URL of the RECOG server that  will conduct the analysis  The RECOG server can be selected from the menu from among  all the registered servers  If an official server is used  the default settings do not have to be  changed  To register the URL for a new server  click the New Server    button
55. cesaevesisicornscnmnacicanaicttovasaleiunnaicaasutieig eee E ea 81  21 3  Display  Nondisplay of the Locus  TaB ac aricsncatvevnoiesssececnseianacnesceavunnintniestausenveciaunavenisuetbieuaagntensionvataciains 81  21 4  Setting the gene COlOK cece cece cesceeceessseesecseseuseeseseuseeseveesessevsesesseusevesseuseuseseeseuseuseseusessevesseuseesesenses 81  21 5  Displaying gene information in a Web DrOoWSe lL    cece cece csseeeseeesesceseesessessesesseusesseseeseuseseeeeusasseees 82  22  Multiple Alignment and Phylogenetic Tree wu    cccccccccccccescscescscsscesseeeeseeseesesesseesessesesseuseveeseuseusesesseusevsesenses 83  22 1  Execution of multiple alignment ce i eee Sema catcecurente ce ace dewininechnciecestt tee euem de eameeatbneca cna ubeinevedeciens 83  22 2  Changing the colors of the amino acid letter Strings    cesses ces ceceseesesceseesessessevesseuseeseeseses 85  22 3  Displaying the phylogenetic ab 8g     lt  enn ee er ae ont oP noe en Pe ee 85  23  Function Category Frequency Graph  Numerical Data Graph ou    eeeeseseseeeeseseseseeteteeseseseeeeeeen 88  23 1  Function category frequency graph wu   eecc ccc ccsccee ces cesceeeesesseseeseesesseveeseuseuseseeseuseuseseusesseeesseuseesesenses 88  23 2  Displaying a numerical data graph  a description or the function category                     88  23 3  Switching between the display nondisplay Of the wo    ceecsssesesesessesesesesesssseveseseseseeteteveseeeaneneeen 90  PROB Mig   oe  at  19 eo   9 nner ene ee
56. ck the Apply button     32 3  Removing a species set    Select a species set in Organism Set on the set control panel  click the right mouse button  and    click Delete organism set  When the warning message is displayed  click the OK button     32 4  Setting colors using a species set    1  In Organism Set on the set control panel  select a species set  click the right mouse button     and click the sub menu of Color organism     For the color setting method for species  refer to    11 4 Species color setting        32 5  Taxonomy filtering using a species set    1  In Organism Set on the set control panel  select a species set  click the right mouse button     click Taxonomy  and click the condition menu for the phylogenetic pattern filtering     For the operation procedure of phylogenetic pattern filtering  refer to    15  Taxonomy    Filtering        133    RECOG Client User   s Manual       33  Similar Phylogenetic Pattern Search    Register a reference phylogenetic profile to evaluate the similarity between the profile and the    phylogenetic pattern of each cluster  The results can be utilized for color setting  filtering     sorting  etc     33 1  Profile registration from a cluster    1  Select a cluster on the PPM  click the right mouse button  and click Create phylogenetic    pattern  The Register phylogenetic Pattern screen is displayed  and phylogenetic patterns    according to the presence or absence of genes in the selected cluster are displayed thereon     Reg
57. dacialnuierakewnen 104  26 16  Saving the CoreAligner results oo  ccccc cece scescesescescesesesceseseesesseuseseesesseveesauseveesauseveesensesseneeeess 104  Genome Comparison Viewer etc ites ts cng ccs eee reenter  105  27 1  Displaying the Genome Comparison Viewer uu      cccceccesccseesescesceecesescescessecescessesesseseseeseesessevensessensneens 106  21 2  Ghanginge the display dees       gt  Reena en eee nn ee eno ae ee ne ee een ne ee een ee ere 106  Fg dle OMA sce E A dsc cc bata dats ceeccth cucpcestbancn eachiessdecaaieeaseeteinntaadaceeatiesasstananeds 106  21 4  Moving a specified ortholog group to the center of the SCreennuucece eee cece teers 107  21 5  Displaying gene information in a VOW SOU ccc ccc sede ceedch se acacsbetacedlacasincccpsetacacctetacedlarasbany 107  Lro Oane bale ONE eee Reece eee ene are er tea aCe Des one ee nt N epee rev em re are AT CeeeeertT ra 107  21 1  Recovering the OREN cca ai iaaa eA A sesh eis gcc EEA AKA AA AAN 107  27 8  Display Nondisplay of SPEC le Sia aniisawoaltietenriensedndenanemniauanaiienaeninmndnnetoe  108  21 9  Changing the display order of species aii craia tice tee siice stented een deceeecastenalabes EnciaeiMtcaslqcasiees lace tna cieattoe 109  27 10  Display Nondisplay of genes or ortholog lIN   S wo  ceceesesesessesecesecescessveseeesesveveveveseseetevveneneeeetanens 109  21 11  Changing the display style of species NAMES ou    eee ese cceeeee cee ceeceeessesseeeesesseeeesesseuseveeenseneneens 110  27 12  Display
58. date function of the RECOG Client  the gene information     chromosome information  Taxonomy Tree information and Function Category information can    be updated     28 1  Updating the Taxonomy Tree based on the update    notice    If any gene information provided by the RECOG server is updated  the    icon is displayed    on the lower right of the screen  To display the update notice  click the icon  To update the    Taxonomy Tree data on the All tab  click the Update taxonomy only button in the notice    message     Ed Infor mation    New gene data  2008  12 24  is released        Inthe future  this message should not be displayed    Update taxonomy only         Tf    In the future  this message should not be displayed    is checked and the Close button is    clicked  no update notices will be displayed thereafter        To display update notices again  click aa  Option  in the Toolbox  check    Update    information is regularly checked    on the General tab on the Option screen  and click the    Apply button                Taxonomy Tree PPM Function Category Color Neighbor Clustering Multiple Alignment _ Tree    General      Update   V  Updated information is regularly checked     115    RECOG Client User   s Manual       28 2  Updating gene information through Update Data    The RECOG Client can download data from the RECOG server  This function is used to update  the RECOG Client data when the RECOG server data have been updated     1  To display the Update data screen 
59. ded 66  19  Enab Disable COnditioNS aesrespaiianio piin atts eee erence ee rete 67  130  Ramoval ol condition sosie a A 67  16  Filtering by Gene Count Species Count in the Phylogenetic Patterns     cccccsssssccecesesseseeeseeseeens 69  10i  cette the conditio    sae ener a eee one ne en Pf eens tv err eee ve er PErErEN eT eTe nT 69  16 2  Enable Disable Conditions         ccccccsssscssessssessesesessescssssssssssssssssessescssssessssessescssssesnsanessenesesceassneseeseseeseseesensens 69  iie Fk A haad SEA 70  e e n a S E a A E O 70  EEE crs  el Ws E E OA AA AE O OAE N EAA AAA A A A AE E E EA eee 72  17a  Redisplayine the Search results ee ements Pt Pee Rn 14  17 4  Enable Disable filter settings by the search results    eceecsccceseseseccsccesessssscevsveveresssseseveneseeees 14  17 5  Enable Disable color settings by the search results    eecccccsesesesescecesecessssseecsveveresesseeevareneeees 14  18  Changing the Display Order of Species or Display Nondisplay Status of Species       c eee 75       RECOG Client User   s Manual       18 1  Changing the order of display of SCC Saigo cee clecatcee testes cen ticuelecsti ce cteastl aa sieetitaclacaiimaloeatileata  715  18 2  Setting the display nondisplay Of SPECIES       eeccesecesesesessssssescsessecsccesescecevavavecusevavaveseusseevavavveneseees 75  18 3  Adding species to be displayed jie cst eas icsccccnacusicscacestieatnnarnienakisina acesaasaindtunmcdeintebeessbusesetsieatianestnntateet  75  18 4  Removing displayed S
60. display the selected species on the PPM  select the species on the Taxonomy Tree on    the Selected tab  click the right mouse button and click Show organism columns     18 4  Removing displayed species    1  Select species in the Organism Column on the control panel  click the right mouse button  and click Delete organism columns  When the warning message is dislayed  click the OK    button        79    RECOG Client User   s Manual       19  List of Genes    A list of genes of the selected species is displayed        be     Taxonomy b           Pp Genome Map b       Gene order on bat  Sort by gene properties       tl Gene Ligt  ASS Oromis Looto Genshime Accession teomaro    Stet                        sca  Cando co   Pert  18352 esa ash kos booa poa   1300 Ceda co    Oe  ss   a mal fos ana prae   aie    13 420 20010 13        lt OS  j       0260005   1 100 Cotter co    4  2033 21 612 626  000 Cebser co    Pert                        19 1  Displaying the list of genes    1  To display the Gene List screen  select the species on the Taxonomy Tree on the Selected  tab  click the right mouse button and click Gene List    on the pop up menu  You can also  click the right mouse button on an abbreviated species name in the header row of the PPM    table to open the Gene List screen     19 2  Sorting the list of genes    1  To sort the gene properties in the ascending desceding order and display the gene property    values  click the gene property name on the Gene List screen     19
61. displayed       il      s0  sto_ST2096    ecj_JW2884    hal YNGOB 3G  nph_NP2  104  sai_SACI_0062  sso_SS02266  sto_ST2096  ecj_JW2884    hal VNGOB 3G  nph_NP2  104  sai_SACI_0062  sso_SS02266  sto_ST2096  ecj_JW2004    hal   NGOB 3G  nph_NP2  104  sai_SACI_0062  sso_SS02266  sto_ST2096  ecj_JW2884    QIASL RIRARODAAY  AALNAL   HAAGQUN NLLAAAVNAARYRA LG ISDALEYAFD    WS EDSTRNYRCL YAK   GLDGHDRG   MWGMS TEGEQRT IRCLYAK   YGLDGHDRG   MKRIK  LYAKLGLDGHDRG   MMITTRRIKYIVAKLGLDGHDRG   MSORRIKYLYARLGLDGHDRG   RY LYPSOCYTGYIAQS HOS RSAS FDATYAQT  gt  QFLADNGRRPRIL TAK MGQDGHDRG  ODDS RR    AHYI TRAFRDAGFEYIYSGLHKAPDE IYQAAVOQ DYDYLGISTILSGAHNTLYPKITCGLA  AHY IARAFRDAGFEYTYSGLHNSPDE IVOAAVOEDYDYLGISILSGAHOTLYPK ILDGLK  ARVVARALKDAGH  YYY TGLROTPEQIVKAAIQEDVDVIGYSILSGAHIELISMLYKEMK  ARYYARALKDAGH  VYY TGLROTPEQIVRAALQEDADVIGISILSGAHLELIPKYYE IMK  AKYIARALKDAGMEYYYTGLROTPEQIVKSAIQEDYDVIGISILSGAHLELYPYVYNLMR  AKVIASAYSDLGFOYDLSPMFSTPEE IARLAYENDVHVYGASSLAAGHK TLIPELYEALK  Oe  OE BOEID I D DRIDE I Bee  RDE E EI   E RIL II  l    EYDAAEDTLYIYGGI IPDEDRADLEAAGYDAIFGPGTPNADTISFIESNAPHRD  EYDAFEDTLYITGGI IPDEDHDELYDAGYDE IFGPGASHEETITFYRENAFER         D KKLNOYGLYYGGYIPPODIPALKOLGYDEYFLPGSSLKEITEKIIKYASDKRGLSFEP  Q NGLNOYGLIYGGYIPPEDIKKLKEMGYDEYVFLPGSSLKEIVEKYKEYAREKRGISYE   E KGLNOYYLYYGGYIPPADIPKLKEMGYDEYFLPGSSLKEYVEKITKAVETKRGIKIA  K WGREDICYVAGGYIPPODYAFLOERGYAAI YGPGTPMLOSYRDYLNLISQHHO   DE O ILOEEDEE DE E    WE II ELI I              84    RECOG Client User   s Manual       22 2  Changin
62. dow consists of the following components   5 2 Window header  5 3 Menu bar  5 4 Toolbox  5 5 Zooming scale bars  5 6 Taxonomy Tree  5 7 Phylogenetic Pattern Map  PPM   5 8 Info tab  5 9 Histogram tab  5 10 Status bar       14    RECOG Client User   s Manual       5 2  Window header    The RECOG Client version  name of the current project and name of the DomClust result file    are displayed     ba RECOG version 1 0 0   default   cluster sample       File View Search Run Option    PS EI ona i e AWN eee    ee r r    5 3  Menu bar            5 3 1  File menu   View Tool Run Opti  New    Change Server b  al Open file     Open Project r  ja  Save Cluster file        Import r    2  Export     Genome Mapping  gt   Page setup     pa Update Data     F Exit   New New Analysis    The currently displayed analysis is cleared   New Project  A new project is created   Change Server    RECOG server  The RECOG server to be used for analysis is specified   Server List       The list of registered RECOG servers is displayed     Open file Analysis result files  including DomClust result files  are opened   Open Project Project    An existing project is opened        15    RECOG Client User   s Manual       Save Cluster file       Import    Export    Genome Mapping    Page setup     Update Data  Exit    Soe    Aggregate Mode   Disaggregate Mode    Cluster Mode Sub cluster Mode    Expand Taxonomy Tree  Collapse Taxonomy Tree    Select all clusters on PPM    Project List     The list of existing pr
63. e Disable     The same operation can be carried out by checking unchecking    Display only matched    clusters    on the Keyword Search Result screen     17 5  Enable Disable color settings by the search results    Select Color     Keyword Search on the control panel  click the right mouse button  and    click Enable Disable     The same operation can be carried out by checking unchecking    Highlight matched    clusters    on the Keyword Search Result screen     74    RECOG Client User   s Manual       18 Changing the Display Order of Species or  Display Nondisplay Status of Species    In the Organism Column on the control panel  the order of species displayed on the PPM can  be changed or the display nondisplay of species can be set         Control     Sp Group    Fitter   Organism Column  IN Bacillus cereus AH187  IN Bacillus cereus 4H620  IN Bacillus cereus ATCC 10987  IN Bacillus cereus ATCC 14579  IN Bacillus cereus B4264    18 1  Changing the order of display of species    1  To change the order of display of species  drag the species in the Organism Column on  the control panel to the destination     On the PPM  only the species displayed on the Organism Column are displayed     18 2  Setting the display nondisplay of species    1  Double click the species in the Organism Column on the control panel     You can also click the right mouse button on the species and choose Show Hide to set the    display nondisplay modes     18 3  Adding species to be displayed    1  To 
64. e column on the right of the species to be set as the    reference genome     Click the Apply button     100       RECOG Client User   s Manual    d Gore Genome Alignment       B amyloiquefaciens FZB42  B arthracis Ames Ancestor  A2     B anthracis Ames   B anthracis Sterne   B cereus ATCC 10987   B cereus ATCC 14579         Up I  Down            Abbrevistion       Cluster ID     Presentative gene nome of the cluster      Gene name of the reference genome      Display Nondisplay of species    To display the screen for changing the Core Genome Alignment view  click View   View  Change    on the Core Genome Alignment screen     To display species  select the species in the Candidate column and click the  gt  gt  button   To hide species  select the species in the Target column and click the  lt  lt  button     Click the Apply button       Changing the display order of species    To display the screen for changing the Core Genome Alignment view  click View   View  Change    on the Core Genome Alignment screen     To change the display order of species  select the species in the Target column on the    screen for changing the Core Genome Alignment view  and click the Up or Down button     Click the Apply button        101    RECOG Client User   s Manual       26 10  Resetting the window size    1  To display the screen for changing the Core Genome Alignment view  click View   View  Change    on the Core Genome Alignment screen     2  To reset the window size of the neighborhood r
65. e for use    in analysis        Ifthe Save column is checked  the entered name       Ifthe Save column is unchecked     Phylogenetic Pattern Coefficient        33 5  Uses of the results of the phylogenetic pattern    similarity search    The results of the phylogenetic pattern search can be used in the following functions       Display on the cluster header      PPM sorting  see    13 PPM Sort           138    RECOG Client User   s Manual          Filtering by keyword search  see    17 Keyword Search           Color setting by properties  see    12 Color Display by Properties        33 6  Removing the phylogenetic pattern similarity    search    1  The coefficient of correlation saved to a file upon checking the Save column can be    removed on the Cluster property tab on the Gene property Cluster property screen  For    the method of removal  refer to    29 2 Referencing a list of    gene cluster properties        139    RECOG Client User   s Manual       34  Downloading the sequence information    Download the protein DNA sequences of the genes contained in the cluster selected on the    PPM     34 1  Downloading the sequence information  1  Select a cluster on the PPM     2  Click the right mouse button and choose Download sequence     The Download sequence       screen is displayed    Download sequence          Additional bases   upstrearn       dovenstrearm            Split in delimiter      Target genes     All genes onthe cluster table     Genes include the select clus
66. e sets only     Project  registration destination project     Cluster sets only      DomClust File  registration destination DomClust result file name   Cluster sets only        Cluster Count  registration destination DomClust result file name   Cluster sets only     Gl Gene set Cluster set             3  To register  edit or remove a gene cluster set  click the Add     Edit    or Delete    button     respectively     126    RECOG Client User   s Manual       31  Combined Set    A combined set consisting of multiple gene cluster sets can be created  The combined set can    be utilized for coloring  filtering  etc  based on the results of set operations using multiple sets     31 1  Registering a combined set    1  Registering a combined set of gene sets  Select gene sets in Gene Set on the set control  panel to display the Gene set combination screen  click the right mouse button  and click    New gene set combination        Registering a combined set of cluster sets  select cluster sets in Cluster Set on the set  control panel to display the Cluster set combination screen  click the right mouse button     and click New cluster set combination        ta Gene set conbmation    Condition   Test gene set 1  amp  Test gene set 2  amp  Test gene       Test gene set 1  amp  Test gene set 2   Test gene set 2  amp  Test gene set 3   Test gene set 1  amp  Test gene set 3             Down             2  On the Gene Cluster set combination screen  specify the gene cluster set names and 
67. ecked  registration is made upon    overwriting the gene cluster set of the same name if such a set exists     Registered gene cluster sets are displayed in Gene Set Cluster Set on the set control    panel     123    RECOG Client User   s Manual       30 2  Outputting a gene cluster set to a file    1  To display the Export gene cluster set screen  select gene cluster set in Gene Set Cluster  Set on the set control panel  click the right mouse button  and click Export gene cluster    set     2  On the Export gene cluster set screen  select the file name to be saved  and click the OK  button     30 3  Editing a gene cluster set  removing genes     1  To display the Edit gene cluster set screen  select a gene cluster set in Gene Set Cluster    Set on the set control panel  click the right mouse button  and click Edit gene cluster set     2  On the Edit gene cluster set screen  change the name destination for registration  only for    gene sets   or remove a gene cluster     To output the list of genes clusters registered as a gene cluster set to a file  click the    Export button   3  On the Edit gene cluster set screen  click the Apply button     td Edit gene set       S CB4264_A4710 lacetyLCoA carboxylase  cosets knal     bee  i   lt tsst  iw   of  carboxylase Serouteeneters    ber BCAH18  44726 of  carboxylase Itranster     bou       CAHE20 4715   Cod  carboxylase  carboocyl transfer       oh ss 0B2173   acety CoA carboxylase carhoxyitransfera         124    RECOG Client Use
68. ed    See    31  Combined sets       Species sets are displayed     See    32  Species sets        45    RECOG Client User   s Manual       11  Display and Operation of the Phylogenetic  Pattern Map    On the Phylogenetic Pattern Map  PPM   the appearance patterns of species belonging to a    cluster are displayed     11 1  Display of the locus tags of genes belonging to a    cell    The locus tags of genes belonging to a cell are displayed on the cell  If the cell region is    narrow  the gene count is displayed  The display can be switched on off as follows        1  Click     Option  in the Toolbox to display the Option screen     Click the PPM tab on the Option screen   2  Check  Display gene names or the number of genes    on the PPM tab     3  Click the Apply button     11 2  Display of the cluster property on the cluster    header    The value of the property corresponding to the cluster is displayed in the display area  cluster    header  on both sides of the PPM  To change the displayed property  do the following     1  On the cluster header  click the right mouse button  and on the menu PPM Label click the  property to be displayed  this displays the property value on the cluster header    Cluster ID The homology cluster ID  cluster ID and  sub cluster ID are displayed    Gene name The typical gene name of the cluster and the  typical gene name of the sub cluster are  displayed  default value     Cluster score The cluster score and sub cluster score are    displayed
69. ed ortholog group is relocated to    the center of the screen     27 5  Displaying gene information in a browser    27 0     Click aL  Show the gene information at clicking gene  in the Toolbox on the Genome    Comparison Viewer screen  Upon double clicking in this state  the information on the  default external resource set in    35  External Resource URL Management    is displayed in    a browser     To display the information on the external resource in a browser  click the right mouse    button  and click the displayed external resource URL   Saving the origin    To save the current origin  center point   click View     Save Origin on the Genome    Comparison Viewer screen     27 7  Recovering the origin    To relocate the genome map with the origin saved immediately before as the center point      I  click    ie  Recover Origin  in the Toolbox on the Genome Comparison Viewer screen        107    RECOG Client User   s Manual       27 8  Display Nondisplay of species    Specify the species set to be displayed on the Genome Comparison Viewer screen     1  Click View   View Change    on the Genome Comparison Viewer screen  The screen for  changing the display of the Genome Comparison Viewer screen is displayed     Click the View tab on the screen for changing the display of the Genome Comparison    Viewer screen   2  To display species  select the species in the Candidate column and click the  gt  gt  button   3  To hide species  select the species in the Target column and c
70. ee eet 13  Display and Operation of the RECOG Main Window uu    ccc seeceecesceeessesseseeseeseveeseeseuseneeeensenenss 14  FTN Pee gt N EN   deen eee eee ee ne eee eee ee ee eee ee 14  Se Ue NN asses ce cee AA NE A baat A cect eee eee 15  SU A ace eens reese E E sese vecencseoeeee eerereeee tie 15  aTe e S a Sean ae eee EA ee ENE EA eae nae A E A anne Ao me nner E ne A ne ae  18  Be Ot 0 ON racine eden eeee science eo nere nee 19  ig Me Faas rset E gleoimsceen mutants ist ecdoimtaeedam tiesto 19  5 7  Phylogenetic Pattern Map  PPM  oonicccccssssssssssssscsssssssssesstsessesssesatssstsesessnsaeensaesnseen 20  Bs NRE Or NN EE EEEE A cast E ccc N cee E E EEN E ste E EEE E E E AE EAN 21  PEE e E E    E EE E E ES N E E E EA EE AAA E E OAA NS P E AE 23  ER Se aT cy  AEA OE E AA E A O A A N O A E N E AN E 24  Project Creation and EdItINE senrose aia E aa 25  0 CGreatonof a new Proj  tt ecas aaa a O 25  EARE a EE  lt  N EIES ALEA AE P VE AEE EEE E A EEEE AE N EEA AE EE EE E T 26  6 3  Reference to a list of registered ProjeCts oo   ec cece cece ssscsceeeessssessesesseesesseseesessesseseuseuseseusessevessenseeees 26  04 Froect reci ccc eects ec E E 21       RECOG Client User   s Manual       0 9 Froject normato edine ae eee md et ne a te ne 2   ON essa eee cee eget case eset cece asec scatter TAE OA 28  Switching the i OP sisiane aiaiai idiniin 29  7 1  Checking the RECOG Server USE    cece ceccseecscesceseeceecessesessessesessessessevseeesseusesesseusesesseusessaveneensenees 29  7 2
71. eeter rtm 36  9 3  Display of the DomClust analysis FES OILS sia cecusissscactiivazcateansicrasesinrerasavecaneadoinesenasiaternastvooaniadineeiuasiiecens 39  9 4  Display of the DomClust analysis result properties    cece ce ceeceseeceecesesessevsesesseseuseesnses 40  9 5  Storage of the DomClust analysis results ascii secs ccchcscshcacescenciceinetciondiaciavndcventineedadcenils 41    Control Panel and Set Management Panel    ccc cess ceseesesceseesescesseesesessessessesesseusesesseusevenseusevseeneess 42  10 1  Display on the control eect secre n sc cca cteete vet eweoat cman ectearc act aeta const encematsdaseenteeeeeeceeond  42  10 2  Controli panel operatio meee een etree Na ten ty ean nn nr pe SO ev ee ne Pv Pe eT 43  10 3  Display of the SEE management panel caiicccisscceccscceccicescetctendlecsectcisceceiceusckaveioncemsustanideos  doatebaatinnaveneeuades 44  10 4  Operation of the set management Panel uu    eee ese eee seecesceseeseesesseseeseseseeseesessessessseusevensessensees 45    Display and Operation of the Phylogenetic Pattern Map    ccc cescecesescescesessessevessesseeseneeeens 46  11 1  Display of the locus tags of genes belonging to a Cello    csceceesceeceeeeseeseeeeseeseneeeeeeeees 46  11 2  Display of the cluster property on the cluster header    eee cece ceeeceeceeeeseeseeeeseeseneeeeeeeees 46  11 3  Display Nondisplay of the homology cluster Meader    ceecscccssesesescscesesesessseecsvavesesssseeeveneseeees 41  11 4  Species color setting            o
72. elation  set the value in the Window Size    section     3  Click the Apply button     26 11  Changing the display style of species names    1  To display the screen for changing the Core Genome Alignment view  click View   View  Change    on the Core Genome Alignment screen   2  Select the display style in the Organism column on the screen for changing the Core    Genome Alignment view       Normal  Species are displayed according to their official names     Abbreviation  Species are displayed according to their abbreviated names        Show Strain  If this is checked  the strains are displayed     26 12  Changing the ortholog group labels    1  Click View   View Change    on the Core Genome Alignment screen  The Core Genome  Alignment view is displayed   2  Specify the items to be displayed as labels for the ortholog group in the Label section on  the screen for changing the Core Genome Alignment view       Cluster ID   The cluster ID corresponding to the ortholog group is displayed       Representative gene name of the cluster    The representative gene name of the cluster corresponding to the ortholog group is    displayed        Gene name of the reference genome    The gene name of the reference genome is displayed  If gene name is undefined in that       102    RECOG Client User   s Manual       genome  the Locus Tag is displayed     If these items are specified  the gene name display column on the PPM is switched    synchronously     26 13  Zoom    1  To zoom in zoom o
73. elected   Clustering j  cr  ey root 6       C Bacteria 6        Bacillales 6  of        C Bacillaceae 6   oe   ee acl   IN Bacillus cereus 4H18   IN Bacillus cereus 4Hb20 brB n 87 A008 8  h cereus ATCC 10987  pe    IN Bacillus cereus ATCC 14579 eck  IN B   p  eae Taxonomy All    Sp Group b Any New P    Genome Map  gt  Any Add  gt     Gene List    None        Control Gene order on beb Cancel          Bacillaceae             15 4  Chang  ng the names of the    Any    conditions    On the control panel  select Filter     Taxonomy Filter      name of    Any    condition   click    the right mouse button and click Rename to display the Rename screen           Name   Bacillaceae          On the Rename screen  edit the name  and click the Apply button     65    RECOG Client User   s Manual       15 5  Changing the threshold value of the    Any       conditions    15 6     On the control panel  select Filter     Taxonomy Filter      Name of    Any    condition  to    display the conditions for phylogenetic pattern filtering     From among the conditions for phylogenetic pattern filtering  select the    Any    condition   click the right mouse button and click Any Threshold     Edit to display the Any    Threshold screen     On the Any Threshold screen  specify the    Any    conditions   For details  refer to Item 3 of    15 2 Setting the    Any    conditions        To specify the    Any    conditions more easily  select and click    Any     click the right mouse  button and click Tax
74. elected color pattern upward downward        94    RECOG Client User   s Manual       25  Species Groups    Multiple species closely related to each other can be registered as a species group  The species    groups registered here can be used in the analysis such as the CoreAligner program     25 1  Displaying species groups    1  To display the species groups set in the lower view of the Taxonomy Tree  click the Sp    Group button in the upper view of the Taxonomy Tree on the Selected tab     A Selected   Clustering       Fiter    Sp Group O Select       It Bocus omylotquetaciens FI642   In Bacius artivactt Sterne   IR Gocis cereus ATCC 10987   In Bacus c  r  ut ATCC 14579   th Bactius cereus ES    In Bacius cereus NYH 391 96   IE Bacius thuringiensis serovar korubion 97 27  IN Bacius thuringiensis Al Hakam   IE Bacillus clousd KSM K16    cre Mee Me Vafethmene AAAF          Sacilus amyhiquetaciens FZB42     Socilus cereus ATCC 10997    25 2  Registration of species groups    1  To create a new species group with a specified set of species  select the species on the  Taxonomy Tree in the upper view of the Selected tab  click the right mouse button  and    choose Sp Group   New Group  The species group is displayed in the lower view     Al  Selected   Clustering        Fiter    Sp Group O Select    An Selected   Chasterina       Fiter   Sp Group    Select ik Bocus omyiotquetociens FZG42   in Bacdus artivacit Sterne   IE Bacillus cereus ATCC 10987   In Bacius cereus ATCC 14579 
75. ene count   gt  2       Base Color       Phylogenetic Pattern 2    Control panel    10 1  Display on the control panel    Click the Selected tab on the right side of the screen and click the downward Control button    to display the control panel        42        Sp Group  Gene Set      Test gene set 1 4     Test gene set 2 3     Test gene set 3 4     Gene Set Conbination      Gene Conbination 1    Cluster Set      Test cluster set 11       Test cluster set 2 1      Test cluster set 3 1   Cluster Set Conbination     Cluster Conbination 1  Organism Set     CH Organism Set 1      Bacillus cereus 4H187       Bacillus cereus AH820     Bacillus cereus ATCC 10987      Organism Set 2       Set management panel    RECOG Client User   s Manual       10 2  Control panel operation    10 2 1  Filter settings  Filter   Filter refers to the conditions for selecting the row  cluster  that is displayed on the PPM  In  the Filter folder on the control panel  the filter conditions that are currently applied to the  PPM are displayed  Only those clusters that meet all the filter conditions are displayed on the  PPM  Click Enable Disable to enable or disable the filter settings     The following filter conditions can be specified     Taxonomy Filter Phylogenetic pattern filtering   See    15  Taxonomy Filtering      Gene Organism Count Filter Filtering by gene count species count   See    16  Filtering by Gene Count Species Count in  the Phylogenetic Patterns      Keyword Search Filtering by key
76. es is displayed     Cluster Property List       Gene Set   Cluster Set List    The list of registered gene sets cluster sets is displayed     Properties       The properties of the displayed analysis results are displayed     5 4  Toolbox    Ub a       RO CRE p v ist    The Toolbox can be moved by dragging and dropping the left side of the Toolbox with the    mouse     To return the Toolbox to its original position  click the Close button on the upper right of the    Toolbox            Open file   Save Cluster File   Import DomClust file   Export   Aggregate Mode   Disaggregate Mode  Cluster Mode   Sub Cluster Mode  Expand Collapse Taxonomy Tree  Color genes by properties   Histogram   Keyword Search   Ortholog Clustering  DomClust   Phylogenetic pattern clustering  PhyloPatClust     Core Genome Alignment  CoreAligner        18    RECOG Client User   s Manual       mg    aH Neighboring Clusters    Wd Option    5 5  Zooming scale bars    Each zooming scale bar expands or contracts the horizontal vertical size of the PPM       Horizontal Zoom      _  1L       Vertical Zoom scale bar       Vertical Zoom     The longitudinal size of the PPM cell is expanded or contracted by sliding the scale bar  laterally        Horizontal Zoom scale bar  The transverse size of the PPM cell is expanded or contracted by sliding the scale bar    laterally     5 6  Taxonomy Tree    The Taxonomy Tree displays the taxonomic tree of the given species     1  All tab for selecting the species to be anal
77. etic pattern    can be displayed by graphs  or the numerical data of the specified gene properties can be  displayed by bar graphs     23 1  Function category frequency graphs    In the Disaggregate Mode  the frequency of each function category in the same phylogenetic  pattern is displayed in a graph on the Histogram tab     wh Moye    wW i  71    5    m   i       23 2  Displaying a numerical data graph  a description or    the function category  In the Disaggregate Mode  a graph of numerical gene property values or the description and    the function category of each cluster is displayed on the Histogram tab     1  To display the Histogram screen  click    Histogram  in the Toolbox     be Histogram          Organism E cereus 4H1S 7          Property Ist art        Difference       Organism 1 5 cereus AH157       L  Organism 2  E cereus AH320          Property              V  Show value          RECOG Client User   s Manual       2  Specify the graph to be displayed on the Histogram tab        Description Function Category    The description of the cluster is displayed with the color of the function category of the    cluster in the background     Pyridonine bloastAhes s erotcin       Glycerate kinase Cid         Value  The specified numerical property data are displayed by a bar graph  Cluster property  data or gene property data of a specified species can be displayed      reo  Hoty            ga gtn oih  7500        GKOSSS            Difference  The difference in the nume
78. f the Locus Tag    Click View   View Change    on the Genome Comparison Viewer screen     Click the View tab on the screen for changing the display style of the Genome Comparison    Viewer screen     To display the Locus Tag on the genome map  check Show in the Locus Tag column on  the screen for changing the display style of the Genome Comparison Viewer screen  In the  Row count  specify the number of rows for displaying the Locus Tag  If multiple rows are    specified  their space is used so that names do not overlap each other   To hide the Locus Tag on the genome map  uncheck Show in the Locus Tag column     If 0 is specified in the Row count  the distance between genomes takes its minimum value    and the Locus Tag is not displayed regardless of whether Show is checked or unchecked     Click the Apply button     27 13  Color setting    The color can be set based on the gene position on the reference genome or it can be set on the    Color tab on the control panel     k     Click View   View Change    on the Genome Comparison Viewer screen   Click the Color tab on the screen for changing the display of the Genome Comparison    Viewer screen     On the Color tab  set the colors for genes and ortholog lines       Reference genome    Core genes are colored with color gradations from green to red based on the gene  positions on the reference genome  Non Core genes and RNA are colored in grey and    deep blue  respectively        Use the color of the control tree    For colori
79. g species  where each row represents    ortholog group and each column represents species  In RECOG  an ortholog table is represented       148    RECOG Client User   s Manual       as a phylogenetic pattern map  PPM    Outgroup  The set of control species in phylogenetic analysis  which are located outside of the ingroup  on the phylogenetic tree  In the DomClust analysis  outgroup species is specified as the group    of species which do not belong to the ingroup     P  Phylogenetic pattern  In a strict sense  a binary vector that indicates the presence  1  or absence  0  of a gene for  each species defined for each orthologous group  In a more broad sense  a numeric vector  that indicates some gene property value of each species   Phylogenetic Pattern Map  PPM     A matrix that shows the presence or absence of species based on the DomClust analysis    results    R  RECOG  http   mbgd genome ad jp RECOG    Research Environment for Comparative Genomics  A client server type software program that is a comparative genome workbench for    conducting various comparative analyses based on DomClust analysis results    RECOG server    The server that conducts the DomClust analysis and CoreAligner analysis and provides  gene information in concert with the RECOG Client  Regional Genome Map  RGM     A genome map for ortholog comparison  I    Taxonomy Tree    A phylogenetic species classification in tree form    149    
80. g the colors of the amino acid letter strings    ius Color Settings x    1  Click the  Color Setting  button on    the Multiple Alignment screen to display       Cystein          the Color Setting screen  Aspartic Acid    Glu Glutamic Acid    Phenylalanine  Glycine   His Histidine   le Isoleucine  Lysine    Leu Leucine    2  On the Color Setting screen  you can  specify the colors of the amino acid letter    strings            Met Methyonine    Asn Asparagine    a  A  D  E  _  M  N  a  a    Glutamine    Apply   Restore Close         f       22 3  Displaying the phylogenetic tree    1  To display the Phylogenetic Tree screen  click    Clustal Tree  on the Multiple    Alignment screen     2  To change the parameters used in the creation of the phylogenetic tree  click Ve   Option  in the Toolbox  Specify the parameters on the Tree tab on the Option screen   Taxonomy Tree PPM Function Category Color Neighboring Clusters Muttiple Alignment    Tree    General  KMURA iON v  TOSSGAPS ON vl OUTPUTTREE phijip v     pootstRae     on Oof COUNT   1000  seo   111            85    RECOG Client User   s Manual       ta Phylogenetic Window 0 9 121  File View Option Settines    SEX S Bw    1222403069650  phb  bt I _BALH_3793 bez 8CZK3938    0 1 bce _BC4134  btk _BT9727_3927 bat _BAS4089  bea BCE 4258 bey BCERSB_2878    zka_GK2400  ztn_GING_2331     bha_BH2788    bl d_BL102606       het _ABC2469 7 PUN ry  pu BPUM 2167 022680       oih_OB 1886        638 RE xe OND os      3  On the Phylogenetic
81. gate Mode oe Pe re ee EET aT  13 3  S0rt based on I  96  61    gl    lt  lt  sirnaan nO AD 58  13 4  Display of the sort CANIS cca cs ced act ceca ctn arcades had alse eee dec ndonwossececulietaluaetaonte 59  14  Phylogenetic Pattern Clustering  PhyloPatClust       c  cecssesssscsscsssesesessesessesesssesessesesecseseserereesevatsesersesecees 60  14 1  Execution of PUI IOP BSE sccsiticsnsininsrcenrinsiestcasionmnssrorasintcstaitin iestadenontsdun E eai ianei 60  14 2  Operation of the clustering tree aac reas scesticcaca bisa snadendeecavead iuctniphecceneacsescanetbimaimcedtaetancstpinaeddetiinimueaarelibe 61  US mm  01g  S106 Weg Loc a   lt  eee eee ee ee een oe te eT ene ene een Pn 62  15 1  Displaying the taxonomy filtering conditions iaceisiiccccscdccccasdsccccsosnasliescnscasscssscnsoncasaeseamsbarssbscseanmenecnance  62  15 2  Setting the    Al  conditions sccessescasscactcsensttvesasccsisoasadsincessicsbuedstscexncanin  avansisl deunwsaiveueuaseceuseisncouemesnnnsesaeas 63  15 3  Setting the    Any     COnCICIONS       ccccccsssecsscsesssssesesseesecsecsecsessessecsessesatsaesucsussecsecsessessessessesatsatsessacsessessecsesees 63  15 4  Changing the names of the    Any  conditions       ccscsssssssecsesssssessesecsessecsecsessessessessesetsatsecsecsessessesseeees 65  15 5  Changing the threshold value of the    Any  conditions         cccccsessssscsessessesecsesecsesseseesecsesseseesecsecseees 66  15 6  Setting the None    conditions seeen a ngs A EA aa AE scales cee
82. genes     27 15  Changing the display style of the scale marks    Scale     Relative postion from center     Position of reference genome       1  Click View   View Change     on the Genome Comparison Viewer screen   2  On the Scale tab  set the display style of the scale marks       Relative position from center    The relative position from the center is displayed on the scale marks        113    RECOG Client User   s Manual         Position of reference genome  The position based on the coordinates of the reference genome is displayed on the    scale marks     27 16  Printing    Genome Comparison Viewer Preview    O Specification      100   lt       Page setting       1  Click File     Preview    on the Genome Comparison Viewer screen  The Genome  Comparison Viewer Preview screen is displayed     2  On the Genome Comparison Viewer Preview screen  the following settings are possible        Zoom  Specify the image magnification   If Auto is specified  the image magnification is adjusted so that the image fits onto a    single sheet of paper       Page settings  Specify the paper size  etc     3  Click the Apply button     The print setting screen is displayed  Printing is carried out in accordance with the screen    display     To directly print without displaying the Genome Comparison Preview screen  click File      Print    on the Genome Comparison Viewer screen     114    RECOG Client User   s Manual       28  Updating the Gene Information    Using the gene information up
83. ging to A and B  that are adjacent to each other  and Faaji and Fagj2 are  parameters satisfying 0  lt   Faaji  lt   aaj  lt   1    Coverage ratio  fagz  for absorbing See above  Note that this parameter is not effective  if Fadj2  lt   aaj     adjacent small clusters    Use domclust dump If this is checked  the DomClust analysis is conducted using the  cache for previous DomClust analysis results  This shortens the  processing time of the analysis     146    RECOG Client User   s Manual       37  Glossary    A  Aggregation  Integration of clusters of the same phylogenetic pattern into a single row in the    phylogenetic pattern map     C  Cell  A square corresponding to a species belonging to an ortholog group on the PPM  CoreAligner  http   mbgd genome ad jp CoreAligner    A software program for creating a core structure based on the preservation of gene  alignment sequences between affinity genomes  Circular Genome Map  CGM   A gene map on which genes  etc  are drawn in a circle  ClustalW  A software program for performing the Multiple Alignment  Cluster  A group of genes grouped together based on DomClust analysis results  Cluster ID  A unique ID given to each cluster    COG  http   www ncbi nlm nih gov COG    Clusters of Orthologous Groups of proteins  COGs     D  DomClust  http   mbgd genome ad jp domclust    Hierarchical clustering program for orthologous protein domain classification  which is the    standard method to construct ortholog groups in RECOG     E  Extra T
84. he Genome Comparison Viewer screen        For changing the display area  there are two modes  as shown below  The two modes can    be switched by clicking the relevant button in the Toolbox              t  Simple Mode   In this mode  the display area is moved within a certain interval without adjusting the    gene display position          o  I  Adjust Mode     In this mode  after changing the view area  the display is made upon relocating so that  the genes belonging to the ortholog group that contains the gene near the center of the  view area on the reference genome align or stand in a straight line  Also  regarding the  orientation of the genes  the display is made upon making the gene orientation uniform    according to the settings described in    27 14 Automatic correction of gene orientation        27 3  Zooming       1  To zoom in zoom out on the display area  click       Toolbox on the Genome Comparison Viewer screen  respectively     2  If the zooming in exceeds a certain scale  the screen automatically switches to the detailed       106    RECOG Client User   s Manual       gene view     27 4  Moving a specified ortholog group to the center of    the screen    In normal mode  the mode without the operation set out in 26 5 below   upon  double clicking on the genes belonging to the ortholog group on the Genome Comparison  Viewer screen  the double clicked ortholog group moves to the center of the screen  Also   on the Core Genome Alignment screen as well  the click
85. in the same cluster    lt  Specified value    Calculate by applying a specified value        137    RECOG Client User   s Manual         Representative value of multiple gene property values    Specify the method of determining the representative value when multiple values are    defined for a gene      lt  gt      lt  gt    lt  gt    lt  gt     Min  The minimum value among multiple values is used   Max  The maximum value among multiple values is used   Median  The median among multiple values is used     Average  The mean value of multiple values is used          Representative value of multiple genes in one cell    Specify the method of determining the representative value when there are multiple    genes in a cell      lt  gt      lt  gt    lt  gt    lt  gt     Min  The minimum value among multiple genes is used   Max  The maximum value among multiple genes is used   Median  The median among multiple genes is used     Average  The mean value of the gene properties of multiple genes is used       4  To calculate the dissimilarity of the specified pattern with each cluster  click the Apply    button after specifying the conditions     After the phylogenetic pattern similarity search is completed  the calculated dissimilarity    value is displayed on the label on the side of the PPM  Also  if Sort by the calculated    result is checked  the PPM is sorted based on the value of dissimilarity     The dissimilarity value is registered as a cluster property under the following nam
86. ing two most  similar sequences  or clusters   the algorithm searches for  another sequence  S3  that matches one of the merged  sequences  S1  in the region outside the alignment between the  merged sequences  The algorithm splits the sequence S1 if  such a sequence S3 is found and the alignment between S1  and S3 satisfies the coverage condition specified by this  parameter and score condition specified by the next parameter   Raise this parameter to avoid excessively short domains  generated due to partial matches    Cutoff score for the match between S1 and S3 described above  to split the sequence  The effect of this parameter is similar but  possibly complementary to that of the previous parameter    This option specifies which similarity or dissimilarity measure   score or PAM  for use in orthology identification or clustering   Note that the scores depend on the alignment lengths while  PAMs do not    The bi directional best hit criterion  i e  gene pairs  a b  of  genomes A and B  such that a is the most similar gene to bin A  and vice versa   is a conventional approach for ortholog  identification between two genomes  The uni directional version       144    Cutoff ratio of the score against the    best    Score for missing relationships    Clustering Mode    Cutoff ratio of paralogs for tree    splitting    Phylogenetically related organisms    Overlap ratio  fag1  for merging    RECOG Client User   s Manual       is also routinely used for predicting gene function
87. ions  double click the Color    column in the Condition column and modify it on the Color selection screen displayed     3  In the Filter Color setting column  specify whether or not to apply a combined set to the    color filter setting     Filter and Color  Register a combined set to add it as the condition for the color filter setting   Color    Register a combined set to add it as the condition for the color setting        128    RECOG Client User   s Manual          Filter   Register a combined set to add it as the condition for the filter setting     None   Register only combined sets     To display the combined conditions in Gene Set Combination Cluster Set Combination  on the set control panel  specify the conditions on the Gene Cluster set combination screen    and click the Apply button     If Filter and Color or Filter is specified in the Filter Color setting column  the filter  conditions are displayed in Filter   Gene Set Filter Cluster Set Filter on the control    panel     If Filter and Color or Color is specified in the Filter Color setting column  the filter    conditions are displayed in Color   Gene Set Cluster Set Filter on the control panel     The specified filter settings and color settings are reflected on the PPM  in the comparative    genome map view  etc        An  Selected Chictening   I roet s     E S Bacterias    5 E Bacitaies 5      9 Bacitacese 5    IN B    c  us cereus ANIET  IN Bact cereus AH220  IN Bactazt c  reus ATCC 10057  IM Bacthzt cere
88. is opened  the default RECOG server that is set for the project    conducts the analysis     7 1  Checking the RECOG server used    1  Click File     Change Server to access the list of URL   s of registered RECOG servers  The    server currently used for analysis is checked        View Search Run Option       New b TS see    SS oe   http   mbed eenome ad jp  J  Open file    R a  server List       Open Project      Save  hicter file       7 2  Switching the RECOG server    1  Click File     Change Server  and click the RECOG server to be used  The RECOG server  in use is Switched  and the Taxonomy Tree on the All tab is updated using the data from    the newly specified RECOG server      Note  When the DomClust analysis results are displayed  the RECOG server cannot be    switched  Click File     New     New Analysis to clear the display and start a new analysis     7 3  Reference to the registered RECOG servers    1  Click File     Change Server     Server List    to display the Server List screen  A list of  registered RECOG servers is displayed  The RECOG server used for analysis is checked  in the Select field     29    RECOG Client User   s Manual       Server List    Name  http     mbgd  genome ad jp    http     recog server  ac jp O             7 4  Registering the RECOG server    1  Click File     Change Server     Server List    to display the Server List screen   2  On the Server List screen  click the Add    button to display the New Server screen     3  On the New
89. ister Phylogenetic Pattern       Bacillus cereus AH820 iO  Bacillus cereus ATCC 10987 i000       2  On the Register Phylogenetic Pattern screen  specify the profile name and weight for each    species       To change the weight  select the species in the list of species on the lower right and    specify the weight in the Setting Value column    lt  Value  Specify the weight value      lt  Ignore  Specify the species to be ignored in determining the coefficient of    correlation        134    RECOG Client User   s Manual       After selection  click the Set Value button         You can change the weight using a species set  To do so  select the species set in the  Species Set column above the list of species  Upon selection  the species contained in  the species set are selected in the list of species  Then  set the weight of specified    species in the Setting Value column     3  After specifying the conditions  click the Apply button  The profile is displayed in    Phylogenetic Pattern on the control panel             Control    Fitter   Organism Column   Sort   Color   Phylogenetic Pattern      Phylogenetic Pattern 2     Phylogenetic Pattern 1           Sp Group    33 2  Editing a profile    1  To display the Register phylogenetic Pattern screen  select a profile in Phylogenetic    Pattern on the control panel  click the right mouse button  and click Edit pattern   2  Edit the profile  For the editing method  refer to    33 1 Profile registration from a cluster        3  Af
90. kground color of a cell can be changed according to the gene count within the cell by    setting a threshold value     1  Select Color     Gene count on the control panel  click the right mouse button and click    Edit to display the PPM tab on the Option screen     2  On    Background color cells containing inparalogs   gt     genes     on the PPM tab  specify       48    RECOG Client User   s Manual       the threshold gene count and the background color of the cell     Click the Apply button to display the color setting condition on Color     Gene count on  the control panel     To enable disable the color setting  double click Color     Gene count on the control panel     11 7  Color display corresponding to each functional  category    The color corresponding to the typical functional category of a cluster is displayed in the gene  name display field        l  Click WA  Option  in the Toolbox to display the Option screen  Click the Function    Category Color tab on the Option screen     2  On the Function Category Color tab  specify the functional category for drawing        Taxonomy Tree   PPM   Function Category Color   Neighbor Clustering   Multiple Alignment   Tree   General      Function Category   MBGD  v          1 Amino acid biosynthesis   1 1 Aromatic amino acid famity  1 2 Asportote famiy   1 3 Branched chain famity   1 4 Ghtamate family   15 Pyruvate family   15 Serine family   1 7 Histidine famiy    S     2 Purines  pyrividines  nucleosides  and nucleotides  
91. l on the control panel and to display the relevant species  names on the tree in light red     asl Selected   Gustering All  Selected   Clustering      root 6  9 root 6        I E Bacteria     9 BacterialS   S OY Soctiaies 6     A Bacillales 6          Bacifacene 6              9 Bacillaceae 6   IN Bacillus cereus 4H187  IN Bacillus cereus AH820  IN Bacillus cereus ATCC 10987  IN Bacillus cereus ATCC 14579  IN Bacillus cereus B4264  OUT Oceanobacillus iheyensis HTE831    IN  IN Be Taxonomy    ik Be Sp Group  ik Be Genome Map  our Ox Gene List    Gene order on bcu          15 3  Setting the    Any    conditions    1  Select two or more species names on the Taxonomy Tree in the upper part of the Selected    tab     2  Click the right mouse button and click Taxonomy     Any New     Setting    to display the    Any Threshold screen   Any Threshold          Name   siete  cle  ot              AN tanaan  ctustering   O Coun    1     less   o  1     a Bact      v    or more 14 A      3 C Bacilses 6  oft    So  aa   sac   Percent  C    orless   0     IN Eachus careus AH167  IN Eactus cereus AH320  IN a A AL 10967  IN Bactus cereus ATCC 14579                A   ormore   50    0    209 40 60 8 100                   js fe  Taxonomy a All  Sp Group A fey New  gt  Setting  Genome Map po   ny Add  gt  For of Any Corxitions  Gene List  None One or more  Gene order on bcb Cancel 20  or more  Fitter    a 40  or more  Samm aego ee moreris  ndh ener mre  a OJAD FHH Neighboring Chisters_ SOK or m
92. le format output by clicking Export gene cluster set    Ce Gene property format  This is the file format described in    29 1 Registration of gene properties           121    RECOG Client User   s Manual       2  Click File     Import   Gene set file  and select the destination for registering the gene set     Select from among the following        Current Project  project name  Register the gene set so that it becomes available only in the current project or in the    specified projects        Global    Register the gene set so that it becomes available in all projects     Register gene set       Name          File             Location            Overwrite the gene set of the same name       3  The Register gene set screen opens  To register the gene cluster set  specify the set name   gene cluster set file and the destination for saving  and click the Apply button     If    Overwrite the gene cluster set of the same name    is checked  the registration is made    upon overwriting the gene cluster set of the same name if such a set exists     The registered gene cluster set is displayed in Gene Set Cluster Set on the set control    panel   30 1 2  Registration from the cluster selected on the PPM  1  Select the cluster on the PPM     2  To open the Register gene cluster set screen  click the right mouse button and click Create    gene cluster set     3  To register the gene cluster set  specify the set name and the destination for registration on    the Register gene cluster
93. lick the  lt  lt  button     4  Click the Apply button              B cereus AH187  B cereus AH820  B cereus ATCC 10987  B cereus ATCC 14579  B cereus B4264     OUT Oheyensis HTESS1       OO000  8             Up   Down       Display elements Organism Morne   V  Core genes One       Lines      Universal Onity 1vs1 Onity  Non Core genes     V  Unique Only         Abbrevistion     v Line      Universal Only  v 1vs1 Only  Show Strain       108    RECOG Client User   s Manual       27 9  Changing the display order of species    1  Click View   View Change    on the Genome Comparison Viewer screen  The screen for  changing the display of the Genome Comparison Viewer screen is displayed     Click the View tab on the screen for changing the display of the Genome Comparison    Viewer screen     2  To permute the species  select the species in the Target column on the View tab and click    the Up or Down button     3  Click the Apply button     27 10  Display Nondisplay of genes or ortholog lines    1  Click View   View Change    on the Genome Comparison Viewer screen     Click the View tab on the screen for changing the display of the Genome Comparison    Viewer screen     2  Inthe Display elements column on the View tab  specify the display nondisplay of genes  and the display nondisplay of the lines that represent ortholog relationships     Genes and RNA display switching options    Option Details  Core genes The genes extracted by the CoreAligner analysis are called Core  genes    
94. ll     If a cell contains multiple genes  specify the method of determining the  representative value for the sorting    lt   Min  The minimum value among the multiple genes is used    lt   Max  The maximum value among the multiple genes is used    lt  Median  The median among the multiple genes is used    lt  Average  The mean value among the multiple genes is used        Only numerical type gene properties apply     3  After specifying the sort conditions  click the Apply button to sort and display the    phylogenetic patterns on the PPM according to the sort conditions     13 4  Display of the sort conditions    The currently valid sort conditions are displayed in the Sort folder on the control panel         Control     Sp Group    Fitter   Organism Column        q Category gene name      Function Category MBGD Cluster      Gene name Cluster       Cluster ID      Sub Cluster ID        59    RECOG Client User   s Manual       14  Phylogenetic Pattern Clustering   PhyloPatClust     Upon the analysis of the phylogenetic pattern clustering  PhyloPatClust   each ortholog group  is clustered based on the similarity of the phylogenetic patterns and sorted based on the results     Also  the clustering tree is displayed on the Clustering tab     14 1  Execution of PhyloPatClust    1  Glick Wee  Phylogenetic pattern clustering  PhyloPatClust   in the Toolbox to display    the Confirm screen     2  Specify the conditions on the Confirm screen     El Confirm    Q  Phylogenetic Pattern
95. lobal  uneditable   System defined gene properties provided by the RECOG server     They cannot be edited or removed      lt  Global    Gene properties that can be referred to in all projects      lt  Project name    Gene properties that can be referred to only within a project     Category  category    Cluster properties only    lt   DomClust  Property corresponding to the DomClust results    lt  Homology Cluster  Property corresponding to a homology cluster      lt  Cluster  Property corresponding to a cluster     lt  Sub Cluster  Property corresponding to a sub cluster          119    RECOG Client User   s Manual       29 3  Editing properties    The property name and type can be modified     L    To display the Gene _Property Cluster Property screen  click Option   Gene  Property Cluster Property List        To display the Edit gene property screen  select the property to be edited on the Gene  Property   Cluster Property screen  and click the Edit button     Edit gene property       Name   Expression          Type Num          Enum elements          Multi value       On the Edit gene property screen  specify the gene property name  Name   type  Type   and the presence or absence of multiple attribute values  Multi value    If the enumeration type is specified  specify also the possible values  Enum elements  in    comma delimited form   On the Edit gene property screen  click the Apply button     On the Gene Property   Cluster Property screen  click the Apply button     
96. lts similar to that by single linkage clustering  The default  value   blank  is  score_cutoff   0 95  or  pam_cutoff   0 95      This option specifies whether orthologous or homologous  groups shall be created  It is simply equivalent to omitting the  tree splitting procedure described below by specifying phylocut   gt  1     In MBGD  orthologous groups are created by splitting the trees  of homologous clusters created by the hierarchical clustering  algorithm  A node with two children  A and B  is split when      Intersect Ph A  Ph B       min   Ph A      Ph B      gt  phylocut     where Ph A  denotes the set of species contained in node A   phylogenetic pattern    Ph  denotes the cardinality of Ph  and  Intersect A B  is the intersection of sets Aand B  This parameter  is not effective when ClusteringMode    homology  is specified    When counting the number of species in the above calculation   one can incorporate taxonomic information by counting related  species only once  A taxonomic rank can be specified to  determine which set of organisms are considered to be related    After the tree splitting procedure described above  two clusters       145    RECOG Client User   s Manual       adjacent clusters of domains are joined when they are almost always adjacent to  each other  More precisely  two clusters A and B are joined  when     adjacent A B     max  A   B    gt   raaj1  or   jadjacent A B     min A   B    gt   ragj      where adjacent A B  is a set of domains belon
97. n     Click the Apply button on the Project List screen       Project information editing    Click File     Open Project     Project List    to display the Project List screen     On the Project List screen  select the project and click the Edit button to display the Edit  Project screen     On the Edit Project screen  change the project name and server URL     After editing  click the Apply button on the Edit Project screen to display the details of the  edited information on the Project List screen     Click the Apply button on the Project List screen     27    RECOG Client User   s Manual       6 6  Project removal    1  Click File     Open Project     Project List    to display the Project List screen     2  On the Project List screen  select the project to be removed and click the Delete button  A    warning message appears  Click the OK button     Confirm    Q  Delete project  Test Project         3  On the Project List screen  click the Apply button      Note  Upon the removal of the project  all the analysis results  including the DomClust results     of the project are removed      Note  The files created in a project are saved in the following folder in the user   s home  directory  RECOG project project_ name  So  unnecessary files can be removed by    directly accessing this folder     28    RECOG Client User   s Manual       7  Switching the RECOG Server    The RECOG servers that conduct DomClust analysis and CoreAligner analysis can be  switched  When a project 
98. n Viewer assigns colors to each gene based on the core structure  extracted by the CoreAligner analysis  and displays a genome map that connects the  corresponding ortholog groups by straight lines  Upon zooming in  the Viewer automatically    switches to the detailed gene view  see the figure below         Ga Genome Comparison Viewer    ene PE     MB   25 0 25                    Banyloliquefaciens    Bandiracts    Bantiraciy    Ranthiracts    Berens    B cereus    Bee a es es ee       m_   lt     R cereis       25 50    REAM O00010 ABAM 000020  REAM 000030 RBA                 B anyloliquefaciens  WAM OIR 10    Bandiracts    Bantiracty       Benthracts                  E BCE 0 01  Beirens  BCE 50  5033  codi  B cereus      l aki i  BCZKL wt BCZHDOC BCZHDOC BCZKOO  i  EA TA  B cereus a Ga g Y  BCZKS 180 BCSK5 ISI BCzKS IS  EcCZKS5I84          105    RECOG Client User   s Manual       27 1  Displaying the Genome Comparison Viewer    The Genome Comparison Viewer is displayed after the CoreAligner procedure is finished   Therefore  the Viewer can be displayed in the same manner as that described in    26 1 Running  the CoreAligner program    and    26 2 Displaying the CoreAligner analysis results        Also  the viewer can be displayed by clicking View   Genome Comparison Viewer on the    Core Genome Alignment screen   27 2  Changing the display area    1  To scroll the view area in the direction of the clicked button  click      Move Left         Move Right  in the Toolbox on t
99. n the Neighboring Clusters    screen       Search range of clusters  Specify the range of clusters considered to be neighborhood on the PPM table  If the  range specified is N  N clusters above and below each cluster are considered to be    neighborhoods of that cluster       Range within which genes are considered to be in each other   s neighborhood  Specify the distance between two genes on the chromosome considered to be in each    other   s neighborhood        Recalculate when the table is re sorted  If the order of clusters is changed due to the sort  etc  of clusters on the PPM  and if  this column is checked  neighborhood gene clustering is automatically executed under  the conditions specified immediately before  If unchecked  the results of the    neighborhood gene clustering are cleared after the order of clusters is changed     Netehborine Clusters    Search range of clusters    Range consider to be neighborhood gene B       Recalculate when the table is re sorted          92    RECOG Client User   s Manual       To execute neighborhood gene clustering  click the Run button on the Neighboring  Clusters screen    Upon the completion of the process of neighborhood gene clustering  the group of genes  considered to be in each other   s neighborhood on the PPM is clustered and displayed in    the same color  Also  Neighboring cluster is displayed in Color on the control panel        In this figure  only the cells having the same color in the same neighborhood in the t
100. ne  respectively  Using    Any     more complicate conditions can be specified  such as       Existence in more than half of each of bacteria and archaea        Species Display on the PPM    set   All The clusters that are present in all the species in the set are  displayed    Any The clusters that are present in more less than a certain    number percentage of species in the set are displayed     None The clusters that are absent in all species in the set are displayed     Species  color  Light red    Light blue    Grey    1  Displaying the taxonomy filtering conditions    The taxonomy filtering conditions are displayed in Filter     Taxonomy Filter on the control       panel  lan Selected   Clustering      root 6     S Bacteriac6     H Bacillales 6      9 Bacillaceae 6   IN Bacillus cereus 4H187    IN Bacillus cereus 4H820       IN Bacillus cereus ATCC 1096   IN Bacillus cereus ATCC 14579  IN Bacillus cereus B4264   OUT        Sp Group       Filter A     A Taxonomy Filter     29 aa      Bacillus cereus 4H187     Bacillus cereus AH820      9 None 1      Oceanobacillus iheyensis HTE831     29 Any Bacillaceae   20 0   2      Bacillus cereus ATCC 10987     Bacillus cereus B4264    62       RECOG Client User   s Manual       15 2  Setting the    All    conditions    1  Select the species on the Taxonomy Tree in the upper part of the Selected tab     2  Click the right mouse button and click Taxonomy     All to display the registered species in  Filter     Taxonomy Filter     Al
101. ng  the color settings on the Color tab on the control panel  i e  the same    color settings in the PPM table  are used        111    RECOG Client User   s Manual    Color         Reference genome       Use the color of the control tree          27 14  Automatic correction of the gene orientation    In accordance with the setting for the automatic correction of the gene orientation  the gene    orientation is made uniform in the ortholog group displayed in the center     The setting for the automatic correction of the gene orientation is valid only in Adjust Mode       View   Color   Arange    Scale  Arrangement of orthologous genes      Arranged in the positive direction         Arranged in the consensus direction of the core genome           Arranged in the direction of the reference genome          1  Click View   View Change    on the Genome Comparison Viewer screen        112    RECOG Client User   s Manual       Click the Arrange tab on the screen for changing the display of the Genome Comparison    Viewer screen     2  Specify the method of automatic correction of the gene orientation on the Arrange tab        Arrange in the positive direction    Place all genes in the positive direction        Arrange in the consensus direction of the core genome    Place all genes in the consensus direction of the ortholog group obtained by the    CoreAligner analysis        Arrange in the direction of the reference genome    Place all genes in the direction of the reference genome 
102. ntity   Alignment coverage   Alignment coverage for domain spitting   Score cutoif for domain splitting    Sknlarty meseure tor orthology     score    pan   Cort DP score IPAM oistence fpercentilertty tring    S S      lt    CSsSY  Best ht crterion  none Obidrec CQunidirec  Cufoft ratio of the score against the best   Score tor missing relstionshigs   Chastering Mode                               Cott ratio of paralogs for free seling  Overleg ratio  radj1  for merging acfacerd clusters  Coverage ratio  radj2  tor absorbing agacer small clusters          Relalive weight for horiz  transfer       Use deenclust dump       After specifying the DomClust analysis result name and parameters  click the Run button    to display the progress screen and execute the DomClust analysis     Click the Run in background button on the progress screen to execute the DomClust    analysis in the background  In this setting  other operations can be performed concurrently     To see the progress screen of the DomClust analysis that is underway in the background        37    RECOG Client User   s Manual       double click the progress bar displayed at the bottom right of the screen     Update data     Ortholog Clustering  DomClust   Reading       Ortholog Clustering  DomClust   Sorting       Ortholog Clustering  DomClust   Indexing       Ortholog Clustering  DomClust   Clustering       Ortholog Clustering  DomClusty   Create Result Table       Reading Cluster File       Run in background       Upon
103. obacillus kaustophilus HT4 426  OUT Geobacillus thermodenttrificans NG8O z    33    RECOG Client User   s Manual       2  To cancel the selection of the ingroup outgroup species  select and click the relevant  species or taxonomic group on the Taxonomy Tree and click Group     Cancel  To cancel    all species or groups  click Cancel on the uppermost root node     8 4  Automatic ingroup outgroup specification    The best way to evenly select the species to be analyzed is to select a representative species for  each taxonomic rank  RECOG can automatically select a representative species from each  taxonomic group for choosing the target species  Here the weight of species is determined by  the date of publication of the genome sequence  that is  the earlier the determination of the    genome sequence  the higher its significance     1  From the Rank drop down menu located above the Taxonomy Tree  specify the normative    taxonomic rank     2  On the Taxonomy Tree  select and click the taxonomic group  with a rank higher than the    normative rank specified above  to be subjected to analysis     3  Click the right mouse button  and click Group     Auto Ingroup or Auto Outgroup  From  among the selected classification groups  the species with the highest weight is chosen as  ingroup outgroup species for each normative rank taxonomic group       All   Selected   Clustering   Selected   Clustering      Rank  family v Rank  family v                          A root 598  A root 598   
104. off percent identity    Alignment coverage    Alignment coverage for domain    splitting    Score cutoff for domain splitting    Similarity measure for orthology    Best hit criterion    Details    This value specifies a cutoff E value for the BLAST results  The  maximum value is 1e 2  Note that  in MBGD  the E value is  adjusted so that the size of the search space  the database size  times the query length  is 1e9    Cutoff score of the optimal local alignment with the JTT PAM250  scoring matrix  Jones ef al   1992   The same cutoff is used for  both the selection and the clustering steps when score is used  as a similarity measure    PAM is a unit of evolutionary distance defined as the number of  accepted point mutations per 100 residues  Dayhoff et al    1978   The PAM distance is estimated based on the PAM  substitution matrix which gives the best alignment score  The  same cutoff is used for both the selection and the clustering  steps when PAM is used as a dissimilarity measure    Percent identity is defined as  the number of identical residue  pairs     alignment length    100  The alignment length  includes the internal gaps    Alignment coverage is defined as  alignment length     length  of the shorter sequence    100  Raising this parameter  removes matches only in short regions before the clustering  procedure  MBGD does not conduct this check by default    In MBGD  a domain splitting procedure is incorporated in the  hierarchical clustering algorithm  When merg
105. ojects is displayed     The currently displayed analysis results are saved    DomClust file      DomcClust result files are imported    Gene property file      Gene property files are imported    DomClust results are output in tab delimited format  and PPM images are output in  PDF format    Regarding imperfect genomes  the contig alignment sequence  direction  etc  on the  chromosome are set    The size of the PPM image to be saved is specified    The local genetic data are updated     The RECOG Client is terminated     View menu      View   Tool Run Option      gb Ageregate Mode          Expand Taxonomy Tree        Collapse Taxonomy Tree    Select all clusters on PPM  PPM Label    ia    Histogram          Taxonomy Tree Pane     Information Pane       Color genes by properties          The PPM display mode is switched between the aggregate mode    and the disaggregate mode     Regarding the analysis of sort  etc   whether to conduct an  analysis based on clusters or sub clusters is specified    The tree in the taxonomy browser is expanded one level    The tree in the taxonomy browser is collapsed one level    All the clusters on the phylogenetic pattern map  PPM  are    selected        16    RECOG Client User   s Manual       PPM Label Whether to display the gene name or the cluster ID in the label  display fields on both sides of the PPM is specified    Color genes by properties Each gene on the PPM is classified by color in proportion to the  gene property value    Histogr
106. onomy     Any New   For all any conditions     Any    condition    shown below  The    Any    conditions are changed to the clicked    Any    conditions         Control     Sp Group    Set     H Fitter    a     O Taxonomy Filter  a 9 Alc      Bacillus cereus 4H187     Bacillus cereus 4H820    Slee pany Bacillaceae  gt  20 0          Bacillus cereus ATCC Rename       Bacillus cereus B426  Os Edit                  E Gene Organism Count Filter Disable For all Any Conditions    Keyword Search Delete One or more    Gene Set Filter 0  OF more    Cluster Set Filter 40  or more  H Organism Column 60  or more  IN Bacillus cereus 4H187 80  or more  IN Bacillus cereus 4H620 20  or less  IN Bacillus cereus ATCC 10987 40  or less  IN Bacillus cereus ATCC 14579 60M or less  IN Bacillus cereus B4264    80  or less       nut Oeeannhacilliis ihewensic HTFA31    Setting the    None    conditions    Select the species on the Taxonomy Tree in the upper part of the Selected tab     Click the right mouse button and click Taxonomy     None to display the registered species    names in Filter     Taxonomy Filter     None on the control panel  The relevant species       66    RECOG Client User   s Manual       names are displayed on the tree in grey     5 root 6      9 Bacteria 6     H Bacillales 6      H Bacillaceae 6   IN Bacillus cereus 4H187  IN Bacillus cereus AH820             IN Bacillus cereus ATCC 10987  IN Bacillus cereus ATCC 14579  IN Bacillus cereus B4264  Taxonomy b All  Sp Group b Any 
107. ore  D C  GerefOrgaris     Show organism columns 206 o less  S _ KeywordSea     Color oresnism  gt  ceB   4B or less    SJ  Cene SetFite     Organism set  gt    60  or less  D Z  Custer Set Fa l   kA   80K or less    Cenaniem Prime     V O phylogenetic pattern          3     s    RECOG Client User   s Manual       Specify the    Any    conditions on the Any Threshold screen        Count  Specify the conditions for the number of species in a cluster    lt    or less  The number of species is lower than or equal to      lt     or more  The number of species is higher than or equal to     Example 1  The conditions    or more  3    or less  5  are met if the number of species  in a cluster is 3 or higher and 5 or lower   Example 2  The conditions    or more  5    or less  3  are met if the number of species  in a cluster is 5 or higher or 3 or lower      Note the use of and and or in the two examples        Percent  Specify the conditions for the percentage of species in a cluster    lt    or less  The maximum number of species accounts for    of all    lt    ormore  The minimum number of species accounts for    of all   Example 1  The conditions    or more  30    or less  50  are met if a species in a cluster  accounts for between 30 50    Example 2  The conditions    or more  50    or less  30  are met if a species in a cluster    accounts for 50  or more or 30  or less     After specifying the    Any    conditions  click the Apply button on the Any Threshold screen  to display
108. p is mandatory  while the specification of outgroup is optional   If the  outgroup is specified  groups are severed in creating an ortholog group so that the ingroup    species form a single phylogenetic group against the outgroup species   1  On the Taxonomy Tree  select and click the relevant species or taxonomy nodes     2  Click the right mouse button  and click Group     Ingroup or Outgroup  The selected    species are chosen as the ingroup or the outgroup species  respectively     IN is displayed for the ingroup species  and OUT is displayed for the outgroup species       H Bacillales 43      N Bacillaceae 22        q Bacillales 43        eens FiB42      Bacillaceae 22     Bacillus anthracis Ames Ancestor  420  IN Bacillus anthracis Ames             Bacillus amyloliquetaciens FZB42 IN Bacilus anthracis Sterne     Bacillus anthracis Ames Ancestor  420 IN Bacillus cereus ATCC 10987  a    Bacillus cereus ATCC 14579       Bacillus cereus E33L       Bacillus cereus NYH 391 98     Bacillus thuringiensis serovar konkukiar     Ba Color  gt  Outeroup    Bacillus thuringiensis AI Hakam     Bacillus cereus E Auto Ineroup    Bacillus weihenstephanensis KBAB4     Bacillus cereus  Auto Outeroup    Bacillus clausii KSM K16     Bacillus thuringie Cincai   Bacilus halodurans C 125    Resilhic thi winai    Bacillus licheniformis ATCC 14580  DSM     Bacillus licheniformis DSM 13  ATCC 14     Bacillus pumilus SAFR 032     Bacillus subtilis 168     Exiguobacterium sibiricum 255 15    OUT Ge
109. p to be subjected to analysis     8 1  Expansion Collapse of the Taxonomy Tree    The classification hierarchy of the Taxonomy Tree can be expanded or collapsed in a stepwise    manner by clicking the Toolbox buttons     For expansion  click  Expand Taxonomy Tree  in the Toolbox  One expansion    increment is made for each click            For collapsing  click  Collapse Taxonomy Tree  in the Toolbox  One collapsing    increment is made for each click     8 2  Specification of a set of classification ranks to be    displayed on the Taxonomy Tree    Users can choose a set of taxonomic ranks  species  genus  family  order  etc   to be displayed    on the Taxonomy Tree        1  Click W6  Option  in the Toolbox to display the Option screen     2  On the Option screen  click the Taxonomy Tree tab     3  On the Taxonomy Tree tab   s Display Rank form  specify the classification rank to be  displayed        32    RECOG Client User   s Manual         All  All the taxonomic ranks are displayed     Select  Only the checked taxonomic ranks are displayed       Click the Default button to return to the default settings     ace Option          8 3  Specification of the ingroup outgroup    On the All tab  users can choose a set of species to be compared by the DomClust program    Specify a set of species belonging to the interested phylogenetic group as ingroup  and specify   as the need arises  a set of distantly related species for comparison as outgroup  the  specification of the ingrou
110. pecies azciesiccsn discs hace ccesidcuisdiitessaccecstacsctuidicounednieatedsialesovecdadtonseceetneuidowentaecteie  715  E eo  ON eee een ca eee ee ee ee ees 76  19  i  Displaying the lst of pen  s icone cde cacesce act bencn get vet weceacetatdanctaslectj a 76  19 2  Sorting the list oan ci  gt  ee re ene oe en ee ee ORE nen Sere renee enter ee 76  19 3  Saving the list of genes wove cece ce ceccecescessecesseueeeeseuseveeseeseusevsesessevsevesseuseseeseeseuseueeseuseuseveseusesesenss 76  20  Display and Operation of the Circular Genome Map  CGM        ccccscssscsssssesessssesessesesesessesscsesesersesecaeseseees 77  20  L MVS Ry MN AI cs ce ace AAR AERA AREA AA ANETAR 77  20 2  Changing the selected FE CIO ae ecciicasa see ietaace eecala idcaeedcndc itu eeeealts easiness ene ececelesinteoiteeebaes 78  20 3  Linkage between the PPM and CGM ouuw ccc cece cece see ceeeeseeseeceseesousessesseseuseuseseuseusesesseuseeseeenses 78  20 4  Changing the color of NCS cscs cccesasonccdind cacccetecanzdce ctionentedledeausanacessiecoletenanbiacdsasiscusdusladoladnonsicaciuntlecues 78  20 5  Displaying gene information in a DrOWSEL Luu    ieee cece cescesee see cescescesessesceesesesseuseseuseesesseveeseuseveeseusannees 719  21  Display and Operation of the Regional Genome Map  RGM        c cccssscsssssesessesesesesseseseesessesesesereeserseseseees 80  2h Diplavne the RON aeeer ene errr nd nr EE rt Uren gS EP DST Eee PS Un ee Toate ne eT 80  21 2  Zooming in out on the RGM sreiniascavna
111. pecify whether to sort for each cluster aac  Cluster Mode  or    designation is not effective     Cluster Mode Sorting is carried out for each cluster   If a sub cluster belonging to a cluster contains at least one species   sorting is carried out  given that the cluster contains at least one  species     Sub Cluster Mode Sorting is carried out for each sub cluster     2  Click the right mouse button on the PPM  select Sort on the pop up menu and click the  sort method on the sub menu  Sort and display the phylogenetic pattern on the PPM as per  the specified sort method  However  for Gene order and Phylogenetic pattern similarity     the species or cluster to be sorted depends on the location on the table of the clicked point        56    RECOG Client User   s Manual       Sort method Details   Category gene name The data are sorted by function category gene  name    Gene order on  lt genome name gt  The data are sorted in ascending order of the    position of the specified species on the  genome    Phylogenetic pattern  lexical order  The data are sorted in lexical order of the  phylogenetic patterns    Phylogenetic pattern similarity based on the The data are sorted in order of similarity to the   cluster   phylogenetic pattern of a specified cluster  As  the similarity indicator  specify one of the  following   1  Normalized hamming distance  2  Correlation coefficient  3  Correlation coefficient  absolute  4  Mutual information   Phylogenetic pattern cluster  PhyloPatClu
112. r   s Manual       30 4  Registering additional genes clusters to a    gene cluster set    The cluster and the genes contained in a cluster selected on the PPM can be added to a    registered gene cluster set   1  Select a cluster on the PPM     2  To add the genes cluster to a registered gene cluster set  select a gene cluster set in Gene  Set Cluster Set on the set control panel  click the right mouse button  and click Add    selected genes clusters to     30 5  Removing a gene cluster set    1  In Gene Set Cluster Set on the set control panel  select a gene cluster set  click the right    mouse button  and click Delete gene cluster set     2  To remove the gene cluster set  click the OK button when the warning message is    displayed     30 6  Referencing the list of gene cluster sets    On the Gene set Cluster set screen  the list of registered gene cluster sets can be checked   1  To display the Gene set Cluster set screen  click Option   Gene Set  Cluster Set List        2  To display the gene property  click the Gene tab  and to display the cluster property  click    the Cluster tab  On each screen  the following items are displayed     125    RECOG Client User   s Manual         Name  property name      Location  location for saving     Gene properties only      lt  Global  A gene property that can be referred to in all projects      lt  Project name    A gene property that can be referred to only in the named project     Gene Count  the gene count of a gene set   Gen
113. r setting based on a combined cluster set condition   See    31  Combined set      Gene count Color setting based on the gene count in a cell   See    11 6 Color change according to the gene count in a cell      Base color Standard color setting   See    11 5 Changing the color of the PPM cell boundary        10 2 4  Phylogenetic pattern registration  Phylogenetic Pattern   The profile used in the similar phylogenetic pattern search function is displayed     For the operation procedure  see    33  Similar phylogenic pattern search        10 2 5  Organism column setting  Organism Column   The species to be displayed on the PPM is displayed   For the operational procedure  see    18  Changing the display order and display nondisplay of    species        10 3  Display of the set management panel    Click the Selected tab on the right of the screen and click the Set button below to display the    set management panel     44    RECOG Client User   s Manual       10 4  Operation of the set management panel    On the set management panel  the following sets are displayed and can be operated     Gene Set    Gene Set Combination    Cluster Set    Cluster Set Combination    Organism Set    Gene sets are displayed    See    30  Registration and management of gene  sets cluster sets       Combined gene sets are displayed    See    31  Combined sets       Cluster sets are displayed    See    30  Registration and management of gene  sets cluster sets       Combined cluster sets are display
114. rence are specified  and if a cell contains multiple genes  specify the  method of determining the representative value for graph display    lt  Min  The minimum value among multiple genes is used    lt  Max  The maximum value among multiple genes is used    lt  Median  The median among multiple genes is used    lt  gt     Average  The mean value of the properties of multiple genes is used      5  To display the graph on the Histogram tab  click the Apply button on the Histogram    screen       The numerical values displayed on the Value and Difference graph can be switched    between display nondisplay on the popup menu Show value  displayed upon    right clicking on the graph     23 3  Switching between the display nondisplay of the       90    RECOG Client User   s Manual       Histogram tab    1  To display the Histogram tab on the right of the screen  choose and check from the menu    View   Information Pane     2  To hide the Histogram tab  uncheck View   Information Pane     91    RECOG Client User   s Manual       24  Clustering Neighborhood Genes    This function groups genes that are located in the vicinity of each other on the phylogenetic    pattern map  PPM  table and in terms of the genome sequence  and assigns a color for each    group     24 1  Execution of the clustering of neighborhood genes    ae  1  To display the Neighboring Clusters screen  click ese  Neighboring Clusters  in the    Toolbox     2  Specify the conditions for neighborhood gene clustering o
115. rical gene property data between two species is displayed by    a bar graph  Two species and their gene properties for comparison should be specified        Info  Histogram         bat   bea bee    I    ga gtn oih          7500000    omg oho    dhbF    EGETI  BASBOT    BAIO94 6451022 2 HAREE          If Value and Difference are specified  the numerical data are displayed on the graph    upon checking    Show value           89    RECOG Client User   s Manual       3  If Value and Difference are specified  specify also the display range and graph style in the       View    column     Range    Specify the display range  To automatically specify the display range as between the  minimum and the maximum values of the target property  click the Auto range    button      Graph Style  Select one of the two graph styles     Bar     bar graph  or    Line     line graph    4  To set the following conditions  click the Option button       Representative value of multiple property values for a gene  If Value and Difference are specified  and if multiple values are set for a gene property  of a gene  specify the method of determining the representative value for graph display    lt  Min  The minimum value among multiple values is used    lt  Max  The maximum value among multiple values is used    lt  Median  The median among multiple values is used    lt  gt     Average  The mean value among multiple values is used       Representative value of multiple genes in a cell  If Value and Diffe
116. roup       After specifying the conditions  click the Run button  The progress screen is displayed and  the analysis by the CoreAligner program starts     Upon the completion of the analysis  the Core Genome Alignment screen and the  Genome Comparison Viewer screen are displayed     tid Core Genome Alinnment           Overview               e ee ee ere e e e    P cemginos  Pcerugnesa  P cemgnese  Puundocine  P entomophila  P fluorescens  P flucrexens  P putida  Pputda  Puen  Pyringae Magnified view  Paringar        Psyringae       98    RECOG Client User   s Manual       26 2  Displaying the CoreAligner analysis results    The previously executed CoreAligner analysis results are displayed     260 3     Click Ll  Open files  in the Toolbox  The Open file screen is displayed     On the Open file screen  select the file filter    Core Genome File   coaln   coregenome     and  then select the project and CoreAligner analysis result file   Upon selecting the CoreAligner analysis result file  the information on the analysis results    is displayed on the right of the screen     Click the Apply button on the Open files screen to display the selected CoreAligner    analysis results and DomClust results     Components of the core structure display     Ortholog group  A longitudinal gene group corresponds to an ortholog group        Node    Each node is represented by a circle or square     Shape Details    Contains only one gene   a Contains two or more genes  inparalogs       Line   
117. s  MBGD does  not use such a criterion in the selection step by default  since  the UPGMA algorithm itself must involve it  but in some  situations  it might be useful for the purpose of filtering out some  apparent paralogs before clustering  See the next section for  details    This parameter is not effective when the best hit criterion above  is not used    Orthology need not be a one to one relationship  As the  bi directional best hit criterion  two genes  a b  are considered  to be orthologs when score a b  satisfies    score a b   max  max_y  score a y     max_x  score x b         100  gt   cutoff_ratio     where x and y are any genes of genomes A and B  respectively   Using cutoff_ratio  100 corresponds to the exact bi directional  best hit criterion    Similarly  as the unidirectional best hit criterion  two genes  a b   are considered to be orthologs when    score a b    min  max_y  score a y     max_x  score x b         100  gt   cutoff_ratio     Although the usual hierarchical clustering algorithm requires a  complete similarity dissimilarity matrix  here only significant  similarities found by the search are used  This option specifies a  value to be assigned for the relationships missed by the search   The value must be smaller  larger  than the similarity   dissimilarity  cutoff  Specifying an extremely small  large  value  will result in a classification similar to that by complete linkage  clustering  whereas specifying a value close to the cutoff gives  resu
118. s 138  33 6  Removing the phylogenetic pattern similarity S   arch uo    ce ceeescsseesescesseccessessesseseessensasseseasens 139   34  Downloading the Sequence information  u    cceccccececesccesecscessesceeessesseeeesesseveesesseusenseseeseuseveeseusesevesseusensenees 140  34 1  Downloading the Sequence information uu    cece cece cee cesceseecescescesescessessesesseuseeseeeeseuseveeseuseveeeenenees 140   35  Management of External Resource URL    S      c cccssccsscessessecsssseseecescessecsecsessessessessesassecsersessessessesatsatsasseseateeee 142  39 1  Registering an external resource URL wie cece cece ses ceeeesescosseseesousesseseesensessesesseuseeesseuseeeesens 142  35 2  Editing an external resource I ac scien cata cece cee caine ctacee bandana gediasdes Ceaedacedcenteucaay 143  39 3  Removing an external resource URL wie eee ce cesceeee ses ceeeesesseeseseeseuseuseveesessessevesseusevensenseneneees 143   e L112  Oc  g   amp   gt   Ge Renna ET EPTE an Et oe en RC eS eRE ET Ur RPO PTR PER SE SIORED NY oS eR EHR RE neESE UTS Per ce oe eet aw nee nega LES A a 144  36 1  DomClust parameters  n   cceccccccccsccccecsceececessesseeeeseuseeseseuseuseveesessevsesesseuseveeseusessuseesesseuseseuseseevesseuseneeeees 144   Be FM 20 ecasiaavecursastcsiedesiagsreniecasas vacua cvidsdnnresiicad sedans nanan asa dali kag vaste rena ela aeadannnaeaeea eee eeie 147    RECOG Client User   s Manual       1  Outline of RECOG    1 1  What is RECOG     Ortholog groups  Clusters        RE
119. s not    removed by the above uninstallation procedure  If you do not need it  remove it manually     3 5  Uninstallation of RECOG for Mac    1  Remove the following directory manually      Application recog app     Library Receipts recog client pkg       2  The data directory  Users  lt user account gt  RECOG is not removed by the above    uninstallation procedure  If you do not need it  remove it manually     3 6  Uninstallation of RECOG for Linux    1  Remove the recog directory manually   2  The data directory  home  lt user account gt  RECOG is not removed by the above    uninstallation procedure  If you do not need it  remove it manually     12    RECOG Client User   s Manual       4  Starting and Terminating RECOG    4 1  Starting RECOG      Windows  Click Start     All programs     RECOG     RECOG menu       Mac    Open  Applications in the finder window and double click the RECOG icon        Linux    From the terminal  move to the recog directory  and execute the following command     4 2  Terminating RECOG    On the menu screen  click File     Exit     13    RECOG Client User   s Manual       5  Display and Operation of the RECOG Main  Window    5 1  Screen structure    EI RICOG vercion 1 1 2   Toot   chartes 29000016 out  Fie Ver Toot An Ostin    JH we G mPORt p vy SRE      Hortror  a Zoo    Traraition state transcriptional reg  Gutter    GoneMe   Cererpton  meton state tre       3 Oreste leone    Amon porh    Douurococos ia       fe ayo    2i          The main win
120. senin            lt  o                          o          o        oZ oOo      O  ada Ada regulatory protervs O methyiguanine DNA  m       a a  r LA  m loss       BCEs264_Ass4adaA Ada reg rvG O methyiguanine DNA m          63740       BCAHIS  _AS7iadsA     BCAH320_3751adsA    020291315   5  09178102   90379510485   5  029141085      02039 10185      02049 1 6485      02086 1 485   02087108   5  02201108   5  02312 16485   S027 108   2  02822108     ripe    pa62  rocR2  teri    ani Se ee  ptional reguistory protein         rrer POA    Trancervdinnal rare tadnr ra dab   criptional regu  stor  putative      F  Display onty matched genes   M Highight matched genes       73          RECOG Client User   s Manual       Check    Display only matched clusters    on the Keyword Search Result screen to display  and only the searched clusters on the PPM  Upon unchecking  all the clusters are    displayed     Check    Highlight matched clusters    to highlight the searched clusters  Upon unchecking   the highlighting is cancelled  Click the color setting column to set the highlight color     17 3  Redisplaying the search results        o  Click    Keyword Search  in the Toolbox to display the Keyword Search screen     On the Keyword Search screen  click the Show search results button to display the last    search results     17 4  Enable Disable filter settings by the search results    Select Filter     Keyword Search on the control panel  click the right mouse button  and    click Enabl
121. snees 99  26 3  Components of the core structure display       cc cece cee cesceseesescescessesessesseseesosseaseveeseuseveeseasensnses 99  204  CSA EV SAY OSI Meise cst ca nant iea elt pdeateceeamdea tities 100  20   Selecting an ortholog 0 tacts esa cac sascaasenecteacacuat tens ananin aieiai riirii iniinis 100  26 6  Locating an ortholog group at center uu    cceesscescseeesesceseesessesseseeseusesseseeseusesseseuseseveeseuseeeesees 100  26 7  Setting a reference genome  gt  ene eee een ee EI ener nc St nee a RTE nee EN CE 100  26 8  Display Nondisplay of SOC CIGS pasiccnvacecniveasaviicarniietonssinsicecandnticencnaia vee odeiundsns aietiswuididvihwuledatotneninanehansisearsens 101  26 9  Changing the display order Of species  u  ee cece ccsceeeecsceeceseesesceseesesseeesesseusesseseeseusesseseusesseeeuseuseeensens 101  26 10  Resetting the window SIZE ou    ceccsescescesceseecssessesessessesessesseesessesesseusesesseusesesseaseseeseesausevenseusensneens 102  26 11  Changing the display style of species NAMES ou    eee ccs c see ce ces cesceeescesceseesesseveesesseuseveesenseneneens 102  26 12  Changing the ortholog group NA CNS sca cise ccc sect acuwasedaceetanttlaccaehaelenacbiedecamnatlaaeiantieceaaned 102  ASA DIN APAP eeepc EEN EE E EAE E A O ET 103  26 14  Searching by gene name Locus Tagiiwi  cceecsssscesssesessecesesesescscecsvesesescevavsveseseesavaveneseseevavaveveneeeetesen 103  26 15  Printing the core structure IAD a cescccicccecia ese ceenceceneiadauceeniiea
122. st  Phylogenetic pattern clustering is carried out   and the data are sorted according to the  hierarchical tree    Homology Cluster ID The data are sorted by the homology cluster  ID cluster  D sub cluster ID    Gene properties    The data are sorted based on the specified gene    properties  see 12 3      13 2  PPM sort in the aggregate mode    In the aggregate mode  the data are sorted based on the phylogenetic pattern of the aggregated    Phylogenetic pattern  lexical order     Select Organism Phylogenetic pattern frequency  v Disageregate Mode Phylogenetic pattern similarity based on the cluster  Phylogenetic pattern clustering  PhyloPatClust _    clusters                 1  Click the right mouse button on the PPM  click Sort on the menu  and click the sort    method on the sub menu to sort and display the phylogenetic patterns on the PPM        97    RECOG Client User   s Manual       Sort method Details   Phylogenetic pattern  lexical order  The data are sorted in lexical order of the  phylogenetic patterns    Phylogenetic pattern frequency The data are sorted in descending order of the    occurrence frequency of the sub clusters with  phylogenetic patterns   Phylogenetic pattern similarity based on the The data are sorted in order of similarity to the  cluster phylogenetic pattern of a specified cluster  As  the similarity indicator  specify one of the  following   1  Normalized hamming distance  2  Correlation coefficient  3  Correlation coefficient  absolute  4  Mut
123. syrthests     41 4 Arosat amiro acid teenity  1 2 Aspartate fasity   1 3 Branched chain fanty       14 Caternate family   15 Pyruvate landy   1 6 Siira liniy   1 7 Hstidna farnty  2 Punines  pyrimidines  racicomdes  and muciootiges  2 4 Z Deoxyriboraclontide metabolism    2 2 Nxlecide and rexiooude rterconversions       2 3 Purine rorscioctide biosynthesis  2 4 Pyresdine rhorwciecntide bosyrthesis  25 Salvaje of nucieosices and nuciedtides       25 Suagae tescloctice bins yntherstit tend Goerwer tines    O Use the color of the control tree    Lasse   restore      creste core _     21 5  Displaying gene information in a web browser    lif    By double clicking a gene  you can display in a web browser the informationof the default  external resource that is set according to    35  External Resource URL Management     You    can also click the right mouse button on a gene and choose an external resource URL to  display     82    RECOG Client User   s Manual       22  Multiple Alignment and Phylogenetic Tree    Multiple alignment among genes belonging to a cluster can be created using the ClustalW    program  Also  a phylogenetic tree can be created and displayed based on the results of the    multiple alignment     22 1  Execution of multiple alignment     Note  Multiple alignment can be executed only in an environment where Internet connection is    available     1  Specify the execution parameters of ClustalW     Clicking Ve  Option  in the Toolbox to display the Option screen
124. ter       3  On the Download sequence screen  select amino acid sequence or DNA sequence     Protein  Download the amino acid sequence     DNA    Download the DNA sequence  For the DNA sequence  the extra sequence count obtained  upstream and downstream and the delimiting letter between the extra sequence obtained    and the gene DNA sequence can be specified   4  In Target genes  specify the target gene     All genes on the cluster table  The genes contained in all the clusters displayed on the PPM are the targets     Genes included in the selected cluster    The genes contained in the cluster currently selected on the PPM are the targets        140    RECOG Client User   s Manual       5  After specifying the conditions  click the Apply button  When the screen for file saving is  displayed  enter the file name and click the OK button     141    RECOG Client User   s Manual       35  Management of External Resource URL   s    To display the information on external resources in a web browser from the Info tab and the    Regional Genome Map  register the external resource URL   s     35 1  Registering an external resource URL    1  Click Option   URL    to display the URL screen        http  Jimbgd genome ad jp       Organism  LocusTag          2  To display the Edit URL screen  click the Add button on the URL screen               FRECOG RECOG_gene_info_frame pl spec  1  amp name  2                  Organism  LocusTag             3  On the Edit URL screen  specify the name of 
125. ter editing the profile  click the Apply button     33 3  Removing a profile    1  Select a profile in Phylogenetic Pattern on the control panel  click the right mouse button     and click Delete pattern  When the warning message is displayed  click the OK button     135    RECOG Client User   s Manual       33 4  Similar phylogenetic pattern search    1  To display the Calculate distance by pattern screen  select a profile in Phylogenetic       Pattern on the control panel  click the right mouse button  and click Calculate distance    by pattern     Calculate distance by pattern Calculate distance by pattern          Name          Name         Profile    Profile     Phylogenetic pattern            Phylogenetic pattern    Gene property  Start                Type of distance Normalized hamming distance       O Gene property Start         Sort by the caluculated result       Type of distance Normalized hamming distance              Treatment of missing value       Ignore     Average     Specified value  Presentative value of multiple gene s property value       Min     Max     Median     Average  Presentative value of multiple gene in one cell       Min     Max     Median     Average             Sort by the caluculated result       2  On the Calculate distance by pattern screen  specify the conditions for determining the    pattern similarity      Name column  Save column    To save the calculation results to a file  check the Save column and enter the name  If  the calculation
126. ters of the same  phylogenetic pattern are displayed in the same row in an aggregated manner  and no cluster    headers are displayed     20    RECOG Client User   s Manual       ingroup outgroup Homology cluster    Subcluster Cluster   Outer ortholog group    Homology Cluster  Inner ortholog group     Subcluster 1  Subcluster 2 Outgroup  cluster       Subcluster 3  Subcluster 1  cluster    2824 1       Nested table structure RECOG screen display    5 8  Info tab    The Info tab displays the cluster selected on the PPM and the gene information in that cluster     In the aggregate mode  multiple clusters with the same phylogenetic pattern are displayed       Info   Histogram    Aminoglycoside N3 acetyltransferase  Cluster ID Gene Name Description  Aminoglycoside N3 a       Organism Locus Tag  ber BCAH187_A2976    BCAH820_2929  BCE_2969  BC2919    BCB4264_42939    Contents  i bce   LocusTag   BC2919    GeneName    a o OOo  e    An ler       Expression  567 0    1  Cluster description field  at the top        The cluster selected on the PPM is described        21    RECOG Client User   s Manual       Cluster information table  upper table    The cluster information selected on the PPM is displayed  When the table is  double clicked  a browser is activated and the details of the cluster information are shown   Also  by right clicking the selected cluster on the table  various functions  including  Multiple Alignment and the display of a Regional Genome Map  for the selected gene 
127. the    logical conditions between the sets       To add gene cluster sets to the combined conditions  select the sets in the Set column    and click the  gt  gt  button        To add multiple gene cluster sets to the combined conditions as a common set  select    the sets in the Set column and click the  amp  gt  gt  button       To add gene cluster sets to the combined conditions as a negative condition  select the       127    RECOG Client User   s Manual       sets in the Set column and click the Not gt  gt  button     To add multiple gene cluster sets to the combined conditions as a common set of    negative conditions  select the sets in the Set column and click the Not amp  gt  gt  button     To remove a condition from the combined conditions  select the condition in the    Condition column and click the Delete button     To modify the order of priority of the combined conditions  select a condition in the    Condition column and click the Up Down button     To change a condition in the combined conditions into a negative condition  select the    condition in the Condition column and click the Flip Not button     To modify the negative condition in each set contained in the combined conditions   double click the condition name in the Condition column and modify it on the Edit    condition screen displayed     El Edit condition       Test gene set 1  Test gene set 2  Test gene set 3    Flip Not    Delete                  To modify the color applied to the combined condit
128. this is checked  the gene properties retrievable on the RECOG server are searched on    that server  If unchecked  all the gene properties are searched locally          Representative value of multiple values set for a gene property       If Value and Difference are specified  and if multiple values are set for a gene property    of a gene  the method of applying the search conditions is specified      lt  At Least One  If at least one of the multiple values meets the condition  it is    considered that the search condition is met      lt  All  If all the multiple values meet the condition  it is considered that the search    condition is met      lt  Average  If the mean value of the multiple values meets the condition  it is    considered that the search condition is met        This applies only to numerical type gene properties     4  Upon the completion of the search process  the Keyword Search Result screen is displayed     In Filter   Color on the control panel  the conditions corresponding to the search result are    displayed  On the PPM  only the clusters containing the searched genes are highlighted         Control     Sp Group    Fitter  Taxonomy Filter  Gene Organism Count Filter  4 Keyword Search     Keyword Search Gene Descritpion r  Gene Set Filter  Cluster Set Filter  Organism Column  Sort  Color          Keyword search Gene Descritpion reguli       Set    al Keyword Search Result    9 9 9 9 9 o o o o o o o o o o ook    E 02356155     _GeneName     Organism De
129. ual information  Phylogenetic pattern cluster  PhyloPatClust  Phylogenetic pattern clustering is carried out   and the data are sorted according to the    hierarchical tree     13 3  Sort based on properties    The phylogenetic patterns are sorted based on the gene properties provided by the RECOG    server or the gene properties registered as described in    29 1 Registration of gene properties        1  Click the right mouse button on the PPM and click Sort     Gene properties    to display  the Sort by gene properties screen     Sort by gene properties       B anthracis Sterne          Asc     AA length       Asc v          Asc Y v    2  On the Sort by gene properties screen  specify the gene properties for the sorting and the                   ascending order  Asc  descending order  Desc      To add the sorting conditions  click the Add Condition button        08    RECOG Client User   s Manual       Click the Option button  and the following setting column is displayed          Representative value of multiple values set for a gene property     If multiple values are set for a gene property of a gene  specify the method of  determining the representative value for the sorting    lt  Min  The minimum value among multiple values is used    lt  Max  The maximum value among multiple values is used    lt  Median  The median among multiple values is used    lt  gt     Average  The mean value among multiple values is used           Representative value of multiple genes in a ce
130. us ATCC 14579  IN Bactht cereus 64254          Controt    Sp Group   Be  ACen Set    D Gene Set Conbirstion  S OY Gere Corbiration 1     Test cere st 1  amp  Teal gers sel 2  amp  Test gene set3     Tesi cere se 1 3 Test gers sel 2     Tesi cere s 23 Test gare sel S     Tesi cere sti  amp  Test gere sel S     Tesi cere 2241     Tesi cere se 2     Tesi cere se 3   E  Custer Set  I Cluster Set Corkunation  S Dy ster Conkination 1     Tesi cluster set 1  amp  Test cluster set 2  amp  Test cluster set 3     Test cluster set 1  amp  Test chester set 2     Test cluster set 2  amp  Test cluster set 3     Test cluster set 1  amp  Test cluster set 3     Test cluster set 1 SCC SCAHIG  A2S22 ECAH 20 2776   BLE 2801     Test chuster set 2     Test chister set 3 ACP SCAHIS7_ASSSO BCAHI20_ 5505   ECE 5539    Bre 2804    BLE 2603          129    RECOG Client User   s Manual       Silas    31 3     Editing a combined set    To display the Gene Cluster set combination screen  select a combined set in Gene Set  Combination Cluster Set Combination on the set control panel  click the right mouse    button  and click Edit gene cluster set combination     Modify the conditions on the Gene Cluster set combination screen  For the method of    setting the conditions  refer to    31 1 Registering a combined set        After modifying the conditions  click the Apply button     Removing a combined set    To remove a combined set  select the combined set in Gene Set Combination Cluster Set    Combination
131. ut on the RGM          JE  Zoom out  in the Toolbox     respectively  on the Regional Genome Map screen     Display Nondisplay of the Locus Tag    Click LE     Label ON OFF  in the Toolbox on the Regional Genome Map screen  When    the button is displayed in color  the Locus Tag is displayed  and when the button is  displayed in grey  the Locus Tag is hidden     Setting the gene color    To display the Color Setting screen  click is  Color Setting  in the Toolbox on the    l    Regional Genome Map screen    2  Set the gene color on the Color Setting screen    Sort Display on the PPM   Function Category The color allocated to the current Function Category is used    Cluster ID A unique color is allocated to each cluster  The color is  allocated according to the gradation based on the cluster ID   Although it is difficult to distinguish between the colors  the  allocation remains the same even if the dsplayed region  changes    Pattern A unique color and pattern are allocated to each cluster  The  color and pattern are allocated to each currently displayed  cluster in the order of cluster size  Although the color and  pattern are easily distinguishable  the allocation changes with  the displayed region    Use the color The color currently in use on the PPM  the color set in Color on    currently assigned on the control panel  is used   the PPM    3     Click the Apply button        81    RECOG Client User   s Manual          Color Setting    Function Category   1 Amino ochd Dlo
132. ut on the core structure image  click  Zoom in    Zoom out     in the Toolbox on the Core Genome Alignment screen     26 14  Searching by gene name Locus Tag    1  In the column for item selection in the Toolbox on the Core Genome Alignment screen   select Gene Name  Locus Tag or Both  If Both is selected  a search according to both the    gene name and the locus tag is conducted   2  Enter a keyword in the keyword entry column     3  Click the Search button  and the ortholog group to which the searched gene belongs is  highlighted on the Overview  and the viewing area of the magnified view is scrolled so    that one of the ortholog groups hit by the search is displayed in the center of the screen     4  If the Search button is clicked under the same conditions  the viewing area of the  magnified view is scrolled so that the next ortholog group containing the searched gene is    displayed in the center of the screen     id Core Genome Alignment          e w Fe Fe Fo G   6 S F  E vb YD s se     FF ha ah Keer ah eh SP a SMP we Ee EE ah eh oh FF   a ae ed a ee IISI  a a ee ee et   ot       a   Pcenginesa   P cemginesa   P cereginesa   P nendocina   P ertomophtla   P flucrexens   P flucremens   Pputda   P putida   P stutzen   Pay rningae   Py ringae   Payringae       103    RECOG Client User   s Manual       26 15  Printing the core structure image    1  Click File     Preview on the Core Genome Alignment screen  The Core Genome  Alignment Preview screen is displayed     2  Click
133. utgroup     3  Click Y  Ortholog Clustering  DomClust   in the Toolbox to display the    Execute domclust screen     4  On the Execute domclust screen  enter the analysis name in the Name field  The result is  automatically saved under the analysis name entered here   If the default parameter set is used  select    Default parameters     if the parameter set  executed immediately theretofore is used  select        lt  lt Last Parameters gt  gt      if DomClust is  conducted using a parameter set saved previously  select the parameter set saved in the  Parameter set field     ta Execute domclust    Q  Start Ortholog Clustering  DornCusty        Name cluster _2008xxXXx   ingroup 5  Outgroup 1    frente eau Prone JE                RECOG Client User   s Manual       To specify a new parameter set  click the Parameters    button and set it on the displayed  parameter setting screen  For details on the parameters  refer to 37 1 DomClust    Parameters     To save the parameters that were set on the parameter setting screen  click the Save  parameters button  To remove the saved setting  specify the parameters to be removed in    the Parameter set field and click the Delete this parameter set button     bel Execute domclust       Start Oriholog Clustering  DorChsty     Name cluster_2009XXXX        Outgroup 1    Farts  ersa Parana z    Orholog Clustering  Domt kest  Setting      Delete this paramler set    Cott BLAST E value   Cutoft DP score   Cufott PAM distance   Cutott percert ide
134. value     If this is checked  the threshold value is automatically divided equally in  consideration of the possible range of the properties immediately after the change    thereof        03    RECOG Client User   s Manual        lt   Color of genes don   t have the gene property       This is used to set the color of the gene with no specified property value     be Color genes by properties       Organism ll    Property lAa length    Remove color       Ario value       Value     Rank  The method for assigning arank     Al species  O Every species    actomelically update value  Coker of genes dont have the gene property LJ          If the properties are of the enumeration type      lt  Set the color for each possible value      lt   Color of genes don   t have the gene property       This is used to set the color of the gene with no specified property value     Ed Color genes by properties       Organism All    Property estat a A    Coke of genes dont have the gene property al       RECOG Client User   s Manual       3  Click the Apply button on the Color genes by properties screen   Each cell is displayed in color as per the conditions specified on the PPM        BAIFA    5d    GX0030 GTNG_0029 TENES abrB   ENTERT EPSEN GTNG_2665 AFERE acci  BCE 3533 acca   BAsd039 BCE 4258 2400 CTNC 2331 accB  BASdO38 M2398 GTNG_2330 081385 acct  BAS4495 2742 GTNG_2866 need  HI HIRERT  083010 EGG   BASIO52 GTNG_0533 0E2404 ace    BASIOBS   BCE_1228 062405 aceB  BAS4535 60463  KEREN GTNG_268
135. word search   See    17  Keyword search      Gene Set Filter Filtering by combined gene set condition   See    31 4 A combined set is set as the filtering  condition      Cluster Set Filter Filtering by combined cluster set condition   See    31 4 A combined set is set as the filtering    condition        10 2 2  Sort setting  Sort   The sort condition that is applied to the PPM is displayed in the Sort folder of the control    panel  For the operational procedure  see    13  PPM sort        10 2 3  Color setting  Color   In the Color folder on the control panel  the gene cluster color that is displayed on the PPM  and the comparative genome map view is set  The coloring is applied in descending order of  the alignment sequence on the control panel  and is overwritten  the upper one has priority  over the lower one   The order can be changed by dragging and dropping an object  Also  the  color settings can be enabled disabled by clicking Enable Disable     The following color settings can be specified        43    RECOG Client User   s Manual       Gene property Color setting based on gene cluster property   See    12  Coloring by property      Neighboring cluster Color setting based on the result of neighboring gene clustering  See     24  Neighboring gene clustering      Keyword search Color setting based on the keyword search result   See    17  Keyword search      Gene Set Color setting based on a combined gene set condition   See    31  Combined set      Cluster Set Colo
136. yword Search      Cluster   Gene         All species                   AND v              AND v vi                      Add Condition Clear Condition       Show search results Close    For the search  the following marks can be entered in the keyword entry column     Type of search Example  Match search Word   Partial match search   word    Prefix search word    Suffix search   word   Or more  gt  10   Or less  lt  10   More than  gt 10   Less than  lt 10   With keyword inside  ABC  Without keyword inside  word  Multiple word search word  word2  Phrase search    wordl word2          Genes that include a phrase matching    word    are  searched        6    Genes that include the phrase     lt   word     are  searched    Genes that include the phrase    word     are searched   Genes that include the phrase     word    are searched   Ten or more genes are searched      Ten or less genes are searched      More than ten genes are searched      Less than ten genes are searched       Genes with    word    inside are searched     Genes without    word    inside are searched   Genes that include word1 or word2 are searched     Genes that include word  and word2 as a phrase are    searched        In the    Description    field  a partial match search is conducted        72    RECOG Client User   s Manual         The inequality sign is valid only for numerical type gene properties     3  Click the Option button to specify the following conditions           Search on the server       If 
137. yzed  The taxonomic tree of all the species registered in the RECOG server is displayed   On the tree screen  it is possible to choose the species  ingroup  outgroup  for DomClust    analysis and to color the species   2  Selected tab for PPM manipulation    This tab consists of upper and lower views    The upper view displays the taxonomic tree for the organisms  shown on the PPM  that  are currently targeted for analysis    On the tree screen  phylogenetic pattern filtering conditions  species groups  efc  can be  set     The lower view changes by selecting one of the following upper buttons        Control  A view for controlling the display of ortholog table  including the color settings  filter  settings  sort settings  and the switching of the order of display or the  display nondisplay of the PPM species  is displayed        19    RECOG Client User   s Manual          Sp Group  Species groups are displayed       Set  Currently registered gene sets  cluster sets  compound conditions and species sets are    displayed     3  Clustering tab    The hierarchical clustering tree obtained from phylogenetic pattern clustering is displayed     5 7  Phylogenetic Pattern Map  PPM     The Phylogenetic Pattern Map  PPM  is the main feature of the RECOG system  In the  PPM  genes that belong to each ortholog group are listed in a table in which ortholog groups  and species are arranged in the rows and columns  respectively  When the display area is  reduced  a pattern representing the
    
Download Pdf Manuals
 
 
    
Related Search
    
Related Contents
SAM3-P256 development board user`s manual  USER MANUAL - NODC  Magnat Pro Power 203  会社案内パンフレット/【PDF】  Manuel d`utilisation  EP5555T Datasheet (English)  WIKING Scandic WIKING Nordic 7 WIKING Nordic 9  Manuale dell`utente Ammortizzatori posteriori 2004  GUÍA DE INSTALACIÓN  RAD-ZIPI Manual    Copyright © All rights reserved. 
   Failed to retrieve file