Home

GOlorize User Guide

image

Contents

1. YCRO12W a LYNL307 i Node Attribute Browser Edge Attribute Browser Network Attribute Browser Welcome to Cytoscape 2 4 0 Right click drag to 200M Middle click drag to PAN The BiNGO Settings panel pops up In this panel the user can specify several parameters for the GO over representation analysis such as the type of a statistical test and multiple testing correction used as well as the significance level e g p lt 0 05 which controls the number of enriched categories that will be outputted For more information about the BINGO settings and its operation please see the BINGO manual at http www psb ugent be cbd papers BiNGO manual htm The particular selections for our example case are shown below Press Start BiNGO button to proceed the example GAL C Paste Genes from Text cy T ia i i lA al T 5 as oO it mM c Z m a a ae La Patt A A t A Wi c rb c S a EL rb 3 r fa a 3 a i t a m a C Underrepresentation EJNo Visualization o o Tm p Tm TT Tm wi Tm co Ea rH Hypergeametric test Benjamini amp Hochberg False Discovery Rate FDR correction 0 05 Overrepresented categories after correction Test cluster versus whole annotation La La La m Li La me me me z me me rb rt aa o it im a mn ri 5 m m iA La 7 rt fl A 3 mm 5 rst
2. category and other available data from GO The icons below the Layout column V indicate whether or not the corresponding categories will be used in the layout process and the last column X can be used to exclude the categories from the subsequent node coloring and placement steps GOlorize e x Start BINGO Apply coloring All nodes in view Coloring effect Default pie size w Level of Details f Selected Layout i GAL i All i Biological Process K Cellular Component Molecular Function Other CO default Saccharomyces cerevisiae pracess Add GO category Class Description Attribute 6096 glycolysis CO default Saccharomyces cerevisiae process FO49 cell cycle CO default Saccharomyces cerevisiae fpracess 6666 intracellular protein transport CO default Saccharomyces cerevisiae fprocess 15031 protein transport CO default Saccharomyces cerevisiae process 161592 vesicle mediated transport CO default Saccharomyces cerevisiae process 6082 organic acid metabolism CO default Saccharomyces cerevisiae process 6519 amino acid and derivative metabolism CO default Saccharomyces cerevisiae process Step 4 Coloring the selected nodes If desired the visualization can be focused only on the neighborhood of the selected categories by node selection mechanisms The Select nodes button applies to the categories with the left most check boxes selected It can be used in conjunction with other selection options in Cytoscape choo
3. compact node classes an example below Such a layout emphasizes the connections between the classes or metanodes while the within class connections are not so clearly visible anymore In the Advanced Settings the user can adjust also several other parameters including the number of iterations performed in the two layout phases the strength of the attraction within a class in the initial layout phase 1 and the strength of the standard attraction between two connected nodes Density of nodes Two alternative modes to the initial placement of the nodes is implemented In the first one all the nodes start from random positions whereas in the second one the standard and class nodes are initially placed on two circles within each other this provides reproducible solutions within a session Advanced Settings Iterations in Lst phase Iterations in nd phase 400 Intra Group Attraction 1 Density of nodes Initial node placement Randomize A stable solution can be obtained even for a large network consisting of hundreds of nodes in a few seconds on a standard workstation The fast operation of the layout algorithm makes it possible to experiment with different parameter combinations and different starting positions The aim of the tool is not to enable a fully automated visualization tool but rather to facilitate in discovering whether or not there is an intrinsic GO class structure hidden in the network of original interaction
4. o m m ei ee D m a E T bere T T on m by Sy im m A m an rat TT a mm mm ri TT i al m h A rE 2 z Se g ri m T m a LA im ri pu n co tv q q q q 4 GO_Biological_Process Select organism annoatation and gene identifier Saccharomyces cerevisiae Le a pu a OL am 2 2 n oo um OL rt mo mm 7 oO OL rt wi ri rt ri ZT co a mm LA ral wo Cc Fa Start BINGO Step 3 Choosing the GO categories Having parsed the annotations and calculated the tests and their corrections the BiNGO results appear in the GOlorize tab named after the BiNGO Settings Cluster name GAL in the example below This table lists the GO categories overrepresented in the selected subnetwork The columns include the GO ID term of the category and its description along with the original and corrected statistical significances p val and corr p value the number of nodes in the selected subnetwork and in the complete annotation that belong to the particular category cluster freq and total freq and the node IDs annotated to the category either directly or to its parent categories Start BINGO Apply coloring All nodes in view w Coloring effect Default pie size w Level of Details i Selected Layout i GAL GO default Saccharomyces cerevisiae process Class Description corr p C 6096 glycolysis 6007
5. GO over representation analysis Attributes gallRGexp Auto Threshold Threshold Class Name MIN Add Remove 0 209 Add Remove 0 0115 Add Remove Add Register Classes The below simple example demonstrates how the four classes defined on the basis of the expression data can be used together with GO categories cell cycle in the sample case in the network visualization Color all the five classes with the Auto Colors option and layout the selected network without taking into account the less significant classes 2 and 3 Blue color identifies now under expressed nodes and red the over expressed ones The purpose of this ad hoc example was just to show that different types of attributes and class definitions can be freely mixed in the GOlorize layout GOlorize Start BINGO Apply coloring All nodes in view Coloring effect Default pie size w Level of Details ooo Selected Layout GAL gallRGexp gallRGexp Intra G Attraction 2 3 0 Inter Group Distance 10 Select All Unselect All ti E A Ee E f a Advanced Settings The final sample use case defines classes from string attributes This example is done in the case of gene ontology annotations which are managed in Cytoscape as lists of string attributes for nodes However any other string attribute could be used instead e g imported attributes from an external file in which case the node classification can be freely defined by the user Th
6. GOlorize User Guide for Cytoscape version 2 4 x and BiNGO version 2 x GOlorize is a Cytoscape plug in for advanced network visualization which uses Gene Ontology GO categories as a source of class information to direct the layout process and to emphasize the biological function of the nodes The implementation of GOlorize version 2 4 is based on the BiNGO plug in version 2 x an efficient tool to determine the GO categories that are overrepresented in a selected part of a given network The both plug ins are used within Cytoscape which is an open source bioinformatics software platform for visualizing and integrating molecular interaction networks The main advantage of GOlorize compared to other graph layout tools is the possibility to incorporate the GO class information already in the node placement phase using a modified version of the force directed layout algorithm An extra attraction force is mediated through additional class nodes which represent the GO categories of interest This version of GOlorize enables also the usage of other node attribute information than GO categories when defining the node classes of interest GOlorize takes full advantage of Cytoscape s sophisticated filtering analysis and visualization properties allowing the user to produce customized high quality network images Installation This version of GOlorize and the user guide 1s compatible only with the Cytoscape version 2 4 which can be freely downloaded fr
7. e list of strings attributes makes it possible to define also overlapping node classes or clusters Go to the import menu File gt Import gt Ontology and Annotation select the Gene association file for Saccharomyces cerevisiae and Gene Ontology Full After pressing the Import button the new string attributes are available in the GOlorize attributes window Choose e g annotation GO BIOLOGICAL_PROCESS and press Select and the following new table should appear that lists all the distinct strings for the attribute Attributes protection fram non homologous end joining at telomere snoRNA 3 end processing ribosame export from nucleus unfolded protein response positive regulation of target gene tr GTF biosynthetic process cell cycle cytokinesis contractile ring formation SRP dependent cotranslational protein targeting to membrane t regulation af pH protein amino acid phosphorylation rRNA modification ER to Golgi vesicle mediated transport exocytosis cellular response to phosphate starvation mRNA cleavage protein targeting to vacuole Register Classes Type e g mito in the text field and press enter The resulting table includes all the GO terms that contain the particular string By entering several terms subsequently the user can constraint the class definition further Double clicking on one of the annotations defines the class based on the single string only exact match option Registration adds the new cla
8. en default value of these parameters After validating the selections and applying the color coding to the Cytoscape s Spring Embedded layout the selected subnetwork view galFiltered child should look something like the layout below due to randomness in the layout process the results from different runs of the Spring Embedded layouts may not be identical oF Cytoscape Desktop New Session tll x File Edit Wiew Select Layout Plugins Help SA QQQQ B Abem on H B CytoPanel 1 O E Yeast Network galFiltered gml child 2222232233 LE TT ELE pa Nodes Edges 331 188 362 0 Bi Yeast Network 188 0 184 05 v GOlorize Start BINGO Apply coloring AIl nodes in view Coloring effect Default pie size Details Selected Layout GAL Biological Proces GO default Saccharomyces cerevisiae process w Add GO category iD Class Description Super Class Layout x GO default Saccharomyces cerewisiae process Ree ee III C0 default Saccharomyces cerevisiae process intracellular protein transpot GO default Saccharomyces cerevisiae process protein transpot O_O default Saccharomyces cerevisiae process ea G0 faut Saccharomyces cerevisiae process sail le isi ll CO defauty Saccharomyces cerevisiae process amino acid and derivative metabolism 000 CO defaut Saccharomyces cerevisiae process Select All Unselect al Validate Auto Colors_ The above layout shows that while the color coding can efficiently facili
9. fine tune the placement of the actual nodes Neither the virtual edges nor the class nodes are shown in the final visualization In the Layout tab the user can specify the parameters of the above layout algorithm The two key parameters are the strength of the attraction within a class in the layout phase 3 termed Intra Group Attraction 2 and the extent of which the class nodes are moved in the separation step 2 Inter Group Distance The example layout shown below corresponds to the default values of these two parameters 3 and 10 after pressing the Layout button Again the results can be different between two Cytoscape sessions and even within the same session due to random starting positions and movements ba Cytoscape Desktop New Session B File Edit View Select Layout Plugins Help QAQAA 2 sampien Search z CytoPanel 1 H F gaFitered gmi child fg Network Editor Network 331 188 362 0 galFiltered gmi188 0 184 0 iv GOlorize Start BINGO Attributes Apply coloring All nodes in view M Coloring effect Default pie size M Level of Details Selected Layout GAL Class Description Group Separation GO default Saccharomyces cer GO default Saccharomyces cer GO default Saccharomyces cer protein transpot co aefauitt Saccharomyces cer E CO default Saccharomyces cer rganicacidmetanolism tN CO default Saccha
10. for expression data based class definition should now look like the screen shot below Attributes gallRGexp v gallRGexp Auto Threshold Threshold Class Name MIN Add 1 MAX Register Classes The class definition mechanism recognizes automatically that the selected node attribute gall RGexp contains floating numbers The Threshold column shows the values that separate two consecutive classes and the Class Name column numbers the corresponding classes The thresholds can be adjusted by double clicking the cells and typing in new value By clicking on the cells of the last column the user can add or remove threshold values resulting in a split or merge of the corresponding classes By default only one class is defined that covers the whole range of the particular attribute from minimum to maximum see the example above The below example was created by clicking on the Add cell three times and pressing the Auto Threshold button These data driven thresholds are based on equally distributed quantiles evaluated from the nodes present in the selected network not from the total set of nodes that have a value associated with the gallRGexp attribute more than 6000 proteins The class definitions can be validated by clicking on the Register Classes button This adds four new classes into the GOlorize Selected tab These classes can be utilized in the same way as the node classes obtained with BiN
11. glucose catabolism 19320 hexose catabolism 46365 monosaccharide catabolism 46164 alcohol catabolism 16052 carbohydrate catabolism 44275 cellular carbohydrate catabolism C 6006 glucose metabolism 6092 main pathways of carbohydrate metabolism 19318 hexose metabolism 5996 monosaccharide metabolism 42257 ribosomal subunit assembly 6066 alcohol metabolism L 42255 aoe Su dra ee ee ee be ama oS ca a cluster fr total freq 22 5638 n 32 5638 35 5638 40 5638 43 5638 57 5638 57 5638 64 5638 n 73 5638 85 5638 n 92 5636 53 5638 162 563 62 5638 genes annotated to the term YOROLAW YORZ54W YALOSBW YDROSOC YPLO SW Y YOROLZW YOR2Z54W YALOSBW YDOROSOC YPLO SW Y YOROLZW YGOR2Z54W YALOS8W YDROSOC YPLOF5SW Y YOROLZAW YOR254W YALOSBW YDOROSOC YPLO FSW Y YCROLAW YOR2Z54W YALO38W YDROSOC YPLOTSW Y YCROLAW YOR2Z54W YALO38W YDROSOC YPLO 75SW Y YCROLAZW YGR254W YALO38W YDROSOC YPLO7SW Y YECROLZW YGR254W YALO38W YDROSOC YPLO SW Y YOROLZW YGR254W YALO38W YDROSOC YPLO SW Y YECROLZW YOR2Z54W YALO38W YDROSOC YPLO SW Y YCROLZW YOR2Z54W YALO38W YDROSOC YPLO7SW Y YFPR102C YOROSSC YLROVSW YOLI27W YMLO24W YCROLAW YOLO66C YOR254W YALOSBW YOROSOC Y YPRIO2C YOROS5C YLROVSW YOL12 7W YMLO2Z4W SP ee A a on h Select All Unselect All Validate Select nodes Do Doiayout The user can choose t
12. he categories that will be applied in the layout by checking the corresponding rows It is also possible to select categories from multiple overrepresentation analyses by re starting the BiNGO again from the GOlorize panel perhaps with different ontologies or parameter settings Each BiNGO run is identified by its name in the tab list The ontology being used is displayed above the categories GO Biological Process of Saccharomyces cerevisiae ontology in the example case After checking the categories of interest press the Validate button and click the Selected tab In the Selected tab all the GO categories selected from the BiNGO result s are shown In this panel the user can also manually add arbitrary GO categories by pressing the Add GO category button and typing the corresponding ID terms In our example case we have selected rather arbitrarily seven GO categories shown below which will be used in the network visualization process Although class overlapping is allowed we recommend not use more than 6 terms per node It may be beneficial to select the categories of interest from the the higher level terms rather than using very broad categories Clicking on he GO Term ID number the first column below in the Selected tab opens an AmiGO web page for the particular GO category The page contains a detailed view of information on the GO annotation and it also allows the user to browse query and visualize the cross links between the selected
13. om the Cytoscape project website http www cytoscape org If not already installed on the computer download and install also the Java 2 Runtime Environment version 1 5 or higher from http www java com en download index jsp Place the GOlorize2 4 jar file into the local Cytoscape plugins directory Note that in this version the BINGO annotation files are incorporated directly in the GOlorize jar file and the version does not support the import ontology and annotation mechanism of Cytoscape 2 4 in the BiNGO over representation analysis Therefore for updating the NCBI based GO annotations open the GOlorize jar file with your favorite archive manager and replace the original annotation files with the new files copied from the updated BiNGO yjar file The new BiNGO ar files can be downloaded from http www psb ugent be cbd papers BiNGO index htm Alternatively the GO Consortium annotation files can be updated by substituting the old annotation or ontology files with their new versions downloaded directly from the Gene Ontology project homepage http www geneontology org Tutorial The usage of the plug in is demonstrated with the following walk through example Step 1 Starting the plug in Start Cytoscape version 2 4 x and use the File menu from the Cytoscape main window to import the example network galFiltered xgmml from the sampleData folder Select GOlorize from the Cytoscape Plugins menu to start the interactive layout process The
14. re are two modes how to define the node classes being used in the network visualization The default mode corresponds to finding overrepresented GO categories in the selected network using BiNGO version 2 x whereas the alternative mode allows the user to define the node classes based on other attributes such as expression data or attributes imported from external files The default mode is demonstrated in detail in Steps 2 5 and some illustrative examples of the attribute based class definitions are given in Step 6 Step 2 Using the BINGO settings Select a cluster of nodes in the network view indicated in yellow nodes which will be used as an input in the determination of GO categories that are statistically overrepresented in the network see an example screen shot below Click on the Start BiNGO button in the GOlorize window 3YQYU6ja 0t0 007 hdd Cytoscape Desktop New SESSION sossiiiiiiiiiiiiiiiiiiiihhhiihhhhhhhhhhhhh hihih LAARA We File Edit Wiew Select Layout Plugins Help QAQA F Gi sean search B EAURANEKA OL FE Yeast Network galFitered gmp Hiii fs Network Editor Te TEENE E G E I ra C C S F f 2 T gt N C gt O l Rifi i ca EA y oS D lt LH i redes A A o j me D e A gt s 2 CytoPanel 2 A ID YDR171W aj YMLO24W YILOG9C mM YOL127W YOL120C YDROSOC
15. romyces cer aij acid anid Gerivative metabolism NNN GO defaut Saccharomyces cer Intra Group Attraction 2 3 0 Inter Group Distance 10 Select All Unselect All G 2e g gt Ea Advanced Settings KR 7049 6886 15031 16192 6082 6519 intracellular protein transport KIKIKIKIK KIKIKIKIKIKIK r a A KK x Nodes that belong to the same class indicated by the same color are grouped together and the nodes with multiple GO class memberships pie coloring lie typically in between their main classes Unclassified nodes white color are placed according to their connection structure only the edges The Layout tab also allows user to control whether the placement of a class node in the in the final layout phase 3 is free or fixed to the location determined in the separation step 2 This is specified using the check boxes under the column Group Separation and it can help to decide placements for small or heavily overlapping node classes A nice feature of the layout algorithm is that it is capable of grouping the class members close to each other even if the original network was disconnected especially if the Intra Group Attraction is substantially larger than one This is because the parameter is directly proportional to the standard attraction between connected nodes Increasing the value of Intra Group Attraction and decreasing the value of Inter Group Distance respectively results in more
16. s In some cases such a structure can not be discovered even after trying a multitude of parameter combinations Step 6 Using other attributes This version of GOlorize plug in can deal with several types of attributes values in addition to ontologies annotations when defining the node classes for the visualization Therefore the annotation and term columns in the tables have been renamed as Attribute and Class description respectively Three types of node attributes can be used depending on the data type continuous attributes handle integer or floating numbers such as expression data from e g microarray or mass spectrometry experiments string attributes can handle e g node labels such as lethal or non lethal gene or protein and list of strings attributes can be used as a generic way to define classes based on e g external files To test this feature import an expression data by going to File gt Import gt Attribute Expression Matrix and choose the file galExpData pvals which contains the expression changes together with their associated p values in the particular experiment Click on the Attributes button in the GOlorize panel At the upper left corner of the new panel is a pull down menu listing all the attributes loaded to Cytoscape Choose the attribute gallRGexp which represents the expression changes in response to altered Gall transcription factor and validate this choice by clicking on the Select button The Attributes panel
17. se Small pie size in the Coloring effect to see the selections in the main network view In the example we have selected all the nodes from the galFiltered network that belong to the seven categories as well as their first neighbors and added them into a new network a subnetwork with 184 nodes and 181 edges Pressing the Auto Colors button generates automatically a color coding for the selected node classes Alternatively the user can manually choose the color of choice for each GO category Due to hierarchical organization of the categories each node can belong to none unique or several classes The unclassified nodes have the default node color of Cytoscape visualization adjustable in Set Visual Style menu item If a node belongs to several classes a convenient pie coloring is applied The user can also specify whether coloring is applied to all nodes in the network view or to selected nodes only Displaying and coloring the nodes with GOlorize depends on two Cytoscape rendering properties render nodeBorderThreshold and render coarseDetailThreshold which can be manually modified from the Edit gt Preferences gt Properties menu The Level of details slide bar provides an interactive way to adjust these parameters for the given network and for the selected zoom level The maximum value right corresponds to those values needed to visualize all the colored pie charts in the whole network display The minimum value left corresponds to the giv
18. ss into the GOlorize Selected tab Note that the ontology structure is not taken into account when defining the class membership of a node based on its attributes This means that even if an ancestor of a term match the query string but not the term itself the corresponding node does not belong to the particular class As the BiNGO classes preserves the hierarchy of the terms the two mechanisms for defining node classes can be considered as complementary to each other
19. tate the visual interpretation of the class information it is in many cases not enough for discovering whether or not the network contains a GO class structure superimposed on the underlying connection structure Step 5 Lay outing with GO classes Our modified force directed layout algorithm finds the placement of the nodes based on both their connection structure the original edges and class structure the selected GO categories Globally the operation of the class directed layout algorithm is organized through the following three phases 1 Initial node placement using force directed optimization where in addition to the standard attractive forces between each connected node an extra attractive force is applied to the nodes belonging to the same class The extra attraction is directed by adding virtual edges between class members and a virtual class node representing the particular class This phase finds good initial positions for the class nodes 2 Subsequent separation of the classes by moving the class nodes in the same proportion away from the center of gravity of all nodes This phase aims at providing maximal distinction between the different classes while still preserving the relative placement of the nodes obtained in the initial layout phase 3 Final layout phase uses the same force directed optimization process as in the step 1 but with class nodes fixed to the positions determined in the step 2 The aim of the step is to

Download Pdf Manuals

image

Related Search

Related Contents

製造業で役立つ技術英語基礎講座    Sony LA-EA2 Operating Instructions  Manual de Usuario  Manual de instrucciones Instruction Manual  

Copyright © All rights reserved.
Failed to retrieve file