Home

User Manual for GIGI-Pick v1.06

image

Contents

1. 2 Marker file this is a composite file that contains the map positions of framework markers in centiMorgans assuming the Haldane map function allele frequencies of framework markers and genotype data of framework markers 3 Parameter file used to run gl_auto Refer to the documentation of MORGAN for guidance on setting up these files and on running gl_auto Example files used to infer Vs using gl_auto are included under the example gl_auto_example directory Overview From running gl_auto we need to obtain 2 files a Framework IVs file GIGI Pick uses this file which contains IVs at framework positions b Pedigree file see Case A for the format 10 Obtaining the files a Framework IVs file In gl_auto s parameter file we must instruct gl_auto to display the output as Meiosis Indicators instead of Founder Genome Labels Please make sure we use the option output meiosis indicators instead of output founder genome labels in the control parameter file of gl_auto 11 Other Information 1 The coverage metric does not differentiate between subjects who are and subjects who are not likely to carry a variant that affects the trait Branches of a large pedigree that are thought to be uninteresting should be designated as unavailable for sequencing in order to focus attention on the pedigree components that may carry variants of interest to the trait It can also be useful to manually select at least one subjec
2. 2 The pedigree data must be ordered in the form that ancestors must preceed descendants For example subject 101 and 102 must be specified before 201 and 202 Subject 201 and 2010 must be specified before subject 301 Please refer to http www stat washington edu thompson Genepi MORGAN Morgan shtml for more information Case B pedigree Here the pedigree refers to the pedigree meiosis file obtained from gl_auto s console output The pedigree meiosis specifies the pedigree structure and the index of meioses that GIGI Pick needs to use to read in the corresponding Inheritance Vectors GIGI Pick imputes genotypes on subjects specified in this pedigree Please refer to the Inferring IVs using gl_auto section for creation of this file frameworkIV The Inheritance Vectors file describes the descent pattern of chromosomes at the positions of the framework markers It is the output file that gl_auto generates The content of this file is generated by gl_auto When you run gl_auto you should instruct gl_auto to print Meiosis Indicators instead of Founder Genome Labels numlV This corresponds to the number of samples that the user actually prints to the the frameworkIV file frameworkMap map positions of framework markers The marker map positions of the framework markers file is a text file which contains the map distance in centi Morgans cM based on the Haldane map function Markers must be ordered in ascending order and consistent
3. also understand the importance of backward compatibility You may continue to use gl_auto s output from the pre v3 2 and the pedigree meiosis file in place of the pedigree file GIGI will detect which kind of the file you are using and read it properly Installing GIGI Pick Simply unzip the files navigate to the code directory and type make If make does not work go to the GIGI Pick cpp s directory and install the program by g GIGI Pick cpp o GIGI Pick Running GIGI Pick GIGI Pick accepts a parameter file To run GIGI Pick type GIGI Pick param lt parameter file gt lt options gt To run the example file go to the main GIGI Pick program directory and type GIGI Pick Then type Example without the quotation and press enter or equivalently for this example that comes with the software GIGI Pick param selectionExample param IV_formatted_v32 txt Users can run GIGI Pick using the interactive mode or the automatic subject selection batch mode The interactive mode allows users to have extensive interaction with the program to evaluate different design choices This mode is assumed by default To run the batch mode instead use the batch e g GIGI Pick param selectionExample param txt batch 10 to ask GIGI Pick to select 10 subjects for sequencing and outputs to the current working directory and GIGI Pick param selectionExample param txt batch 10 outD selectionExample batchOutput as
4. with the order used in gl_auto Each line contains the position of a marker position of Markerl position of Marker2 position of Marker3 position of MarkerN eg 1 0 2 0 3 0 4 0 samplingPoints Region on the chromosome to sample in Haldane map position in centi Morgan For example if the bounding intervals of the region of interest are 48 and 52 the file would be a single line containing the two numbers e g 48 52 otherPedigreeInformation This file specifies additional details about the pedigree This file contains three columns The first column is the subject ID The second column specifies who are available for sequencing in the pedigree O no 1 yes The third column specifies who have already been chosen for sequencing O not pre chosen 1 pre chosen e g 10100 10210 20110 20211 201000 30110 In this example subject 102 201 202 and 301 are available for sequencing and subject 202 has been pre chosen for sequencing Inferring IVs using gl_auto for Case B The first step to impute genotypes is to use framework markers to infer IVs Framework markers are a relatively sparse set of markers that are used to infer Vs on a chromosome of interest For this purpose we use gl_auto a program in the MORGAN package that is freely available at http www stat washington edu thompson Genepi MORGAN Morgan shtml To infer IVs in gl_auto we need to supply the required files in MORGAN format 1 Pedigree file
5. NY WARRANTY without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE See the GNU General Public License for more details There is NO WARRANTY for the program to the extent permitted by applicable law In no event unless required by applicable law will any GIGI Pick copyright holder be liable to you for damages including any general special incidental or consequential damages arising out of the use or inability to use the program including but not limited to loss of data or data being rendered inaccurate or losses sustained by you or third parties or a failure of the program to operate with any other programs Acknowledgement We thank Elizabeth Thompson for providing valuable comments This research was supported by funding from National Institutes of Health grants R37GM046255 PSOAG05 136 RO1AG039700 RO1MH094293 and ROOAG040184
6. User Manual for GIGI Pick v1 06 Author Charles Y K Cheung cykc uw edu Ellen M Wijsman wijsman uw edu Department of Biostatistics University of Washington Last Modified on 2 3 2015 Contents IntrodU CHO inst td A A data 3 Citing CAI r 3 Olt Ware UREA e e a 3 Files in GIGI Pick software distributiON ooocccccononononannnonnononnnnnonononocnononnnnononnnnonononnnnn no nnnnncononnnnanononnnos 3 What s new in version 1 00 tii tia 3 T st llin Ce PU dd dias 4 Rima GIGEPICK ar A AS 4 A chcc sess e a a n a a a ai aa a aiaia a 5 TWO USE Cas A A dt 6 GIGI Pick s Parameter Ple uu ica 6 Pile HOP AUS ccs trade 7 Inferring IVs using el auto for Case Bani 9 Other Information viii dais 11 O o OO A 12 ME AA EN A E A aE 13 License St adan 13 a A cabeautledncinsndes 13 Introduction GIGI Pick is a computer program to evaluate sequencing choices and to suggest who to select for sequencing in pedigrees We assume that a common goal is to collect sequence data on subjects in pedigrees to identify risk alleles and to accomplish this goal we seek to first sequence a subset of subjects in the pedigrees and then use imputed genotypes to identify risk alleles GIGI Pick can use the location where there is evidence of linkage to increase the chance of imputing well in a particular region of interest In this case GIGI Pick uses existing markers from linkage analysis to infer inheritance vectors which are then used to recomm
7. e chromosomal region is not available or if multiple trait phenotypes are collected on a pedigree To compute the genome wide coverage GIGI Pick only requires a pedigree structure file B Local coverage may be desired if the user intends to prioritize the search of causal variants on a specific region on a chromosome To compute the local coverage GIGI Pick needs the sampled inheritance vectors IVs file from gl_auto and other corresponding files See the section Inferring IVs using gl_auto for Case B GIGI Pick s Parameter File The parameter file tells GIGI Pick where to look for the required files For case A GIGI Pick only requires the minimum of one file that describes the pedigree structure For case B GIGI Pick requires the minimum of a pedigree meiosis file that user prepares from the output of gl_auto an IVs file from gl_auto a file specifying the map positions of framework markers and a file specifying the map position of interest to optimize for genotype imputation An example of the parameter file is found in the example directory under selectionExample param IV_formatted_v32 txt In GIGI Pick_v1 06 the parameter file is organized as follows pedigree selectionExample ped32 MORGAN ped frameworkIV selectionExample frameworkPanel IV v32 txt numIV 100 frameworkMap selectionExample framework map samplingPoints selectionExample dense_48_52cM map otherPedigreeInformation selectionExample ped52_can_need_select txt For Ca
8. end subjects to be sequenced This situation usually arises when linkage analysis has already suggested a chromosomal region to prioritize Alternatively GIGI Pick leverages the pedigree structure to suggest subjects for sequencing which may be more suitable in situations when multiple phenotypes are of interest or when prior evidence of linkage is not available Citing GIGI Pick Cheung CYK Blue EM Wijsman EM 2014 A statistical framework to guide sequencing choices in pedigrees American Journal of Human Genetics 94 257 267 Software URL GIGI Pick is developed under the linux environment The program is available for download at http faculty washington edu wi sman software shtml Files in GIGI Pick software distribution GIGI Pick software code and its dependency files the Mersenne random number generator e example folder What s new in version 1 06 We have made changes so that GIGI is now compatible with the new Inheritance Vectors file format generated by MORGAN s gl_auto version 3 2 Because this new file format no longer requires users to provide GIGI with the meiosis indexes GIGI can now directly use the MORGAN pedigree file In the past version that uses MORGAN s gl_auto version 3 1 or earlier users must create the pedigree meiosis file from manually parsing the console output of gl_auto Therefore to avoid confusion we suggest our users to use the pedigree file instead of the pedigree meiosis file from now on We
9. ks GIGI Pick to writes the output files to the specified selectionExample batchOutput directory Options The flags are case sensitive Flag param FILE outD DIRECTORY m MAF batch MC Purpose To specify the path of the parameter file To specify the absolute directory path of where the output files will be created If this flag is missing the output files will be saved to the user s current directory Note the outputs are only saved to a file when the batch mode is run To specify the parameter value for the joint prioritized selection algorithm default m 1 When m 1 the joint prioritized selection algorithm reduces to forward stepwise selection The GIGI Pick paper refers to this parameter as y To specify the minor allele frequency between 0 and 0 5 used to calculate coverage The GIGI Pick paper refers to this parameter as a To run GIGI Pick as the batch mode specifies the number of subjects to select Number of independent random Inheritance Vectors samples to take when genome wide coverage is used as the metric The default is 1000 Two Use Cases GIGI Pick uses a metric called coverage to compare sequencing choices The genome wide coverage measures the coverage at a random locus in the genome and the local coverage measures the coverage at the position of interest in a specified region A Genome wide coverage may be desired if prior information about a candidat
10. se A only the pedigree is required otherPedigreeInformation is optional For Case B pedigree frameworkIV numI V frameworkMap and samplingPoints are required otherPedigreeInformation is optional Notes I suggest using the absolute paths of the filenames instead of relative paths The parameter file in the example folder is created using a relative path File Formats Examples of these files are provided in the selectionExample directory refer to the param txt for the filename of these files Case A pedigree The pedigree file should either be the ordered MORGAN compatible pedigree structure OR the pedigree meiosis file Please refer to Case B for the description of the latter file An example of the ordered MORGAN compatible pedigree structure file input pedigree size 6 input pedigree record names 3 integers 2 input pedigree record trait 1 integer 2 28 2K 2k 2 ok 1010010 1020020 201 101 102 10 202 101 102 2 0 20100020 301 201 201010 In bold are the content that can be changed The first 3 lines contains information describing the pedigree The 4 line is simply five In the remaining lines the first column is the subject ID the second column is the ID of the subject s father the third column is the ID of the subject s mother the fourth column is the sex 1 male 2 female and the fifth column is the dummy trait integer value just put 0 Note 1 The pedigree data must begin on the 5 line
11. t with an unusual phenotype of interest to start with e g a subject who is affected 2 A future topic is likely to add other metrics for subject selection 12 Appendix A In versions prior to 1 06 GIGI Pick only supported inheritance vectors from MORGAN gl_auto version 3 1 or earlier If you generated the inheritance vectors file using MORGAN gl_auto version 3 1 or earlier you must use the Pedigree Meiosis file instead of the Pedigree file See selectionExample param txt and selectionExample ped52 pedMeio Pedigree Meiosis file for versions prior to GIGI Pick ver 1 06 We need to create the pedigree meiosis file from the console output of gl_auto It is very easy to make this file When we run gl_auto the program prints a huge amount of output to the console This console output actually contains the content of the pedigree meiosis file that we need to extract 1 In order to extract this content we first need to direct the console output to a file by using the gt directive so we can subsequently extract the content from this file ie gl_auto gl_auto_parameter_file gt glauto_console_output txt 2 Then we extract the pedigree meiosis content from the console output to a new file To simplify the creation of this file use the Perl script extractPedMeiosis pl e Usage perl extractPedMeiosis pl glauto_console_output txt FILENAME_PED_MEIO o We need to have Perl installed in linux o assuming glauto_console_outp
12. ut txt is in the same directory as extractPedMeiosis pl e Alternatively this file can also be easily extracted by the user from the console output of gl_auto Refer to the example below or the selectionExample ped52 pedMeio file to see what this file looks like Creating the Pedigree Meiosis file manually from the console output of gl_auto Using a text editor we open the console_output txt and fetch the line that begins with name name pa name ma Compnt pat meio mat meio We copy this line and table below We paste this table to another file and save it The file includes the header line and looks like this name name pa name ma Compnt pat meio mat meio 2100_6 0 Or 1 0 0 2100_21 0 0 1 0 0 2100_25 0 0 1 0 0 2100_29 0 0 1 0 0 2100_31 0 0 1 0 0 2100_39 0 0 1 0 0 2100_907 0 Oo 1 0 0 13 2100_908 2100_901 2100902 1 2 1 2100_909 0 0 1 0 0 2100_910 2100_901 210092 1 4 3 2100_911 2100_901 21009002 1 6 5 2100_915 2100_907 21009008 1 8 7 until the end of table ChangeLog The ChangeLog describes the recent changes to the program It is located in the root directory of the uncompressed file License GIGI Pick is a free software you can redistribute it and or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation either version 3 of the License or at your option any later version This program is distributed in the hope that it will be useful but WITHOUT A

Download Pdf Manuals

image

Related Search

Related Contents

7 - Beomanuals.com  取扱説明書 - AsReader Developer  Sears 385.1915 Sewing Machine User Manual  PDF形式の東村山市情報化計画  Desq 1551 flat panel desk mount  135/150/175/200 Verado FourStroke  twall®Basic16 standard Color mobile foldable User Manual Rev. 1.0  Samsung Galaxy Spica Käyttöopas  SBook: - Simson Garfinkel  User's Manual for Bending Presses  

Copyright © All rights reserved.
Failed to retrieve file