Home
User Manual 5.0
Contents
1. ADARCS ADARC6 lt ADARCS ADARC4 lt ADARCS 4 ADARC2 Exporting contig summaries When you are displaying the Summary view of a contig see Summary Report View in Chapter 12 Editing Contigs you can export the contents of the report to a text file Go to the File menu and choose Export then the Summary submenu This is particularly useful if you wish to add annotations to a report of your assembly in standard page format SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 163 20 Output In this chapter we discuss printing options We will explain how to set up pages copy pictures print and use different report formats Copy picture Sequencher can copy the images from many of its windows including text windows to a picture on the clipboard Select the window you want copied go to the Edit menu and choose Copy As then click on Picture Print setup Before printing go to the File menu and choose Print Setup to change the options that are specific to your printer Some options that may be available on your printer include portrait or landscape printing vertical or horizontal orientation paper size reduction enlargement or special handling for color printing See Chapter 21 Customizing Sequencher and User Preferences for more information
2. Contig The Contig preferences control the vertical sorting of the fragments when a new contig is created or when fragments are added to an existing contig You can use the Contig Font option to change both the font and the font size This allows you to use fonts other then the default Monaco 9 point font in the contig editor You can see a sample of your current font selection below the font Size drop down menu in the Sample portion of the preference pane SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 178 Figure 21 14 Contig preference settings User Preferences General B Settings Sort New Contigs Auto Save C Don t Sort e by Position ae C by Date C by Size Menu C by Name C by Strand Sound New Project i E Display Contig Font Chromatogram Font Lucida F Contig Chromatogram Feature Motif Size nn ei Format Ruler Sampl e ACGTN Start Stop Variance Table Input Output File Import Report Feature Motif The Feature Motif preferences control the default settings for new features and motifs When you import a sequence from GenBank into Sequencher these settings determine the name and appearance of the feature within both the Overview and Sequencher s editor windows For more information on GenBank Feature tables refer to the Appendix 28 Feature Keys and Qualifiers When you click the Define Feature Key Default Styles
3. seeeeeeeeeeeeeeeeeeeeees 196 28 APPENDIX FEATURE KEYS AND QUALIFIERG ccccccssseeeseeeeees 199 GenBank Feature tables i2 ssc ceisscts cxcsigeacniacets EEN 199 Pe ature table qualitiers 2 25 sts eis acts tal e lee 201 29 APPENDIX DEFAULT FEATURE QUALIFIERS cccccceessseeseseeees 203 30 GLOSSARY ccwceicanicateciucnateucysantvenuubtewspiceneaviuatoauuseiuccnnicaniannostwesnuccaminnnaate 206 xiii 1 Preface Sequencher is the premier DNA software choice for DNA sequence assembly and analysis Its capabilities include Multiple configurable DNA assembly algorithms Comprehensive DNA sequence editing tools Full support of sequence data confidence values Powerful Reference Sequence and Variance Table to find SNPs quickly and easily Restriction Mapping Extensive data import amp export capabilities including customizable GenBank Feature handling Specialized tools for Forensic mtDNA analysis Comprehensive HTML Help throughout Almost 20 years of daily use by biologists in labs around the world have refined Sequencher s tools and interface You get the power and the speed to get the best most accurate results from your DNA analysis and get back into the lab more quickly Gene Codes Gene Codes is a biotechnology oriented software developer whose headquarters are in Ann Arbor Michigan Our goal is to write powerful software tools that are easy to use If you have any suggestions on how to im
4. Base counts are displayed at the bottom of the window If your sequence also includes confidence scores you will also see a base count for each of the three confidence ranges If you use an ABI sequencing system click the Show ABI Info button Sequencher displays a window with information from the ABI data sheet shown in Figure 8 6 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 50 Figure 8 6 ABI data sheet info Chromatogram Info Quality E Quality eight Data Collection File Name gipt12 br3_k38 1512899 ab1 Sample Name k38 1512899 Comment gipt12 br3 15 Lane Number Number Of Pts 11060 Number Of Bases 755 Run Started At Thu Sep 16 2004 7 57 PM Run Stopped At Thu Sep 16 2004 8 53 PM Gel Data File mot availabe Dye Set Primer KB_3730_POP7_BDTv1 mobj Comb Name grot available Instrument Name DORI 1522 005 Collection Vers 2 0 Data Analysis Primer Peak Loc 1264 Base Call Start 1264 Base Call End 11060 Signal G 2326 A1258 T 1114 C 1948 Metrix Filename grot available Channels Awgd 1 Foalysis vers KB 1 1 Base Spacing 13 9044 Selecting Bases Sequencher offers a number of ways to select bases You can choose commands from the Select menu or you can use a keyboard combination To select bases in the opposite orientation press and hold the Shift key when using any Select Next command from the menu or with a keyboard shortcut
5. Although the Selected Columns radio button is checked you can change your selection to include all rows by clicking the Entire Table radio button You can copy your Translated Variance Table into another document or application that accepts tabbed text To copy the contents of your Translated Variance Table Report click on the Copy as Text button The data can now be pasted into its new location SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 121 Exporting data for selected variants Each row in the table represents a variant If you need to export selected rows make your selection by using Shift click for a continuous range or Ctrl click for a discontinuous range see Figure 13 30 Then click on the Reports button The Translated Variance Table Reports dialog box appears The Selected Rows radio button is checked Three buttons are enabled Cancel Copy as Text and Save as Text Save as Text is the default setting Figure 13 30 Translated Variance Table selected variants z Compare Translation to Reference K DR Description 12 consensus sequences compared to Reference 3 exon Reference Comparison Range Unfiltered Base Positions 15394 15644 Amino Acid Positions 1 83 Cee RL E e A 400 SE AAG 15 406 5 15 412 ATG TAT TAT TAT H M Y Y Y E E E E 12 a e SI WT ti C C eM The rows you have selected will be saved in one table Click the Save as Text b
6. Select All Matches Deselect All Matches Select None Done SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 156 Have Names Containing If you want to select all the sequences in the Project Window whose names contain a specific text string go to the Select menu and choose All Items That and then Have Names Containing to display a dialog box Figure 18 9 Figure 18 9 Have Names Containing dialog box All Items Whose Names Contain Find What Case Sensitive 0 items selected out of 8 Select All Matches Deselect All Matches Select None Done Were Edited On Or Since If you want to select all the sequences and contigs in the Project Window that were edited on a specified day go to the Select menu and choose All Items That and then Were Edited On Or Since to display a dialog box You can use the drop down menu to specify a preset day such as today yesterday 1 week ago or even 2 months ago You can also type a date in the Find What box Note You can also include items edited on all the days since a specific date by checking the And All Days Since box SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 157 19 Exporting Data In this chapter you will learn how to export your work This chapter discusses how to export sequences consensuses and contigs and also how to export defined data such as selected bas
7. button a new window also called Define Feature Key Default Styles appears On the left hand side of the window is a pane listing Feature Keys To set new attributes select the appropriate Feature Key On the right hand side you will see a pane where you can set attributes such as the Default name Display style and Color You can restore the default settings at any time by clicking the Restore to Defaults button Click the Done button when you have finished When you click on an item in the left hand Feature Key pane you will see the same text appear in the Default Name box In most instances there will be extra text enclosed in square brackets The enclosed text is the default feature qualifier GenBank have defined several Feature Qualifiers for each Feature Key You can replace the text within the square brackets with your preferred Feature Qualifier For more on Feature Qualifiers see Appendix 28 Feature Keys and Qualifiers and for a listing of Sequencher s default Qualifiers see Appendix 29 Default Feature Qualifiers Note If a Label indicated by label some text is used in a Feature Table it takes precedence over any default keys or qualifiers since it unambiguously identifies a feature or item Click the Display Feature in Editors checkbox if you want a specific feature to be displayed in the sequence or contig editors SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights
8. 1991 2011 Gene Codes Corporation Inc All rights reserved 111 Figure 13 18 Review mode showing Sister Tables Compare Consensus to Reference E D x B ABV EE E E E EE Description 5 consensus sequences compared to reference JE NC_001807a 3 end TGTAGTACATAAAAACCCA CACATCAAACCCCCCccc s sequence NC_001807a 2 end 4E _ABV19999CPrimeriR ITGTAGTACATAAAAACCCA Comparison Range Unfiltered AE _ABV_12251924_PrimertRIITGTAGTACATAAAAACCCA Base Positions 15884 16571 AE _ABV_12261924_PrimertF TG TAGTACATAAAAACCCA 5 frag bases selected at B Chromatograms from ABV _ABV_1999 X_Primer1R Fragment base 16 172 Base 148 of 328 i TAAA AAC C AC nET TETE ES T A d d Compare Translation to Reference E EI KA T i i 8 i VO oi Ak wier ele ec We a ER Se bach EI T Description 5 consensus sequences compared to reference Tai a T T 7 ee sequence NC_001807a 3 end Comparison Range Unfiltered Base Positions 15894 16571 Amino Acid Positions 1 229 ga Y A T ALA A A Lap O T Pn Delt SCH t M 172 L v2 of K C Variance Table Reports The format of each Variance Table report is broadly similar The report starts with a Common Header This header includes the date the project name the type of table Comparison Range base positions and options This is followed by a header containing more pertinent information and finally the body of the report Variance Table Report The Variance Table
9. 1991 2011 Gene Codes Corporation Inc All rights reserved 126 Enabling the protein display To enable the protein display in the summary view go to the View menu and click on Translation You can then choose from the following list of menu sub options Single Stranded Double Stranded Protein First Frame Protein Second Frame Protein Third Frame and Protein All Three Frames To see the protein translations for the opposite strand go the View menu and select Reverse amp Comp Then select the desired option Where the assembly algorithm has inserted a gap in a fragment Sequencher will translate the codon correctly Figure 14 3 Summary Report displaying translation across a sequence gap WE HLA HLAOO 617 CGGACCCCCCCCAAGACACATATGACCCACCACCCCATCTCTGACCATGAGGCCAC AspProProGlndspThrTyraspProProProHisLeuSTPProSTPGlyHis WE HLA HLAOO 617 CGGACCCCCCCCAAGACACATATGACCCACCACCCCATCTCTGACCATGAGGCCAC AspProProGlndspThrTyraspProProProHisLeuSTPProSTPGlyHis E HLA HLAOL 617 cocaB cGCCCCCAAAACGCATATGACTCACCACGCTGTCTCTGACCATGAAGCCAC Asp AlaProLysThrHisMETThrHisHisAlaValSerAspHisGluaAlaThr Protein translations for the Consensus or Reference Sequence This option is context sensitive If you do not have a Reference Sequence or you have selected the Consensus radio button you will be able to show protein translations for the consensus Click on the Consensus check box at the bottom of the Summary Options dialog window If you have a R
10. Fragment Sequences You can choose to hide the fragment sequence by clicking on the Fragment Sequences box This gives you a very compressed report that can be useful when viewing protein translations Figure 14 2 above shows a summary line without the consensus line but with Bullets Pluses and amp Dashes all checked Matching Bases as Dashes Matching Bases As Dashes is a very powerful option when you are interested in any kind of mutation detection When you check this option all the bases that match each other are converted to dashes If you have selected Display Color Bases the dashes will be marked in the color of the original base call If you also go to the View menu and chose Colors as Background each dash will appear embedded in a small color block Bases that disagree are displayed as characters and as such stand out in stark relief With this option you can easily see mutation hot spots or clusters and potential SNPs Protein translations and the Summary Report Sequencher can show protein translations for individual fragments and mark matching amino acids with dashes This helps you see whether a particular discrepancy in an alignment has functional implications or whether it may simply be a silent mutation If you combine options you can turn off the DNA display altogether and just focus on your translations Alternately you can use a mixture of options to tailor the report to your specific needs SEQUENCHER User Manual
11. Gi Polylinker SS Use Vecbase File Name P Polylinker Use Vecbase File If the cloning technique you are using places your insert between two sites replace the vector sequence between the two sites with a single bullet just as your insert replaces this sequence Specifying the insertion site manually If the vector you select in VecBase does not have polylinker information or if you are screening for contamination using a sequence not already in VecBase you must enter the information manually In the Specify Vector Insertion Sites window paste the sequence maximum 250 bases into the polylinker window Insert an asterisk by holding down the Shift key and the 8 key not the one from the number keypad to represent the sequence you wish to retain Saving and loading vector sites You can save manually entered vector information to a file by clicking the Save Sites button You can reload the file in the future by using the Load Sites button This allows you to save your preferences and parameters for a later session SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 37 Setting Vector Trimming Criteria Sequencher has a set of default criteria to facilitate the recognition of any contaminating sequence These criteria are listed in the top region of the Vector Insertion Sites window If these parameters do not work for your specific circumstances you c
12. Id e Ek ln Ee eters e Cancel Note To import traces you must import the actual trace file If you import a text file containing ASCII characters you will not import traces Text files will be only a couple of kilobytes in size whereas trace files will be well over 50 kilobytes Sequencher uses a very efficient algorithm to compress the imported traces This conserves disk space so a project file containing several sequences from an automated sequencer may be smaller than any one of the original trace files Folder of sequences When you have many sequences to import you can import all those files from within a single folder Select the Project Window and go to the File menu and choose Import and Folder Of Sequences from the submenu The standard Browse for Folder dialog box appears select the folder that holds the data you want to import and click OK Use the Include TXT and SEQ files check box to include or exclude these files from your import Sequencher tells you how many files are in the folder and asks whether you want to import them all If you do click Import All Files in Folder Sequencher imports all the files assigning a separate icon to each You can specify that you want files imported into the Project Window or directly into the trim window see Chapter 6 Preparing your Data for Assembly SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 19 Figure 4 2 Browse for
13. Jase Soronia n EREE OOC ERER OCORRE 153 Any Lut Ee E E 153 Finding bases by Number eieiei REES 153 Extend Select Oi eege 153 Other Search options EE 153 Project Window SElECHOM sk weiieccseeastateaccecedsasiaacnescacdewna EE A E R deans 153 Finding a item by E 153 Selecting allat ms thatessi eels aie Ag Ae Aa a aaa hee ae headed ete ages Agi eats elie eked 154 WEE H WEE 155 Contamiliems Named EE 155 Contain Items With Names Containing eresie e E A AAAA EEE AANS 155 Haer store 156 Have Comments Contain g os vt ccssecd sch asct sich ocak sedtascs ced deier iaiaaeaia 156 Have Labels Contamin gis Ae 156 Have Names Contain EE 157 Were Bdited On On Since secs te Meese tacts dees teete ee 157 19 EXPORTING DATA TE 158 Exporting data from Sequencher ios siyi2seiccs cas avdaceende ceacudges lanes Goeaacciaetaavaats osageagustaventuens 158 EX POPU SEENEN 158 PX porting a CONSENSUS age 159 Fern SEELEN 160 Exporting Selection as Subproject 2 ssiiic chcssencicees estes ee EENS 160 Exportselected DEE 161 Egeter eieiei eege REES 161 Ee 161 Exporting protein translations sccis s setieisieazisdccsstieccieagtaigiead teeta EES edel 162 EXport OAS A A TAEAE AAEE E EEE Ee 163 EXpOrtin CO Sa ES a is coal Gest on dele oe gis 163 20s ei n EK EE 164 Copy pIChitGs ta dee Eeer 164 PrintSetup EE 164 Print Trace in One Page ee EE EE 164 Printing imc E 164 Setting header footer and margin Options 0 0 eee cess eseeeeseeeeseeeesee
14. Sep 13 2004 3 46 49 PM E IS_AS_2001 X_Primer2R 392 BPs Auto Seq Frag Mon Sep 13 2004 3 45 40 PM aE IS_AS_2001 X_Primer3F 442 BPs Auto Seq Frag Thu Sep 23 2004 8 19 31 PM AE S_AS_2001 X_Primer4F 414 BPs AutoSeq Frag Thu Sep 23 2004 8 19 31 PM E IS_AS_2001 X_Primer4R 322 BPs Auto Seq Frag Sep 13 2004 3 46 44 PM E IS_AS_2001 X_PrimerSF 226 BPs AutoSeq Frag Sep 23 2004 8 19 31 PM E A_FDR_01301882_Primer1 F 440 BPs AutoSeq Frag Sep 23 2004 8 19 33 PM a A_FDR_01301882_Primer1R 327 BPs AutoSeq Frag Sep 13 2004 3 46 49 PM L A_FDOR_01301882_Primer2R 446 BPs AutoSeq Frag Sep 13 2004 3 46 40 PM You can enable Assemble By Name by clicking on the Assembly Parameters button and checking the Enabled checkbox at the bottom of the window Once you have clicked on the APN button this box will already be checked Setting Assembly Parameters Sequencher uses the values set in the Assembly Parameters window to control the way it assembles sequences Once you have set your required options you can store these as your default Assembly Parameters Click on the Set as Defaults button at the bottom of the window and Sequencher will use your preferred Assembly Parameters whenever it launches If you just click OK to dismiss the Assembly Parameters window Sequencher will use your new parameters only with this project or session of Sequencher Performing an Assembly with Assemble by Name Once you have chosen your Name Delim
15. Td Figure 11 2 Contig overview options dialog box Overview Options Iw Scale Diagram To Window Size Cancel Iw Display Diagram Key Labels Of Fragments Names Of Fragments AtLeft Positions Of Fragments a Ia Start amp Stop Codons Base Numbers At Transitions Base Numbers Every x Bases Condensed Base Numbers 1141 Scale Diagram To Window Size Selecting Scale Diagram To Window Size shrinks the whole overview to fit in the window even if the names of the individual sequences become unreadable When you turn this option off you make the names of the sequences readable but you may have to scroll the window to see all the sequences in the diagram Diagram Key Clicking this checkbox displays or hides a key that explains the Overview Labels Of Fragments You must have already defined and applied some labels to your sequences with User Preferences see Chapter 21 Customizing Sequencher and User Preferences and the Label command Clicking Labels of Fragments will display or hide any fragment label names Names Of Fragments Clicking this checkbox displays or hides the sequence names in the Overview You may have this and the previous item checked at the same time Names Of Fragments At Left Clicking the At Left box in conjunction with Names of Fragments lists the fragment names down the left side of the window instead of on top of each arrow as shown in Figure 11 3 If for example the sequences are al
16. The Assemble Interactively window provides you with detailed information regarding overlap mismatch and gapping for any given sequence and a set of candidate sequences The candidate assemblies are driven by your user defined assembly parameter settings Performing an interactive assembly Bring the Project Window to the front and select the items you want to assemble Click on the Assemble Interactively button at the top of the Project Window or go to the Contig menu and choose Assemble Contigs and then Interactively from the submenu Sequencher displays the Assemble Interactively dialog box shown in Figure 9 4 The sequences and contigs you selected are displayed in the Candidates list The panel on the right showing a face in the upper left corner is called the agent It displays information about the comparisons you have requested In the center is the Matches panel which displays the possible matches For each match in the panel Sequencher lists the Match the approximate number of Overlap bases the number of Mismatch bases the number of Gaps and the New Contig Length The top line shows the Candidate sequence and the line below this shows the Match sequence Below these two lines is the consensus sequence There is a small button to the right of the consensus labeled with a black circle If you click repeatedly on this button you can display the first second third reading frame or all three reading frames at once The label
17. To select a discontinuous list of sequences hold down the Control key while clicking on the sequences Go to the Edit menu and choose the Remove From Table command Note This does not remove sequences from the underlying contig You can then export the remaining data as normal To export all the data in the table click on the Reports button The Variance Table Reports dialog box appears A radio button called Entire Table will be checked Below that you will see a drop down menu called Report Format Choose either Variance Table Report or Individual Variance Table Reports If you choose Variance Table Sequencher will generate a single table If you choose Individual Variance Table Reports each sample will appear in a separate report There are four buttons below the Report Format drop down menu Cancel Reports Copy as Text and Save as Text Save as Text is the default setting If you click on the Save as Text button Sequencher presents you with a dialog to assign the name and location of your export The report is saved in a tabbed text format Note You can select and then remove columns from your table using the Remove From Table command before you export your data Exporting selected sample sequences Each column in the table represents a sequence or consensus If you need to export specific columns in the table make your selection by using Shift click for a continuous range or Ctrl click for a discontinuous range see Figure
18. amp Name Menu Sound New Project Display Chromatogram Contig Chromatogram Contig Feature Motif Format Ruler Start Stop Variance Table Input Output File Import Report Confidence Ranges Wen Low Medium High Specify values to set the confidence ranges Sequencher combines the medium and high ranges to calculate your Quality SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 173 Label amp Name The Label amp Name preferences control the labels and descriptions used to mark sequences and contigs Use the boxes grouped under Available Labels to choose label colors use the pull down color menus at the left of the label name and descriptions for marking individual sequences Once you have set your preferences here you can apply them to sequences contigs and refrigerators while you are working in a project by selecting the icon s in the Project Window Go to the Edit menu and choose the Label submenu Under the Default Names section of the Label amp Name pane you can enter default names for the new contigs and fragments A bracketed asterisk tells Sequencher to add a number to the name and increase it for each new contig or fragment you add to the project Figure 21 7 Label and name preference settings User Preferences General Settings Available Labels Ato Save MEI Confidence Label 2 Label amp Nam
19. fields Changing the selected enzyme You can also bring up the dialog by clicking the Select Enzymes button in the button bar of any cut map Alternately you can go to the Window menu and choose Specify Restriction Enzymes The enzyme editor dialog box is shown in Figure 16 7 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 140 Figure 16 7 Restriction enzyme editor Restriction Enzymes Load Enzymes Enzyme Name Recognition Sequence Saci Effect of NaCl Concentration gt 30 active 10 30 active lt 10 active J J d Not specified a Om 50mhM 100mM 150m Cancel Available enzymes appear in the scrolling list on the left side of the window To view information about a particular enzyme click on its name Sequencher highlights the enzyme and puts the appropriate information in the Enzyme Name and Recognition Sequence fields If an enzyme is currently selected for display in the map the square to the left of the name is filled with an x An enzyme not selected for the map has an empty square When you move the cursor to the checkbox column it changes to a check mark To change your map selections click the checkmarks to toggle them on or off Figure 16 8 shows the list in detail SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 141 Figure 16 8 Checkmarked enzyme list Select All v If you are inte
20. go to the Select menu and choose Bases by Number A dialog box lets you enter the range of bases you want to highlight Extend Selection You can extend the current selection to the beginning or end of your sequence by going to the Select menu and choosing Extend Selection Then choose the suboption to Left End or To Right End Note In a contig chromatogram the Extend Selection command can extend your selection across the entire sequence in one click Other search options You can also search for ambiguous bases by going to the Select menu and using the Next Ambiguous Base command You can search for edited bases using the Next Edited Base option These are described in more detail in Chapter 8 The Sequence Editor You can also search for potential open reading frames using the Next Met to Stop gt 0b option This is described in more detail in Chapter 21 Customizing Sequencher and User Preferences Project Window selection Finding an item by name To locate a specific sequence or contig select the Project Window Type the name of the item you are trying to find for example Sequence C As you type Sequencher starts looking in the Project Window for an icon with the name you are typing and highlights a match If it finds more than one it highlights the first one SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 153 Note If you pause between characters for more th
21. in the consensus line Therefore if you delete gaps from the consensus this will not affect the Reference Sequence unless it also has gaps at the position of the deleted bases SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 43 For example you can delete a range of bases from all fragments by selecting a range within the Consensus sequence When the deleted bases are in the middle of a fragment they will be replaced with gaps to maintain the alignment with the Reference Sequence Since the action cannot be reverted using the Undo command you must use the Revert to Experimental Data command from the Sequence menu to recover Fill Coverage Gaps with Reference Under normal circumstances the Reference Sequence does not contribute to the calculation of the consensus However there is a special instance where the only data available in a particular region may be the data in the Reference Sequence This might happen if you have aligned cDNAs to a genomic reference The command Fill Coverage Gaps with Reference from the Contig menu will allow you to fill the gap with bases from the Reference Sequence The bases from the Reference Sequence are dropped into the consensus line but only in those places where there is no other data Figure 7 4 Figure 7 4 Filling gaps with the Reference Sequence B Fill Gaps OX P Ultanan 20 303 bases Contains D ambiguities 0 gaps amp 0 edits Select Next Ambi
22. or handle1_handle2_handle3 However a Name Delimiter can also be a combination of letters numbers and characters To define your Name Delimiter Sequencher provides a set of commonly used characters as a drop down menu in the Assemble by Name Settings dialog You will see this menu if you go to Assembly Parameters and click on the Name Settings button When your naming convention does not use simple characters as Name Delimiters you can use formal descriptors known as regular expressions instead which we describe later in this chapter Once you have defined an Assembly Handle you will be able to assemble all of the sequences in your project Only sequences that share the same Assembly Handle will be analyzed for assembly into the same contig In a clinical application for example several contigs might be created with each contig representing sequence data from only a single patient provided that each sample patient has a unique Assembly Handle in its name Setting the Assemble by Name Conditions Configuring Assembly Handles using a Single Delimiter Starting in the Project Window click on the Assembly Parameters button At the bottom of the Assembly Parameters window click on the Name Settings button The Assemble by Name Settings window appears in Figure 10 1 below Figure 10 1 Assemble by Names Settings window Assemble by Name Settings Name Delimiters _ Underscore EI Assembly Handles e 1 Date C 2 Clon
23. slider To set the minimum number of bases that must overlap use the Minimum Overlap slider SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 60 Figure 9 2 The Clean Data algorithm with Maximum Loop Out Size shown Assembly Algorithm Dirty Data e Clean Data C Large Gap The clean data algorithm is FASTER than the dirty data algorithm Unknown base calls are considered matches to anything Maximum Loop Out Size 2 H Assembling with Large Gap The Large Gap algorithm is specifically for data that has or might have large insertions or gaps This might include data from evolutionary studies different organisms or comparing cDNA to genomic DNA When you select the Large Gap algorithm there are two sliders for changing minimum match values To set the proportion of bases that have to match use the Minimum Match Percentage slider To set the minimum number of bases that must overlap use the Minimum Overlap slider Not e Assemble to Reference cannot be used with the Large Gap algorithm Assemble By Name You can enable Assemble By Name by clicking in the checkbox called Enabled Click the Names Settings button Choose your delimiter from the Name Delimiters drop down menu Then type names for the Assembly Handles into the fields in the Assembly Handles pane To activate an Assembly Handle click the adjacent radio button When you are done click the OK button You will notice that the drop d
24. 2011 Gene Codes Corporation Inc All rights reserved 51 Figure 8 7 ORF selected by Next Met to Stop command E Quality four D Par ES RT RR Espen Residue 88 to 192 D Low Confidence 0 0 0 Ambiguous 0 0 NNN NNN NNN NCT CTC GNN NCG GGT CCT GGT GAA TGC CTG GAA ATC AAA GAN NAN CCA TGG GGT TCT ra CCC GC G TTi T G TCA CAG AGC GGC GCG GTT GGT GCC GCG GCG GCG TGC TGA CGA CTA GCT GCG GTA ATA CCC TCA CAT Sequence features Looking at codon maps Click the Codon Map button located in the button bar at the top of the Sequence Editor or go to the View menu and choose Display Codon Map The three outlined bars at the top of the editor window represent the three forward frames of translation Each green flag sticking partially into the top of a frame marks the position of a start codon while each red line cutting through the frame and extending below marks the position of a stop codon Clicking between any two markers highlights that section of the sequence Figure 8 8 Sequence Editor with codon map button selected B Quality four D Fra Pn GES RTR RSR Sow Cronaozon Residue 1 Sequenced Strand Sequence Length 794 Frame 1 7 me Frame 2 7 s NNN NNN GGN NNN NNN NCT CTC CTG GNN NCG GGT CGA GTT CCT GGT GAA TGC CTG GAA ATC AAA GAN NAN ACC CCA TGG GGT TCT CAT ATG ACA CCC GCT GTT TTG ACT CAA CGG TCA CCG AGA GTG ATA TCC GTG TTG AGG AGT CAA TTT ACC AAT GTT GTG ACT TGG CCC CCG AAG CCA GA
25. 3 but you can change this using the Complement check box Use the Display Style pull down menu Figure 17 6 to specify strand type color case and underlining You may select one attribute from each of the four style categories When you have finished entering or editing your features click Done If you have chosen a GenBank Feature Key Sequencher will insert a default name in the Feature Name field using the Feature Key and Feature Qualifier You may edit this if you wish Enter the starting and ending base numbers for the feature in the Feature Location boxes The default strand is 5 gt 3 but you can change this using the Complement check box Use the Display Style pull down menu Figure 17 6 to specify strand type color case and underlining You may select one attribute from each of the four style categories When you have finished entering or editing your features click Done Check the Display Feature in Editors checkbox and then go to the View menu and click on Display Features to have your feature appear in the sequence or contig editor and the associated overviews When you set a feature to include Protein or DNA RNA this will only be displayed in appropriate views SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 148 Figure 17 6 The display style menu Features from Feature Display Style Feature Key Sequencher sl Feature Name New Feature Ne
26. 821 BPs 550 BPs Ref Contig of 3 Refrigerator Ref Contig of 3 Ref Contig of 3 Auto Seq Frag ABI Fer ON Fragment Auto Seq Frag ABI Ref Contig of 3 Ref Contig of 3 AutoSeq Frag ABI AutoSeq Frag ABI AutoSeq Frag ABI AutoSeq Frag ABI Ref DNA Fragment Fri Mar 24 2006 2 02 14 PM Fri Mar 24 2006 1 59 39 PM Fri Mar 24 2006 1 58 59 PM Fri Mar 24 2006 1 58 59 PM Fri Dec 16 2005 4 06 10 PM Tue Dee 13 2005 4 39 43 PM Fri Dec 16 2005 4 06 10 PM Fri Mar 24 2006 1 58 59 PM Fri Mar 24 2006 1 58 59 PM Fri Dec 16 2006 4 06 11 PM Fri Dec 16 2005 4 06 11 PM Fri Dec 16 2006 4 06 10 PM Fri Dec 16 2005 4 06 10 PM Tue Dec 13 2005 4 39 43 DM Has SNPS that need to a accessing Help then the Project Window context page will appear This will be the case for most of Sequencher s major windows If you see a flagged icon this means you can click on a link to get further information If you place the cursor over this icon a Tool Tip with the title of the Help article will be displayed The button bar The button bar contains buttons for frequently used commands It appears just below the top of many windows see Figure 2 2 Figure 2 2 The Project Window button bar D Assembly Parameters Assemble Automatically Assemble Interactively Assemble to Reference On many Sequencher windows you can use the smaller left hand button which has an up arrow on it to hide the button bar Click on the but
27. Agent Describes current selection and the current status of the spacebar shortcut D Splitter bar Controls the width of list message area and the editing area dragging the splitter bar to the left or right changes the size SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 81 Label Name Function of these items E Sequence Displays the sequence bases aligned Bases F Sequence Scrolls up and down through the sequence bases in the Scroll contig G Base Shows which bases you are viewing numbered from the Numbers beginning of the contig H Consensus Shows the calculated consensus of the sequences that were line aligned to form this contig l Ambiguities Displays a plus below any ambiguous base calls in the contig line and a bullet below any base calls with disagreeing bases above them This area also displays protein translations with a bullet J Base Scroll Scroll left and right through the bases of the contig K Translation This button toggles the display of protein translations on and Button off Five options are available Frames 1 2 or 3 all three reading frames or In Frame with the Reference Sequence L Notation Displays translation of Reference Sequence and user text Line notes M Reference This button toggles the display of the Reference Sequence Translation translation on and off Toggle Viewing sequence names If the names of your sequences are very long you
28. Figure 21 19 File import preference settings User Preferences General Settings E 2 Puto Save E Confidence Review Trim Parameters bel amp N See ER Add Imported Sequences to Trim Window Sound Alllmported Sequences New Project e OF Sie ep All That Have Less Than 1500 Bases Gorki See All That Have a Chromatogram Contig IT All That Have Unknown Bases Feature Matt Format Ruler Import Options Start Stop Prefer File Names to Data Names Variance Table Input Output File Import Report The Report preferences command controls the default page setup for report printouts Use the fields in the Page Margins box to set the margins and gutter Click Page Setup to open the printer page setup dialog box The page setup settings will apply to all windows you open thereafter Figure 21 20 Report margin preference settings User Preferences General g S Settings Page Margins Futo Save Left os in Top 0 5 in Kette ar 0 5in Bottom 0 5 in Label amp Name k en Gutter 0 5 in Sound New Project Sequenchentm Project Name Display Enter your default header and footer Chromatogram into the fields above and below lo EEN Fay of the following bracketed strings Feature Matt included in the header or footer will be Format Ruler replaced with the requested Start Stop information Page Total Pages Mime Date Project Name amp User Variance
29. Finding Ambiguous Bases One of the most frequently executed commands is Next Ambiguous Base under the Select menu You can use this command in the Sequence Editor or the contig editor By using this command in the consensus sequence of a contig Sequencher will display the next position in your contig that contains an ambiguity or disagreement The shortcut for this command is Ctrl N for more on keyboard shortcuts see Appendix 23 Keyboard shortcuts If you use the Shift key with this key combination you will go back to the previous disagreement Once you have used any of the Select Next commands Sequencher will activate the space bar to execute that operation If you use either the menu command or the shortcut key combination two consecutive times Sequencher will remind you that this easier alternative is available Finding Open Reading Frames ORF Choose Next Met to Stop gt 0b to highlight the next pair of start and stop codons in one of the three forward reading frames The number of bases shown in this command depends on whether you specified a preference for a minimum length in the User Preferences For instructions on how to set a specified minimum length for open reading frames see Chapter 21 Customizing Sequencher and User Preferences If you set preferences to highlight ORFs only when they are of some minimum length the menu command will display the number of bases you specified SEQUENCHER User Manual 1991
30. Folder dialog box Browse For Folder Select Folder Location Include TXT and SEQ files O Gene Codes B O Sequencher 4 6 Sample Data E Mitochondrial Sequence Sample Fragments O T G Sample Data oma Folder ADARC Projects in specific formats Sequencher supports importing projects as ACE projects GCG contigs and as CAF projects The contents of these files include confidence values feature annotation and assembly information Select the Project Window and go to the File menu and choose Import and then choose the specific project format from the submenu The standard Open dialog box appears select the folder that holds the data you want to import and click Choose Read more about importing confidence values in the section Importing Confidence Scores later in this chapter Sequencher projects At times you may want to merge two projects or combine your own work with that of a colleague To combine Sequencher projects first open one of the projects Go to the File menu and choose Import and then Sequencher Project from the submenu Select your other project and click the Open button Sequencher displays all the imported projects in the initial Project Window retaining all comments edits and even the relative positions of the icons To keep track of which sequences came from which project you may want to establish a naming convention for sequences before combining them Since the importe
31. Gutter fo in Foy of the following bracketed strings included in the header or footer will be replaced with the requested information Page Total Pages Time Date Project Name amp User Name Cancel Report formats Special report formats are used for printing the contents of the contig editor Specify the report format you want by clicking the Report Manual or Poster radio buttons from the Style pane of the Set Header amp Footer dialog box Sequencher then adjusts the report accordingly The Report option is for reports printed on single sided pieces of paper that need to be displayed one page at a time Each page is fully labeled The Manual option is for two sided printouts when the user can look at a left and a right page simultaneously Only the left hand page of each pair will have all the labels for each sequence printed in a report the right page uses the additional space for more data The Poster option is for displaying all your data together It minimizes the amount of labeling and assumes that the entire report is to be laid out as one big rectangle for paste up and display Page breaks Page breaks which are shown on screen as in Figure 20 2 appear in the Sequence Editor and in the Summary Report view of a contig SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 165 Figure 20 2 A page break in the Summary report 213 tgCGCCGCTACAGGGCGCGTCCATTC
32. Next Commands cere ceaiac a teh A A dues atbsues Dt lesyat aden sbesavapal AAA A 133 Editing b ses froni traces sereine iii aes A das A A EE lt eb OAE ag 134 Reverting to experimental data ee NEEN ee 134 Reverting assembled sequences back to experimental data s ssssessssssesssssestststseseerestststsrsrsrsrsesresesestsese 135 Adjusting trace POSITIONS tee Ee ENNEN EE 136 16 RESTRICTION MAPS iieiinsssicssneusreseneratenseensineenncensinmndieninnainninnnsiindauniitias 137 Displaying a Restriction Map 2 ntentecustiahdedtiatedastieducadhantedwetewuuuackundydn 137 Restriction Map display Optom lt aiccsivesssdsccc dec esdpucslancededesaes saysensaeicescouaesasate Gewese boone 137 How to select EE 137 How to show cut positions or fragment satzeg eee escseeseeseeeseeeeseseeseseeseeeesesscseeseseeseecseeaeseeaesetaeseeas 138 How fo settheimap style eege 138 Setting the map Widths ee EEN 139 Setting the map Capon EE 139 Getting more information about an CNZYME ou cesses eseeeeseeseseeseseeseeeesceecseeecsecacseeseseeseacseeesseeaeeeaeeeeas 140 Changing the s lect d enzyme eege 140 Changing the default enzy mesic nerone nna AROE AER ARAA OO ORROA R RRRA 142 Adding and editing EE 142 Changing the recognition sequence 00 eee eesceeseeseteeseeeseeeseescsseseseeaesecsesecseaesesacseeaeseeseacseeasseeaeseeaeeeeas 143 Salt concentration CLICCIS ege 143 Saving new or modified enzwvmes eee ese eeeseeeeseeseseeseseeseseeseeeseeecseeacseeseseesee
33. Red 2 gene 3 misc_signal a promoter 1 CAAT_signal 2 TATA_signal 3 35_ signal 4 10_signal 5 GC_signal b RBS c polyA_ signal d enhancer e attenuator SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 196 Feature Display Color Style Second Strand f terminator g rep_origin h oriT 4 misc_RNA a prim_transcript 1 precursor_RNA a mRNA f exon y Blue g CDS y Blue Protein 1 sig_peptide 2 transit_peptide 3 mat_peptide h intron y Cyan Invert case i polyA_site j rRNA k tRNA 1 sCRNA m snRNA n snoRNA 5 Immunogobulin related a C_region b D_segment c J_segment d N_region e S_region f V_region g V_segment 6 repeat_region a repeat_unit b LTR c satellite d transposon 7 misc_binding primer_bind protein_bind STS primer a b c d 8 misc_recomb a iDNA SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 197 Feature Display Color Style Second Strand 9 misc_ structure a stem_loop b D loop 10 gap 11 operon 12 source Unrecognized Note The Unrecognized key is at the same level at misc_feature SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 198 28 Appendix Feature Keys and Qua
34. Report consists of three parts the Common Header the Variance Table and the Comparison Range Coverage table This table lists the name of each sequence the type of coverage complete or incomplete and the range of bases in the comparison see Figure 13 19 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 112 Figure 13 19 Extract from Variance Table Report Ah Variance Table Report Date July 12 2007 Project Name 1 exon HPS4 Tutorial Compare Consensus to Reference 3 exon Reference 30 end 50 end Comparison Range Unfiltered Base Positions 15 394 to 15 644 Options Large gap insertions 10 or more bases excluded Matches to ambiguous reference positions excluded Reference 128 152 Total 15 576 15 517 15 401 1 15 400 15 395 15 394 15 612 15 403 15 624 15 615 Total BO N AALALA A a Hj r ml al u b i wl wl wlw DI N Sequence Name Comparison Range Coverage 128 Complete 15 394 to 15 644 152 Complete 15 394 to 15 644 Individual Variance Reports You can create a separate report for each sample in the table The Individual Variance Report lists the variants in a single column This table is followed by the Comparison Range Coverage table which lists the name of each sequence whether coverage is complete or incomplete and the range of bases in the comparison see Figure 13 20 Figure
35. The Project Window then displays the sequences and contigs as shown in Figure 3 4 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 10 Figure 3 4 Project Window with Large Icons Untitled Project SEE D Assembly Parameters Assemble Automatically E Assemble Interactively f Assemble to Reference Parameters Dirty Data With ReAligner 3 gap placement Min Overlap 20 Min Match 85 ZS ef e e ef Quality eight Quality Quality Quality Quality two Quality five Quality nine peer seven Quality three Note A new project defaults to the icon view unless you previously specified a different view View as Small Icons To fit more information onto the screen you can view project items as smaller icons Go to the View menu and choose Project Icons As and then choose Small Icons from the submenu The Project Window then displays the sequences and contigs as shown in Figure 3 5 Figure 3 5 Project Window with Small Icons Untitled Project DER Assembly Parameters Assemble Automatically Assemble Interactively Assemble to Reference Parameters Dirty Data With ReAligner 3 gap placement Min Overlap 20 Min Match 85 AE Quality eight aE Quality four AE Quality one aE Quality six AE Quality two AE Quality five E Quality nine AE Quality seven aE Quality three Moving icons To reposition a sequence or contig icon click the icon and drag it
36. User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 114 Comparison Range coverage This report consists of two types of tables a Population Table and individual Detail Tables The Population Table lists the variants in each population group It displays information on the number of samples which contain each variant base as well as the total number of samples in each population group Each population group is given a name based alphabetically on the first member of the group see Figure 13 22 The Detail Table includes information on the frequency of the population group and the location of the variants with reference to the exemplar It also lists the names of the contributing samples Figure 13 22 Extract from a Population Report Participating Data 5 population groups consisting of 5 samples Nonparticipating Data 0 samples dropped from the report due to incomplete comparison range coverage Populations Reference 116 128 150 152 156 Total Samples 15 576 G PLELT UTI IS 15 517 A GiG iG GjG js 15 401 1 Li TIT TIT YIS 15 400 TIAJTAITATA IA IS 15 395 G Cle tcic ic is 15394 T Afa JaA fafa IS 15 612 G w RIRI3 15 403 E S S 2 15 624 G R 1 15 615 G 1 Total Samples 1 1 1 1 1 116 Like Frequency 20 00 Variants 7 15576 T 15517 G 15401 1 T 15400 A 15395 C 15394 A 15403 S Samples 1 116 Working with Vari
37. anar Fragment base 16 is selected Base 16 of 727 Editing modes The Edit command allows you to replace overstrike insert or erase a base call The specific option you choose from the Edit menu will create that behavior When you use Replace When Editing the base you have just edited will remain highlighted When you use Overstrike When Editing the cursor will move one base to the right and this base will then be highlighted Insert When Editing allows you to insert bases before the currently selected base pushing all of the following bases forward This also allows you to add bases beyond the end of a sequence Sequencher uses Replace When Editing as its default setting Note If you insert or delete a base from any one sequence the sequence may no longer align with other sequences after the point where you made your change The consensus line will reflect this by showing lots of ambiguities in the form of bullets e If you have made only one change go to the Edit menu and the Undo command will remove it Viewing Base Edits To assess the quality of your data you need to be able to see how much modification has already been done to your contig as you edit it When a base is edited Sequencher changes its appearance by displaying it in magenta and bold faced text If you want edits to stand out even more for instance in a black and white printout you can also invert the case of the characters e g change all edits from
38. and choose View Options A dialog box Figure 16 2 lets you change several attributes of the restriction map SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 137 Figure 16 2 Restriction map options Restriction Map Options Cutters r Show Iw Unique Cutters Iw 2 Cutters C Fragment Sizes Iw 3 Cutters Ze Cut Positions Iw 4 amp More Style wh Single Line Inches 12 Se 7 Multiple Lines C Text l Scale to window J L Iw List enzyme names at bottom of map Cancel Use the Cutters checkboxes to display enzymes according to the number of times they would cut For example you could select just 2 Cutters that is only those enzymes that cut in two locations If you select the All button this turns on all of the checkboxes simultaneously How to show cut positions or fragment sizes Use the Show options to choose the Fragment Sizes display or the Cut Positions display by clicking on one of the radio buttons How to set the map style Use a radio button from the Style pane to choose one of the three basic map styles The Single Line map is the default display Choose Multiple Line map Figure 16 3 if you want to show each enzyme on its own line Figure 16 3 Multiple line restriction map KT C Apal 2 f E HA Drall 5 ES Kpal 1 PstI 2 SacI 2 e 234 k E Sacll 4 f ES Xmalll 1 M
39. between the exemplar sequence and your chosen sequence s The results may be from just one contig but can represent an assembly of hundreds or even thousands of sequences If you are performing activities such as de novo sequencing clone checking or resequencing you may want to use this form of the Variance Table The Variance Table and consensus sequences You can also create a Variance Table that summarizes all of the differences between selected contig consensus sequences see Figure 13 1 below in your project and a common Reference Sequence consensus Variance Table When you are working with multiple samples from multiple sources and have used Assemble by Name with a Reference Sequence you should use this Variance Table If you are performing activities such as comparative sequencing clone checking or SNP analysis you probably want to use this form of the Variance Table SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 94 The Translated Variance Table If you are interested in focusing on the differences between sequences or contigs at the amino acid level you should use the Translated Variance Table This table differs from the Variance Table by displaying both codons and their associated amino acid residues Each row in the Translated Variance Table summarizes the variations among all of the selected sequence translations at a given amino acid position relative to the exemplar You can als
40. button to dismiss the Assembly Parameters window Assembly Algorithms To choose which assembly algorithm you want to use click on the Assembly Parameters button on the Project Window and then click on one of the three radio buttons at the top of the window Your choice will depend on your data SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 59 For instance if you click the Clean Data radio button contig assembly is faster but Sequencher may miss some possible matches if the ends of your sequences contain a large number of ambiguities For best performance make sure that you have trimmed poor quality data from your sequences The Dirty Data algorithm is somewhat slower because Sequencher is performing more rigorous comparisons between the sequences You should note that automated sequencers create dirty data and Sequencher s algorithm is optimized to deal with that The Large Gap algorithm allows you to assemble sequences that are expected to contain insertions and deletions gaps greater than 10 bases long Such gaps are typically found in evolutionary studies and sequence comparisons from different organisms The Large Gap algorithm is similar to and slower than the dirty data algorithm but allows larger gaps to occur when performing the assembly Typical examples of assemblies that require the Large Gap algorithm include the comparisons of a cDNA sequence to a genomic sequence and the assembl
41. by the smallest category of ambiguity that covers all available data In the example in Figure 11 7 a column with one sequence containing a G and the other containing a C is shown in the consensus line as an S the IUPAC code for C or G This consensus is particularly useful in any work involving mutations Figure 11 7 Consensus Inclusively example CCCCATGGGGTTCTCATATGAC CCCCAT GGGGTTCTCATATGAC CCCCAT GGGGTTCTCATATGAC CCCCATGGGCTTCTCATAT GAC 670 CCCCATGGGSTTCTCATAT GAC Consensus By Plurality Consensus By Plurality is Sequencher s default setting With this consensus calculation Sequencher determines the consensus line bases by majority rule If there is no clear majority Sequencher chooses the smallest category that fits the available data In the example in Figure 11 8 below a column with two sequences that contains a T and an A is still shown in the consensus line as a W because there is no clear majority However a column with three sequences that contains a gap in one sequence is shown in the consensus line as an ambiguity code in the consensus line only if the other two sequences do not agree Figure 11 8 Consensus by Plurality example CACCCGCTGTTTTG CACCCGCTGTTTTGACA CACCCGCTGTTTTGACTCAACGGTCA CACCCGCTGTTTTGA CAACGGACA 710 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 83 Consensus to Forensic Standards In certain applications such as forensic mtDNA sequen
42. cancel the operation simply press the Cancel button Chromatograms occupy a lot of memory but Sequencher uses a very efficient algorithm to compress trace data However we suggest that you only duplicate the text if you don t need the chromatograms See Chapter 15 Chromatograms for more information Reverse and Complement The data you bring into Sequencher may not necessarily be in a particular biologically relevant orientation To change the orientation of the sequence go to View and choose Reverse amp Comp Sequenced Strand To display the original orientation go to the View menu and choose Sequenced Strand Note You will always have two copies of your data in Sequencher the original data and the edited version This duplication allows you to revert to the original data as discussed in more detail below About experimental data Creating a baseline Sequencher notes the original base sequence when you import your data so that after editing the sequence you can still discard all of your edits and revert to the baseline experimental data You can also do this for sequences that you enter from the keyboard When you close a Sequence Editor that has not had its data recorded as experimental data the program explicitly asks whether the contents of the Sequence Editor should be recorded as experimental data see Figure 8 4 Click on Record As Experimental Data if you are finished editing the sequence If you are not finished
43. click the Not Yet Finished button Figure 8 4 The Record as Experimental Data dialog box Sequencher i Are you finished entering Frag 0001 from the keyboard If you are LD finished this sequence of bases will be recorded as experimentally determined data Record As Experimental Data SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 48 Note When a sequence becomes part of a contig the data are automatically recorded as experimental data if they have not previously been stored in that form Once a version of the sequence has been recorded as experimental data all future changes are considered to be edits to it As a data quality tool all such edits can be marked in color and displayed in Bold Magenta or lower case text This feature is invoked in the View menu under the Base Edits As command The default is set to Bold Magenta but you can also choose Not Highlighted or bOLD amp cASE cHANGE Resetting a baseline If you decide that some of the original data was incorrect perhaps a simple typing error that has been corrected in the edited version of the sequence go to the Experimental Data version of the sequence by clicking the Show Experimental button and then click the New Baseline button Note Setting a new baseline for a sequence resets all of the bases to a not yet edited state None of the bases that were edited will appear in Bold Magenta letters Viewin
44. click the OK button SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 149 If you create a feature while selecting bases in the consensus of a contig you will actually create a feature for each of the sequences that contribute to the consensus at the selected bases Note If a fragment spans only a portion of the bases selected for a feature that fragment will not carry the new feature If you do not define display properties the display attributes will default to the ones you specified in the Feature Motif section of the User Preference settings See Chapter 21 Customizing Sequencher and User Preferences for more information Figure 17 7 The Mark Selection As Feature dialog box Mark Selection As Feature Feature Key variation Feature Name variation Feature Color ME Feature Style Invert Case V Underline Display C Complement e Single Strand Only Protein RNA Translation To alter default styles go to the Feature Mot panel in user preferences Cancel Showing and hiding features Go to the View menu and use Display Features to toggle feature display on and off For each individual feature you must also enable or disable its display in User Preferences See Chapter 21 Customizing Sequencher and User Preferences for more information Listing a sequence s features If you want to get a complete list of all your sequence s fea
45. features Use the Edit Features command to create a series of features which share the same name followed by a bracketed number CDS 01 CDS 02 see Figure 13 13 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 104 Figure 13 13 Creating a Joined feature Features from Reference Feature Key CDS Feature Name CDS 02 CDS 01 v Display Feature in Editors Feature Location 376 570 Which strand Complement Display Style v St Remove To alter default styles go to the Feature Motif panel in user preferences Setting the Variance Table Display Options Throughout Sequencher you can customize the display of your sequences using the commands from the View menu When creating a Variance Table you can also specify a number of display options see Table 13 1 Table 13 1 View Options Translated Translated Variance consensus Table Variance Table Variance consensus Table Variance Table Base Ambiguities Base Edits Display Color Bases Display Motifs Colors As Backgrounds Display Base Confidences Labels m Im Je Je ys In In In S O e Ir e Ir Features me me Im me O EELER mm Is Is Im To highlight variants in your data go to the View menu and click on Display Base Confidences This will add a background color to the cells in your Variance Table to SEQUENCHER User Manual 1991 2011 Gene Codes Co
46. found Assemble Interactively Candidates Matches Watch Overlap Mismatch Gaps Hew Contig Len Assembly Parameters HLA HLAO0002 Alert Jl Sory nothing matches the selected candidate within the HLA HLAODOOS parameters currently set HLA HLAO0009 HLAHLAO0010 i R HLAHLA00011 HLA HLAO0012 ATGGCCGTCATGGCGCCCCGAACCCTCCTCCTGCTACTCTCGGGGGCCCTGGCCCTGACCY 46 Candidate EE 1 10 20 30 40 50 6 When you click a possible match in the Matches list Sequencher recalculates the optimal positioning for gaps The agent then displays a message about the selected sequences and their computed overlap SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 65 The start of an overlap and the base consensus appear in the fields at the bottom of the interactive assembly box Figure 9 7 You can explore the alignment further by moving the scroll bar to the right Figure 9 7 Interactive Assembly window showing overlap Assemble Interactively Candidates Matches Overlap Mismatch Gaps New Contig Len HLA HLAOO001 HLA HLAO0004 HLAcHLAODOD2 HLA HLAO0002 HLA HLAD0004 HLA HLAO0009 iJ Overlap of 1098 bases HLA HLAOD00S HLA HLAO0018 49 bases mismatch HLA HLAO000S HLA HLAOOD17 Inserted 0 Gaps HLA HLAO0008 HLA HLAO0005 9553 of Overlap matches Contig Length of 1098 bases HLA HLAO0010 HLA HLAO000S HL HLAO0011 HL cHLA00016 HLA HLAOOD12 HLA HLAOOD 12 AT GGCCGT CAT GGCGCCCCGAACCO
47. report is a text box that automatically displays pertinent warnings In the report shown in Figure 22 1 the two mtDNA profiles disagree The bases in conflict are highlighted in yellow Also there are positions where there is data for one mtDNA profile but not for the other The base position is distinguished by a yellow highlight and the cell with the missing data has a pink X though it The cells are unshaded where the two profiles both agree and confirm each other SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 184 Figure 22 1 The Validate mtDNA Profiles window S Validate mtDNA Profiles 9 07 07 16 26 PM Export CME mtDNA Profiles Disagree Reference Profile 1 Profile 2 HV1 and HV2 Ranges Disagree Reference Sequence rCRS Profile 1 Incomplete Coverage for HV1 Name ABV HV1 Range 16298 to 16569 HV2 Range 1to 576 Profile 2 Name JK HV1 Range 16024 to 16569 HV2 Range 1to 576 Wamings Create Reports At the top of the Validate mtDNA window is a button bar Click on the Create Report button A new window called Analyst Report opens This contains a version of the Validate mtDNA report that you can send to a printer by clicking the Print button You may also save this report by clicking on the Save button Note The report includes spaces for the analysts and reviewer s signatures Export CMF You can create a CMF report by clicking on the Expor
48. rights reserved 110 Figure 13 17 The Translated Variance Table in Review Mode Cox Description 10 sequences compared to reference sequence NC_001807a 3 end of contig GWB Fragment base 16127 is selected Base 103 of 332 B Chromatograms from GWB a BG Idi 16 220 113 16 226 115 Note You cannot edit your sequences directly from the Translated Variance Table although you can do so in the ordinary Variance Table Any edits to the Translated Variance Table must be made in the Contig Editor In order to see the results of your changes in the Translated Variance Table click on the Refresh button in the button bar The Review Mode and Sister Tables You can generate a Sister Table when you are working in an individual Variance Table If you have generated a Variance Table you can see the Translated Variance Table for the same data set by going to the button bar and clicking on the Translation button see Figure 13 18 Similarly if you have already generated a Translated Variance Table you can see the Variance Table for the DNA sequences of the same data set by going to the button bar and clicking on the Bases button If you are in Review mode and have generated a Sister Table you can arrange all four windows and save the layout by going to the Window menu and choosing Remember Window Layout Then click on Double Table Variance Review from the submenu SEQUENCHER User Manual
49. s button bar You can also SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 143 display this view by going to the View menu selecting Bases Map Overview and then choosing Restriction Map from the submenu If you click between two cut sites Sequencher highlights the resulting restriction fragment Hold down the Shift key and click in the rectangle between other cut sites Your selection is extended from the original site you selected When you return to the Bases view you will see that the sequence representing the chosen region has been highlighted also See Figure 16 9 Figure 16 9 Fragment selected in map view and its equivalent bases highlighted E Contig 0001 D Bases Overview Select Enzymes Options B Contig Editor D Pa Era En RH stow Cronaoganel Reser ITCTGCAG itctctd CTGTGTGi ITCTGCAG ktctctd CTCTGTG TCTGCAG GGGG CTGTGTG 111 frag bases amp 37 consensus bases selected at consensus position 474 Select Next Ambiguous Base spacebar Copying the map After you have made a selection in the restriction map as in Figure 16 9 above you can copy the bases between the two cut sites by going to the Edit menu and choosing Copy Selection You can also copy the entire diagram for use in an illustration program by going to the Edit menu and choosing Copy As and Picture from the submenu SEQUENCHER User Manual 1991 2011 Gene Codes Corpora
50. supports confidence values from PHRED ACE and Trace Tuner files as well as SCF ABI 3730 and ESD files Importing PHRED called data Sequencher can read in the SCF FASTA and Qual files generated by PHRED and supports PHRED base calls and confidence values To view the confidence levels you must first set the user preferences for displaying confidence values See Chapter 21 Customizing Sequencher and User Preferences for more information Then go to the View menu and choose Display Base Confidences To import the data into Sequencher it must first be organized in a folder containing three subfolders These subfolders are chromat_dir which contains the trace files edit_dir which contains the text files and phd_dir which contains the quality scores Select File then go to Import and then Folder of Sequences to import the phd_dir folder To import the sequences you would like to view you may also go to the File menu and choose the Import command with the Sequences submenu Importing PHRAP Files Sequencher can import the ACE projects created by PHRAP To import an ACE project select File then go to Import and then the ACE Project submenu Browse to a file that ends with ace or ace x such as ace 1 ace 2 etc This file will typically be located in the subfolder edit_dir Select the highest numbered ACE file to account for all editing and reassembling Sequencher automatically imports and incorporates as much auxiliary da
51. the first occurrence of the search string anywhere and then selects the string in the editor Find Next finds the next occurrence after the current selection whether it is a match found by Sequencher or a selection that you marked The Find Bases dialog box stays in front until you click Done If you want to repeat the same subsequence search but without invoking the Find Bases dialog box you can go to the Select menu and use the Find Bases Again command SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 152 Refining a subsequence search You can refine any of your searches by using one of the radio buttons The default setting is Exact Matches Exact matching If you click Exact Matches the search is based on exact character matches and disregards matches that occur due to ambiguities For example AGGTW would only match AGGTW Matching ambiguous bases If you click Specified Bases the search finds only matches that are identical or more specific than the sequence you entered in the Find What field For example the string AGGTW would match AGGTT or AGGTA since W is the ambiguity code for T or A Any ambiguous match If you click Any Ambiguous Match the search will find any degenerate DNA that may match the sequence you have entered including matches consisting of all unknown bases For example AGGTW would match NNGTN or even NNNNN Finding bases by number To select a range of bases quickly
52. the reader that the gene is alcA the product of the gene is alcohol dehydrogenase and that the evidence is experimental Figure 28 2 Qualifier examples Key Location Qualifiers CDS 6006 gt 6253 gene alcA experiment experimental evidence no additional details recorded note ADHI ethanol regulon codon_start 1 product alcohol dehydrogenase I Information on the Feature Keys and Tables and Qualifiers has been taken from the official GenBank documentation which can be obtained from the following URL http www ncbi nlm nih gov collab FT index html SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 202 29 Appendix Default Feature Qualifiers In this table you will find the default Feature Qualifier for each Feature Key as implemented in Sequencher Feature Key Default Feature Qualifier Sequencher A misc_feature misc feature note 1 misc_difference misc difference note a conflict conflict compare b unsure unsure difference c old_sequence old sequence compare d variation variation note e modified_base gene modified base mod_base Allele allele Mutation mutation 2 gene gene gene 3 misc_signal gene misc signal a promoter gene promoter 1 CAAT_signal gene CAAT signal 2 TATA_signal gene TATA signal 3 35_ signal gene 35 signal 4 10_signal gene 10 signal 5 GC_signal gene GC signal b RB
53. to the new location Note Icons can be positioned on top of each other thereby obscuring one or more of them from view Cleaning up Sequencher has clean up commands to help you organize the icons in your Project Window Go to the View menu and choose the Sort Cleanup command to see a submenu SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 11 that will let you rearrange the icons according to date edited name kind size etc See Figure 3 6 Figure 3 6 Small icons sorted by name S small icons SPF E O x D Assembly Parameters Assemble Automatically Assemble Interactively Assemble to Reference Parameters Dirty Data With ReAligner 3 gap placement Min Overlap 20 Min Match 85 PS ON 46 88_123_for 4E 0D_123_for 363 FF_123_for 46E 00_456_for PS JE Sp 13 re AE 00_123_rev 46 FF_123_rev afg MORI PS 46 CC_123_for 46 EE_123_for 46 6G_123_for afg MORI E CC 133 re FS EE_123_rev AE GG_123_rew View as a list You can show your project items as a list rather than as icons Go to the View menu and choose Project Icons As and then select A List from the submenu The Project Window will look like the one shown in Figure 3 7 Figure 3 7 Project Window with list view S Untitled Project DER Assembly Parameters Assemble Automatically Assemble Interactively zz Assemble to Reference Parameters Dirty Data With ReAligner 3 gap placement bells Overlap 20 Min M
54. your sequences You will learn how to use these functions when you add sequence data to a project We discuss the various criteria you can use such as trimming off ambiguities trimming off low confidence data how to screen for specific vector contamination and how to change all the trim tools criteria to your own specifications Removing undesirable data Sequencher s trimming tools allow you to trim one sequence at a time or thousands of sequences at a time The two trimming commands under the Sequence menu are Trim Ends and Trim Vector Both the Trim dialogs share a similar interface All the sequences to be examined are collected in a single table The criteria that you set determine how much data you trim off The trimmed data are fully recoverable because Sequencher always stores two copies of every imported file the original sequence and the edited version In this way you can always use the Revert to Experimental Data command from the Sequence menu to recover the original untrimmed sequence Trimming poor quality data Selecting and trimming fragment data Select one or more fragment s in the Project Window by clicking on them Then go to the Sequence menu and choose the Trim Ends command Note that if you select contigs the Trim Ends function only affects unassembled fragments All the sequences are collected in to a single table At the top of the table is a button bar Figure 6 1 that you use to manage trim criter
55. 13 20 Extract from Individual Variance Report AA Individual Variance Reports Date July 12 2007 Project Name 1 exon HPS4 Tutorial Compare Consensus to Reference 3 exon Reference Comparison Range Unfiltered Base Positions 15 394 to 15 644 Options Large gap insertions 10 or more bases excluded Matches to ambiguous reference positions excluded V Owe Nw Vi MII VI MII Vi Sequence Name Comparison Range Coverage 128 Complete 15 394 to 15 644 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 113 Note This can produce a very long report Variance Detail Report As its name indicates the Variance Detail Report contains comprehensive information on the variants in a Variance Table Below the Common Header you will see detailed information for each of the described variants The information for each variant consists of the sample name and comparison range for the parent sequence The Detail Table is divided into two parts The first part contains the sequence name orientation confidence score where available base call primary peak base call secondary peak base call and the height of the secondary peak as a percentage of the primary peak The second part of the table contains a tracelet which is an extract of the sequence trace of the variant combined with some flanking bases see Figure 13 21 The tracelet only appears in
56. 13 27 Then click on the Reports button The Variance Table Reports dialog box appears the Selected Columns radio button is checked Choose a Report Format from the drop down menu SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 118 Select Individual Variance Reports if you want to save each column sample as a separate file Select Variance Table Report if you want the selected columns in one table Click the Save as Text button Sequencher presents you with a dialog to assign the name and location of your export You can copy your Variance Table into another document or application that accepts tabbed text To paste the contents of your Variance Table Report in the document of your choice click on the Copy as Text button The data can now be pasted into its new location Figure 13 27 Variance Table with discontinuous range of columns selected Sequencher Compare Consensus to Reference D File Edit Select Contig Sequence View Window Help Options Comparison Range Description 8 consensus sequences compared to Reference 3 exon Reference Comparison Range Unfiltered Base Positions 15394 15644 Display Options Large gap insertions 10 or more bases included Matches to ambiguous reference positions excluded e KR EECH RRE Total Ew AF o gt Se E EAN La an on an cn cn on an on olin ey eed ees 0 ojojo o ea 4 03 Sjaj s G R R Exportin
57. 13 7 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 100 Figure 13 7 Translated Variance Table for consensus sequences from different contigs S Compare Translation to Reference E ml x Description 5 consensus sequences compared to reference sequence NC_OOT8O07a 3 end Comparison Range Unfiltered Base Positions 15984 16571 Amino Acid Positions 1 229 ATC ACC T Setting the Variance Table Conditions Restricting the Comparison Range You can restrict the Comparison Range by Feature Key if you have any Features in your exemplar Construct a Variance Table then go to the button bar and click on the Comparison Range button The Comparison Range dialog window appears see Figure 13 8 In the default setting the Unfiltered radio button is checked Leave this radio button checked if you decide to leave your table unfiltered If you want to restrict the Comparison Range click the Filter by radio button Once you have clicked the Filter by radio button the Feature Key drop down menu becomes active You can now choose which Filter Key to use by making a selection from this menu Once you have selected a Feature Key you will see a list of features annotated with that key in the pane located below the Feature Key drop down menu You can select a single feature a continuous range or a discontinuous range of features You can also select all the features within this pane by c
58. 173 kabel amp Name eavededdia ied id ted tladta had ial dita lia tlie tlt alittle dere 174 Me eer ecrprre rere ere ener pe ceprre ne eelere 174 SOUNG EE 175 KEE EE 175 Displayen etare nnen Sesaee ge e Se d a E a e aa aa a EEEE 176 ERT E 176 Contig Chromatograms ees EENS EEN 177 CONG pcs ee ee ce ih Ae eo A E ee 178 EE eendeitege egal iesse EE E ofc 179 Format E 180 Start DO REESEN e 181 teil e EE 182 Pee Kr EE 182 REPOME GE 183 22 FORENSIC FEATURES siciisctisctiscdicctisctienadcutiaumiacnacceiaseasuusauetsehitceiauancdads 184 Working with mtDNA Promles iisstadion ea ea A E E EAT 184 Validating mitochondrial DNA profiles AE 184 Create Reports E 185 Export CMB so eet 185 Purther Commands Or EE 186 Set Circ lar Genome S enone nren a tege geseet ker 186 How to mark a sequence as a Reference Sequence 0 ieee eeseeeeeeseeeeseeeeseesesesacseeseseeseecseeeeseeseseeaeeeeas 187 New Project From Templaten aneen arae AO verdad taalerd endian ben egener See 187 Display secondary peak iesietent she asenenatien ani nlanienhinda niet enanienvitin ENTOTE 187 23 APPENDIX KEYBOARD SHORTCUTS ccccccsseeeeseeeeeeeeeeeeeeeeeeeneeeees 189 24 APPENDIX ADVANCED EXPRSSIONS cccccssseeeeseeeeeeeeeeeeeeeeeeseneees 191 25 APPENDIX IUPAC IUB STANDARD CODE ccss eeeeeeeeeseeeeeeeeeees 193 26 APPENDIX STANFORD EXTENDED CODE eeeeceeeeeeeeeeeeseeeees 194 27 APPENDIX DEFAULT FEATURE STYLES
59. 24 View bw SUMIM ALY eege NEEN geleed 124 Summary Teporbidisplay Options eler gelee eelere 125 I Colo EE 125 Bullets Pluses and Dashes cseiiviciecsvcestesshtestesvstan tea tea hes EENS 126 VE DEE 126 Lage UE 126 Matching Bases ag Dashess 2a2 250i genee Be bat oie Batak oda ta O 126 Protein translations and the Summary Report 126 Enabling the protein displays cic scsi seivesesceecciss eb sven cede seas eaa GEARR RRR RRR REER RA 127 Protein translations for the Consensus or Reference Sequence 0 ei eeeeesseeseeseeeeseeeeseeeeseeeeseeseeeseeeens 127 Protein translations for Fragment e ec eesceeseeseeeeseeeeseeeeseeecsesseseeseseescacseesesesscseeseseeseseesesasseeseseeaseeeas 127 Matching Proteins As Dashes eise ee 127 Using the formatting Merron rs cae etn eileen ENEE 128 Reverse and Complement a contig oo cesceesesseseeseeeeseeeeseesesessesessesesseseeseesesesacsesseseesesecseeesseeseseeaseeeas 128 15 GRROMATOGRANIS oseegeeeueueukENegugAgENeVENEgEgEeSEEEEEENEKEEEEEeEgEENE NENNEN 129 Working with automated sequencer data i cc s isssssscieccestedeseadancsceestedesesasncncanateseseesveens 129 Viewing chromatograms from a Sequence Editor eeseeseeseeeeseeeeseeeeseesesecaeseeseeesesesseeeeseesseeaeeeeas 129 Viewing chromatograms from a Contig editor oo eee eeeseeseseeseeeseeeeseeeseescsesaesecseseeseeeseeseseeaeseeaeeeeas 130 Understanding the Trace Display ees 131 Display Secondary peak eege geess ee EEN 132 Select
60. 6 This lets you select items in the Project Window where the search string is based on only part of the complete name Figure 18 6 Contain Items With Names Containing dialog box All Items That Contain A Fragment Name Containing Find What Case Sensitive 0 items selected out of Select All Matches Deselect All Matches Select None Done Have Chromatograms If you want to find out which fragments have a chromatogram or which contigs contain fragments with chromatograms go to the Select menu and choose All Items That and then Have Chromatograms Have Comments Containing If you want to select all the sequences in the Project Window whose comments contain a specific text string go to the Select menu and choose All Items That and then Have Comments Containing to display a dialog box Figure 18 7 Figure 18 7 Have Comments Containing dialog box All Items Whose Comments Contain Find What Case Sensitive 0 items selected out of 8 Select All Matches Deselect All Matches Select None Done Have Labels Containing If you want to select all the sequences in the Project Window whose label contains a specific text string go to the Select menu and choose All Items That and then Have Labels Containing to display a dialog box Figure 18 8 Figure 18 8 Have Labels Containing dialog box All Items Whose Labels Contain Find What IT Case Sensitive 0 items selected out of 3
61. 8 37 PM J10_74969_0045E7 SNPS8R_ Pt 670 BPs Fri Dc 16 2005 4 06 10 PM Fererence 550 BPs Tue Dc 13 2005 4 39 43 PM J10 74969 OO4SEISHPSBF PI 667 BPs Fri Dec 16 2005 4 06 70 PM Viewing the information for a sequence or contig The Get Info command displays additional information about a selected or open sequence or contig To view additional information about an open sequence or a contig editing window go to the File menu and choose Get Info If you do not have an open sequence or contig editing window you can select the icons of your sequences or contigs from the Project Window and then go to the File menu and choose Get Info Figure 3 10 The Get Info window Get Info JO4_14210_0045E1SNP88R_P1 E J04_14210_0046E1SNP88R_P1 Kind AutoSeq Frag ABI Size 824 BPs Where In contig 14210 Original Imported from disk Created Tue Feb 24 2004 9 49 AM Modified Fri Dec 16 2005 4 06 PM Version Modifiable Comments Base Count 226 As 240 Ts 157 Cs 171 Gs 30 Ambigs D Edits 0 Gaps Confidence 270 Low 255 Med 299 High Show ABI Info SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 14 If you have sequences from an ABI sequencing system Sequencher can display information from the ABI data sheet In the Get Info window shown in Figure 3 10 note the Show ABI Info button Editing the information for a sequence or contig You can annotate the sequence b
62. 95_0045E1SNP83R_P1 738 BPs i Dec 16 2005 4 06 10 PM 2 J14_15135_0045E1SNP88F_P1 662 BPs i Dec 16 2005 4 06 10 PM 2 J14_15135_0045E1SNP88R_P1 752 BPs i Dec 16 2005 4 06 10 PM J16_15155_0045E1SNPS88F_P1 663 BPs i Dec 16 2005 4 06 10 PM 2 316_15155_0045E1SNP88R_P1 673 BPs i Dec 16 2005 4 06 10 PM A J18_15239_0045E1SNPS88F_P1 664 BPs i Dec 16 2005 4 06 10 PM J18_15239_0045E1SNP83R_P1 727 BPs i Dec 16 2005 4 06 10 PM 2 J20_15303_0045E1SNP88F_P1 661 BPs i Dec 16 2005 4 06 10 PM 3 J20_15303_0045E1SNP88R_P1 668 BPs i Dec 16 2005 4 06 10 PM Selecting using menu commands Selecting all icons You can change the icons selected in the Project Window by going to the Select menu and clicking on the Select All Select None and the Invert Selection commands Selecting all items containing sequence To select all items in the Project Window that contain a particular subsequence go to the Select menu and choose All Items That command Then select Contain Subsequence from the submenu Sequencher displays a dialog box that lets you type two sequences the box has buttons to specify how you want the sequences matched to items in the Project Window You can use the drop down menus to specify restriction enzyme recognition sequences Finding assembled sequences To locate data assembled in a contig go to the Select menu and click on one of the following options from All Items That then Contain Subsequence or C
63. A A A For more details on regular expressions see Appendix 24 Advanced Expression and the tutorials in the Tutorials folder Setting your Assembly Handle Names Starting in the Project Window click on the Assembly Parameters button At the bottom of the Assembly Parameters window click on the Name Settings button The Assemble by Name Settings window appears as in Figure 10 4 You can describe up to eight Assembly Handles of which only one is active at any one time For each Assembly Handle you can use the default name Sequencher automatically gives your contigs or you can type a descriptive title into the text box Click on the radio SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 71 button to the left of the handle you wish to activate Sequencher displays the number and the name of the Active Handle at the bottom of this dialog Click on the OK button to dismiss this window and return to the Assembly Parameters window Figure 10 4 The Assemble by Names Settings window Assemble by Name Settings Name Delimiters Advanced Expression v Define Assembly Handles 1 fOrigin 2 Clone C 3 Date 4 Primer C 5 Handle5 C 6 Handle6 C Handle C 8 Handle8 Active Handle 2 Clone Cancel Choosing a New Assembly Handle There are two ways to choose an Assembly Handle If you are in the Assembly Parameters window you can change the handle by
64. ATC 46 Ji l e rp o o H The table header indicates how the Comparison Range was restricted For some Feature Keys the actual feature is composed of several components joined together see Figure 13 9 The Base Positions indicate the exact range of bases used for each component Features which share the same Feature Key such as CDS and which are joined the Feature has a Join qualifier will appear in the Feature Listing as CDS 01 CDS 02 The same Feature will appear in the Translation Range dialog window as a single item see Figure 13 11 See Chapter 17 Motifs and Features and Appendix 28 Feature Keys and Qualifiers for more information SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 103 Figure 13 11 Feature key with more then one component Translation Range C Translate Entire Sequence Feature Key misc_structure x misc structure 3 features Select All Cancel If you prefer to have each component of a Joined Feature as a separate item you must give them different names You can do this by editing the Feature using the Edit Features command see figure below Figure 13 12 Appearance of features once they have been renamed Translation Range C Translate Entire Sequence Feature Key misc_structure x Fred 131 150 Ginger 291 340 mise structure 391 400 Select All Cancel You can also create your own joined
65. BPs Guto Zeg Frag Interesting FS AA 466_rew 591 BPs Auto Seq Frag Interesting AE BB_123_for 589 BPs Auto Seq Frag Mutations AE BB_123_rev 594 BPs Auto Seq Frag Mutations AE CC_123_for 581 BPs Auto Seq Frag Mutations AE CC_123_rew 594 BPs futo Seq Frag Mutations FS FF_123_rev 592 BPs Futo Seq Frag Mutations FS GG_123_for 590 BPs Futo Seq Frag Mutations DS DD 3 tor 583 BPs Futo Seq Frag Resequence AE DD 23 re 594 BPs Futo Seq Frag Resequence PS EE_123_for 591 BPs Auto Seq Frag Resequence Gf EE_123_rev 594 BPs futo Seq Frag Resequence Saving your work Saving your Project To save a project go to the File menu and choose Save Project which saves all your changes to the current project file When you use the Save Project command for a new project Sequencher will prompt you to name your project with the Save As dialog Enter a name for the new project in the File name field then click Save Auto Save If you are working on complex projects you may find it helpful to have Sequencher automatically save your work at pre set time intervals You will need to enable the Auto Save user preference from User Preferences in the Window menu See the Chapter 21 Customizing Sequencher and User Preferences for more details Saving a duplicate of a current project If you wish to create a second copy of a current project you can use the Save Project As command to create the new copy of your project with a different name When yo
66. Boolean and search using two specifiers for the subsequence Enter the first specifier in the Contains field and the second in the And field There are also two drop down menus to assist you in the search for specific restriction enzyme sites Figure 18 4 Contain subsequence dialog box Add To Selection All Items Containing C Exact Matches e Specified Bases C Any Ambiguous Base Contains 1 items selected out of 8 Select All Matches Select None Reverse amp Comp Deselect All Matches Done Contain Items Named If you want to find which contig contains a specific fragment go to the Select menu and choose All Items That and then Contain Items Named to display a dialog box Figure 18 5 This lets you select items in the Project Window that contain a fragment bearing the specified name Figure 18 5 Contain Items Named dialog box All Items That Contain A Fragment Named Find What I Case Sensitive 0 items selected out of 8 Select All Matches Deselect All Matches Select None Done Contain Items With Names Containing If you are looking for several fragments with related or similar names and need to check which contig or refrigerator they are in go to the Select menu and choose All Items That Click on the submenu option Contain Items With Names Containing to display SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 155 a dialog box Figure 18
67. C AGG CCA TAA GGT CGC TCA CAG AGC GGC TTT ACG TCG GGG GTC CCC TGA CNA ATT CAA AAG GGC AGA ACT Frame 3 In the example shown in Figure 8 9 a click between a start and a stop codon of the first reading frame has selected the corresponding bases in the editor Note that User Preferences have been set to highlight ORFS of more than 100 bases There is one such ORF in Frame 2 marked with a hash pattern To dismiss the map click on the Codon Map button again Figure 8 9 Sequence Editor with codon map E Quality four Dan ep Pue RR Emtee Residue 88 to 192 D Low Confidence 0 0 D Ambiguous 0 0 Frame 1 Frame 2 a Frame 3 5 r3 NNN NNN GGN NNN NNN NCT CTC CTG GNN NCG CCT GGT GAA GAN NAN ACC CCA TGG GGT TCT d TT C CC GTG TTG sT CAA T G T TGC CGC TCA ACG TCG TGA CNA ATT CAA AAG GGC AGA ACT Looking at Restriction Maps You can look at a restriction map for the sequence by clicking the Cut Map button at the top of the Sequence Editor A restriction map replaces the sequence in the window See Chapter 16 Restriction Maps for a general discussion of restriction maps SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 52 Sequencher can add a pane showing the restriction map for a sequence into the same window as the sequence itself At the top of the Sequence Editor is a button showing an icon for a staggered enzyme cut shown in F
68. CGGGGAC TCTACCTGCTCCCCAACC gatnacg Trim 18 five prime bases 11 Ambigs SK iz Trim 7 three prime bases 1 Ambigs d Quality four Has Chromatogram 674 good bases 4 Ambigs AnAnnnggnnnnnnnnetctectygnnn CGGGTCGAGTICCTGGTG CCTTATGGGCAAGGATGA ttctgatgactcacttcttctecatcc Trim 28 five prime bases 17 Ambigs SK iz Trim 92 three prime bases 3 Ambigs 3 Quality nine Has Chromatogram 634 good bases 0 Ambigs Annnectnnnnngnaggagtaggeace ACATGAACCAGCGGGGTC GGAGGAGCATGATGTTAT cagctccaagtcgtattctggtcgggg Trim 42 five prime bases 20 Ambigs 5H Hz Trim 120 three prime bases 3 Ambigs Sorting items In order to assist you with the review of your data prior to trimming you can sort the fragments Click on the Sort Items button at the top of the Ends Trimming window to order the fragments being trimmed Figure 6 5 shows the sort criteria Figure 6 5 Sort Fragments dialog box Sort Fragments by Fragment Name ees by Items Checked For Trimming C by Original Sequence Length C by Amount To Be Trimmed by Length Of Fragment After Trimming Executing the trim Once you have reviewed your data and are happy with the criteria you have set you proceed with the trim by clicking on the Trim Checked Items button Sequencher warns you that this command cannot be reversed with the Undo command To proceed click the Trim button Any red lines or bases will disappear from the Ends Trimming window Now you
69. C_000022 of contig Comparison Contig Comparison Range Filtered on CDS plus 0 flanking bases Base Positions 2093 2136 4503 4590 6719 6864 10918 11023 11437 11555 A aeo CZ i 152 3815 A E 32490 BPs DNA Fragment fl 19863 c Passed ojr ass o _ asa o _ as o _ am o aso o aa o s30 e nme o E Pe r E WT ET te D O Olo oO o o Setting the Consensus Calculation for a Variance Table You can generate a Variance Table where the exemplar is a consensus sequence rather then the Reference Sequence by using the Compare Bases To command from the Sequence menu You generate a consensus Variance Table using the Compare Consensus to Reference command from the Contig menu SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 106 In either case the method you choose to calculate your consensus can affect your results Generally using the command Consensus Inclusively will report more variants than Consensus by Plurality Sequencher recalculates the consensus on the fly so if you change the methods you are using to compute your consensus you can reconstruct the Variance Table quickly Just click on the Refresh button in the button bar at the top of the Variance Table to display the results of the new consensus calculation For more information on consensus calculations see Chapter 11 The Contig Editor Variance Table Options If you need to chang
70. ER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 168 Figure 21 1 Ambiguity editor Ambiguity Editor h Load Ambiguities IUPACAUB Ambiguity Set Adenine Color Green e Character to Display Keystroke 1 Ia Keystroke 2 cm I Extended Ambiguity Set Jase the Clear All command in the Edit menu to clear all the key definitions Cancel Each base and each ambiguity has a display character which is drawn on the screen and sent to the printer to represent that base or ambiguity The display character interprets ASCII text files imported into Sequencher Each code is also assigned a color which can be displayed in fragments or contigs by going to the View menu and clicking on Display Color Bases Select a base or ambiguity from the scrolling list at the left by clicking on its name When the item is highlighted information about it appears on the right side of the dialog box To change the display character click the box to highlight it Replacing an existing character keystroke code Each base code can be assigned up to two alternate keys from the keyboard Using either of these keys generates the desired base code when using the sequence or contig editors These keystroke settings do not affect importing of data To change a keystroke setting for a base code select the base code from the list on the left of the window Click on one of the keyboard keys to highlight it and then press the
71. GCCATTCAGGCTGCGCAACTGTTGGGA AGGGCGATCGEGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGat AGGGCGATCGGTECGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGat 42 anarci gTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGT gTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGT 4 anarce TAAAACGACGGCCAGTGAATTGTAATACGACTCACTATAGGGCGAATTGGG x Blannen TGTAAAACGACGGCCAGTGAATTGTAATACGACTCACTATAGGGCGAATTGGG TGTAAAACGACGGCCAGTGAATTGTAATACGACTCACTATAGGGCGAATTGGG E EE Eanarca gt 1 gt GNTGCTCGAGCGGCCECCANTGTGatgGATATCTGCAGAATT k P 4Baarce 52 CCCTCTAGatgCatgCTCGAGCGGCCECCAGTGTGat gGATATCTGCAGAATT 42 anarci 425 COCTCTAGatgCatgCTCGAGCGGCCGECCAGTGTGat gGATATCTGCAGAATT SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 166 21 Customizing Sequencher and User Preferences In this chapter we discuss how to set your view preferences such as colors custom codes features and motifs and the genetic codes you want to use We also discuss how to set your user preferences such as confidence scores labels and names assembly parameters and how you want to display chromatograms You will learn that all of these preferences can be saved as individual or group preferences View Preferences Sequencher offers you a number of options for specifying how you want to view your data in the sequence and contig editors These options can provide visual cues which help to differentiate your data Ambiguous bases Sequencher can u
72. K AAG AAA GAG AGC ATC CTG CCT GGG CAA CAT AGT GAA ACC CTG CCT K K E 5 I L P G Q H S E T L P SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 54 The sequence margins you set with the formatting ruler are not the same as your absolute print margins which allow room at the left to show the first base number of a line To change the absolute limits go to the File menu and choose Set Header amp Footer and enter new values into the Margin boxes or drag the triangular markers just under the Translation and Base Grouping icons as shown in Figure 8 13 above Translation On the left just under the word Residue and above the ruler is a set of three small icons that let you specify sequence representation single stranded double stranded or single stranded with a protein translation If you select the third translation icon the numbered button at the right controls which frame or whether all three frames will be displayed You can also use the translation commands in the Translation submenu of the View menu Single Stranded Double Stranded Protein 1st Frame Protein 2nd Frame Protein 3rd Frame or Protein All 3 Frames see Figure 8 14 Figure 8 14 Translation icons on the Formatting ruler im SE EBB BIE I Io IGN Font Di i gt SR 3 1 4H 4 15 See Chapter 21 Customizing Sequencher and User Preferences for information on editing the genetic code or using
73. Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 120 If you prefer to copy your Translated Variance Table to another document or application which accepts tabbed text click on the Copy as Text button The data can now be pasted into its new location Exporting selected sample sequences Each column in the table represents a sequence or consensus If you need to export specific columns in the table make your selection by using Shift click for a continuous range or Ctrl click for a discontinuous range see Figure 13 29 Then click on the Reports button The Translated Variance Table Reports dialog box appears see figure above A radio button called Selected Columns will be checked Figure 13 29 Discontinuous selection of columns O Compare Translation to Reference E EI EN Description 12 consensus sequences compared to Reference 3 exon Reference Comparison Range Unfiltered Base Positions 15394 15644 Amino Acid Positions 1 83 See Le Te Kette GR GST GCT GCT D A AAG AAG AAG TAT TAT TAT ee y F AGT GET A AAG K G K K K T A Z 415 Choose one of the three enabled buttons in the Translated Variance Table Reports dialog box Cancel Copy as Text and Save as Text Save as Text is the default setting If you click on the Save as Text button Sequencher presents you with a dialog to assign the name and location of your export The report is saved in a tabbed text format
74. Ot W 1M OW ee dee dE EA dee 23 E Wu CO EE 23 5 ORGANIZING PROJECT VIEWS cc ccccccsceeesseneeeeeeeeeeeeeeeneeeeeeeeeeeseeeeeeeeenee 25 Working with icons WE 25 Changing between project VieWS eege SEENEN 23 EEn Uer 25 E Ee EE He Selecting Sequences and Bn 26 Selecting with the MOUSE meena ra a A T ireland etre eeivin tein aaa 26 Selecting by ty pine nrinn et ari a GENEEN EOE ROTERER 26 Selecting multiple items with a Marquee cesceseeseeseeseeseeseeseeseeseeseeseeseeseeseeseeseeseeseeaeeseeseeseeseeaeeaeeaeeaes 26 Selecting using menu commands eege eat Net 27 GEDEELT A AEE E EET 27 Selecting all EE RE UE 27 Finding ass mbled Sequences eer dE EE 27 iv ru DER EE 28 Renaming Using the Mouse een ee Ae ie ee eal ele Ia A OEMS he 28 Renaming sng the MenWiaeuescssndaudddddndvaddduwdiaddiidadtaddddaddanddiwdnaddadadiewe 28 Labeling items in the Project WindOWaauaddcndgudiid cine uadiaaddanaad lei ae 28 pe RIGID TAD 50 ME deed 29 Savine your Projectors chee eA EE AOO RR R AA AA ET A E A es 29 a UONE NA A ER E ee Eet EE E E E E 29 Saving a duplicate of a Current Project iiri rnanan EAE E E ERARA RARA EEEE 29 Reverting to the saved version of the project cece sesessesseseeeeseeecseeeesesscseesesecscecseeeeseseesesassesaeeeaeeeeas 29 Closing the Project WNdoW eegene 30 Exporting datano ai a aA R ee ENEE SE 30 Exiting th pro gratin E 30 6 PREPARING YOUR DATA FOR ASSEMBLY sscssseeeeeeeeeeeeeeeeeeeeeeees 31
75. Print Trace in One Page This command appears within the File menu and allows you to print an individual trace so that the entire sequence fits on one page Printing in detail Most windows that contain data can be printed To print select a window by bringing it to the front Then go to the File menu and choose Print and enter your print specifications Setting header footer and margin options Specify header footer and margins for any printout by going to the File menu and choosing Set Header amp Footer You enter the text of the header and footer and a code style for elements into a dialog box Figure 20 1 that Sequencher will automatically increase incrementally The code elements are Date Time Page Total Pages SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 164 Project Name and User Name You can also add your own free text in either the header or the footer Headers and footers are supported in all reports Set the margins by typing appropriate numbers into the Left Right Top Bottom and Gutter boxes in the Margin pane Remember not to set margins smaller than those your printer can handle Figure 20 1 Setting margins for printouts Set Header Footer amp Margins Header Sequencher tm Project Name Foote Deel Page Pageti ot TotalPages Margin Style Left man Report Right asin Book Top 0 5 in o C Poster Choose a printer Bottom D I in
76. Project From Template command allows you to open a new project containing all the sequences settings and preferences associated with your chosen template Go to the File menu and click on New Project From Template Select a template from the submenu A new blank Project Window will open Import from Template You can add preferences settings and sequences to an existing project by importing a template If you have an open project go to the File menu and click on the Import command and then choose From Template Select a template from the submenu The settings and or sequences from the template will be applied to your open project automatically SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 16 4 Importing data In this chapter we will discuss how to import data from automated sequencing equipment and other programs into Sequencher projects The different techniques we will discuss include dragging and dropping copy and paste and using import commands We also explain how to deal with projects in specific formats how to import confidence scores and how to remove unwanted sequences and contigs from a project Quick and easy importing of data Dragging and dropping files You can drag files from the desktop right into the Project Window and Sequencher will import and load those files Sequencher will alert you if it cannot read and load the files If you drag sequences onto the Sequencher icon on
77. Remoymeg undesirable data E 31 Trimming poor q ality Mata EE 31 Selecting and trimming fragment data 31 Performing a trim with default settings 00 eee eseseeseeseeseeseeseeseeseeseeseeseeseeseeseeseeseeseeaeeaeeseeaeeaeeaeeaeeaeeaeeaeees 32 Automatically selecting sequences for ends triMming 0 eee eee ese cece eseeeseeeeseeeseeeeseeseseesesesaesesaeeeeas 32 How to set the Ends Trimming criteria eesceceseeseseeseseeseeeseeecseeecsessesesacsesseeeseeeseseesesseseeaseeeaeeeeas 32 Using Contidencesto TUM Em Svecsiseets iesscsszheiesssaigleag ganiesd eos lacd soalecte eas laesesigeged EE 33 Reviewing the Trim ee 33 SOINE Items 268i eet ee tes ee Rees a RA etn SAR ten Aes en Ren a RA 34 Set lr RE 34 Trimming vector contamination EE 35 Selecting sequences for Vector Trim eoa Ea RAEE E E Aia 35 Specifying a vector using VecBaErecocuio onena a RAA A O EA T AA E 35 Sp cifyine GEN LEE 37 Saving and load ne e EE 37 setting Vector Trimming Criteriatec ntecaieitadalelatedelivewlednitintedaeelytadal elated 38 How Sequencher screens Sequences sin ve eae cadena eege ege deet deed eege 39 Executing the trim command Ee 39 7 JHE REFERENCE SEQUENCE piicscisscccistisesiastoamnccuraantavteasneaemugnmanteamnagemi 40 Whats Reference Sequence erone ee 40 Why use a Reference Sequence 20 nioi iaaa aAa RR RARER dele onsite 40 Reference Sequence e EE 40 Basic Reference Sequence properties oo ee ceeseeeeseeseseeseseseseeseseeseeecseeeesessesesacsesacse
78. S gene RBS c polyA_ signal gene polyA signal d enhancer gene enhancer e attenuator gene attenuator SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 203 Feature Key Default Feature Qualifier f terminator g rep_origin h oriT 4 misc_RNA mRNA g 1 sig_peptide 2 transit Deptide 3 mat_peptide h intron i polyA_site j rRNA k tRNA 1 seRNA m snRNA n snoRNA 5 Immunogobulin related a C_region c J_segment d N_region e S_region f g V_segment 6 repeat_region a repeat_unit b LTR c satellite Transposon 7 misc_binding a primer_bind b STS c protein_bind Primer 8 misc_recomb a iDNA 9 misc_ structure gene terminator gene replication origin gene oriT gene misc RNA gene primary transcript gene precursor RNA gene MRNA gene 5 clip gene 3 clip gene 5 UTR gene 3 UTR gene exon gene CDS gene signal peptide gene transit peptide gene mature peptide gene intron gene polyA site gene rRNA gene tRNA gene scRNA gene snRNA gene snoRNA Immunogobulin related gene C region gene D segment gene J segment gene N region gene S region gene V region gene V segment gene rpt_type repeat region rpt_family gene rpt_type repeat unit gene LTR gene rpt_type satellite transposon misc binding of bound_m
79. Sequencher User Manual The complete software solution for sequencing DNA Windows Gene Codes Corporation THIS IS AN AGREEMENT BETWEEN YOU THE END USER AND US GENE CODES BY OPENING THE SEALED DISK PACKAGE YOU ARE AGREEING TO BE BOUND BY THE TERMS OF THIS AGREEMENT IF YOU DO NOT AGREE TO THESE TERMS PROMPTLY RETURN THE UNOPENED DISK PACKAGE AND ALL ACCOMPANYING MATERIALS TO US OR TO YOUR DEALER FOR A REFUND License We grant you the right to use the enclosed software program the Software on a single terminal connected to a single computer The Software is accompanied by a hardware locking device that prevents its full use on any computer without the device if the Software is used without the locking device it operates in demonstration mode only You may not use the Software on a network without obtaining a license for each terminal on which the Software will be used You may not use the Software for any purpose related to the use development analysis testing or research on biological weapons Copyright The Software is protected by United States copyright laws and international treaty provisions You must treat the Software like any other copyrighted material such as a book or musical recording You may however either a make copies of the Software for backup or archival purposes b transfer the Software to a single hard disk provided you keep the original or c make copies of the Software to give to others for purposes o
80. T CCT CCT GCTACTCTCOGGGGCCCTGGCCOCTGACCY yf Candidate AT GGCCGTCATGGCGCCCCGAACCCTCGTCCTGCTACTCTCGOGGGGCTCTGGCCCTGACCY AE Match 1 10 20 30 40 50 6 Assemble If you choose to create this assembly click on the Assemble button Name the new contig in the Set Name dialog window and click OK Sequencher takes only a short time to assemble or reassemble a contig and the name of the new contig now appears at the top of the Candidates list Note that as a sequence is assembled its name is removed from the Candidates list Figure 9 8 The new contig displayed in the Assemble Interactively Window Assemble Interactively Candidates Matches Overlap Mismatch Gaps Rew Contig Len gt HL HLA00017 Assembly Parameters HLAcHLAOD002 HLA HLAO0004 HLA HLAOD00S HLA HLAO000S HLA HLAOD00S HLA HLAO0009 HLA HLAO0009 HLA HLAO0016 HL amp HLAOO010 HL HLAO0012 HLA HLAO0011 HL HLAO014 HLA HLA00012 f HLAHLA00020 ATGGCCGTCATGGCGCCCCGAACCCTCSTCCTGCTACTCTCGGGGGCYCTGGCCCTGACCY E Candidate EE 1 10 20 30 40 50 6 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 66 The new contig will appear in the Project Window when you close the Assemble Interactively dialog by clicking on the Done button Changing parameters for an interactive assembly If you suspect that your fragments should form a contig but Sequencher cannot suggest candidate matches you can change the parameters for assembly To do this whi
81. Table Name Input Output Page Setup d File Import Date Page Page of Total Pages You can specify headers and footers on reports generated from any window in which you are working Any text you type into the field above the page layout will appear in the header and text you type in the field below the layout will appear in the footer For variable elements date time page and total number of pages enclose the word typed with an initial capital in square brackets as shown in Figure 21 20 above SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 183 22 Forensic Features If you have the forensic version of Sequencher you have access to special functions for DNA based identification In this chapter we describe how you can validate mtDNA create reports and export your results Working with mtDNA profiles Validating mitochondrial DNA profiles The Validate mtDNA Profiles command allows you to compare the results of separate analysts Select two contigs which have been assembled to the same Reference Sequence Go to the Contig menu and click on the Validate mtDNA Profiles item This command creates a report in a new window The title bar includes a date and time stamp as shown in Figure 22 1 below The report includes the name of the Reference Sequence used the name of the contigs being examined and the HV1 and HV2 ranges Underneath the main body of the
82. The Variance Table also displays the total number of differences for each position to the right of each row There are two Total buttons at the bottom left and top right corners of the Variance Table These act as Sort buttons SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 107 The Total button on the bottom left of the table sorts the sample columns so that the samples with the greatest number of differences move to the left Click Total again and the columns sort in the opposite order There are two ways to define the sort order for the rows You can sort by total number of differences or you can sort by 5 3 order In the default unsorted display of the Variance Table rows are sorted by base position in ascending 5 3 order When you click on Total in the top right corner of the Variance Table the rows are rearranged so that those with the greatest number of differences sort to the top Click the Total button again to reverse the order To restore the order by base position click on the button in the top left corner of the Variance Table Click here again to have the rows sort by position in descending 3 5 order Note The text on the sort button is context sensitive and will change depending on the Compare Bases To command used to generate the Variance Table Removing columns from the Variance Table As you start to work with your Variance Table you may decide that some se
83. The New Project From Template command allows you to open a new project containing all the sequences settings and preferences associated with your chosen template Go to the File menu and click on New Project From Template Select a template from the submenu A new blank Project Window will open Note A template called rCRS is provided This contains the Revised Cambridge Reference Sequence and is the default template for this command For more information on Templates see Chapter 3 The Project Window There is a tutorial on analyzing mtDNA with Sequencher in the Tutorials folder Display secondary peak You can identify heterozygotes by identifying the second highest peak beneath the primary peak If you have a particular candidate for a heterozygote first open the chromatogram for the individual sequence then click on the base call s Go to the Sequence menu and choose Call Secondary Peaks A dialog box lets you specify how the secondary peak should be called The slider lets you specify how significant the second highest peak must be to generate a change SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 187 If you want Ns to be replaced then select the Allow Ns to be replaced checkbox If you ve already edited bases by hand and don t want them changed you must deselect the checkbox called Allow edited bases to be replaced If you select Only make changes that result in an ambig
84. UENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 95 The range of bases included in a Variance Table is called the Comparison Range The length of the exemplar determines the default Comparison Range and sets the base numbering The extent to which the exemplar matches any sample sequence is called the Coverage Range The Coverage Range can be complete full range or incomplete If the range is full the entire length of the sample sequence has been compared to the entire length of the exemplar sequence If the range is listed as incomplete the display of the sample sequence comparison runs along only part of the exemplar sequence If a sequence extends the full length of the Comparison Range its column header will be shaded in gray If a sequence in the Table does not extend the full length of the Comparison Range its column header will be shaded in pink Figure 13 2 Components of a Variance Table Description 5 sequences compared to Reference Reference Comparison Range Unfiltered Base Positions 1 856 Display Options Large gap insertions 10 or more bases excluded Matches to ambiguous reference positions excluded EC a a CAE EEN Sort Rows by Number of Variants Sort by Position of Variants a 6 Shading Reflects Confidence of Base Call A 356 c ez 6 Resize Column S Sort Columns by Number of Variants Posi
85. VecBase File button in the Vector Insertion Sites window Sequencher displays a list of the vectors Select the vector you want and click the Select button Figure 6 6 Selecting a vector Vector Selection Change Current VecBase File Current File 3 Vectors from VecBase TXT Vectors BluekSm M13mp8 DLC ie Cancel The polylinker region of the vector you select is displayed in the window shown in Figure 6 7 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 35 Figure 6 7 The polylinker window Polylinker Prime Other Strand Select All SacI Bet 5 GTTGTABAACGACGGECAGTGAATTGTAATACGACTCACTATAGGGC GAATTGGAGCTC CACCEGGCTCE SacII XmalIT Spel Smal EcoRI HindIII Sall CGGCCECTCTAGAACTAGTGGATCCCCCGGGETGCAGGAATTC GATATCAAGCTIATC GATACCCTCGAC NotI Xbal Bank PstI Ec okt Clal Drall KpnI qre GAGEGGCGGE QCGGCTACCCAGCTITTGTICCCTITAGCTGAGGETTAATTCCGAGCTIGGCGTAATCA Si Xhol amp pal a Bluek Gm Cancel Note Not all vectors in VecBase have polylinker information Sequencher cannot work with VecBase vectors that do not include polylinker data Add these manually see below Click the enzyme restriction site s where your sequence fragment was inserted into the vector The name of a selected site s changes to green and is boxed If you change your mind and decide on a different site you must click the selected site again to deselect it It will then return to its or
86. WB_2001 05_PrimertF CGGTACCATAAA 3 A GWB _07061948_Primer1F CGGTIACCATAAAT d A_GWB_2001 05_PrimerF 2 A_GWB_07061948_Primer3F 3 A_GWB_2001 05_Primer2R 3 A_GWB_2001 05_Primer4F 3 A GWB _07061946_Primer4F 5 frag bases selected at 6120 16 O 1 consensus position 16 127 25 7 o o ce op a E rm cca GAAT ATT Vtt 16 226 16 237 EB Chromatograms from GWB A_GWB_2001 05_I T een base 16 127 Base 103 of g A Ta T GTA k 3 wy rags 127 Ga ee S T TA S AT EE EE TG i J haa byl yw dvr aioli A_GWB_2001 05_Prim 9 ger paze VG 127 Zi 103 E T E ATAT ARTA GE f Cat A D AA Cette on D SS bi The Review Mode and the Translated Variance Table The Translated Variance Table also has a Review button in its button bar When you click this button you will see the associated Contig Editor and chromatograms see Figure 13 17 When you click on an individual cell in the table its codon is highlighted in the Contig Editor and the first base is selected You will notice that the equivalent codon in the exemplar sequence is also highlighted Once you have arranged the individual windows to your satisfaction you can save the layout by going to the Window menu and choosing Remember Window Layout Then click on Single Table Variance Review from the submenu SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All
87. WHICH VARY FROM STATE TO STATE NO LIABILITY FOR CONSEQUENTIAL DAMAGES IN NO EVENT SHALL WE BE LIABLE FOR ANY DAMAGES WHATEVER INCLUDING LOSS OF PROFITS ARISING OUT OF THE USE OR INABILITY TO USE THIS SOFTWARE EVEN IF WE HAVE BEEN ADVISED IN ADVANCE OF THE POSSIBILITY OF SUCH DAMAGES BECAUSE SOME STATES DO NOT ALLOW THE EXCLUSION OR LIMITATION OF LIABILITY FOR CONSEQUENTIAL OR INCIDENTAL DAMAGES THIS LIMITATION MAY NOT APPLY TO YOU U S Government Rights The Software and documentation are provided with RESTRICTED RIGHTS Use of duplication or disclosure by the Government is subject to restrictions set forth in subparagraph c 1 ii of the Rights in Technical Data and Computer Software clause at 52 227 7013 The contractor manufacturer is Gene Codes Corporation Ann Arbor Michigan 2011 Gene Codes Corporation Inc All rights reserved Sequencher is a trademark of Gene Codes Corporation Microsoft is a registered trademark of Microsoft Corporation Macintosh is a trademark of Apple Computer Inc ABI Applied Biosystems and 3730 are trademarks of Applied Biosystems Inc ALF is a trademark of Pharmacia Biotech All other trademarks are the properties of their respective owners er E 1 Gene EE EE 1 2 USING THIS MANUAL eessen Vekeseeug ee KENNEeNREeENENKEENANEENES ENEE 3 Conventions used in this mamm si 3osc ssn se sass ceseye eens asda euasocaig sd cashed shin SEENEN tee 3 Installing ScQuenchen ee e 3 Sequencher H
88. able and go to the button bar and click on the Translation Range button The Translation Range dialog window appears You will see that the Translate Entire Sequence radio button is checked This is your default Leave this radio button checked if you decide to leave your table unfiltered SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 102 If you want to restrict the Comparison Range click the Filter Translation by radio button Once you have clicked the Filter Translation by radio button the Feature Key drop down menu becomes active You can now choose which Filter Key to use by making a selection from this menu Once you have selected a Feature Key you will see a list of features annotated with that key in the pane located below the Feature Key drop down menu You can select a single feature a continuous range or a discontinuous range of features You can also select all the features within this pane by clicking on the Select All button Once you have made your selection click the OK button The Translated Variance Table will be rebuilt automatically see Figure 13 10 Figure 13 10 Translated Variance Table restricted by Translation Range Compare Translation to Reference Translation Range Description 58 consensus sequences compared to Reference Reference Comparison Range Translate only CDS Base Positions 169 249 376 570 Amino Acid Positions 1 92 a en en 8 430 ACC
89. ains a listing of the standard feature keys which have been implemented in Sequencher Table 28 1 Feature keys groups and hierarchies Feature Specificity or detail Functional grouping A misc_feature 1 misc_difference a conflict b unsure c old_sequence d variation e modified_base f mutation g allele Difference and change features 2 gene 3 misc_signal a promoter CAAT_signal TATA_ signal 35_signal 10_ signal GC_signal b RBS polyA_ signal enhancer attenuator terminator g rep_origin h oriT 1 2 3 4 5 c d e f Expression signal features 4 misc_RNA a prim_transcript 1 precursor_RNA a MRNA 1 sig_peptide 2 transit_peptide 3 mat_peptide h intron Transcript features SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 200 Feature Specificity or detail Functional grouping i polyA_ site j rRNA k tRNA 1 scCRNA m snRNA n snoRNA C_region D_segment J_segment 5 Immunogobulin related N_region 6 repeat_region repeat_unit Repeat features LTR satellite transposon primer_bind Binding features protein_bind STS primer iDNA Recombination features stem_loop Structure features D loop 7 misc_binding 8 misc_recomb a 9 misc_structure 10 gap 11 operon 12 source F
90. ambiguous matches and large gaps will be treated If you wish to include ambiguous matches in your table click the Include Ambiguous Matches checkbox If you wish to exclude large gaps from your table click the Exclude Large Gaps checkbox When you generate a Variance Table you would expect to have blank cells where the base at that position matches the exemplar If you would like to see all bases across the comparison range whether they match or are different to the exemplar then click the Populate All Cells checkbox see Figure 21 18 Figure 21 18 Variance Table preference settings je Include Ambiguous Matches Jv Exclude Large Gaps Input Output File Import File Import preferences control how Sequencher applies the automatic trim function to imported files Use the checkboxes grouped under Add Imported Sequences To Trim Window to specify which sequences Sequencher should examine for poor quality ends and then trim You can also specify how you want to resolve conflicts between the names of imported files and names stored with particular sequences If you click the Prefer File Names To Data Names checkbox Sequencher uses the file name Clicking the Review Trim Parameters button displays the Ends Trimming dialog box in which you can set trim criteria See Chapter 6 Preparing your Data for Assembly for details SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 182
91. amed or Have Chromatograms from the Select menu You can indicate if you want these names to be case sensitive or not Note The subcommand Contain Items with Names Containing only works on contigs For more detailed information on these commands see Chapter 18 Finding Items Selecting with the mouse First make sure the Project Window is at the front When you click on an icon or hold down the Shift key while clicking on several icons Sequencher highlights the icon s When you have finished selecting a group of icons release the Shift key before you try to do anything with the group You can deselect an icon by clicking on it while holding down the Ctrl key If you do not hold down the Shift Ctrl key Sequencher deselects the last highlighted icon s as soon as you click on another If you click in the white space between icons and names Sequencher deselects all the highlighted icons When you display the list view of a Project Window or a contig you can select a range of fragments by using Shift click You can make a discontinuous selection by using the Ctrl key and clicking on the individual fragments Selecting by typing Type the name of a sequence fragment or contig As you type Sequencher searches the Project Window for an icon with that name If it finds the name it scrolls to the point at which the icon is visible and highlights the icon Note If you pause for more than a second or two while typing a name Seque
92. an a few seconds the program assumes you ve started typing a different name and looks for the new name Another way to locate a sequence is to go to the Select menu and choose Item Named This brings up the dialog box shown in Figure 18 3 Figure 18 3 Item Named dialog box Find What my sequence frd Erann Done Type the name of the sequence you want to find and click Find Sequencher searches through all the sequences displayed in the Project Window for a name that matches the name you entered e When it finds a sequence the sequence is displayed and highlighted in the Project Window If Sequencher does not find a sequence with the exact name you typed it looks for a sequence with a name starting with the first few letters you typed e If Sequencher still does not find a match it looks at all the sequences contained in contigs If it finds a matching sequence within a contig it highlights the contig that contains the matching sequence If Sequencher can find no matches anywhere it stops searching and informs you that the item was not found e If there is more than one sequence with a matching name Sequencher selects and highlights the first one If that is not the one you want click Find Next This continues the search in alphabetical order Note The Find window is movable Selecting all items that The Select menu s All Items That command provides a series of suboptions based on different cri
93. an alter the vector screening parameters As an example if there is ambiguity in the region of the vector junction you can increase the likelihood that Sequencher will recognize the vector sequence by decreasing the approximate match percentage If it is essential to remove even a single base of vector contamination you can decrease the minimum overlap to be considered as vector contamination You can change any of the five independent parameters in order to improve your trim Elevator buttons are used to increase or decrease the parameter value The following table describes the recognition criteria for contaminating sequence Parameter Minimum overlap to consider as vector contamination Approx match percentage to consider as contamination Minimum overlap allowed without exact matches Additional bases to remove from a contaminated 5 end Additional bases to remove from a contaminated 3 Description The smallest amount of overlap to consider as vector contamination The default value is three bases The percentage match Sequencher needs to find before considering the matching sequence to be vector contamination Ambiguities do not count as full matches or mismatches The smallest number of matching bases within which an inexact match is permitted If fewer bases than this match they must match exactly Extra material if any to be trimmed off the 5 end of the remaining DNA sequence along with the vector co
94. ance Table Reports You now have a number of ways to present the data from your Variance Table You can use the table s Reports function to print your data or to export it The options vary depending on the data you have chosen to use in your report Printing a Variance Table Report You can print the data from an entire table selected columns or selected rows The type of report you can generate will depend on this initial selection see Table 13 2 Table 13 2 Table 13 2 Variance Table print options Variance Table Individual Variance Detail Population Report Variance Table Report Report Reports Entire Table Yes Yes Yes Yes Selected Yes Yes Yes Yes Columns Selected Rows Yes n a Yes n a SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 115 To print all the data in a table click on the Reports button The Variance Table Reports dialog box appears see Figure 13 23 A radio button called Entire Table will be checked Below that you will see a drop down menu called Report Format Choose either Variance Table Report Individual Variance Table Reports Variance Detail Report or Population Report Click the Open Report button The Sequencher Report Viewer opens in a new window Figure 13 23 Variance Table Reports drop down menu Variance Table Reports e Entire Table ts E Report Format Population Report nd Variance Table Report Cancel Individ
95. appen Deleting bases You can delete bases in the middle of a sequence by simply highlighting them and then using the Delete key Sequencher asks if you want to fill the gap with bases moved from either the left or the right of the void you have created Click on either Yes Fill Void From Left or No Fill Void From Right button Figure 12 4 Sequencher then moves the rest of the bases accordingly Click on Cancel if you do not want to delete anything You can also perform the deletion by going to the Sequence menu using the Delete Bases command and then choosing either Fill Void From Left or No Fill Void From Right from the submenu SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 89 Figure 12 4 Fill Void dialog box 1 When deleting the bases to the right or to the left of the deletion can fill LD inthe empty space The relative positions of the fragments seem to indicate filling in bases from the left would be appropriate Delete Bases Fill From Right Yes Fill Void From Left TAGAGTGGAGGTG AGGIgTGACAGCCGTAGG No Fill void From Right pd GCCGTAGCATTTT Cancel Delete Bases Fill From Left Iw Continue Editing Hints Deleting bases from the 5 or 3 ends Although you may have automatically trimmed your sequences for poor quality data when you imported them see Chapter 6 Preparing your data for assembly you may still need to delete a few 5 bases to correct any
96. apping all cutsites Hoa Catters BomiI Bsp106 BetXI EcoRI EcoRV KindIl NotI Sall Smal Spel XbaI amp XhoI SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 138 Choose the Text representation Figure 16 4 to list the cut sites in tabular form In this view the number in parentheses following each enzyme shows the number of times that particular enzyme cuts Figure 16 4 Text restriction map Anal 2 48 707 Drall 5 44 45 328 476 704 Sail 431 PstI 2 631 800 SacI 2 781 1029 Bacll 4 123 234 425 496 Xmalll 1 117 Mapping all cutsites Hoa Catters BamHI B p106 BstXI EcoRI EeoR HindIII NotI Sall Smal Spel XbaI amp XhoI Setting the map width The restriction map normally scales to the width of the window When you print a restriction map it scales to the size of the printed page To set the width of the diagram in inches turn off the Scale to window checkbox and use the elevator button to set the width in inches see Figure 16 5 This option is particularly useful when copying a map to paste into a drawing or presentation program Figure 16 5 Map options showing width in inches Restriction Map Options Cutters Show Iw Unique Cutters V 2 Cutters C Fragment Sizes Iw 3 Cutters e Cut Positions Iw 4 amp More ER Style width Single Line Inches 18 Se Multiple Lines Ls C Text l Scale to widow Iw List enzyme names at bott
97. arch field at the top right corner of the Help Viewer When your search is complete double click on an item to display it Figure 2 1 The Sequencher Help dialog window E Sequencher Help GEN GH Hide Back Home Print Options Contents Index Search Project Window Screen 0 Sequencher Applic A E Assembly Paran E Contig Chromato E Contig Editor Ow E Contig Editor Cut E Sequence Editor E Motif Definitions S Project Window no AbN SPF A Gees me n SE Assemble il a hameters Dirty Data O items Name i Modified E Assemble by Ne B Project Window B Project Window E Trim Ends Scree E Trim Vector Scre E User Preference H Sequencher User v o Preface Using This Mant CH The Project Wint 9 Sequencher amp Project views B View as la E View as st Y gt mE 14210 Je be edited S567 Fe T4839 ER Reference VW J08_14839_U09SEt CnER EI 14969 EI 15095 3 P20_1011MARa_0046E15 2 P20_1011MARa_0045E1S 3 e 13808 enee A J02_13806_0045E1 SNP88 3 Reference Context Sensitive Screens Context sensitive screens provide easy links to information about the region of Sequencher you are currently using If you have the Project Window displayed when SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 912 BPs 69 items 930 BPs 763 BPs 674 BPs 550 BPs 666 BPs 758 BPs 831 BPs 679 BPs 609 BPs 663 BPs
98. at are not covered by sequence data indicating that the status of the base at that position is unknown SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 99 Figure 13 6 Translated Variance Table for sequences in the same contig Compare Translation to Reference Se _ oK Description 10 sequences compared to reference sequence NC_001807a 3 end of contig GWB Comparison Range Unfiltered Base Positions 15884 16571 Amino Acid Positions 1 229 St erre 16 127 TAC 82 Sj 16 172 ACC 97 T 16 184 CCC 101 R 16 199 CCC 103 E 16 196 ATG 105 M 16 202 ge 107 16 205 Se 108 16 208 109 a E TT eeng The Translated consensus Variance Table Sequencher allows you to compare the translation of a consensus sequence to the translation of a Reference Sequence Go to the Contig menu and click on Compare Translation to Reference You can also use the Compare Translation to Reference command to compare the translations of consensus sequences from multiple contigs to the translation of their common Reference Sequence These contigs are formed when you use the Assemble By Name with a Reference Sequence command For both of these operations select one or more contigs in the Project Window that share a common Reference Sequence Then from the Contig menu choose Compare Translation to Reference The Translated consensus Variance Table appears in a new window see Figure
99. atch 85 ety l a leng d oaea AE Quality ei 755 BPs Auto Seq Frag n 2004 AE Quality fi 758 BPs g FutoSeq Frag S 2004 lt AE Quality 794 BPs d futoSeq Frag i 2004 AE Quality ni 796 BPs 5 FutoSeq Frag 2004 AE Quality 798 BPs FutoSeq Frag i 2004 AE Quality 791 BPs h futo Seq Frag A 2004 AE Quality si 805 BPs FutoSeq Frag i 2004 lt AE Quality 796 BPs f FutoSeq Frag i 2004 AE Quality 798 BPs i Guto Zeg Frag You can navigate the list by using the up and down arrow keys To select a continuous list of sequences choose the first sequence in the list and then holding down the Shift key select the last sequence in the list To select a discontinuous list of sequences hold down the Control key while clicking on the sequences Project Window Columns When you are in the list view Sequencher displays the attributes such as the item name or kind of each sequence as a series of columns You can control which columns are displayed Go to the View menu and choose the Project Window Columns command Then choose your desired option from the submenu If you are working with ABI sequences you can view the sample name by going to the Project Window Columns command and choosing the Sample command SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 12 The column labeled Quality displays a value for ea
100. ation in the Enzyme Name and Recognition Sequence fields When you click Done Sequencher adds the new enzyme to the list in alphabetical order SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 142 Changing the recognition sequence The recognition sequence field shows a double stranded sequence If you type in the sense strand the anti sense strand will be calculated automatically Sliders above and below the recognition sequence window are used to move the lines which indicate the cleavage positions You can use the mouse to move the sliders left and right Note The maximum length for a recognition sequence is fifteen bases If you exceed this length the right end will be truncated Salt concentration effects You can record buffer sensitivity with the Effect of NaCl Concentration slider controls The default is an unspecified value for all concentrations of NaCl Use the slider controls to indicate the percentage activity for a given NaCl concentration Saving new or modified enzymes If you enter an enzyme that you will want to use later click Save Enzymes to save the enzyme set as a file Click Load Enzymes to retrieve a saved file Setting the sequence selection in a Sequence Editor When you click on the Cut Map button a full page cut map is displayed If you click between two cut sites Sequencher highlights the resulting restriction fragment Hold down the Shift key and click in the re
101. atogram button in the button bar or go to the Window menu and choose Chromatogram You can scroll either vertically the default or horizontally To change orientation click the appropriate button in the left bottom corner of the fragment chromatogram window SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 129 Note Remember that even if a sequence fragment is incorporated into a contig you can still open its editor by double clicking the sequence name in the list at the left of the contig editor Viewing chromatograms from a contig editor The Sequencher contig editor allows you to view simultaneously all chromatogram data relevant to a consensus base call If you are at the 5 end of the contig Sequencher selects the most 5 base in the consensus and displays all the associated traces If you are elsewhere in the contig Sequencher selects the center most consensus base in the contig display and displays the associated traces This allows you to check the signal strength of any peak in conjunction with its base call and edit your data accordingly Select a contig column by clicking on its base in the consensus line Figure 15 1 Then click the Show Chromatogram button or go to the Window menu and choose Chromatogram Sequencher opens a window to display the traces Figure 15 1 The bases you highlighted in the contig editor will also be highlighted in the trace windows Figure 15 1 Selected c
102. ave a growing data set Sequencher compares all the selected new items to the unselected old items It will also compare the new items to each other It has been designed to increase the efficiency of handling large numbers of sequences in efforts such as DNA library clustering and genome assembly Therefore it will not compare any unselected items to each other again Go to the Contig menu and click on Add Selected Items To Others and then on Assemble Contigs Assemble to Reference The Assemble to Reference command allows you to assemble samples to a single Reference Sequence regardless of inconsistencies between the individual fragments Because it is a many to one comparison instead of the normal many to many comparison Assemble to Reference is much faster than the standard Assemble Automatically algorithm To use this command you need to have a Reference Sequence already designated Click on your sequence and then go to the Sequence menu and select Reference Sequence Now select the sequence fragments you want to assemble including the Reference Sequence Click the Assemble to Reference button or choose the Assemble Contigs command from the Contig menu and Assemble to Reference from the submenu SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 63 Interactive assembly The Assemble Interactively algorithm can be used whenever you want complete control over the assembly of a batch of sequences
103. ble in Review mode Select the cell to be edited in the table then type in your desired change The change will be reflected in the Variance Table and the Contig Editor immediately Note You can create features in the Variance Table by going to the Sequence menu and clicking on the Mark Selection as Feature command Figure 13 15 Comparing Sequences in Review Mode D Green Bases to Reference Sequence DIS Bow o o OX ai E E EE EE DEE RER EE E Description 10 sequences compared to reference sequence E NC_001807a 3 end CAGCCACCATGAATATTGTACGGTACCATAAATACTTGAC NC_001807a 3 end of contig GWB 3 A_GWB_2001 05_PrimerIR ICAGCCACCATGAATATTGMACGGTACCATAAATACTTGAC Comparison Range Unfiltered 3 A_GWB_07061946_PrimerIRIIC AGC CACCATGAATATTGCACGGTACCATAAATACTTGAC Base Positions 15884 16571 s 3 A GWB_2001 05_PrimeriF JCAGCCACCATGAATATTGCACGGTACCATAAATACTTGAC 3 A GWB _07061946_PrimeriF CAGCCACCATGAATATTGCACGGTACCATAAATACTTGAC A_GWB_2001 05_Primer3F A_GWB_07061946_Primer3F 2 A_GWB_2001 05_Primer2R 2 A GWB _2001 05_Primer4F 2 A GWB _07061948_Primer4F Hir e highlight base call disagreements CAGCCACCATGAATATTGCACGGTACCATAAATACTTIGAC 4 highlight ambiguities B Chromatograms fom GWB A_GWB_2001 05_Primer1R Fragment base 16 127 Base 103 of 332 ble C A GCC A C TG A A Ca CIGR AC GGT dd T Idd T d ALT FAT AA Td Td JT JT AT al BR ER The Review Mode and Compare Consensus to Reference The C
104. can change these settings later from within a specific editor When you click an icon in the sets of icons above the ruler your selection is explained and confirmed in the panel below the ruler Figure 21 16 For example use the Preferred Bases per Line counter to increase or decrease the preferred number of bases in each line Note that if you request a line length that will not fit on the printed page Sequencher uses the maximum line length that will fit If you want the margin to be automatically adjusted when you increase the size of a window check the Auto Adjust Margin if Window is Resized checkbox SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 180 Figure 21 16 Format ruler preference settings User Preferences General Settings Use the ruler below to choose the default settings for Futo Save rulers that appear in Sequencher s editor windows Confidence Menu Sound New Project Single Stranded DNA Display Break every 10 bases Chromatogram Single spaced Contig Chromatogram UPPERCASE BASES Contig Feature Motif Font Courier New l2pt Format Ruler Start Stop Preferred Bases per Line 40 gt Variance Table Input Output Auto Adjust Margin if Window is Resized File Import Report Laval E aE 0 EE IESE Ta E Foe DI e El E KEE KENE E Start Stop The Start Stop preferences control the locating of open reading frames fro
105. can dismiss the window with the command from the File menu called Close Window Remember to save your work at this point by going to the File menu and using the Save Project command since none of your trim work will have been recorded onto disk Alternatively use the Auto Save function to save your work at user defined intervals see Chapter 21 Customizing Sequencher and User Preferences SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 34 Trimming vector contamination Selecting sequences for Vector Trimming Select one or more fragment s in the Project Window and go to the Sequence menu and choose the Trim Vector command Note that even if you select contigs the Trim Vector function only affects unassembled fragments Specifying a vector using VecBase After you have selected the Trim Vector command and if you have not previously specified vector insertion sites a window will appear Click the Choose Insertion Site Now button if you are ready to work with the vector trim function Otherwise click on OK to dismiss You can also go to the Window menu under Specify Vector Insertion Sites to see this dialog Once you have pressed the Choose Insertion Site Now button the Vector Insertion Sites dialog box appears In the bottom part of the window you will see that you can screen for up to five vectors at once Load vector information from the VecBase database by clicking the Use
106. ce w ox Options Description 9 sequences compared to reference sequence NC_001807a 3 end of contig JK Comparison Range Unfiltered Base Positions 15884 16571 i SES Pe tee Perel et The consensus Variance Table When you want to compare a consensus sequence to a Reference Sequence you use the Contig menu s Compare Consensus to Reference command You can also use the Compare Consensus to Reference command to compare the consensus sequences from multiple contigs that share the same Reference Sequence These contigs are the result of using the Assemble By Name command with a Reference Sequence For each of these operations go the Project Window and select one or more contigs that share a common Reference Sequence Then from the Contig menu choose the Compare Consensus to Reference command The Variance Table appears in a new window see Figure 13 5 Each cell in the resulting Variance Table will display a consensus base when that base differs from the Reference Sequence Pink shading in the header and footer cells of the Variance Table highlight contigs where the consensus sequence does not cover the span of the Reference Sequence The pink X s mark specific cells where coverage is missing SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 98 see Figure 13 5 It is possible for a sequence to have partial coverage a pink header and footer but not have any pink X s in
107. ch Auto Save Use the Idle Time Before Auto Save slider to set how long your computer should be idle before Sequencher executes an Auto Save The latter setting will prevent Sequencher from performing an Auto Save while you are typing SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 172 Figure 21 5 Auto Save preference settings User Preferences S e e Settings P al Jv Auto Save Project Files A Confidence Label amp Name Menu Sound New Project Display Chromatogram Contig Chromatogram Contig Feature Motif Format Ruler Start Stop Variance Table Input Output File Import Report Confidence Iw Ask Before Each Auto Save Time Between Saves minutes Idle Time Before Auto Save minutes SE 2 Many of the new base callers generate confidence values associated with each base Sequencher supports confidence values from PHRED ACE Trace Tuner SCF ABI ESD and compatible files The Confidence preferences control the display of base calls and confidence values from these files Sequencher s default settings show confidences below 20 as dark blue the midrange values as a medium blue and high confidences that is above 40 as a light blue Change the threshold values by altering the numbers in either Confidence Ranges box Figure 21 6 Confidence Ranges settings User Preferences General Settings Futo Save Confidence Label
108. ch sequence that has confidence scores The Quality value is the percentage of bases in a sequence that is above the Low Confidence Range threshold The default setting is 20 You can alter the Low threshold value in the Confidence pane of the General settings See Chapter 21 Customizing Sequencher and User Preferences for more details You can sort your list by item name size quality kind label or modification date Click on the title of the column you want to use for your sort For example to sort by size click the word Size at the top of that column Sequencher will reorder the list by size You can also sort the list by going to the View menu choosing Sort Cleanup and then selecting from the options on the submenu If you sort by Kind the order of precedence is first an unassembled Reference Sequence or any Contigs which contain Reference Sequence fragments then Contigs without Reference Sequences and finally Sequence Fragments at the bottom of the list Figure 3 8 Project Window Columns submenu Project Icons As LW Sort Cleanup D Project Window Columns Sample Expand ll v Size Collapse All v Quality ms EL v Kind Base Ambiguities As v Label Base Edits As es Display Color B v Modified isplay Color Bases Conners Display Features Display Motifs Colors As Backgrounds 4 vy Display Base Confidences Note You can toggle between types of views if you hold down the Alt key and click in the header area just ab
109. choosing a new one from the drop down menu Note The probable number of contigs that can be formed using this handle is indicated in square brackets to the right of the handle Figure 10 5 Assembly Handles drop down menu Origin amp Clone 16 4 Origin 10 Clone 16 g You can also click on the Name Settings button and then click on the radio button next to your preferred handle Enabling Assemble by Name There are two places from which you can enable Assemble by Name From the Project Window click on the AbN button in the button bar If you are in the List View you will notice a new column titled Handle which lists the active handle for each sequence You will also see that the text of the assembly command buttons has changed to Auto Assemble by Name and To Reference by Name The new assembly status is also reflected in the current parameters above the column headers SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved ZS Figure 10 6 The Project Window with Assemble by Name enabled Assemble by Name SPF A OX Assembly Parameters ap Auto Assemble by Name Assemble Interactively To Reference by Name Parameters Assemble by Clone Dirty Data With ReAlignen Min Wa 20 Min Match 85 2 items selected out of 212 po Name Harte Size ing Megtei E 1S_AS_2001 X_Primert F 408 BPs AutoSeq Frag Thu Sep 23 2004 8 19 31 DM a E IS_AS_2001 X_Primer1R 332 BPs AutoSeq Frag Mon
110. cing there is an established standard reference for human identification This calculation is based on the coverage of fragments at a given position and taken from work at the US Armed Forces DNA Identification Laboratory AFDIL in Rockville Maryland e All positions with a coverage of only one fragment are marked as N ambiguous e All IUPAC codes apart from ACG and T are treated as N e If any positions disagree where the coverage is between two and four this position is marked as N If the coverage is between five and seven any position with a disagreement will be marked as ambiguous e If there are two or more disagreeing bases at a position this will be called an N regardless of the coverage Figure 11 9 Consensus to Forensic Standards example GCAATCAA CCTTCAACTA GCAATCAA CCTTCAACTA GCAATCAA CCTTCAACTA GGAAATAGNTCAACTA ANTA 30 40 GCANNNAANNNNTCAACTA Assigning a Reference Sequence To designate a sequence as a Reference Sequence select the sequence icon in the Project Window and check the Reference Sequence item in the Sequence menu From now on that sequence icon will include a small letter R to remind you that it has been marked as a Reference Sequence see Chapter 7 The Reference Sequence for more information SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 84 12 Editing Contigs In this chapter we explain how to find bases such as ambiguities and low conf
111. code You can take any one of the pre existing genetic codes and edit it in the genetic code editor to change the codon translations The genetic code editor has two main parts abbreviations and three letter or one letter codes Go to the Window menu and choose Genetic Code Choose the code you want to edit and then click the table entry you want to change Finally click the abbreviation you want in that position To undo any changes select the appropriate genetic code from the Reset Code pull down menu Abbreviations To edit the abbreviations click the Abbrvs button in the bottom right corner of the dialog box Another dialog box Figure 21 3 then lets you change both the three letter abbreviations for the amino acid names and the corresponding one letter codes The standard abbreviations and one letter codes are available as the default cases SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 170 Figure 21 3 Abbreviations editor Amino Acid Abbreviations 1 Letter Codes For Aspartic Acid 3Letter Asp Set to Defaults Remember Window Layout The Remember Window Layout command lets you define default positions for the Contig Editor and Contig Chromatogram when you are editing in the Bases view First you must organize the windows Then go to the Window menu and choose the command called Remember Window Layout Figure 21 4 Select the submenu option called Contig Chromatogram E
112. ct the project you want to work with Click the Open button in the lower right area of the dialog box or click the Return key Working with items in the Project Window To work with items in the Project Window first select the item s you need by clicking on them Then choose the appropriate menu command SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 9 Icon types In the Project Window a sequence is represented by an autorad icon However there are other data types you may have in your project To get familiar with some of the icons Sequencher uses for these data types see Figure 3 3 Figure 3 3 Examples of Sequencher icons A sequence fragment E RSR Frag 0001 g A sequence fragment with edited E q g comments Frag 0002 A contig Contig 0065 H A refrigerator Storage Refr igerator E A Reference Sequence R ref E A sequence in the inverse complement Frag 0002 orientation Other information may be displayed in addition to the basic icon For example the downward pointing arrow on a sequencing image icon shows that this sequence is stored in its original orientation the sequence travels down the gel The arrow points up when the data is inverse complemented as shown in Figure 3 3 above Project views View as Large Icons To display large icons in your Project Window go to the View menu and then choose Project Icons As Then select Large Icons from the submenu
113. ctangle between other cut sites Your selection is extended from the original site you selected When you toggle back to the Bases view you will see that the sequence representing the chosen region also has been highlighted If you wish to display the sequence and the cut map at the same time click the button showing the staggered enzyme cut icon A window with the restriction map appears You can adjust the size of the panes by dragging the splitter bar that separates the scroll bars for each pane Like the full page cut map this window is linked to the Sequence Editor If you click on the name of an enzyme Sequencher boxes the recognition sites If you click between two cut sites Sequencher highlights the resulting restriction fragment Hold down the Shift key and click in the rectangle between other cut sites Your selection is extended from the original site you selected You can copy the bases between the two cut sites by going to the Edit menu and choosing Copy Selection You can use this command to excise a particular restriction fragment by clicking on it in the cut map and switching back to the Bases view Once you are in the Bases view you can remove the highlighted fragment by going to the Edit menu and using the Cut Selection command Setting the sequence selection in a contig editor When you are working in the Bases view of a contig editor you can display a restriction enzyme map by clicking on the Cut Map button in the editor
114. d ReAligment Complete will appear This window displays a summary of the disagreements gaps and ambiguities before and after the ReAlignment Note ReAlignment is not the same as reassembly If you wish to reassemble your sequences you must first dissolve your contig See Chapter 9 Sequence Assembly for more information Moving sequences If you hold down the Ctrl key the grabber tool is invoked It resembles a small hand You can use the grabber tool to drag an entire sequence to the left or right This method of moving entire sequence fragments also works in the Overview The grabber tool is shown in Figure 12 3 You may notice that some or all of your sequence no longer aligns and that a large number of bullets appear in the consensus line If you decide that the new position is not correct you can use the Undo command to restore it to its previous place Figure 12 3 Grabber tool E Contig 0001 DER G CCATTTTTCTCCACTAATATAT CCATTTTTCTCCACTAAGATAT TCCATTTTTCTCCACTATTATA CCATTTTTCTCCACTAATATAT CCATTT TY TCCACTAATATAT 0 540 550 disagreements c CAT TT T TC T CCACTAATATAT s highlight e ee ese esee oeeee ambiguities Select Next See Base pacebs Note Moving sequences in a low coverage region could leave a hole in the middle of the contig This is not recommended since the hole may cause the contig to align inappropriately with other data Sequencher will warn you if this about to h
115. d sequences and contigs are highlighted after import you could also use sequence Labels to differentiate them See Chapter 21 Customizing Sequencher and User Preferences Files from VecBase VecBase is a file of vector sequences in your Sequencher application folder You can import a vector from VecBase just as if it were any other DNA sequence Go to the File SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 20 menu and choose Import then the subcommand Sequence From VecBase The Vector Selection dialog box appears choose the desired file and click Select If you wish to import more than one file you can use Shift or Command click for continuous or discontinuous listings of sequences Creating a new sequence Another way to bring new data into Sequencher is to create a new sequence fragment To do this go to the Sequence menu and choose Create New Sequence Sequencher displays a dialog box asking you to name the new sequence Figure 4 3 The default name in this case Frag 0001 can be set as a user preference A sequence name can be a maximum of 255 characters including any punctuation and spaces but 31 characters is recommended for export functions When you have typed the name you want or if you accept the default name click OK or press the Enter key A new Sequence Editor appears showing the name you selected in the title bar The new Sequence Editor lets you enter and cha
116. dvaadeeta arden tediavebah eee 88 Moving sequencess 3 EN 89 Deleting bases s 2 sh sh eid each dele ced AE AE dad Aen ele ne ee ed aed eh eee 89 Deleting bases from the 5 or 3 end oo sees esseeeseeeeseeseseeseseeseseeseseeseceeseeseseesesessesesseeesesassesaeseeaseeeaeeeeas 90 Inserting gaps Or bases into a Contig eee eseeeeseeeeseesesesscseescseeseseeseaeseescseescsecseseeseeesessesesassesaeseeaeeeeas 90 Making edits from the consensus line oo eee sees esceeeseeeeseeseseeseseesesecsceeseacseeacsesacsesaeesseeeseeesseeaesees 90 Split After Selectionner fied led ed eich hash tel Rel ceed Mee see Raed les eh dee 91 Auto Match tose ssc ai sd haved iere Hes teases Sleds Heath had Heng ee eh en Rhee ee eed ee lh Benes 91 Create new sequence from a Consensus on ese eseeeeseeeeseeseseeseseeseseeseseeseeeseeseseeaesesseseeseseesesasseeasseeaesesaeeeeas 92 Removing sequences from a CONTIG ese eeeeeseeeeseeseseesesesscseeseseeseeesceacseeacsseacsesacseeseseeseeessessesesaeseeaeeeeas 93 Dissolvin ea COnte eniin ee tee RA AA a A Me e 93 L THE VARIANCE TABLE wwinsostsccccsicieieiastsneedaceisiennetiasnnceddebeasneneucanncetanuceeds 94 dhe Variance Lable Stated y EE 94 W atisa Variance aleet 94 The Variance Table and consensus sequences eects eeseeseseeseeeeseneeseescsecscseescsecacsesseseeseeessesasseeaeseeaeeeeas 94 The Translated Variance Table emgeet 95 The Stuenmte oat the Variance Table geet 95 Structure of the Translat
117. e CES Primed K Handie4 C 5 Handle5 C 6 Handle6 CF Handle c 8 Handle Active Handle 1 Date Cancel SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 69 If your Assembly Handles are separated by a single character included in the Name Delimiters drop down menu choose that character from the list Otherwise follow the directions for an Advanced Expression Figure 10 2 The Name Delimiters drop down menu Assemble by Name Settings Name Delimiters Y Underscore Bar Colon l Dash Dollar Equals Period Plus m E Handles C E Handie6 m 7 Handle7 C amp Handles Active Handle 1 Handle1 Cancel Configuring Assembly Handles using Advanced Expressions You will need to create an Assembly Handle using a regular expression if your Assembly Handle is not separated by one of the characters in the Name Delimiters drop down menu What is a regular expression A regular expression is a way of describing a text pattern using letters numbers and special characters These expressions follow certain syntactical rules Starting in the Project Window click on the Assembly Parameters button At the bottom of the Assembly Parameters window click on the Name Settings button The Assemble by Name Settings window appears Choose Advanced Expression from the Name Delimiters drop down menu You will notice that the Define butt
118. e Calls e to Unknowns F Convert all ambiguities to unknown Remove inserted gaps from sequences The Browse button opens a window called Choose a Folder From this window you can choose where to store the new text file s You may create a new folder in which to store your sequences by clicking on the New Folder button at the bottom left of the window Click on the Choose button to record your chosen location The Format drop down menu allows you to choose a format type for your exported sequence You may set the case of the exported sequence the sequence strand to export and the treatment of ambiguities and gaps To choose these settings click on the Options button select the options you want and click OK see Figure 19 2 Finally click on the Export button to execute the command Figure 19 2 Export Sequences drop down menu Export Sequences Export Location CADocuments and EES Settings Administratorn Desktop _Browse Forma ASCII Plain v Options V UselASCII Plain a Unformatted bases onb Note If you have added features to a sequence and chosen GenBank as your export format these will be exported as a GenBank Feature Table Exporting a Consensus To export a consensus sequence click on the contig icon s in the Project Window Then go to the File menu and choose Export and Consensus from the submenu The SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserv
119. e Detail REpOre E 114 Population Repottgis tase s iere ack adh Meta abes geed eieiei 114 Working with Variance Table Reports eeeeeesscececcceeeeeeeenneeeeeeceeeeeeeeenaeeeeeeeeeeeeeea 115 Printing a Variance Table Report 115 Printing selected data from a Variance Table A 116 Exporting a Variance Table Report cece eeceecseeeeseeseeeseseeseeeseeecseeeesessesecsesecsceeseessseesssesseeeeseeesseeseeges 118 Exporting selected sample SeQUeNCES 0 eee cess eeeeeseeeeseeeeseesesecsesecsesecsesecseacsesscsesseseesesacseeasseeaesetaeeeeas 118 Exporting data for selected vartantz rerien aa a 119 Translated Variance Table Report sissscsciccccedeedssanavatie ctdeea Saauatesccadecivsaaviansctlacddeeneetavaans 120 Translated Variance Table Report 2s cneytieine denny heads cdldkn nd ddedunaeuandien 120 Working with a Translated Variance Table Report 120 Exporting a Translated Variance Table Report 120 Exporting selected sample SeQUueNCES eects eeeseseeeeseeeeseeseseeseseesesecseseeseesesesacseeseseeseaeseeaeseeaeseeaeeeeas 121 Exporting data for selected vartantz seis eeseeseeeeseeeeseeeeseescsesseseescsecseeeseesesecacseeseseeseacseeasseeasseeaeeeeas 122 Removing data from a table before export ee eee eee eseeeeeeseseeseseeseeeseecseesesesacsecseeeseecsesaesesseseeaeeeeas 123 14 THESUMMARYRERORT ever eeesgugesugeegekCgegegNDEEEEENENEEegNESEEEEENEEKE Een 124 Ehesummary Report VIG W is scsiecteseea E a ean ates eee eee eae 1
120. e User Preferences for your Variance Table click the Options button in the Variance Table button bar The User Preferences dialog opens in a new window See Chapter 21 Customizing Sequencher and User Preferences for more detailed information Working with the Variance Table Navigating a Variance Table To navigate through any form of Variance Table use the arrow keys to move across the rows or down the columns When you are working with any Variance Table and only want to focus on differences use an arrow key with the Alt key This will skip any cells that match the comparison sequence If you try to move the cursor beyond the edge of the table you will be alerted by a beep Note If you are using the Compare Consensus to Reference command you will see the consensus base s for each contig you have selected in the same position If you are using the Compare Translation to Reference command you will be viewing the consensus codon for each contig you have selected in the same position Sorting the Variance Table The Variance Table s Sort functions are useful when for instance you need to segregate clone sequences that perfectly match your Reference Sequence or cluster sequences of like alleles see Figure 13 14 The Variance Table simultaneously displays differences and calculates summary information about your selections The Table lists the total number of differences for each sequence at the bottom of that sequence s column
121. e bases 0 Ambigs pi BlueKSm 370 good bases 0 Ambigs BlueKSm SEENEN EES You can print out the Vector Contamination window as a record for your lab notebook by going to the File menu and choosing Print This is particularly useful if you use the Show Bases button to show which sequences are to be trimmed You can also sort the window using various criteria by clicking on Sort Items Executing the trim command To perform the vector screen and trim go to the Sequence menu and choose Trim Vector Once you have reviewed your data and are happy with the criteria you have set proceed with the trim by clicking on the Trim Checked Items button Sequencher warns you that this command cannot be reversed with the Undo command To proceed click the Trim button Any red lines or bases will disappear from the Ends Trimming window Now you can dismiss the window with the command from the File menu called Close Window If you wish to revert some or all of your sequences to their original state you can go to the Sequence menu and click on the Revert to Experimental Data command Remember to save your work at this point by going to the File menu and using the Save Project command since none of your trim work will have been recorded onto disk Alternatively use the Auto Save function to save your work at user defined intervals see Chapter 21 Customizing Sequencher and User Preferences SEQUENCHER User Manual 1991 2011 G
122. e es Wenu weg Sound Je label 4 New Project Display Label 5 Chromatogram Iesel Label 6 Contig Chromatogram Il SOS S S Label 7 or Feature Monit Default Names Format Ruler Start Stop E Contigs Contigr Variance Table JE Other Fragments Frag Input Output File Import The 1st asterisk in each name will be replaced with an item number Report which is incremented each time an item of that type is created in the project Menu The Menu preferences control command options that let you optimize the way you work in your own lab Use the buttons under Use Ctrl O For to program the Ctrl O key combination to open a project or open a window Use the pull down menus grouped under User Defined Command Keys to create keyboard shortcuts for any of the menu commands After you have assigned a keyboard shortcut to a menu item the shortcut will be listed to the right of the menu command on the drop down menus SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 174 Figure 21 8 Menu preference settings User Preferences Settings Futo Save Confidence T Label amp Name User Defined Command Keys Use Ctrl 6 For ond E Sequence gt Rename Sequence v New Project Display Use Ctrl For Chromatogram None Contig Chromatogram Contig Use Ctrl 8 For Feature Motif None Format Ruler Start Stop Use Ctrl 9 For Variance Table None Inpu
123. e the Summary Report This is a static report which has a number of options making it a useful addition to the Variance Table for mutation hunting verifying re sequenced fragments comparisons to a Reference Sequence or just as a report for your lab notebook The Summary Report view View by Summary You can display this report by going to the Contig Editor and clicking on the Summary button in either the Bases View window or the Overview Alternatively you can go to the View menu choose Bases Map Overview and then click on Summary Report You will be presented with a new window which shows the assembled sequences within your chosen contig There are five buttons arranged in a button bar see below Button Description Bases This button returns you to the contig editor Overview This returns you to the contig overview Ruler This button brings up a simple text editor like ruler Options This controls which information will be displayed Find This button brings up a window into which you can type a substring and perform a search see Chapter 18 Finding Items SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 124 Summary report display options Sequencher has a default format for the summary report but you can change it to display different types of information Click the Options button or go to the View menu and choose View Options The Summary Options are divided into those
124. eature table qualifiers Genbank Feature Qualifiers convey many different types of information To accommodate this breadth there are several value formats Table 28 2 Feature Qualifiers Qualifier Type Description Free text Free text qualifiers are usually descriptive phrases enclosed in double quotation marks Controlled vocabulary or Controlled vocabulary or enumerated values enumerated values qualifiers are selections from a controlled vocabulary list and are entered without quotation marks Examples of controlled vocabularies can be found in the Appendices to the GenBank Feature Table documentation http www ncbi nlm nih gov collab FT index html Citation or reference numbers A citation or reference number is enclosed in square SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 201 Qualifier Type Description brackets to distinguish it from other numbers Sequences Since 1998 it has not been acceptable to use a literal sequence of bases as the Sequence Qualifier Feature labels The feature label qualifier supports clarity in reporting by making sure your references are unambiguous An example of a feature label would be alcA see example below The figure below shows a typical example of a feature table entry with qualifiers In this example the feature key is CDS The location is expressed as two numbers The qualifiers tell
125. ecial windows that the operating system or Sequencher displays when it needs to alert you to something you need to know or needs to ask you a question Usually a character such as or which separates elements of a sequence name Click twice in quick succession You can set the amount of time you have before the machine decides that you have performed a second single click rather than a double click Press the mouse button and hold it down while you move the mouse and thus the cursor across the screen A very small scroller that allows you to increase or decrease a numerical value by using the mouse An elevator button has an up arrow and a down arrow In this manual enzyme always refers to a restriction endonuclease Chosen or primary sequence Literal meaning typical example or excellent model A rectangle in which the user can or in some cases must enter names or other information The portion of a sequence name between two consecutive delimiters Also known as Assembly handle A hidden menu that looks like a command until you invoke it When you move the cursor over the name of the hierarchical menu it appears to the right of the parent menu A hierarchical menu is indicated by a right pointing triangle A dialog box that requires a response before you can continue with regular actions in the application In Sequencher this is a short subsequence of 50 bases or less The mouse is integral to the interface It c
126. ed 159 Format drop down menu allows you to choose the format type for your exported sequence Clicking on the Options button displays the window where you turn options on or off In most cases you will want to just click on Export and use the original consensus name The Browse button opens a window called Browse For Folder From this window you can choose where to store the new text file s You may create a new folder in which to store your sequences by clicking on the Make New Folder button at the bottom left of the window Click on the OK button to record your chosen location Uncheck the Use Default Names checkbox if you wish to set a specific export name Exporting Contigs To export a contig click on the item you want to export Go to the File menu and click on Export and Contig from the submenu Sequencher displays the export dialog The browse button allows you to choose a disk and folder to store the new text file s The format button allows you to choose the format type for your exported contig You can choose from MSF Nexus Interleaved Nexus Sequential Aligned FastA and CAF You may also set a number of options Clicking on the Options button displays the window where you turn options on or off Finally clicking on the Export button executes the command The Use Default Names option gives each export the name of its contig Uncheck the Use Default Names checkbox if you wish to set a specific export name The Br
127. ed Variance Table 97 Creatine BV AULA CSM KEE 98 Variance Table for sequences in the same contig oo eeeeeeeeeeeeeeseneeseeseseeeeseescseeseseeseeeseeaesesasseeaseeaeeeeas 98 The consensus Variance Table 05290 98 Translated Variance Table for sequences in the same CONTIG 0 eee eeeeeseeeeseeeeseeeeseeeeseeasseeaeseeaeeeeas 99 The Translated consensus Variance Table oo eeeseeeeseeseseeseseeseecsenecseesesecscsesseseeseeeseeasseeaeeesaseeeas 100 Setting the Variance Table e te 101 Restricting the Comparison Ran eee A R E E A 101 Restricting the Translation Range 102 Setting the Variance Table Display Options scceim io aE E 105 Setting the Consensus Calculation for a Variance Table oo ieee eeeeeseneeseeeeseeeesessesesseeeseeeeseeeeaes 106 Mariance Table Opuons tc E E E oo a le oo le ta alee a ate 107 Working with the Variance Table i ciicccciscececesetscncnsecancctdscdaven de caceceavedseesdecnnececenaaeensecae 107 Navigating a Variance Tablette een esses ected d e 107 Sorting the Variance ablenne geesde eas codes Ae aed nde Ea 107 Removing columns from the Variance Table A 108 REVIEW Melen EE EE 108 The Review Mode and Compare Consensus to Reference 0 eee eceeesceecseeseeeeseeeeseeeeseeecsessesesseseeseeeeas 109 The Review Mode and the Translated Variance Table A 110 The Review Modeand Sister Tables est see ee ENEE Bake 111 Variance Table Reports EEN 112 Variance able Report EE 112 Individiial V atiance Report 20 113 Varianc
128. eeaeeseeaeeaeeaeeaeeaeeaeeaeeaeeaeeaeeaeeaes 71 Choosing a New Assembly Handle 0 ccc cccscesceseeseeseeseeseesesseeseeseeseeseeseeaeeseeseeseeaeeseeaeeaeeaeeaeeaeeaeeaeeaeeaeaeeas 72 Enabling Assemble by Name eidele g eieegtegegetegdk od Sege sinus dvacdvacdie sdeasivandvesdea duasdvacdiesese ctrandvagdnerdeecdveedts 72 Setting Assembly Parametersis tics cencsesdiscdeescvacdvesdie clnezevisanecevacdvarendvevaciie cduasesievesdeycdvasevazaresdna sleasevivenereren vacate 73 Performing an Assembly with Assemble by Name 73 11 THE CONTIG EDITOR sscesscetecnustes viens ietsteuscetctarsiatrstscctscetviatecetscctsentwecisies 75 COMES ee 75 EE 75 k nt OVERVIEW WINKOW asia AAEE ET EE EE E AETA E E A TA A 75 Selection UE 76 Checkins coverage UHT EE 76 Overview TSatUre Saia aaa A AREE E RA E A A aR 77 EE RPO OD RRO A EOE BPO PORTO OP ODEO CARTE DEO PE E 77 ett 77 DOM E 77 MIST DEANA BIE LOI SEARE ic A a ae E ere le urea acne le arg ae Rie et 11 Scale Diagram To Window Size 0 0 eesssecsscsecssceesseeseevsceessccacsesecsnescsavscenesecesssesesnsecsavscacsseeesneesenseeeaseaees 78 Da STAM Key EE 78 Labels CH Fragments ais ist 78 n 78 Names Of Fragments At Weft s ctscaiecossiacctisgedageaediaccleigecesszedaagescdeacds vedeasdeaadeacgeaadeacdosedesedeastecedouedsaedeecheaage 78 Positions Of Era Sm EE 79 Start ee EE 79 vii Bage Nuberg At Transitions ee 80 Base Numbers At Every X Bases tee ees 80 Condensed Base Numbers ca
129. eege EENS EENS 61 Refining the Assembly Conditions 0 ceeceessececceeeeeeeeeenneeeeeeeeeeeeeenaaeeeeeeeeeeeeeenaaaees 62 Minim m Match Percentage aanerer a T AE AA EAA vars 62 Minimum Overlap E 62 Maximum Loop Out Size tege dee deed AAT dE de idee 62 Optimization of gap placemen tiis 0c entetesni ene dean gegr nd deet ere 62 Performing the assembly ebessen artgerecht 63 AUILOMAatl GC ASSCM DL EE 63 Adding selected items to others incremental building of contigs 0 0 elec ee eeeseeeeseeeesetseeeseeetaeeeens 63 Assemble to Reference geseet eege eiiiai K Ra REA EA AAVA EEN 63 ele E E 64 Performing an interactive assembly 000 eee eteeeeeeeeseeseseeseseescseeseeeseaesesseseesesesaesesseseesesessesssseeaeeeeaseeeas 64 Changing parameters for an interactive assembly 0 eee eee eseeeeeeeeseeeeseeeeseeseseeseseeseeeseeessesaeseeseesaeeeeas 67 geegent lade hes 67 T0 ASSEMBLE BY NAME wisicssscsssoswcesscessinssterecutccansantvantbacnscetsseisactbacteuteceaites 68 Setting the Assemble by Name Conditions 200 000 eeeeeeessnccceceeeeeeeeeennaeeeeeeeeeeeeeeenaeees 69 Configuring Assembly Handles using a Single Delimiter 0 eeseesceeeseeeeseeeeseeeesesaesesseeetseneeas 69 Configuring Assembly Handles using Advanced Expressions 0 ccseessesesseeeeeseeeeseeeeseeseseeseseeaesesaeeeeas 70 What isa regular expressio neeaaea Ana ERATA OA AOO 70 Setting your Assembly Handle Names ccecesceseeseeseeseeseeseeseeseeseeseeseeseesees
130. eeseesesesseecseeeseseeseesesesaeeeeseeeeseaeaes 164 epes eegenen 165 PapeiDreak Siac decades faecal tied Eeer 165 21 CUSTOMIZING SEQUENCHER AND USER PREFERENCEG 167 VIEW E 167 AIMDISUOUS DASCS ee 167 Edited Bases gerne ee Ae ee e deeg Age CCRT TITER TCC TESTS 167 Display Color ee Ee 167 Colors a8 Backgrounds iticsi 53 45 st het seh ss eee lessees AE sade Sash at Mech ee dea dE AE EA 168 Display Base Confidences 9 20h heehee eh heeds wh ee eel ee Ad 168 Display H s anoen ei i Pace caeaisatvcca ta T ENO O AE E 168 Display MO EE 168 Setting Up CUSTOM E 168 Standard cOding Suter ee eege 168 Configuring Ambiguity Codes ccccccccsssscecsscesencsceaveceacscscensescsnseseavsecasrcsesssecssvecsevsaeassesserecscasecearteses 168 Replacing an existing character keystroke code 0 eescecseeseseeseeeseeeeseeecseesesecacseeseseeseecseeaeseeaeseeseeeeas 169 Saving and loading custom COdES 0 eee eseeeeseeseseeseseeseseeseseeseseeseacseesesecseseesesecseeaesesassesasseeseeeaeaeaes 169 Sh w and hiderambisuity belger e RA TE ee 169 Choosing a g netic code aaien a AAR E E AA IA A E R 170 Editinp a Penetic E 170 breede Eeer ege ee AE OE A E 170 Remember Window Tatta iestteedieeteediengiegeeg A T E E RAAT E A eege 171 UE nisrnsnnnsoniinnia en AR nea E OA Ata a a 172 General gerni nea a E E E T E O E N A AEN a 172 GEIER roeren e AAR A A AAR R E ees e 172 E e 172 Conhidence td stadt tt vied tlt tlhe tal tute iat tulle ta
131. eference Sequence in your contig and you have selected the Reference radio button you will be able to show translations for the Reference Sequence Click on the Reference check box at the bottom of the Summary Options dialog window Protein translations for Fragments To show protein translations for fragments click on the Fragment check box in the Summary Options dialog box Matching Proteins As Dashes To show Matching Proteins As Dashes click on the check box in the Summary Options dialog box SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 127 Using the formatting ruler You can use the formatting ruler to change the blocking for example to groups of 3s instead of 10s or to show protein translations See Chapter 8 The Sequence Editor for more information on the formatting ruler Click the Ruler button in the button bar or go to the View menu and choose Display Format Ruler You can use the smaller font options in conjunction with the landscape page format to obtain full width displays of your assembled sequences for use in posters and figures Reverse and Complement a contig You can change the orientation of the contig while displaying any of its views by going to the View menu and choosing Reverse amp Comp You can display the contig s original orientation by choosing Reverse amp Comp again SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights r
132. eferences see Chapter 21 Customizing Sequencher and User Preferences Note You can use the Next Low Confidence Base command to help direct your editing Under the Window menu click on User Preferences and set the Low Confidence range appropriately for the value range you wish to find Go to the Select menu and click on Next Low Confidence Base to move your cursor directly to the bases that require attention Finding Edited Bases You can quickly review edits already made to a contig by typing Ctrl E or by going to the Select menu and choosing Next Edited Base Note This function will not find deletions Performing edits The Edit Command To edit a base first set the Edit mode you wish to use then select the base by clicking on it see Figure 12 1 Now type the new character over the old If you accidentally type the wrong character go to the Edit menu and choose Undo or just type over the mistake Sequencher will recalculate the consensus line automatically If you resolve an ambiguity at a particular position the ambiguity marker below the consensus line disappears see Figure 12 1 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 86 Figure 12 1 A base is selected in the contig editor amp Contig 0012 DoR CATGGTCCTGATCCATTGAACTNGATTATC CATGGTCCTGATCCATTGAACTT CTTATC CATGGTCCTGATCCATTGAACTT CTTATC CATGGTCNTGATCCATTGACCTT CTTATC Ci TGGTCCTGATCGATTGAACTT CTTATC iE
133. elp vik lesc ip ccctessy E E ade ee AS 3 Navigatins Sequench r Helge eesti ee ek lal al ehh tate aac Stl Ahh eh A Sell i ee Sade 4 Context Sensililve SCreenS stat enh Ah AAR EARL A NGA Aah AAS RINT RaR 4 THe b tton EE 5 SEHR ee ee 5 SOQUCU CHEE EE 5 Pile mei EE 5 QIU CI fo este dceteeateced dee ee 5 Selectmen she susie leila eee deena etal a tel aleal a levee AA Beha 5 Contig Mens one E eet ace eee E A ea hana talent eae lL ahead aan a y 6 SEQUENCE IMM ee 6 WAC W MCI E 6 Wao E NEE 6 VSN A N EEE E A E EE N E AAEN 6 EECHER eenegen 6 Contextual MENUS orcii dE eegnen EE genee ege eege 6 Key board SHOrtCuts 2a eneen Se caceseuvanceesde cacgesuvedeven AER Ee 7 3 THE PROJECT WINDOW coviisciioncousmcucccvasneneenibonnmcuninesnnninebueenmeunminainnamonbae wns 8 Seg encher EE 8 About the Project Window eege REENEN EEN NEE 8 Creating a E EE 8 Opening an existing Proj eC eieiei EENS 9 Working with items in the Project Window 0 i eeceesceesceseseeseeeseeeseeeseeeesesseseesesecseseeseceesesasseesesesseeeeseeaeaes 9 ICON Ype Seine iopen rererere tite testes NEETER NOG OTELE ETEA EEO EENE E RER 10 Project VIEWS denu a E A E E S 10 VIEW as Large ICOS sirrimi EENS 10 View as Small Icons suoda eiiiai eee EA d e Meek eds een eh Bish ed eas cad ested enna Meh eh See ee 11 Moving E 11 Clen E 11 View a a LISU ei ceerecccevetie e 12 Project Window Columns EE 12 KRITT 13 Viewing the information for a sequence OF Con 14 Editing the inf
134. emble to Reference Parameters Dirty Data With ReAligner 3 gap placement Min Overlap 20 Min Match 85 Modified Comments AE Quality 755 BPs FutoSeq Frag Sep 27 2004 4 10 09 PM E Quality 758 BPs FutoSeq Frag Sep 27 2004 4 10 07 PM AE Quality 794 BPs FutoSeq Frag Sep 27 2004 4 10 06 PM E Quality 796 BPs FutoSeq Frag Sep 27 2004 4 10 09 PM AE Quality 798 BPs to Zeg Frag Sep 27 2004 4 10 05 PM E Quality 791 BPs go zeg Frag Sep 27 2004 4 10 08 PM AE Quality si 805 BPs to Zeg Frag Sep 27 2004 4 10 07 PM aE Quality 796 BPs FutoSeq Frag Sep 27 2004 4 10 06 PM a Quality 798 BPs futoSeq Frag Sep 27 2004 4 10 05 PM All the sequences and contigs added to this particular sequencing project are displayed in this window Note If you have so many open windows on the screen that it s difficult to find the Project Window go to the Window menu and click on Project Window This will bring the current project to the front As you work if you close an existing project and then want to start a new project go to the File menu and choose New Project Sequencher will open a new untitled project Sequencher will remember the last projects you opened You can see these as a list by going to the File menu and selecting the Open Recent command Opening an existing project To open an existing project go to the File menu and choose Open Project to sele
135. encher and User Preferences Figure 6 2 User Preferences for setting trim criteria when files are imported ER Review Trim Parameters Add Imported Sequences to Trim Window All Imported Sequences OF All That Have Less Than 1500 Bases All That Have a Chromatogram F All That Have Unknown Bases How to set the Ends Trimming criteria The amount to be trimmed is based on the criteria you set for both the 5 and 3 ends of the sequence These criteria can be set independently of each other To review or change those criteria go to the Ends Trimming window and click on the Change Trim Criteria button As you can see in Figure 6 3 you can trim off dye primer peaks set windows of acceptable reliability percentage of the highest peak height 3 end and even set an absolute number of bases For example you could specify that fragments must have SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 32 fewer than 3 N s in the first 20 and last 30 bases To activate specific trim criteria you must first select the checkbox that governs the criteria fields Figure 6 3 The Ends Trimming Criteria window Ends Trimming Criteria 5 prime end Trim ABI primer blobs where GHB consecutive bases remain off the scale Iw Trimming no more than 25 trim until the first 25 bases contain less than 3 ambiguities Iw Trimming no more than 25 trim until the first 25 bases conta
136. ene Codes Corporation Inc All rights reserved 39 7 The Reference Sequence In this chapter we explain Reference Sequences sequences or fragments used as baseline or benchmarks for subsequent comparison We will describe the special features of Reference Sequences We will then discuss how to designate a sequence as a Reference for your project how to assemble sequences to the Reference and how to edit contigs containing a Reference Sequence What is a Reference Sequence A Reference Sequence is any sequence that you have selected to act as a model for your subsequent comparisons You may have obtained this sequence from one of the public DNA databases or you may have characterized it yourself Once you have marked your sequence as a Reference Sequencher attaches certain properties to it Why use a Reference Sequence There are many situations where you might want to designate one fragment in a project as a Reference Sequence In forensics human identification depends on comparing differences with a standard reference sequence In medical genetics a lab might want to compare the same gene from family members to the specific variant that is known to be part of a disease process In a population study a lab might compare the same gene from thousands or tens of thousands of individuals to a wild type sequence In all of these circumstances using Sequencher s Assemble to Reference command with a Reference Sequence allows you to pinpoin
137. epresents a subsequence or region of a sequence which has some biological significance Assigning a feature is a method of describing and naming such a region Sequencher supports both personal features and GenBank standardized features The Feature Editor enables you to name and highlight sequence regions of special interest Note Features will not appear unless you go the View menu and select Display Features SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 147 Creating and editing features To create or edit features in a sequence click on the sequence icon Go to the Sequence menu and choose Edit Features to display the Feature Editor Figure 17 5 Figure 17 5 Feature Editor dialog box Features from rCRS Feature Key misc_feature Feature Name ui Hil Iw Display Feature in Editors Feature Location 73 Gan Which strand Complement Style Preview Add Remove To alter default styles go to the Feature Motif panel in user preferences Done To create a new feature click the Add button in the dialog box From the Feature Key menu choose Sequencher if you are creating personal annotations or choose one of the GenBank keys If you have chosen Sequencher as your Feature Key type a name in the Feature Name field Enter the starting and ending base numbers for the feature in the Feature Location boxes The default strand is 5 gt
138. er Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 125 Bullets Pluses and Dashes Clicking the Bullets Pluses boxes highlights disagreements and ambiguities in a line below the consensus If you also click on amp Dashes you are marking positions where the bases in individual fragments match so that the disagreements are more visible Note You can display Bullets Pluses without the amp Dashes but you cannot display amp Dashes without the Bullets Pluses However see Matching Bases As Dashes below Consensus Sequence You can choose to hide the consensus sequence by clicking on the Consensus Sequence box Figure 14 2 shows a summary line without the consensus line but with Bullets Pluses and amp Dashes all checked Figure 14 2 Summary report with no consensus line with bullets pluses and dashes Ha HLA HLAGGG16 301 GACCTGGGGA CCCTGCGCGG CTACTACAAC CAGAGCGAGG CCGGTTCTCA 4EE HLA HLAGGG1G 361 GACCTGGGGA CCCTGCGCGG CTACTACAAC CAGAGCGAGG CCGGTTCTCA D i Reference 361 RNCCTGGGGA CCCTGCGCGG CTACTACNAC CAGAGCGAGG MNCGGTTCTCA DS HLA HL ppo 361 GACCTGGGGA CCCTGCGCGG CTACTACAAC CAGAGCGAGG CCGGTTCTCA DS HLA HLAG G Z 361 AACCTGGGGA CCCTGCGCGG CTACTACAAC CAGAGCGAGG ACGGTTCTCA DS HLA HLAG G 4 361 AACCTGGGGA CCCTGCGCGG CTACTACAAC CAGAGCGAGG ACGGTTCTCA DS HLA HLAGGG1S 361 GACCTGGGGA CCCTGCGCGG CTACTACAAC CAGAGCGAGG CCGGTTCTCA 4E2 HLA HLAGBGG1 301 AACCTGGGGA CCCTGCGCGG CTACTACAAC CAGAGCGAGG ACGGTTCTCA
139. er case letters You can save the motifs in a file on your hard disk by clicking on the Save Motifs button Enter a suitable name for your motif file in the Save dialog window Figure 17 2 Display style pull down menu Motifs Load Motis Find Exact Ambiguous Reverse Matches Matches Allowed v None Blue Cyan Green Magenta Red Yellow v Invert Case Underline KKK ceeneeveevevere To view motifs in editors select the Display Motifs option in the Mem menu Quick motif entry Select the bases in a Sequence Editor then go to the Edit menu and click on Enter Selection As Motif If you choose this command the default style will be used see Chapter 21 Customizing Sequencher and User Preferences Displaying motifs To use the motifs you have entered go the View menu and choose Display Motifs Figure 17 3 shows how motifs are displayed in the contig overview they are the small downward facing colored blocks on the arrows representing the sequences The colors are those you set in the Motif Default Style menu see Chapter 21 Customizing Sequencher and User Preferences Figure 17 3 Overview showing motifs SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 146 C Hole in contig gt E Single fragment Bumps on ES Multiple fragments same direction fragments Ei Both strands show motifs B i Both strands plus hollow Start c
140. es and then browse to and select the file you want to import Click the Open button in the lower right area of the dialog box or press the Return key You may need to select a file format from the drop down menu called Files of Type in order to see the files of interest in the dialog window To see all the files in a particular location choose All Files You can also select from a Folder of Sequences ACE Project CAF Project GCG contig Sequencher project or Sequence from VecBase Working with specific formats is discussed below If Sequencher cannot import the file you will get a warning message to that effect Importing GenBank features When you import a sequence in GenBank format you will automatically import the features including the location the feature keys and feature qualifiers The feature key used in sequences published in GenBank EMBL and DDBJ describes the biological nature of the annotated feature or indicates information about changes to the sequence For more information on features and feature keys see Chapter 17 Motifs and Features and Appendix 28 Feature Keys and Qualifiers Where feature locations described in the GenBank file are ambiguous Sequencher will warn you that it cannot import these features and will list the features in a dialog box you can print out for your records Importing lists of sequences Sequencher s functionality has been modified to take advantage
141. es and subprojects Exporting data from Sequencher Sequencher has a rich set of export formats You can export sequence s contig s or consensus sequence s subsets of your data the Variance Table the Summary Report and even protein translations Sequencher offers you a number of options depending on the type of data you choose to export For each data type you can select a choice of format and the location for the export For some data types there are additional options Sequencher may offer a default for the export Unless you deselect the option to use default names Sequencher saves the file s in the default name The Export dialog box will default to the last location and choice of format used All of the export commands are accessible from the File menu Exporting Sequences To export a sequence or a number of sequences click on the item s you want to export Go to the File menu and click on Export and choose Sequence s from the submenu Sequencher provides a context sensitive dialog box for changing the export options See Figure 19 1 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 158 Figure 19 1 Export Options dialog box Export Options Bases should be exported as Upper Case C Lower Case Strand to export e Sequenced Reverse Complement Current Orientation en exporting extended ambiguities to formats other i than IG convert them to Bas
142. es from a contig When you have aligned several sequences from a contig and displayed the chromatograms several traces appear in a single window one above the other across the width of the window SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 177 Use the Columns For Multiple Traces buttons to tile the data into columns so you can look at more traces at once For example if you select 3 you Il see three columns of traces If your screen and window size allow you to see three rows of traces at once you will be able to view a total of up to nine traces without having to scroll the window Click the Always Scroll To Selected Column checkbox to have Sequencher automatically update the chromatogram window whenever you select a new column of data in the contig window You can have Sequencher remember your window position Sequencher can also compress SCF chromatogram displays Figure 21 13 Contig chromatogram preference settings User Preferences General Setti Barca nae EEan Futo Save OI Confidence Label amp Name Columns for Multiple Traces Men o KS 2 E we Sound New Project Display Jv Always Scroll to Selected Column Chromatogram Compress SCF Chromatogram Display Contig Chromatogram Note SCF data cannot be compressed in Narrow Width mode Contig Feature Motif Format Ruler Start Stop Variance Table Input Output File Import Report
143. eseeaesesaeseesseeeseeeeseeeeats 143 Setting the sequence selection in a Sequence Editor oo eee eeeseeseseeseeeseeeesesecseeeesesseseeaeeesseeeeseeeeaes 143 Setting the sequence selection in a contig editor ee eee eeeseeeeseeeeseeseseeseseeseecseeseseeseseesesetseeeeseeeeaes 143 Copyime the EE 144 17 MOTIFS AND FEATURES iinan praana neidin oonan eneidiaa anaana eiui aaia 145 tere E 145 Entering LCE 145 QUICK REENEN EE edel dE dE 146 Displaying e 146 Ia ET 147 Wh atiS a Teater EIERE AE aes deen nen 147 Creating and editing EE 148 Editing an existing Teature goanen ona an EAE AERO ia 149 Oial EU Te GE 149 Showing and hiding features tee eseeeecesceecssensscecsnescnsocavsceassccscsssessnsscsavscenssseacsssessneeceavseeeseseasensess 150 Listing a sequences E ences wae ee eas eens eee sree eed 150 18 FINDING ITEMS wssociscccstcctsnetndaincaicdneiiattenenctbedatiienndasedalodieaninasnneodatanns 151 Finding the Project Wand ow EE 151 Finding open windows with menu commands 0 eee esceeseeseseeseeeseeeesceeeseesesesacseeseseeseeeseeaeseeseseeaeeeeas 151 Finding open windows in the Project View 0 eseeeeseeeeseeseeeseseeseeeeseeecsecacsesacseeseeeseseeseeasseeaeseeaeeeeas 151 Finding the ctirrent selectron EE 152 Searching in Open Windows ee sed cacisectdvia Veedel 152 eet DEN UE 152 Refining a subsequence Searchh sicce 2cecacs ccceesassaciease cacevessatauunsheacectaveauiaadhasaaceavercvardueas 153 een LEE 153 Matching ambiguous
144. eseeeseeassesssseeaseeeaeeeeas 41 Working with a Reference EIERE Eed eier 42 How to mark a sequence as a Reference oo eeeesceesceseseeseseeseseeseseeseeeseeeesecscseeacsesseseeseseeseseesessssesaeeesaeeeeas 42 Assemble to Reference eniinn aina a aa ERRER REER EARRA eevee oe 42 To Reference Dy NaMe 21 42 Trim to EH 43 Contig Editing with a Reference Sequence oo eeseeseseeseseeseeeseecseeeesesseseesesesseecseeeseseesesasseeaseeaeeeeas 43 Fill Coverage Gaps with Reference ci ege Ee 44 Reference S quence Translation ee 44 8 THE SEQUENCE EDITOR E 45 Opening an existing sequente for editing seseris ieren ir e e RE E E E RE 45 Locked Edit rS seen a EE 45 Basic SGquence edi EE 46 Setting the base nUMD CHiN tee nis vase haves ee ee teeta atin 47 Set Circular Genome EE 47 e Mag Dt RU iat aise neces tis E seated E teat ete nett San et ees Sot at iti ET E Le 47 Duplicating a UE 48 Reverse and Complement ce EE 48 Abo texperime ntal data tele 48 Creating a EE EE 48 Res tting a baseline nenio EAT OAOE OTEO E A E E i 49 Viewing experimental data yz 21 22 testes secs ie aa ARARA RARER R A OARA RARR RR RARE RRE a 49 Reverting to experimental data scscnccccocicciinicirineniciii niee aiiai ia a dee 49 Viewing experimental data for Chromatograms s ssesessssssessestsrststesststststststststnenrsetstststntsrsrsrenrseeresentsent 49 Viewing summary information in the Sequence Editor ssesssessssesrsesessesestsrststststsrsrrreststs
145. eserved 128 15 Chromatograms In this chapter you will learn how to use the chromatograms from an automated sequencer to help you when editing your contigs We discuss how Sequencher displays traces and secondary peaks to find heterozygotes and how you can edit bases from traces or revert to experimental data Working with automated sequencer data When you import a trace file from an automated sequencer Sequencher imports the chromatograms and the confidence values if these are available Note If you import a text file containing just ASCH characters you will not have imported traces To import traces you must import the trace file Text files from an automated sequencer will be only a couple of kilobytes in size whereas trace files will be well over 50 kilobytes When you copy a Sequencher project file from one machine to another you don t need to copy the trace files separately because Sequencher stores the trace information along with everything else in the project file Sequencher compresses the imported traces to conserve disk space so don t be alarmed if a project file containing several sequences from an automated sequencer is smaller than any one of the trace files Viewing chromatograms from a Sequence Editor Sequencher s chromatogram display is tightly integrated with its other editors You can view the entire original trace for a sequence fragment by opening the Sequence Editor for that sequence Click the Show Chrom
146. f demonstrating the Software You may not copy any written materials accompanying the Software Other restrictions You may not rent or lease the Software but you may transfer it and the accompanying written materials on a permanent basis provided you retain no copies and the recipient agrees to the terms of this agreement You may not reverse engineer decompile or disassemble the Software Limited Warranty We warrant that the Software will perform substantially in accordance with the accompanying written materials for a period of 90 days from the date of receipt Any implied warranties on the Software are limited to 90 days Some states do not allow limitations on duration of an implied warranty so this limitation may not apply to you Remedies Our entire liability and your exclusive remedy shall be at our option either a return of the price paid or b repair or replacement of the Software that does not meet our limited warranty and that is returned to us Our limited warranty is void if failure of the Software has resulted from abuse or misapplication We warrant the replacement Software under this limited warranty for the remainder of the original warranty period or 30 days whichever is longer NO OTHER WARRANTIES WE DISCLAIM ALL OTHER EXPRESS OR IMPLIED WARRANTIES INCLUDING THE IMPLIED WARRANTIES OF THE MERCHANTIBILITY AND FITNESS FOR A PARTICULAR PURPOSE OUR LIMITED WARRANTY GIVES YOU SPECIFIC LEGAL RIGHTS YOU MAY HAVE OTHER RIGHTS
147. f the consensus line of a contig For example if a T in the Reference Sequence overlaps a G in another fragment the consensus line will show a Q SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 41 Figure 7 2 Base numbering showing decimal numbers E decimal numbering OX TTTATAAACTTGATGTBAGTACCAATAGAT AAT G TTTATAAACTTGATGTBAGTACCAATAGATAATG TTTATAAACTTGATGTBAGTACCAATAGATAATG TTTATAAACTTGATGTBAGTACCAATAGATAATG TTTATAAACTTGATGT AGTACCAATAGATAATG 5 frag bases selected at consensus position 419 1 Select Next Ambiguous Base spacebar Working with a Reference Sequence How to mark a sequence as a Reference To designate a sequence as a Reference Sequence select the sequence icon in the Project Window and under the Sequence menu select the Reference Sequence command From now on that sequence icon will include a small letter R to remind you that it has been marked as a Reference Assemble to Reference Assemble to Reference is a powerful command which allows you to assemble all the samples you select to a single Reference Sequence regardless of any inconsistencies between the individual fragments Because it is a many to one comparison instead of the normal many to many comparison Assemble to Reference is much faster than the standard Assemble Automatically mode To Assemble to Reference select the sequence fragments you want to assemble Click the Asse
148. file stored in your Sequencher folder Selecting a voice file First make sure your computer sound system is enabled and that the volume is not set on mute Then go to the Sequence menu and choose Speech Then go to the submenu Select Voice File You will not get any audio feedback until the Select Voice file is loaded Identifying a speaker You ll probably be familiar with the voices Gene Codes has persuaded to provide audio for you To hear whose voice is being used go to the Sequence menu and choose Speech and then the Identify Speaker subcommand You can only select this command if a sound file was selected previously Audible keystrokes Any meaningful keystroke you type into a Sequence Editor will be spoken aloud if you go to the Speech command in the Sequence menu and select the Say Sequence Keystrokes submenu That is if you press the A key a voice will say A but if you press the shift key you will not hear any voice Read sequence selection To check the sequence you have entered into the Sequence Editor first select the bases you want the computer to read to you Then go to the Sequence menu click on Speech and then the Read Sequence Selection submenu Note You must place the cursor somewhere in the small window that appears above the sequence as in Figure 8 15 To pause the reading just move the cursor out of the window Figure 8 15 Speak selection cursor and instruction menu p53 genomic Move spea
149. g Gel A T7 dog A 2 Gel A T7 dog Gel A T7 dog d 2 Gel A T17 dog Gel A T dog If your sequence name is complex you may need to write a regular expression that describes the delimiters and the Assembly Handles In the example below the text between the parentheses defines the Assembly Handles Anything not enclosed by parentheses will be treated as a delimiter Assembly Handle 1 Delimiter Assembly Handle 2 Delimiter Assembly Handle 3 You would use this kind of expression with the Expression is a delimiter box unchecked Regular expressions are powerful tools There are many resources on the internet which describe how to use regular expressions Here is one to help you get started http www macadamian com column regular_expressions html SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 192 25 Appendix IUPAC IUB Standard Code O e Q o Base s qa QA QA gt U AorG CorT or U Gor T or U S aK Roud oo gt A or Loi GorC Aor T or U D C G T or U G A T or U A C T or U A C orG zZz lt D A C G T or U Meaning Adenine Cytosine Guanine Thymine Uracil RNA Purine Pyrimidine Keto Amino Strong Weak not A not C not G not T Any SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 193 26 Appendi
150. g data for selected variants Each row in the table represents a variant If you need to export selected rows make your selection by using Shift click for a continuous range or Ctrl click for a discontinuous range Then click on the Reports button The Variance Table Reports dialog box appears The Selected Rows radio button is checked There are four buttons Cancel Open Report Copy as Text and Save Save as Text is the default setting The chosen rows are saved as one table The options in the Reports Format drop down menu are limited to Variance Table Report and Variance Detail Report Click the Save as Text button You will see a dialog window prompting you to assign the name and location of your export You can type a name for the table in this window Click the Save button to save your table as a tabbed text file Although the Selected Rows radio button is checked you can change your selection to include all rows by clicking the Entire Table radio button You can copy your Variance Table in to another document or application that accepts tabbed text To paste the contents of your Variance Table Report in the document of your SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 119 choice click on the Copy as Text button The data can now be pasted into its new location Translated Variance Table Report Translated Variance Table Report The Translated Variance Table Report consists of
151. g experimental data After you enter a new sequence Sequencher archives a copy that is locked not editable In the course of assembling your data you may make a great number of edits to any particular sequence This locked version enables you always to refer back to the sequence as it was originally entered To see the Experimental Data version of the sequence click the Show Experimental button in the Sequence Editor button bar Reverting to experimental data If you edit a sequence incorrectly you can delete all the changes to that sequence and go back to the archived baseline To do so open the Sequence Editor Then choose Revert To Experimental Data from the Sequence menu If you edit a contig incorrectly you can delete all the changes to an individual sequence and go back to the archived baseline To do so open the contig editor Select the sequence by clicking on its icon which can be found on the left hand side of the contig editor You can now remove the sequence from the contig by using the Remove Selected Sequences from the Contig menu Then choose Revert To Experimental Data from the Sequence menu You can then reselect the fragments and rebuild the contig Viewing experimental data for Chromatograms If your data came from an automated sequencer click the button labeled Show Chromatogram to view the experimental data of a sequence This will display the original trace data Sequencher allows you to scroll either vertical
152. g the same function over and over by simply pressing the space bar Figure 15 5 Base change generated by secondary peak Select Next Commands The Select menu provides you with several commands for navigating to bases which need further review or attention see Table 15 1 As you move your selection from each position that requires scrutiny to the next you are likely to need to repeat these commands frequently Therefore Sequencher creates a shortcut for that command in the space bar after the first time you invoke a Next command Press the space bar to move from one selected base to the next candidate To change any Next command to search in the reverse direction combine the command with the Shift key If you use the Next Met To Stop gt 0bp you can highlight the next pair of start and stop codons in one of the three forward reading frames The number of bases shown in this command depends on whether you specified a preference for a minimum length in the User Preferences SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 133 Table 15 1 Summary of Select commands Select command Contig Next Ambiguous Base Ambiguous bases Disagreements First base in large gap run Next Contig Disagree Disagreements Next Edited Base Edited bases Next Low Confidence Base Any consensus base that is derived from a low confidence base Next Met To Stop gt 0bp Any pair of Start and Stop codon
153. ge of sequences and choose the first sequence by clicking on its column header Then holding down the Shift key select the last sequence in the same fashion To select a discontinuous list of sequences hold down the Control key while clicking on the sequences Go to the Edit menu and choose the Remove From Table command Note This does not remove sequences from the underlying contig If you want to print selected samples from your table you can choose a continuous or a discontinuous range of sequences by clicking in the column header as described previously Click on the Reports button The Variance Table Reports dialog box appears see Figure 13 26 A radio button called Selected Columns will be checked Below that you will see a drop down menu called Report Format Choose either Variance Table Report Individual Variance Table Reports Variance Detail Report or Population Report Click the Open Report button The Sequencher Report Viewer opens in a new window Figure 13 26 Variance Table Reports dialog window Variance Table Reports Entire Table Report Format Population Report Cancel Open Report Use the Print Preview button from the button bar at the top of the Report Viewer to preview your report Click on the Print button to send the report to a printer If you want to print selected variants from your table you can choose a continuous or a discontinuous range of base positions by clicking the equivalen
154. gory name to expand the list and show the various preference topics within that category When a category list is expanded the triangle changes to point downwards When you click a preference topic name a Preference Pane appears on the right side of the window and enables you to specify your preferences for that topic Note Each preference setting you select will be stored when you quit the program General Settings In the File menu you will find a command called Open Recent The submenu for this command is a list of projects You can specify the number of recent projects Sequencher will remember in this submenu The maximum number is 99 Sequencher is installed with a set of default values As you work with the program it replaces the default values with your preferences However if you want to restore the original settings you will find a button at the bottom of the Settings user preferences called Reset user preferences to factory defaults This option resets all of the user preferences to their original status Auto Save Auto Save preferences control the timing of automatic saving of project files To use this preference you must have saved your data and given your project a name at least once To activate Auto Save click the Auto Save Project Files checkbox Click the Ask Before Each Auto Save checkbox if you want to confirm the save each time Use the Time Between Saves slider to set how long Sequencher should wait between ea
155. gram as you would any other information SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 162 Figure 19 3 Copy As Protein Translation options dialog box GE Selected Bases Translated Ser 1st Frame Entire Sequence C 2nd Frame Entire Sequence C 3rd Frame Entire Sequence Alternatively you can also go to the File menu Click on the Export command and choose Selected Bases as Protein from the submenu You will be presented with a window similar to that above see Figure 19 3 You will then be prompted with a Save As window Export Overviews If you are looking at the schematic view of a contig you can generate a text version map of the fragments as shown in Figure 19 4 for viewing in other programs or word processors There are two ways to do this 1 Go to the File menu and choose Export then Overview as Text from the submenu Sequencher will treat the map as any other exported file 2 Alternatively you can copy the overview as a picture and paste it into other programs Select the overview window by clicking on it then go to the Edit window and choose Copy As and Picture from the submenu Note When viewing the output you must specify a fixed width font to preserve the scale of the arrows in relation to each other and the fragment names Figure 19 4 ASCII map of sequence Overview of contig Contig 00801 from project ADARC SPF ADARC 4
156. guous Base spacebar Reference Sequence Translation This command is only available when the Reference Sequence is assembled into a contig To see a translation of the Reference Sequence in the notation line below the consensus sequence go to the View menu and choose Reference Sequence Translation The translation has a pale gray border above and below the amino acids to distinguish it from the consensus sequence translation Note As you cycle through the Consensus translation button modes you will notice one mode which translates the consensus sequence in the same frame as the Reference Sequence In this mode the button icon is marked with an r SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 44 S The Sequence Editor In this chapter we explain basic sequence editing setting base numbers and how to reverse and complement your data We then go onto discuss working with experimental data and how Sequencher enables you to find ambiguities work with codon and restriction maps and annotate features You will also learn how to customize your sequence view and use voice verification to help you check base calls Opening an existing sequence for editing To open an existing sequence select a sequence icon and then go to the File menu and choose Open Window You can also just double click the icon to open its editor window Locked Editors If an editor displays experimental data or a seq
157. her and User Preferences for details The IUPAC IUB codes also are included in the appendix to this manual The Sequence Editor works like any other text editor The insertion point can be moved using either the mouse or the arrow keys Note If you hold down the Alt key and use an arrow key the insertion point moves three bases If you hold down both the Ctrl and Alt keys the insertion point moves nine bases If you hold down the Ctrl key and the up arrow the insertion point moves to the beginning of the sequence if you hold down Ctrl key and the down arrow the insertion point moves to the end of the sequence The editor also performs typical operations including Cut Copy Paste and Undo Use the mouse to highlight a selection hold down the Shift key and click elsewhere in the sequence with the mouse to extend the selection There are also two commands for extending a current selection to the beginning or end of a sequence To Left End and To Right End both located under the Select menu in the Extend Selection submenu Once you have made a selection you can copy it by going to the Edit menu and using the Copy Selection command You can also cut the selection by using the Cut Selection command also located in the Edit menu The Paste function will remove non sequence characters You can define a sequence character by going to the Window menu and then using the Ambiguity Editor in the Ambiguity Key Codes option For instructio
158. hin Sequencher it has special properties A special folder in which to store subsets of sequences which need to remain in your project Standard Chromatogram Format A format used to store analyzed data or fragment data for a single sample A chromatogram peak whose height may be less than that of the primary peak and whose presence may indicate that a heterozygote exists at that position A small piece of DNA decoded into its component bases in order represented as text Sometimes in reverse order depending upon the vagaries of the sequencing procedure The window used for editing DNA sequences This editor looks much like a word processor and supports the standard text editing commands Cut Copy Paste and Undo An on screen control that simulates the action of a slide control that might normally be used as a volume or dimmer control on a household appliance The region at the top of a window on the computer A cloning vector sequence database prepared by Friedhelm Pfeiffer in collaboration with William Gilbert Also a file with this name in VecBase CODATA format Part of the Sequencher distribution 208
159. his Fragment 16569 Base Pairs Note Circular numbering can only be used in conjunction with a Reference Sequence You must designate your sequence as a Reference Sequence before you can enable circular numbering See Chapter 7 The Reference Sequence for more information Splitting a sequence You can split a sequence fragment into two pieces for instance at a known exon boundary Put the cursor where you want to split the sequence Go to the Sequence menu and choose Split After Selection Sequencher will ask you if you want to split the sequence after the last currently selected base Click on the Split button to proceed Sequencher will split the sequence into two pieces and label the new fragments for you If SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 47 your sequence was called MyPUC it will be divided into two sequences called MyPUC 5 end and MyPUC 3 end Duplicating a sequence You can duplicate a sequence fragment To do this highlight the icon then go to the Edit menu and choose Duplicate Seq Fragment Sequencher will use the name of the original fragment as a basis for the new fragment s name If the fragment you want to duplicate has a chromatogram associated with it Sequencher will ask whether you also want to duplicate the chromatogram or duplicate the sequence without chromatograms Click on the appropriate button to proceed If you wish to
160. ia review which bases will be trimmed and perform the trim command SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 31 Performing a trim with default settings You can specify how much dirty data should be trimmed off as a function of the number of ambiguous base calls per run of bases by altering the Ends Trimming criteria Each pane in the End Trim window shows how much sequence should be trimmed off There will be a checkbox at an end if it requires trimming Bases to be trimmed will be marked in red You can proceed with the trim by clicking on the Trim Checked Items button Once you have trimmed your data dismiss the window with the command from the File menu called Close Window Figure 6 1 The Ends Trimming button bar Trim Checked Items Change Trim Criteria Automatically selecting sequences for ends trimming You can set the User Preferences so that sequence files are automatically added into the trimming window as they are imported During Import Sequencher loads the sequences that require trimming directly into the trim window To determine which sequences are directly imported into the trim window go to the Window menu and under User Preferences click on Input Output and then select the submenu File Import Figure 6 2 shows the criteria you use to specify which sequences are candidates for direct import to the Ends Trimming window For more information see Chapter 21 Customizing Sequ
161. idence bases We discuss how to perform your edits move and delete bases and sequences insert gaps or bases and create a new sequence from a consensus You will learn how to compare sequences to highlight differences and how to create customized reports Finding bases which need attention You are likely to see a number of issues in your data Some will need to be examined and perhaps edited These issues might be indicated as an N as a disagreement as a gap or as having a low confidence rating You can use one of the Next commands from the Select menu in the consensus sequence of a contig to find any of these problem areas Once you have used any of the Next commands Sequencher will activate the space bar to execute that operation If you use either the menu command or the shortcut key combination two consecutive times Sequencher will remind you that this easier alternative is available Viewing an individual sequence from the contig To look at the data for one of the sequences in your contig double click the sample name in the sequence list Sequencher opens a locked sequence viewer containing the data for that sequence You can also select a sequence in the contig editor and then go to the File menu and choose Open Window Finding disagreeing ambiguities You can find ambiguities quickly by typing Ctrl N or by clicking in the consensus line and going to the Select menu and choosing Next Ambiguous Base This will find data that is i
162. ig menu and choosing Create New Seq From Consensus In addition to creating a new sequence you can use this feature to track the changes made to your contig over time Sequencher displays a dialog box that tells you that you are about to create a snapshot consensus of the selected contig This dialog allows you to control some basic features of the new sequence Figure 12 6 shows the options dialog box for this command Figure 12 6 Create New Sequence From Consensus options dialog box Create New Sequence From Consensus You are about to create a snapshot of the selected contigs consensus sequences Each new fragment will inherit the name of its parent contig followed by an optional time stamp Iw Remove Gaps Retain Large Gaps Iw Include Features Iw Append Time Stamp to name Cancel Check the box to indicate how you want the new sequence to appear If the default setting Remove Gaps is checked then Sequencher will remove all gaps large and small when it creates the new sequence If you want to keep large gaps independently from other gaps then select the checkbox called Retain Large Gaps In Sequencher a large gap consists of 10 or more consecutive bases Such a gap might appear if for example you have used the Large Gap algorithm to assemble genomic and cDNA The Retain Large Gaps option is not available if Remove Gaps is checked Note To see the new sequence go to the Project Window Remember this is a snapshot of
163. ights reserved 131 If you select the Contig Chromatogram section you can specify how many columns of traces you want to view at once whether Sequencher should scroll to the selected column when you make a new selection in the contig and where the window should appear Note If you hold down the Ctrl key while changing the scale the new scale will apply to all the traces in that window If you hold down the Ctrl key while clicking any of the buttons to hide a lane the lane s will be hidden for all the traces in the window See Figure 15 2 above Display secondary peak You can identify heterozygotes by identifying the second highest peak beneath the primary peak If you have a particular candidate for a heterozygote first open the chromatogram for the individual sequence then click on the base call s as shown in Figure 15 3 Figure 15 3 Trace with base selected 20 1DM m P10X2AF 102 Fragment base 48 pere 48 or Ve A TAT T GI AT T Go to the Sequence menu and choose Call Secondary Peaks A dialog box lets you specify how the secondary peak should be called Figure 15 4 Figure 15 4 Secondary peak parameters Secondary Peak Search Minimum lower peak height as percentage of upper Iw Allow Ns to be replaced Allow edited bases to be replaced Iw Only make changes that result in an ambiguity e Iw Search Selection Only Cancel The slider lets you specify how significant the second highest peak must be to ge
164. iginal color The bases from the selected site s are automatically entered into the Vector dialog box shown in Figure 6 8 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 36 Figure 6 8 Vector Dialog box displaying vector and insertion site S Vector Insertion Sites OX Minimum overlap to consider as vector contamination Approx match percentage to consider as contamination Minimum overlap allowed without exact matches Additional bases to remove from a contaminated 5 end gornogrnorigrip Additional bases to remove from a contaminated 3 end Use an asterisk to indicate insertion sites to be screened BlueKim D BamHI T amp Name Lei Polylinker GTTGT AASACGACGGCCAGT GAATT GT AAT ACGACTCACT AT AGGGCGAATTGGAGC CCACCGCGGT GGCGGCCGCTCT AGAACT AGT G GATCCCCCGGGCTGCAGGAATTC Use Vecbase File GATATCAAGCTTATCGATACCGTCGACCTCGAGGGGGGGCCCGGTAC J H Q PstI D Name A Polylinker SCTITACACTTT Af GCTTCCGGCTCGTATGTTGTGTGGAATT GT GAGCGGAT AAC AA i TTCACACAGGAAACAGCTATGACCATGATTACGAATTCCCGGGGATCCGTCGACCT Use Vecbase File GCA GCCAAGCTTGGCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTG GCGTTACCCAACTTAAT putlic Q EcoRI D Name Le j Polylinker CGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGAC GTT GT ABAACGACGGCCAGT GCC AAT ET GCAGGTCGACTCTAGAGGATCCC Use Vecbase File CGGGCGAGCTCG AATTCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTA CCGCTCACAATTCCACACAACATAC D Name
165. iguous pieces OK Remove Items Dissolving a contig To dissolve a contig first select the contig s icon from the project Go to the Contig menu and choose Dissolve Contig You can also perform this command while you are in the contig editor Sequences from the dissolved contig are shown highlighted in the project It is important to remember that all edits made to a sequence while in the contig will be retained If you want to restore the original sequences you must go to the Sequence menu and click on Revert to Experimental Data SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 93 13 The Variance Table In this chapter we explain how to use the Variance Table an interactive display of the differences among your sequences or contigs We go on to discuss how you find differences and work with your trace and confidence data The Variance Table is useful for SNP analysis mutation detection and clone checking The Variance Table Strategy What is a Variance Table In its simplest form a Variance Table compares and displays the differences between two sequences The data in the Variance Table are dynamically linked to the data in the underlying contigs You can compare some or all of the sequences in a contig to the Consensus sequence the Reference Sequence or the Top Sequence of that contig see Figure 13 1 below This generates a Variance Table which will display the differences
166. igure 8 10 Figure 8 10 Restriction Map button selected with map and sequence shown E p53 genomic DER EIS Sequenced Strand Sequence Length 20303 Residue 1 pal BamHI 4 Bsp106 3 pa 20 Drall 22 EcoRI EcoRV CI Hindili 6 TTCCCATCAA GCCCTAGGGC TCCTCGTGGC TGCTGGGAGT TGTAGTCTGA ACGCTTCTAT CTTGGCGAGA AGCGCCTACG CTCCCCCTAC CGAGTCCCGC GGTAATTCTT AAAGCACCTG CACCGCCCCC CCGCCGCCTG CAGAGGGCGC AGCAGGTCTT GCACCTCTTC TGCATCTCAT TCTCCAGGCT TCAGACCTGT CTCCCTCATT CAAAAAATAT TTATTATCGA GCTCTTACTT GCTACCCAGC ACTGATATAG GCACTCAGGA ATACAACAAT GAATAAGATA GTAGAAAAAT TCTATATCCT CATAAGGCTT ACGTTTCCAT GTACTGAAAG CAATGAACAA ATAAATCTTA TCAGAGTGAT AAGGGTTGTG AAGGAGATTA AATAAGATGG TGTGATATAA AGTATCTGGG AGAAAACGTT AGGGTGTGAT ATTACGGAAA GCCTTCCTAA AAAATGACAT TITAACTGAT GAGAAGAAAG GATCCAGCTG AGAGCAAACG CAAAAGCTIT CTTCCTTCCA CCCTTCATAT TTGACACAAT GCAGGATTCC TCCAAAATGA TITCCACCAA TTCTGCCCTC ACAGCTCTGG CTTGCAGAAT TITCCACCCC AAAATGTTAG TATCTACGGC ACCAGGTCGG CGAGAATCCT GACTCTGCAC CCTCCTCCCC AACTCCATIT CCTITGCTTC CTCCGGCAGG CGGATTACTT GCCCTTACTT GTCATGGCGA CTGTCCAGCT TTGTGCCAGG AGCCTCGCAG GGGTTGATGG GATTGGGGTT TTCCCCTCCC Click the button a window with a multi line restriction map appears You can adjust the size of the panes by dragging the splitter bar that separates the scroll bars for each pane see Figure 8 10 You can also invoke this command by going to the View menu and choosing Display C
167. ii a a A RRRA AERAR AAAA E dii 80 Getting M re TOT AU OM sae e E E E AE E 80 ale NEE 80 Diet nt Aerer lear veces verde ccvacdvusdvacibecevacaviniva eeh A A AE 80 The Bases NICW osscsinanaissatutaest ane sua a E A EE E 80 Workin inthe Bases ME ee Eege 80 VieWINS SEQUENCE NAMeS aii ee ieee E OA ale ere ics eel alate AT 82 Before you e dE LEE 82 Setting the base UMD Sri S sonenn deka decvatends decease eelere AO EO ATERN 82 Consensus Galulatop ege ete a eh AAA A Ee ee 82 CONSENSUS INCIUSIVELY irch Zeie zelt rdbieh det gehk dech ged die fede andudedhecdedzduacdh det gchh de ged dief Zefgeddeh zelt reves 83 Consensus By Pl rality E 83 Consensus to Forensic Standarde 84 Assigning a Reference Sequence cricicccccidisiririiiniiiiiieci tinianu iini iaei 84 12 EDITING ei el AE 85 Finding bases which Melek EE 85 Viewing an individual sequence from the conpg eee ese eseeeeseeeeseeeeseeseseeaesecseseeseeeseeessesseseeaesesacseeas 85 Finding disagreeing Ambiguities 0 eee ee eeeeeseeeeseesesessesesseseesesecseeeesesseseesesesscseesesessesessesaeseeaseeaeeeeas 85 att GE 86 Findins Beet E 86 Finding Edited Bases eerste EENS 86 Pertormin a EE 86 The Edit Commian E 86 Lise 87 Wie Wine EE 87 Collect Gaps ue tea Roches Ete De ROA AERO ERROR RRM 87 Mowing Bases and Gaps sccsstudecbacisdacvapdedcta durch ecduaechesdaus E aust ek dechtetdek et deele EEOAE TRA 88 Using ReAligner to clean tp COmt gs syisstecdesidhestioreepdeavsbendenebetheebedievenesi
168. in the consensus line and enter the character that represents the edit you want to make in that column If you currently have a base selected in a sequence fragment go to the Select menu and choose Contig Column Ctrl K SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 90 Any bases in the sequences above the contig column you select that do not match the character you type will be changed to match the character you type in the consensus line Your edits will appear according to the preferences you set in the View menu Generally this will be magenta and bold text You can also delete bases from the consensus line You cannot use the Undo command to reverse this action if your contig contains a Reference Sequence Figure 12 5 The base is changed E Contig 0001 DER GTGCTTGTCT TATCTCCTA GTGCTTGTCT TATCTCCTA GTGCTTGTCT TATCTCCTA GTGCTTGTCT TATCTCCTA GTGCTTGTCT TATCTCCTA H frag bases selected d Ig TGCTTGTCTMATATCTCCTA consensus position 569 HO Select Next Ambiguous Base spacebar Note Remember that changing bases from the consensus line edits only the sequences after which the consensus is recalculated and displayed Your action does not change the consensus directly Changes in the consensus result from matching sequences with each other and evaluating them according to the type of consensus calculation you have chosen e g by plurality inclusi
169. in less than 13 bases with confidences below 25 Aways trim at least jo bases from the 5 end 3 prime end Trim chromatogram files before the first EY consecutive peaks below 25 of the highest peak Iw Starting 100 bases after 5 trim trim the first 25 bases containing more than f3 ambiguities V Trim from the 3 end until the last 25 bases contain less than 3 ambiguities V Trim from the 3 end until the last 25 bases contain less than 3 bases with confidences below 25 Post fix htaximum desired length after trimming is jo bases trim more from the 3 end if necessary Iw Remove leading and trailing ambiguous bases cancel Using Confidence to Trim Ends In addition to trimming data based on the number of ambiguities at the 5 and 3 ends you can trim sequence data based on the confidence scores assigned by your base caller Confidence scores also called quality values are numbers associated with each base call and which define the likelihood that a base call is incorrect The most common scale is from 1 60 where 60 represents a 1 10 chance of a wrong call and 20 represents a 1 10 chance Depending on the program used the confidence score may be based on peak height the presence of more than one peak and or the spacing between the peaks See Figure 6 3 the Ends Trimming Criteria window for trimming by confidence criteria Reviewing the Trim Each pane in the trimming windows shows schematically how
170. ished assembling your sequences Figure 10 7 The Assembly Completed window Assembly Completed Time Elapsed 00 00 03 Assembled by Origin amp Clone ltems Selected 212 Assembled Completely 9 Comparisons Performed 3 778 Assembled Incompletely 6 This dialog window gives a summary of the assembly process If this information is sufficient click the Close button This will dismiss the Assembly Completed window If you require more detailed information click on the Details button You will see a report that contains further information about the assembly You can save a text copy of this report by clicking on the Save As button Dismiss the dialog window by clicking the Close button Note The name of each contig formed using Assemble by Name will be based on the Assembly Handle used SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 74 II The Contig Editor In this chapter you will learn about the overview and bases view of the contig editor You will learn how to use these options to explore your contig We will discuss the steps to take before you start editing base numbering setting a consensus calculation and assigning a Reference Sequence Contig editing DNA sequencing is an error prone process and so the base calls from an automated DNA sequencer may not correctly represent the sample being sequenced This means that when working with automated DNA sequencing data it is usuall
171. iters and Assembly Handles you are nearly ready to perform your assembly In addition to the standard Auto Assemble by Name command you can use the Assemble by Name function to assemble your sequences with a Reference Sequence or to Mindlessly Join your sequences Under the Contig menu the following Assemble Contigs menu items will change if you have enabled Assemble by Name The Mindlessly Join command becomes Join by Name Similarly the Assemble to Reference button will now be labeled To Reference by Name Before you use the To Reference by Name command you will need to define a Reference Sequence You will also need to choose and set your assembly algorithm and any associated parameters For more details see Chapter 9 Sequence assembly SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 13 Starting from the Project Window go to the Select menu and click on the Select All command to select all the sequences in your project Click on the Auto Assemble by Name button or the To Reference by Name button if you have a Reference Sequence The Assembly Preview window will appear This window contains the names of the contigs that may be generated using your chosen handle and the probable number of fragments in each contig Click on the Assemble button to continue with the assembly or click on the Cancel button Sequencher displays the Assembly Completed dialog window when the assembly has fin
172. ker cursor out of this rectangle to pause DER Click the mouse button or use Command to cancel D Low Confidence 0 0 D Ambiguous 0 0 Selection Length 1 TICCCATCAA GCCCTAGGGC TCCTCGTGGC TGCTGGGAGT TGTAGTCTGA ACGCTICTAT eTrecgcaca AGCGCCTACG CTCCCCCTAC CGAGTCCCGC GGTAATTICTT AAAGCACCTG CACCGCCCCC CCGCCGCCTG CAGAGGGCGC AGCAGGTCTT GCACCTCTTC TGCATCTCAT TCTCCAGGCT TCAGACCTGT CTCCCTCATT CAAAAAATAT TTATTATCGA GCTCTTACTT GCTACCCAGC ACTGATATAG GCACTCAGGA ATACAACAAT GAATAAGATA GTAGAAAAAT TCTATATCCT CATAAGGCTT ACGTTICCAT GTACTGAAAG CAATGAACAA ATAAATCTTA TCAGAGTGAT AAGGGTTGTG AAGGAGATTA AATAAGATGG TGTGATATAA AGTATCTGGG AGAAAACGTT AGGGTGTGAT ATTACGGAAA GCCTTICCTAA AAAATGACAT TTTAACTGAT GAGAA rr SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 56 To speed up or slow down the rate at which Sequencher s audio reads your data go to the Window menu and choose User Preferences Under the General section you will find an item called Sound Click on this to display a slider bar which controls the speed of read back in bases per second To end the read back just click your cursor anywhere on the Sequence Editor window SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 57 H Sequence assembly In this chapter you will learn how to set assembly conditions by using Sequencher s rich variety of assembly algorithms and parameters You will lear
173. key you want to assign to that base code To change a color setting for a base code select the base code from the list on the left of the window Select a different color from the Color drop down menu Saving and loading custom codes You can load and save your customized ambiguity settings by going to the button bar at the top of the Ambiguity Editor and clicking on Load Ambiguities and then Save Ambiguities Show and hide ambiguity helper Sequencher can show a small quick reference window that lists the current ambiguity symbols To bring up this window go to the Window menu and click on Show Ambiguity Helper Close the window with the close box in the upper right corner SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 169 Choosing a genetic code You will find that Sequencher already has the major genetic codes built in You can choose from among these predefined coding systems by going to the Window menu and choosing Genetic Code Select the code you want from Reset Code To pull down menu The genetic code dialog box is shown in Figure 21 2 Use the buttons at the bottom of the box to choose three letter or one letter abbreviations If you chose one letter abbreviations the letter will be aligned with the first left base of the codon Figure 21 2 Genetic code editor Genetic Code oo OOS DEOC oksoe e ZLetters ResetCodeTo w 1 Letter Left Editing a genetic
174. l Output Use Ctrl 0 For File Import Report None oe Eis Use Cmd 0 For C Open Project Open Window Sound Sequencher provides audio feedback to help you maintain the quality of data input When you select the Sound icon you will see the preference pane shown in Figure 21 9 The slider controls how quickly Sequencher reads bases to you Click the Play Audible Welcome at Startup checkbox if you want Sequencher to say hello at startup Figure 21 9 Sound preference settings User Preferences General Settings El Futo Save lt D Confidence Label amp Name Reading Speed bases sec Menu New Project S Display Chromatogram Jv Play Audible Welcome at Startup Contig Chromatogram Contig Feature Motif Format Ruler Start Stop Variance Table Input Output File Import Report Note You have to select a voice file to get audio output For more details see Chapter 8 The Sequence Editor and review the section on Voice Verification New Project This User Preference determines what type of project will be opened by the New Project command SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 175 Click the Use Blank Project radio button if you want your New Project to be blank Click the Use Project Template radio button if you want the project settings to be controlled by a templa
175. le O Compare Translation to Refe TOX Description 10 sequences compared to reference sequence NC_001807a 3 end of contig GWB Comparison Range Unfiltered Base Positions 15984 16571 Amino Acid Positions 1 229 ee Re P 16 127 TAC 82 Y 16 172 ACC 97 HE 16 184 CCC 101 P 16 190 CCC 103 P 16 196 ATG 105 M 16 202 ACA 107 T 16 205 AGC 108 fz Note When Sequencher encounters a gap it skips to the next available base to include it as a component of the codon If an N is in the first or second position of the codon Sequencher displays a question mark instead of an amino acid If the N is in the third position of the codon a residue may be displayed This will depend on the genetic code redundancy SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 97 Creating a Variance Table Variance Table for sequences in the same contig When you are ready to begin creating your Variance Table either select individual sequences from within a contig or go to the Project Window and select a contig by clicking on its icon This will select all of the contig s sequences Then go to the Sequence Menu and choose Compare Bases To and either Consensus Reference Sequence or Top Sequence from the submenu The Variance Table appears in a new window Figure 13 4 Figure 13 4 Variance Table Created with Compare Bases To Reference Sequence S Compare Bases to Reference Sequen
176. le sequence using the lasso tool The selection can be a single base or gap or a number of bases or gaps Begin by making your selection by dragging the cursor across the bases or gaps you wish to move The region will be highlighted Then hold down the Alt key and your cursor will turn into a little lasso tool Hold down the Alt key and click on the highlighted selection you want to move Then drag it to its new location The selection will change to Bold Magenta text in its new position Note If you have used the Large Gap algorithm to assemble sequences you can double click anywhere within the gap region to select it Figure 12 2 The lasso tool around a base to be moved E Contig 0001 DER z CCATTTTTCTCCACTAATATAT CCATTTTTCTCCACTAAGATAT TCCATTTTTCTCCAQTATTATA CCATTTTTCTCCACTAATATAT CCATTTTTCTCCACTAATATAT 0 T540 550 CCATTTTTCTCCACTAATATAT eee cooo eeses Select Next Mwana Base pacebs Using ReAligner to clean up contigs The ReAligner button on the Contig Editor button bar lets you apply the ReAligner algorithm to optimize gap placement in your sequences without first dissolving your contig This is especially useful when you are left with gaps and unaligned bases after removing one or more sequences from a contig SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 88 Click the ReAligner button Once the ReAlignment has finished a new dialog window calle
177. le using Assemble Interactively click the Assembly Parameters button The Assembly Parameters dialog box will appear When you have changed the settings to your satisfaction click OK to return to the Assemble Interactively window Mindlessly Join Mindlessly Join allows you to put sequences together without using the conventional sets of algorithms Examples of situations where you might wish to do this include making a plasmid or constructing an artificial CDNA sequence Select the sequences you want to join by dragging a box around them or Shift clicking them Note They will be listed in the new contig in the order in which you selected them Go to the Contig menu and choose Assemble Contigs and then select Mindlessly Join from the submenu If Assemble by Name is enabled the Mindlessly Join submenu item becomes Join by Name Join by Name uses handles to group the sequences appropriately Sequencher will then ask if you want all the sequences aligned at the left of the contig All Left at the right of the contig All Right or joined End to End Click the appropriate button The selected items will be assembled and replaced by a contig icon in the Project Window Figure 9 9 Join by Name dialog box Join To Form Contig Cancel e All Left C End To End C All Right SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 67 10 Assemble by Name Many labs apply strict sample naming convention
178. licking on the Select All button You can widen your Comparison Range by typing a number into the Flanking Bases box For example by typing the number ten in this box the Comparison Range will include 10 flanking bases on either side of the chosen Feature Key SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 101 Once you have made your selection click the OK button The Variance Table will be rebuilt automatically Figure 13 8 Comparison Range dialog window Comparison Range C Unfiltered ep Feature Key misc_structure x jo Flanking Bases Fred 131 150 Ginger 291 240 mier structure 391 400 Select All Cancel Restricting the Translation Range Reference Sequences are often marked up with GenBank style features See Chapter 17 Motifs and Features for further information You can reduce the scope of your Translated Variance Table to a single feature Some features such as mRNA or CDS may be composed of several elements These elements are concatenated and treated as a single feature Translation of the DNA sequence starts in the first frame with the first base in the feature range Figure 13 9 illustrates an unfiltered Comparison Range a joined feature blue and a single feature pink Figure 13 9 Restricting the Comparison Range using Features Unfiltered Comparison Range LC TI Joined feature Single feature View your Translated Variance T
179. lifiers GenBank Feature tables GenBank Feature Tables provide a way of describing and locating features in a DNA sequence using internationally agreed upon layout and vocabulary Although these tables look complex all tables are comprised of three basic elements which are described in the table below Element Description Feature Key A single word or abbreviation which indicates a functional grouping Location The instruction for where to find the feature in the sequence Qualifier Additional information about the feature Feature tables may look simple or complex depending on the number of annotations in the table The following figure shows a typical entry Figure 28 1 A simple feature table entry Key Location Qualifiers source 1 1509 organism Mus musculus strain CD 1 In the example above the feature key is source and the location is represented by a range 1 1509 A location can be a single base a range of bases a single base within a range etc The qualifier which can be free text controlled vocabulary citation or SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 199 references numbers sequences or user defined feature labels is separated from the location by a forward slash For a more detailed discussion of table entries and their standard typography see http www ncbi nlm nih gov collab FT index html The table below cont
180. ls If you select a sequence and then click on the Trash Can button that sequence will be removed from the project Templates A template is a special type of project that can be set with your choice of parameters and preferences Templates can contain ordinary sequences or a Reference Sequence Preferences can include Feature settings enzyme sets and display settings With templates you can also set Assembly Parameters Assemble by Name handles and Trim parameters These will be discussed later in this manual A template can be used as a new project or imported into an existing project You can use templates to set up standard operating procedures and methods to reuse or distribute throughout your lab SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 15 Save Project As Template You can save any project as a template by using the Save Project As Template command Set the User Preferences and parameters that you want to use You can include a Reference Sequence or ordinary sequences Once you are satisfied with your choices go to the File menu and click on the Save Project As Template command A new dialog window called Template Name will appear Type a name for your template in the text box and then click on the OK button Your project will now be saved in a Templates folder The name of the template will appear in the New Project From Template submenu New Project From Template The New
181. ltiple items Note Unlike other Export commands which may create a number of files on your hard disk this command will create only one new project Export Selected Bases You may want to export selected bases rather than an entire sequence To do so drag the mouse to highlight your selected bases or go to the Select menu and use the Bases by Number command Then go to the File menu and choose Export and click on Selected Bases from the submenu You will then see the Save As window which allows you to choose your format options and preferred location Export Options If you click the Options button in an export dialog box Sequencher provides a context sensitive dialog box for changing the export options Select the options you want and click Export Export Formats When you export a file you may want to specify a file format so other software can access it Click the Format dropdown menu in the export dialog box The menu will display a list of the available formats The Sequencher Export menu is context sensitive which means that it varies based on whether your selection contains a sequence a contig or both For each kind of export you will be prompted to apply appropriate formats See Table below for more information SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 161 Table 19 1 Examples of Sequencher s Export formats Export format type Description Sequence amp ASCII plain U
182. ly the default or horizontally To change the scroll orientation click the appropriate button in the left bottom corner of the fragment chromatogram window See Chapter 15 Chromatograms for more information SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 49 Viewing summary information in the Sequence Editor You can bring up summary information on your data from the Sequence Editor Go to the File menu and choose Get Info The Get Info window in Figure 8 5 provides summary information on the sequence you are currently editing Figure 8 5 Sequence information window Get Info Quality eight E Quality eight Kind AutoSeq Frag ABI Size 755 BPs Mere In project window Original Macintosh HD Users naomithomson Documents Created Won Sep 27 2004 4 10 PM Modified Mon Sep 27 2004 4 10 PM Version Modifiable Comments Type your comments here Base Count 160 As 179 Ts 189 Cs 186 Gs 41 Ambigs D Edits 0 Gaps Confidence 117 Low 58 Med 580 High Show ABI Info If you want to annotate your data with more information than the comment box allows you to see go to the Edit menu and click on Edit Comments Sequencher will open a small text editing window that stores about 250 characters of text with each sequence After you have added text with Edit Comments a small T for Info will appear at the upper left corner of the icon
183. m an editor Use the buttons grouped under On Codon Maps Highlight to indicate whether you want None which gives you the plain map of start and stop codons Use Any Start to Stop gt or Any Unstopped Run gt to give you a shaded box showing open frames of a length you specify Figure 21 17 shows selections in the Start Stop preference pane that will make Sequencher highlight ORFs beginning with a MET and longer than 60 bases Figure 21 17 Start stop codon preference settings User Preferences General Settings futo Save Confidence Label amp Name On Codon Maps Highlight Menu Soe None New Project C Any Starto Stop gt 0 Bases Display C Any Unstopped Run gt fo Bases Length not including start and stop codons Chromatogram Contig Chromatogram Contig Feature Motif Format Ruler Select Next MET to Stop Start Stop The Next MET to Stop command should only Yariance Table make selections that are longer than D Input Output File Import Report Under Select Next MET To Stop you can define the minimum length of ORF selections in a Sequence Editor If you specify a minimum length here for example 30 bases the Next MET To Stop command under the Select menu in the Sequence Editor will change to read Next MET To Stop gt 30b SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 181 Variance Table The Variance Table preferences control how
184. may want to move the splitter item D in the key to the contig window Before you start to edit Setting the base numbering You have control over the base numbering of the consensus For instance you can select any base in the consensus and make it the origin base base 1 All the bases before that position will have negative numbers To set the numbering select a base then go to the Sequence menu and choose Set Base Number with an appropriate suboption Note If you have already used a Reference Sequence in your assembly this will set the base numbering Consensus calculation Under the Contig menu there are three choices for calculating the consensus of your sequences Consensus Inclusively Consensus By Plurality and Consensus to Forensic Standards The consensus is displayed in the bottom line of the contig editor SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 82 and is continuously updated as you edit Thus the consensus will alter after any change you make whether that is altering bases or adding new fragments to your contig Bullets under the consensus highlight discrepancies and pluses indicate ambiguities If you choose a consensus calculation for a project and then save and close the project the consensus choice will be set as the default calculation for that project Consensus Inclusively With Consensus Inclusively Sequencher determines the base in the consensus line
185. mble to Reference button or go to the Contig menu and choose the Assemble Contigs command then click on Assemble to Reference You will get a warning dialog box if you have not included a properly designated Reference Sequence among your selected sequences Note that you use the Assembly Parameters to change the conditions under which the assembly will take place See Chapter 9 Sequence Assembly for more information To Reference by Name To Reference by Name is a special case where the behavior of the Reference Sequence is modified to work with the Assemble by Name command The To Reference by Name button will only be displayed if Assemble by Name has been enabled For more information on Assemble by Name see Chapter 9 Sequence Assembly SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 42 When the To Reference by Name command is used each contig created will contain a Reference Sequence and will also retain the numbering of the Reference Sequence To use To Reference by Name select the sequence fragments and Reference Sequence you want to assemble Click the To Reference by Name button or go to the Contig menu and choose the Assemble Contigs command and then click onto Reference by Name from the submenu Trim to Reference The Trim to Reference Sequence command is used to remove bases in a contig that flank the Reference Sequence on both the 5 and 3 ends see Figure 7 3 T
186. mistakes made during the amplification process Select the bases you want to delete If you are at the 5 left end then click the Forward Delete key Sequencher will delete those bases without moving the entire sequence to the left Similarly if you choose bases on the 3 right end use the Backspace key and Sequencher will automatically delete those bases without realigning your sequence Note When editing from the consensus line you do not have to worry about the bases shifting to the left and right Inserting gaps or bases into a contig If you want to insert gaps go to the Sequence menu and click on Insert Gaps amp Move Bases then choose Right or Left from the submenu You can also create gaps by selecting a number of bases equivalent to the gap size you want to create Press the Tab key if you want the selected bases to move to the right To move to the left hold down the Alt key when you press Tab The space created will be filled with gap marks small colon marks that will appear without disrupting the rest of your contig Once your gaps are in place you can either type the bases you want into the gap using Overstrike When Editing mode or use the lasso and grabber tools to move existing bases into the new position Making edits from the consensus line You can simultaneously edit all the bases in the column above the position you choose in the consensus line instead of individually changing each one To do this select a base
187. mmand allows you to set the number of bases in your DNA circle Select a sequence that has already been defined as a Reference Sequence Then go to the Sequence menu and choose the Set Circular Genome Size command The dialog shown in Figure 22 3 below will appear Enable the circular number by clicking in the checkbox If you have enabled the Cambridge Reference Sequence as your Reference Sequence the number of base pairs in your fragment will automatically be set to 16 569 Dismiss the dialog window by clicking the OK button Note Circular numbering can only be used in conjunction with a Reference Sequence You must designate your sequence as a Reference Sequence before you can enable circular numbering SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 186 Figure 22 3 The Circular Genome Size dialog window Circular Genome Je Enable For This Fragment 16569 Base Pairs How to mark a sequence as a Reference Sequence To designate a sequence as a Reference Sequence select the sequence icon in the Project Window and under the Sequence menu select the Reference Sequence command From now on that sequence icon will include a small letter R to remind you that it has been marked as a Reference For more information on Reference Sequence properties and how to work with a Reference Sequence see Chapter 7 The Reference Sequence for more information New Project From Template
188. much sequence should be trimmed off Blue lines represent the acceptable sequence data The blue line is then flanked by red scissors which mark both the 5 and 3 suggested trims based on the current Trim criteria The remaining red regions of the sequence define the portion of the sequence to be trimmed Click the Show Bases button to see more information on the bases After you have clicked Show Bases the button toggles to Show Overview Sequencher also provides additional control within the Trim Ends window If you want Sequencher to ignore the SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 33 trim command for one or the other ends of the fragment s without modifying the overall specifications just click off the checkbox associated with that trim position Figure 6 4 shows a portion of the base detail for these sequences You can double click on the sequence icon and invoke a Sequence Editor to examine the data in more detail Figure 6 4 Bases view in Ends Trim window Ends Trimming DER Zeiss 9 items in list 9 items have at least 1 check d Quality eight Has Chromatogram 585 good bases 2 Ambigs Antggnnnnennnnnagganceentca COGGCCTGAGGAAGGGTGG CGTCCTCACTAGCCTCCT cgctcacggtagaccaagaccnnnnnn Trim 34 five prime bases 19 Ambigs SK iz Trim 136 three prime bases 20 Ambigs d Quality five Has Chromatogram 733 good bases 1 Ambigs Angnngnntggntngnnn TCTGCCCCCCC
189. n about automatic assembly assembling interactively assembling groups of sequences using sequence name information the large gap algorithm for assembling cDNA and genomic DNA and assembling to Reference Sequences You will learn how to set assembly conditions for each of the different algorithms The Assembly Strategy Sequencher provides you with powerful options for performing your sequence assembly In many cases the default options will perform well However experimental variation can affect your data so there will be times when you may need to change the assembly parameters The following is a basic strategy for assembling your data e Start with strict parameters to minimize the possibility of incorrect matches Select the sequences you want to assemble and use the Assemble Automatically option Automatic assembly relies on an exhaustive search and compare algorithm e If some sequences do not assemble decrease the stringency of the parameters and try Assemble Automatically again e Ifthe selected parameters become relaxed enough that the Assemble Automatically process may result in scientifically insupportable alignments you should switch to interactive alignment so you can exert more control over the process The most frequently used options Assemble Automatically Assemble Interactively and Assemble to Reference are available as prominent buttons in the Project Window In addition Auto Assemble by Name and To Reference by Name
190. name your contigs reposition sequences remove selected sequences and Fill Coverage Gaps with the Reference Sequence or Trim to Reference Sequence Sequence menu The commands in the Sequence menu facilitate most of your work in sequence editing You can set up data entry preferences such as trimming poor quality data or vector contamination edit features create and rename sequences set up a Reference Sequence set up base numbering and Revert To Experimental Data View menu The commands in the View menu let you control how data is displayed as you work on it You can add extra information to the display specify the format and marking of the data and organize project files from the View menu Some of the functions in the View menu have options which you can change by going to User Preferences see Chapter 21 Customizing Sequencher and User Preferences for more information Window menu The commands in the Window menu offer access to different kinds of user help to reference information on various aspects of your data to different windows you have opened and to User Preferences Help menu The command in this menu offers access to user help Context Sensitive menus and buttons Sequencher has a number of menus and buttons that may change depending on the context Some items will become enabled The labels of other items will change This feature is designed so that Sequencher can offer you extensive options within the same eas
191. ncher assumes you have started typing a different name and starts looking for the new one instead Selecting multiple items with a marquee You can select multiple sequences by drawing a box around the icons or list items you want Position the mouse so it is not touching any icons or names The cursor should be approximately where one corner of the box should be Hold the mouse button down and drag diagonally across to where you would expect the opposite corner of the box As you drag the marquee appears as shown in Figure 5 1 As you move the mouse any icons or names touched by the marquee will be selected SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 26 Figure 5 1 Selecting with the marquee Variance Table SPF fel Assembly Parameters AbN Assemble Automatically Assemble Interactively Assemble to Reference Parameters Dirty Data With ReAligner 3 gap placement Min Overlap 20 Min Match 85 Quality Modified P Fri a 663 BPs Dec 16 2005 4 06 10 PM 821 BPs i Dec 16 2005 4 06 10 PM 664 BPs i Dec 16 2005 4 06 10 PM 824 BPs RN i Dec 16 2005 4 06 10 PM 661 BPs i Dec 16 2005 4 06 10 PM 842 BPs i Dec 16 2005 4 06 10 PM 665 BPs i Dec 16 2005 4 06 10 PM 673 BPs i Dec 16 2005 4 06 10 PM 664 BPs i Dec 16 2005 4 06 10 PM JUVE Wt 668 BPs i Dec 16 2005 4 06 10 PM A J12_15095_0045E1SNP88F_P1 663 BPs i Dec 16 2005 4 06 10 PM J12_150
192. nderline all ambiguous bases in the sequence or contig editors Go to the View menu and choose Base Ambiguities As Go to the submenu and choose Underlined to mark ambiguities or Not Highlighted for unmarked ambiguities Edited Bases Sequencher can highlight bases that have been edited which helps you audit the integrity of your data Go to the View menu and choose Base Edits As Select Not Highlighted for unmarked edits Bold Magenta for colored edits and bOLD amp cASE cHANGE for bold and a changed case Display Color Bases Sequencher can display bases in color To turn on this feature for either a sequence or contig editor display choose Display Color Bases from the View menu For information on how to change the color assignments see the section Ambiguity Codes in this chapter SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 167 Colors as Backgrounds Sometimes displaying bases in color does not differentiate them enough You can further enhance the colored bases by going to the View menu and choosing Colors As Backgrounds Sequencher then displays the base as a colored rectangle with the base character printed over it Display Base Confidences Sequencher can represent imported confidence scores as a colored background In order to view base confidence values as colored backgrounds go to the View menu and click on Display Base Confidences Sequencher will only display these backgro
193. ndicated as an N or as a disagreement or gap The Next Ambiguous Base command will also find positions in the contig where the only contribution to the SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 85 consensus is from a base with a low confidence score This type of base will be marked with a plus or a bullet below the consensus line Each time you press Ctrl N Sequencher moves the cursor to the next ambiguity in the consensus starting after the currently selected character After establishing this function you can move to the next ambiguous base by hitting the spacebar or going to the Select menu and using the Repeat Select command If you use the Shift key in conjunction with a Next Ambiguous Base key combination the cursor will move in the reverse direction Finding Disagreeing Bases You can find disagreeing bases quickly by typing Ctrl D or clicking in the contig line going to the Select menu and choosing Next Contig Disagree This type of base will be marked with a e bullet below the consensus line Sequencher moves the cursor to the next disagreement in the consensus starting after the currently selected character and ignoring ambiguities Finding Low Confidence Bases You can quickly locate low confidence bases by typing CtrI L or by going to the Select menu and choosing Select Next Low Confidence Base You can set the ranges for Low Medium and High confidence in the User Pr
194. nerate a change In Figure 15 4 above the slider is set so the secondary peak has to be at least 75 as high as the highest peak If you want Ns to be replaced then select the Allow Ns to be replaced checkbox SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 132 If vou ve already edited bases by hand and don t want them changed you must deselect the checkbox called Allow edited bases to be replaced If you select Only make changes that result in an ambiguity Sequencher may change an ambiguous base call to an A C G or T if it does not meet the secondary peak criteria If you prefer it to remain ambiguous do not select this option If you wish to search just a range of bases highlight them and check the Search Selection Only checkbox When you click OK to make the changes a dialog box notes how many bases will be changed you can choose to Cancel or Continue at that point If you continue the changes take effect Once they have taken effect unlike normal edits to a base they cannot be undone Once you have pressed the Continue button you can find all the changes made during the session by selecting the first base in the sequence you can click on it right on the chromatogram and going to the Select menu and clicking on Next Edited Base After you execute any Select Next command Sequencher creates a shortcut for that command in the spacebar so you can continue to search for your edits usin
195. nformatted Plain text Consensus formats AFDIL Fitch Genentech Specialist and legacy formats IG Strider GenBank NBRF EMBL Database formats FASTA FASTA Commonly used formats concatenated GCG Nexus PAUP interleaved Phylogenetic program format Nexus PAUP sequential Phylip Phylip 3 Phylip4 SCF 2 0 SCF 3 0 Standard Chromatogram Format Contig formats CAF Common Assembly Format MSF Multiple Sequence Format FASTA aligned Phylogenetic program format Nexus Paup Phylogenetic program format Subproject formats CAF Common Assembly Format FASTA Commonly used formats Sequencher Older project file formats Note SCF Standard Chromatogram File format is a file format for exchanging data between different sequencing systems that use four color chromatograms to show sequencing data Most current sequencing hardware supports this file standard Exporting protein translations Sequencher can automatically translate bases into proteins as it copies items to the clipboard These protein sequences can then be exported to another file or program To perform this select a portion or all of the sequence and then go to the Edit menu and choose Copy As and Protein Translation from the submenu Sequencher asks you whether you want to translate only the selected bases or whether you want the entire sequence but in a specific frame The default setting is to translate only the bases you selected You can then take this copy and paste it into another pro
196. nge sequence data Just type as you would with a standard word processor When you are finished entering and or editing a sequence click the close button in the top left corner of the window at the left end of the title bar or go to the File menu and choose Close Window Sequencher asks if you have finished editing At this point you can record your data as experimental data or you can tell Sequencher you have not yet finished editing by selecting the appropriate button Sequencher then removes the Sequence Editor window from the screen The sequence is now represented by a sequencing icon in the Project Window Figure 4 3 Create New Sequence dialog box Set Name New sequence s name Frag 0001 Cancel Double clicking a project icon When you double click on a sequence project icon this launches Sequencher and opens the project If a Project Window is already open when you double click on a project icon Sequencher will ask whether you want to save changes to your current project before closing it Click the Cancel button if you do not want to proceed If you want to close the project and save any changes you have made click the Yes button If you want close the project without saving your changes click the No button SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 21 Importing Confidence Scores Many base callers generate confidence values associated with each base call Sequencher
197. ns on trimming ambiguous data and vectors at the time you load your fragments see the chapters Importing Data Preparing Your Data for Assembly and Customizing Sequencher and User Preferences elsewhere in this manual SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 46 Setting the base numbering Open the Sequence Editor and highlight a base Go to the Sequence menu and choose Set Base Number From the submenu choose either As Base 1 or As Base Number If you choose As Base Number you will see a dialog box and can then enter a new number for the selected base Click OK to dismiss the dialog box Note If you set a base to be base 1 and this is not the first base in your sequence every base before it will have a negative number Set Circular Genome Size Sometimes you will be working with circular DNA such as plasmids or even small genomes The Set Circular Genome Size command allows you to set the number of bases in your DNA circle Select a sequence that has already been defined as a Reference Sequence Then go to the Sequence menu and choose the Set Circular Genome Size command The dialog in Figure 8 3 below will appear Enable the circular number by clicking the checkbox Then enter the number of base pairs in your fragment Dismiss the dialog window by clicking the OK button Figure 8 3 The Circular Genome Size dialog window Circular Genome x je Enable For T
198. ntamination The default is zero bases Extra material if any to be dropped off the 3 end of the remaining DNA sequence along with the vector contamination The default is zero bases Once you have finished choosing vectors and setting recognition criteria close the Vector Insertion Sites window SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 38 How Sequencher screens sequences When you execute the Trim Vector command Sequencher screens the selected sequences for vector contaminants using the vector sequences you specified Sequencher starts at the designated insertion site the bullet and then scans outwards Any sequences that are found to contain vector bases are collected in the Vector Contamination window See Figure 6 9 Please note that any contigs you have selected will be ignored as will any sequences containing fewer than 21 bases Figure 6 9 The Vector Contamination window Vector Contamination Trim Checked Items H Sort Items 10 items in list 10 items have at least 1 check AE trev BlueKSm 240 good bases 0 Ambigs BlueKSm Trim 170 five prime bases 0 Ambigs Sh iz Trim 247 three prime bases 0 Ambigs BlueKSm 448 good bases 0 Ambigs BlueKSm Trim 232 five prime bases 0 Ambigs Sh Hz Trim 20 three prime bases 0 Ambigs BlueKSm 245 good bases 0 Ambigs BlueKSm Trim 92 five prime bases 0 Ambigs 5 Ke Trim 304 three prim
199. ntral to Sequencher Users work within the framework of a project A Sequencher project comprises a collection of DNA sequences and contigs contiguous alignments of overlapping sequences that are built from those sequences A project can be as large or as small as you want Sequencher stores the sequences you enter the information on how these sequences fit together to create any contigs you have formed from them and information on user specified parameters that control the alignment operations all combined into a single data file The Project Window displays all of your sequences and contigs Creating a new project When you launch Sequencher by double clicking the program icon a new empty project is created see Figure 3 1 Figure 3 1 An empty Sequencher Project Window Untitled Project DER D Assembly Parameters Assemble Automatically Assemble Interactively Assemble to Reference Parameters Dirty Data With ReAligner 3 gap placement Min Overlap 20 Min Match 85 Guaity Kind Label Modified SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 8 Until you create or import sequence fragments the new project will remain empty As you add sequences to your project and build contigs from them the Project Window begins to look more like Figure 3 2 Figure 3 2 A new project S Untitled Project Assembly Parameters Assemble Automatically Assemble Interactively Ass
200. nts any digit 0 9 Represents any digit A character or special character which is followed by matches zero or more instances of the character A character or special character which is followed by matches one or more instances of the character A character or special character which is followed by matches zero or one instances of the character N A character or special character followed by N where N a whole number matches that number of instances of the preceding character The contents of square brackets describe a list of matching items regardless of order A Use this in square brackets to exclude the other items contained within the brackets The is known as the escape character and it allows the character to be interpreted literally It can also give a standard character a special meaning as in d Matches whatever you type within the parentheses The acts as an or SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 191 You may require a more complex delimiter then those provided in the Name Delimiters drop down menu You can use regular expressions to define these more complex delimiters Table 24 1 Examples of delimiter expressions Delimiter Sequence Name Handle Handle2 Handle3 Handle4 Handle5 Handle Expression Gel A T dog Gel A 17 dog 2 Gel2 A T72 do
201. o create a Translated Variance Table that summarizes the differences between the translations of selected contig consensus sequences in your project and the translation of a common Reference Sequence Translated consensus Variance Table When you are working with multiple samples from multiple sources and have used Assemble by Name with a Reference Sequence you should use this form of the Translated Variance Table Figure 13 1 Variance Table Menu Commands Sope Sequence View Window Saach View Window Help Assembly Parameters Create New Sequence le to Reference Assemble Contigs Consensus 2 aed Compare Bases To Compare Translation To Consensus Inclusively v Consensus By Plurality Top Sequence Consensus to Forensic Standards Fill Coverage Gaps with Reference Call Secondary Peaks Compare Con us to Reference A Compare Translation to Reference Revert To Experimental Data validate Mitotypes Trim to Reference Sequence Trim Ends Create New Seq From Consensus Trim Vector Rename Contig Dissolve Contig Speech r The Structure of the Variance Table Each row in a Variance Table summarizes the variation among all of the selected sequences at a given base position relative to the exemplar see Figure 13 2 The exemplar can be a Reference Sequence a consensus sequence or any other sequence you choose In the consensus Variance Table the exemplar is limited to the Reference Sequence SEQ
202. o use this command go to the Contig menu and choose Trim to Reference Sequence This command is only available when you are in a contig that contains a Reference Sequence Note You cannot undo this action If there are any sequences which do not overlap the Reference Sequence they will be removed from the contig Sequencher will warn you which fragments have been removed in a window which you can print for your records Figure 7 3 Contig with Reference Sequence flanked by overhanging sequence before and after E Before trim to Reference TOX B After trim to Reference TOX D Pa Pr f canen WR Espen p Era f Sunmay ff camen WR Peer AE Quality three ATGGCAAAAAATGAGGTTTT AE Quality one aE Quality two afi Reference aE Quality one E Quality two dei Reference AE Quality seven ES ES 5 frag bases isee Next selected at Ambiguous Base spacebar consensus position 1 Select Next Ambiguous Base spacebar Contig Editing with a Reference Sequence If you try to edit a Reference Sequence which has been assembled into a contig using either the standard Assemble Automatically or Assemble to Reference a warning dialog will be displayed asking if you are sure this is what you want to do You can proceed with the edit by clicking the Yes button You can cancel the warning and continue to edit by clicking the No button The Reference Sequence is protected from any changes you make while you are editing
203. odon frame 1 rectangles i Stop codon frame 2 show features Diagram Key In the Bases view of a sequence motifs will appear highlighted in the manner you specified You may need to deselect Display Color Bases Colors As Backgrounds or Display Base Confidences to see the Motifs more clearly In Figure 17 4 for example all start codons are in lower case type and blue with only Colors as Backgrounds on Figure 17 4 Sequence Editor showing motifs E felc CGTAGACAGGGAGGGAAAGAAGTGTGCAGAAGGCAAGCCCGGAGGCACTTTC 1981 RAGABERIAG CATATCTCATCTTCCCGGAGARARARARARAAGABEIGTACGTCTG koss AGABRRIAAATTTTGAAAGAGTGCZE GGTCGTTTGATAATTTGTCGGGAAA 2091 AACAATCTACCTGTTATCTAGCTTTGGGCTAGGCCATTCCAGTTCCAGACGCAGG CTGAACGTCGTGAAGCGGAAGGGGCGGGCCCGCAGGCGTCCGTGTGGTCCTCCGT GCAGCCCTCGGCCCGAGCCGGTTCTTCCTGGTAGGAGGCGGAACTCGAATTCATT TCTCCCGCTGCCCCATCTCTTAGCTCGCGGTTGTTTCATTCCGCAGTTTCTTCCC Eame ACCTGCCGCGTACCGGCCACTTTGTGCCGTACTTACGTCATCTTTTTCCTA AATCGAGGTGGCATTTACACACAGCGCCAGTGCACACAGCAAGTGCACAG caac GTTTTGGCCCCTAACCGCTCCGTG CCTACCAAGTCACAGACCCTTTTCA If you have a certain set of motifs you want to use for several projects for example if you use motif definitions to highlight certain restriction sites you can save your definitions to a file To do so click Save Motifs at the top of the motifs window and follow the instructions Then click the Load Motifs button to use those definitions in other projects Features What is a feature A feature r
204. of the MacOS X file system If you use the Import menu item with the suboption Sequences OSX allows you to select multiple files for a single import command To select a continuous list of sequences choose the first sequence in the list and then holding down the Shift key select the last sequence in the list The import window selects the two sequences and all of the files in between Click the Open button to import all of the selected sequences in to your project To select a discontinuous list of sequences hold down the Control key while clicking on the file Click the Open button to import all of the selected sequences into your project The Files of Type drop down menu allows you to filter the files that are displayed in the file browser The filters are With Chromatogram Sequences Without Chromatogram SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 18 Sequences Supported File Formats and All Files You may see some files becoming grayed out with different file filters because these files do not meet the import criterion of the current filter see Figure 4 1 where With Chromatogram Sequences is set but some of the files are text only files Figure 4 1 File import Enable option set to With Chromatogram Sequences Look in 9 Sample Data EI ch ES E ADARCI E ADARC E ADARCZ E aDARCS E aparc3 E ADARC4 E aparcs E ADARCE File name Open Files of type
205. oiety gene primer binding site STS locus_tag bound_moiety protein binding site primer Misc recombination gene map gene iDNA misc structure note SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 204 Feature Key Default Feature Qualifier a stem_loop gene stem loop b D loop gene D loop 10 gap gap of estimated_length 11 operon operon operon Source organism Unrecognized x Sequencher Feature Key is for personal annotation ae Legacy keys Se This is used for Feature Keys which are unknown to Sequencher so that they can be imported in to a project SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 205 30 Glossary Name Administrator Agent ASCII Assemble by Name Base caller Button bar CAF Checkbox Click Confidence score SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved Definition Also known as Admin or admin user A user account that typically has privileges which allow the user to change system settings and install software A region in some dialog boxes and also in the contig editor that contains information in text form American Standard Code for Information Interchange the most widely accepted coding system that permits letters numbers and punctuation to be represented in binary computer code A feature
206. oject Window any icons that already have open editors are dimmed To move an open editor to the front just double click the dimmed icon for example the icon for Contig 0002 shown in Figure 18 1 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 151 Figure 18 1 A dimmed icon Sample Data SPF DEAR D Assembly Parameters Assemble Automatically Assemble Interactively Assemble to Reference 3 items _ Contig 0001 Contig 0002 Contig 0003 Finding the current selection In a sequence or contig editor you can always scroll to the currently selected bases by using the Enter key on the number keypad Searching in Open Windows Finding a subsequence You can search for a specific subsequence in an open sequence or contig editor Go to the Select menu and choose Find Bases Sequencher displays the Find dialog box shown in Figure 18 2 In the Find What field type the string of bases you want to find Use the drop down menu to specify the recognition sequence of one of a subset of restriction enzymes available Use the Other option in the drop down menu to type in the name of your enzyme of interest If it is in the database the recognition sequence will be placed in the Find What box Figure 18 2 Find Bases dialog box Find Bases C Exact Matches Specified Bases C Any Ambiguous Base FindwWhat oO SSSS Find Reverse amp Comp Done Find locates
207. olumn chromatograms E Contig Editor LG Overview f Summary cut man po _ show Chromatograms Goes TTACACAGGCCTGTCCAAAGGTATC TTTGAACCAATTCCCATACATTATTS TTACACAGGCCTGTCCAAAGGTATCETTTGAACCAATTCCCATACATTATTS TTACACAGGCCTGTCCAAAGGTATC TTTGAACCAATTCCCATACATTATTS TTACACAGGCCTGTCCAAAGGTATC TTTGAACCAATTCCCATACATTATTS 4 frag bases selected at consensus position 1 003 Select Next Ambiguous Base spacebar B Chromatograms from Contig Editor ment base 635 Base 635 of 727 T T T CB TI T T G Bai ADARCS Fragment base 573 Base 5 3 of 714 A T Ce TT G AT Cc G ZS ADARC4 Fragment base 282 Base 282 of 739 TAT CC Tf T TG A 8 a aa n en ADARCS Fragment base 227 Base 22 A T CHT T T Ad d AA SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 130 When you click the Show Chromatograms button in a region without trace data you will see a No Chromatogram Data message in the Contig Chromatogram window When you move the cursor back into a region where there are fragments with chromatogram data the traces will reappear Understanding the Trace Display Above the traces are two lines of bases The upper line displays the sequence as it currently appears in the editor including any edits you may have made The lower line displays the base calls as originally imported including the trimmed data The arrows to the left show the orientation and relative lengths of
208. om of map Cancel Setting the map caption Use the checkbox List enzymes names at bottom of map to toggle the caption at the bottom of the restriction map It lists all of the enzymes used to make the map see Figure 16 5 above The caption also lists as Non Cutters enzymes that were not mapped For SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 139 example if you were displaying only unique cutters then enzymes that cut more than once would be excluded They would be listed as Cutters that are not mapped Enzymes that did not cut at all would be listed as Non Cutters Getting more information about an enzyme You can get more information on the enzymes in your restriction map or edit the enzymes included in the map In Figure 16 6 where the Multiple Lines style is being used the SacI enzyme is selected Figure 16 6 SacI enzyme selected E Epal 1 c 7 631 800 PstI 2 a o_O O _ 781 1 029 SacI 2 123 234 425 496 Bac 4 i i i i i If you click on an enzyme label in the map and go the File menu and then choose Get Info you can bring up the Restriction Enzymes dialog box You can also use the keyboard shortcut Ctrl I Available enzymes appear in the scrolling list on the left side of the window Sequencher highlights the enzyme you have selected and puts the appropriate information in the Enzyme Name and Recognition Sequence
209. on is now enabled Click on the Define button The Advanced Expression for Name Parsing window appears At the top of the window is a text box where you can type in a regular expression Assemble by Name uses regular expressions in two ways When the Expression is a delimiter box is checked the regular expression you type in lets you define a delimiter option other than one available from the drop down menu SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 70 If the Expression is a delimiter box is not checked your regular expression must completely define each Assembly Handle and Name Delimiter in your sequence name The regular expression you write will not work unless it describes the entire name of your sequences When you type the regular expression into the text field it should be in the form Assembly Handle1 Name Delimiter Assembly Handle2 In the example in Figure 10 3 below a regular expression is used to combine the first two parts of the sequence name Origin and Clone to create a new Assembly Handle Origin amp Clone The Preview button displays the results of your regular expression If you are satisfied click on the OK button Figure 10 3 Advanced Expression for Name Parsing dialog window Advanced Expression for Name Parsing AACA Preview Expression is a delimiter 01301882 01301882 01301882 01301882 01301882 01301882 01301882 1933 45 Or A A A A A
210. on the button changes to reflect which of these choices is active Figure 9 4 Assemble Interactively dialog box Assemble Interactively Candidates Matches Watch Overlap Mismatch Gaps Hew Contig Len HLA HLAO001 Assembly Parameters HLAcHLAO0002 HL amp HLAO0004 HLAcHLAODD0S HLH oppe HLA HL ng HL amp HLAOO010 HL HLA00011 HLA HLAOOD12 disagreements s highlight ambiguities Candidate Match To have Sequencher compare any single sequence or contig with the other items in the Candidates list click on the name of that sequence or the contig in the Candidates list When the comparison is complete the possible matches appear in the Matches list to the right of the Candidates list as shown in Figure 9 5 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 64 Figure 9 5 Interactive assembly showing a candidate and its matches Assemble Interactively Candidates Overlap Mismatch Gaps Rew Contig Len al HLA HLAOODO4 Assembly Parameters HLA HLAODDO2 HLA HLAO0002 HLA HLAO0004 HL amp HLAO0009 HLA HLAOO00S HLA HLAO0018 HLA HLAO000S HLA HLAO0017 HLA HLAO0009 HS HL p HL amp HLA00010 HL HL op HLA HLAO0011 HLA HLA00016 HLAHLAODD12 HLA HLAODD12 ATGGCCGTCATGGCGCCCCGAACCCTCCTCCTGCTACTCTCGGGGGCCCTGGCCCTGACCY 46 Candidate EE EES 1 10 20 30 40 50 6 If Sequencher fails to find a match as in Figure 9 6 the agent panel displays a message to that effect Figure 9 6 Match not
211. one letter abbreviations for amino acids Base grouping To the right of the translation display icons are the icons that let you choose blocking the number of bases to be displayed in a single group The available settings are 3 5 10 20 and Fill to margin In Figure 8 14 above the icon for a grouping of 10 is selected Line spacing You can choose single double or triple line spacing The three small icons that let you control the line spacing are located to the right of the base grouping icons Figure 8 14 above shows the icon for single spacing selected Case and Font To the right of the line spacing icons is a pair of icons that let you choose upper or lower case for the display In Figure 8 14 above the icon for upper case is selected To the right of the upper and lower case icons is a pull down menu which lets you choose a font and size Only fixed width fonts are displayed ensuring that the rows of bases will line up vertically SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 55 Voice verification Sequencher provides two forms of audio feedback to help you maintain the quality of data input when you are entering information manually from the keyboard You can have Sequencher announce each base as you type it in You can have the computer read the sequence aloud back to you while you look at the original source of the data Sequencher stores the sounds that it uses in a sound
212. ontain Items With Names Containing Sequencher will select the contig s that match your selection criteria You can also use the Select and then Item Named menu option to locate which contig contains a particular fragment With this command you must type the full name in to the Find What box and then press the Find button The contig containing the fragment will be highlighted SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 27 Renaming a sequence or contig Renaming using the mouse To rename a sequence fragment or contig with the mouse click twice on the name of the item This selects the icon and puts a text box around the name Just type in the new name and when you are done click elsewhere in the Project Window If you only want to change part of the name drag the cursor over what you want to replace and then type the replacement text in the highlighted area You can also use the Copy and Paste functions to rename sequences Renaming using the menu If you wish to rename a sequence you can go to the Sequence menu and choose the Rename Sequence command To rename a contig go to the Contig menu and click on the Rename Contig command Sequencher displays a dialog box Figure 5 2 with a field for you to enter the new name Click the OK button when you have finished Figure 5 2 Renaming a sequence Set Name New name My New Name Cancel Labeling items in
213. ontig menu s Compare Consensus to Reference command creates a Variance Table that reports differences from multiple contigs see Figure 13 16 The Review mode lets you navigate efficiently between a difference in one contig and the equivalent position in the next Each column in the Variance Table contains the differences from separate contigs As you change your selection across the Variance Table rows Sequencher closes the Contig Editor and the Contig Chromatogram Editor from your previous selection It will then open the editors corresponding to your current selection Just as you can edit contigs from the consensus line in a contig editor you can also edit contigs from a selection in the Variance Table Review mode Your selection in the Variance Table corresponds to a selection in the consensus of the underlying contig Note Remember that when you type over a base in the Variance Table the underlying data in your contig will also change SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 109 Figure 13 16 Comparing Contigs in Review Mode Compare Consensus to Reference oOx B GWB ey ET Description 5 consensus sequences compared to reference E NC_001807a 7 end ACC AT Aa We sequence NC_001807a 3 end 3 A_GVWB_2001 05_Primer1R CGGTACQCCAT AA A Comparison Range Unfiltered 3 A_GVWB_07061948_Primer1 pl CGOGTACCAT AAA Base Positions 15884 16571 3 A_G
214. ontrols the position of the cursor on the screen and is used to enter some kinds of information See Click Double click Drag Multiple Sequence Format A format used by multiple sequence 207 Name NEXUS Paup Project Project Window Quality score Radio button Reference sequence Refrigerator SCF Secondary peak Sequence Sequence Editor Slider Title bar VecBase SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved Definition alignment programs Format used by phylogentic and cladistic software A project contains sequences information on how these sequences fit together to form larger pieces and parameter information specified by the user that controls alignment operations The window in which Sequencher displays the contigs and unincorporated Sequences of a project Items in the Project Window are can be shown as icons small icons or as a sorted list A number associated with each base call and which defines the likelihood that a base call is incorrect See also Confidence score A control that lets you choose among alternate possibilities Radio buttons always appear in sets of at least two Click one of the buttons to select it A button that is selected is shown with a filled circle buttons that are not selected are shown with open circles A sequence used as a prototype or benchmark sequences If a sequence has been designated as Reference wit
215. ormation for a sequence OF contig eeesceseeseeseeseeseeseeseeseeseeseeseeaeeseeseeseeaeeaeeaeeaeeaseaseaeeaeeas 15 Fhe Project Window bution Dari 29istesidugeteegi eiis geseid siadated a aa geseid saatuaen 15 Template ET 15 Save Project AS Templit nni e e EE AE ANE Th 16 New Project From Templite 2 ees 16 Iipott fromi Template erte SEENEN AE AE 16 A IMPORTING DATA wisi onc cccccscicsiniesinsiccuideninieeiiabinedddaiiieinestauabisecdiueiessneminnaud 17 Quick and easy importing of CAA ee ege E E 17 Dragging and dropping Tiles iii eeiege eene EENS 17 COpy and EE 17 How to import using menu COMMANGS2 eivediesecsnneedecswckestcamnasvecsnneneedamaenvon meee iaamennces 18 NMPOrtin S SEQUENCES mrena EE 18 Importing GenBank features nanena cdhagebagensadve ctnardbecesogehe avagisazesacdes cabazevaeabereee cfhareravenar east eitg 18 Importing lists Of seg entes ee SEENEN EENEG 18 Folder OF sequences mon ranra EE cedkacde EE EE 19 Projects inspecitic KON N a 0E 1 EAA AAN ech EE sav ancusets A R tees 20 Segue cher EE 20 Files froni Near nanan eE ORAE OTO OROORO AEA 20 Creatine EE 21 Double clicking a project 1COM sireroncrnencitiuiicniiiinin niii iiaii iiaii iiia 21 Importing Confidence Serres Aude 22 Importing PHRED called daf renin ear O ORO R RRRA OR AARAA 22 Importing PHRAP Fee ee AAA re eee oe 22 Removing sequences and contigs from a Project sess ssetesseeeeseneeseeseseeseseesesecscseeseeeseeeesesaeseeaeeeaceeeas 23 Clos itis ant Git
216. ove the icons and below the parameter information Viewing constituent sequences When you have contigs in list format click the plus sign to the left of the contig icon Figure 3 9 to see a sublist of the sequences incorporated into that contig That plus sign will turn in to a dash sign and the sequence list will expand see Figure 3 9 To hide a sublist of sequences click the dash sign again it turns to a plus and the sequence list will collapse SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 13 Go to the View menu and click on the Expand All and Collapse All commands to either display or hide all of the contents of the project s contigs Figure 3 9 Viewing a sublist of a contig s constituents E Variance Table SPF SEE D Assembly Parameters Assemble Automatically Assemble Interactively Assemble to Reference bt Parameters Dirty Data With ReAligner 3 gap placement Min TE 20 Min Match 85 1011MARa 705 BPs Sun Mar 12 2006 4 58 37 PM a EN E 13806 913 BPs Sun Mar 12 2006 4 58 37 PM E Si 14210 912 BPs Sun Mar 12 2006 4 58 37 PM J04_14270_0045E7 SMPS8R_ Pt 824 BPs o Fri Dc 16 2005 4 06 10 PM Fererence 550 BPs Tue Dc 73 2005 4 39 43 PM J04_14270_0045E7 SMPS8E_ Pt 665 BPs i Fri ec 16 2005 4 06 70 PM ES B 14667 930 BPs Sun Mar 12 2006 4 58 37 PM ES IS 14839 763 BPs Sun Mar 12 2006 4 58 37 PM E E 14969 758 BPs Sun Mar 12 2006 4 5
217. own menu in the Assemble By Name pane under the Assembly Parameters window now contains the names of your handles Next to each handle is a number enclosed in square brackets This indicates the probable number of contigs you can generate using that handle If you want to use a more complex delimiter you must select Advanced Expression from the Name Delimiters drop down menu Clicking the Define button will take you to the Advanced Expression for Name Parsing window In this window you write and preview the regular expression you want to use You can find more information on regular expressions by going to Appendix 24 Advanced Expressions or for more detail consult the tutorial in the Sequencher Tutorials folder You can find more information on Assemble by Name in Chapter 10 Assemble by Name For information on regular expressions see Appendix 24 Advanced Expressions or for more detail consult the tutorial in the Sequencher Tutorials folder SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 61 Refining the Assembly conditions Minimum Match Percentage The Minimum Match Percentage can be used with all three assembly algorithms It is used to set the proportion of bases which must match in candidate sequences before Sequencher accepts the sequences as actually overlapping The default value of 85 can be changed by moving a slider If this value does not give you any ove
218. owse button opens a window called Browse For Folder From this window you can choose where to store the new text file s You may create a new folder in which to store your sequences by clicking on the Make New Folder button at the bottom left of the window Click on the OK button to record your chosen location If you execute the Export command with the Sequence suboption while a contig is selected Sequencher will create a folder with the name of the contig The folder contains all of the sequences in the contig Exporting Selection as subproject If you wish to share your data with colleagues who may have older versions of Sequencher or another program choose Export from the file menu and then Selection As Subproject then select the appropriate version or type from the Format dropdown menu In certain cases you may want to move some but not all items from one Sequencher project into another Working in the Project Window select the sequence fragments and contigs you want to export Go to the File menu and choose Export and then choose Selection As Subproject Sequencher creates a new project containing only the items you selected Now you can import the newly created sub project into the chosen Sequencher project SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 160 You should remember to provide a relevant name to define the new project because you have derived it from a selection of mu
219. prove this product please contact us You can telephone us at 734 769 7249 fax us at 734 769 7074 or write to us at Gene Codes Corporation 775 Technology Drive Suite 100A Ann Arbor Michigan 48108 http www genecodes com SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved We welcome your comments or support questions by electronic mail at support genecodes com We welcome your general inquiries by electronic mail at gcinfo genecodes com SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 2 Using this manual This chapter will introduce you to the conventions we will be using in this manual and how to get help We will then introduce you to the button bar and Sequencher s menus which offer you a rich range of options while maintaining a scientist friendly interface Conventions used in this manual This manual follows certain conventions for displaying text and pictures and for alerting you to special information Installing Sequencher Sequencher is easy to install Your installation package will include a CD a Sequencher key dongle and an installation guide Follow the instructions in the installation guide to install your software You must have Administrator privileges in order to install Sequencher Sequencher Help Sequencher offers help through a Help viewer To display Help go to the Help menu and choose Seq
220. quences are no longer of interest The Edit menu s Remove From Table command allows you to delete sequences selectively from your Variance Table To select a continuous range of sequences choose the first sequence by clicking on its column header and then holding down the Shift key select the last sequence in the same fashion To select a discontinuous list of sequences hold down the Control key while clicking on the sequences Go to the Edit menu and choose the Remove From Table command Note The Remove From Table command does not remove sequences from the underlying contig Review Mode In Review mode the Variance Table lets you view the data supporting each cell see Figure 13 15 You can access the Review Mode by clicking on the Review button in the Variance Table button bar or by double clicking on any cell in the table Sequencher will open both the Contig and Chromatogram editors with the associated base or column of bases selected You can specify the layout of these three Review windows the Variance Table the Contig Editor and the Contig Chromatogram Editor and then set the new organization as your default layout After you have arranged your windows go to the Window menu and choose the Remember Window Layout command then select Single Variance Table Review from the submenu SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 108 You can edit your data directly in the Variance Ta
221. rands show motifs i B hollow i Start codon frame 1 rectangles SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 79 Base Numbers At Transitions Clicking Base Numbers at Transitions displays or hides the numbers showing approximate positions of the coverage transitions Base Numbers At Every x Bases Selecting Base Numbers Every x Bases shows the base numbers at constant intervals Condensed Base Numbers Selecting the Condensed Base Numbers option changes the font of the base numbers to a compressed version This option can be used with either Base Numbers at Transitions or Base Numbers at Every x Bases Getting more information Find You can search for a specific subsequence in your contig Fragments which contain the subsequence will appear with a heavier line in the Overview You can choose to find the first occurrence by clicking the Find button You can search for subsequent occurrences using the Find Next button Three radio buttons allow you to control the specificity of the search by choosing Exact Matches Specified Bases which would allow you to use ambiguity codes such as W to find T or A You can use the Any Ambiguous Base command which will find any degenerate DNA that may match There is also a drop down menu from which you may choose restriction enzyme sequences If you have used a Reference Sequence and want to include it in the search select the Include Reference Seq
222. references Switch between large icons small icons and list view Names at Left Sequence editor Contig Insertion point moves one base to the editor ummary view right Insertion point moves one base to the left Insertion point moves 3 bases to the right Insertion point moves 3 bases to the left Insertion point moves to 3 end of sequence Insertion point moves to 5 end of sequence Display Formatting Ruler Frame 1 Frame 2 Frame 3 Windows Ctrl L Ctrl M Ctrl Ctrl R Ctrl 4 Ctri 5 Ctrl 1 Ctrl 2 Ctrl 3 Ctri Ctrl Ctrl T Ctrl U Alt click in Title line Ctrl Shift lt Right arrow key Left arrow key Alt right arrow key Alt left arrow key Ctrl right key Ctrl left arrow key Ctri Alt R Ctri Alt 1 Ctrl Alt 2 Ctri Alt 3 Ctri 6 Ctrl 7 Ctri 8 Ctri 9 Ctrl 0 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 190 24 Appendix Advanced Exprssions A regular expression is a special way of describing a search pattern using text strings according to certain syntax rules Sequencher uses regular expressions to help you break down your sequence naming schema in order to define your Assembly Handles Table of Regular expression special characters Represents any character A Za z Represents any letter A Z Represents any capital letter a z Represents any lowercase letter d Represe
223. replace Assemble Automatically and Assemble to Reference when Assemble by Name is enabled see SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 58 Chapter 9 Sequence Assembly for more information There are numerous other options under the Assembly Parameters and the Contig menu Setting the Assembly Conditions Setting assembly parameters To change the parameter settings select the Project Window Then either click the Assembly Parameters button in the Project Window or go to the Contig menu and choose the Assembly Parameters command Sequencher displays the dialog box shown in Figure 9 1 Figure 9 1 Assembly parameters dialog box Assembly Parameters Assembly Algorithm e Dirty Data C Clean Data C Large Gap The dirty data algorithm is SLOWER and more rigorous than the clean data algorithm Ambiguous base calls are considered poor matches to exact base calls Optimize Gap placement Iw Use ReAligner Use ReAligner Anson amp Myers 97 a ggatc to optimize gaps for small inserts and double called bases Manu remove hand entered gaps Iw Prefer 2 Gap Placement Minimum Match Percentage 60 Minimum Overlap Assemble By Name Enabled Handle1 0 7 Handle2 0 Name Settings Handle3 0 Set as Defaults Cancel Sequencher uses the values set in this window to control the way it assembles sequences Once you have set the options you require click the OK
224. reserved 179 If you are making changes to the way Feature Keys are currently displayed you can click on the Update Project button to update your project You can decide whether to apply the current styles name or both to All features in your project If you do not want to do so just limit your choices to Selected features in your project by clicking the appropriate button When you are finished click on the Done button to return to User Preferences Use one or more of the three Display Feature Numbering For check boxes to indicate which of the second line features Complement Bases RNA Bases or Amino Acids should have its own numbering Use the Motif Default Style pull down menu to set color case and underlining style for new motifs You can see a preview of the motif style in a box on the right hand side of the Motif Default Style preference pane Figure 21 15 Features and motif preference settings User Preferences General Settings futo Save Define Feature Key Default Styles Confidence SE SS Display Feature Numbering For Sound Complement Bases New Project E Display RNA Bases Chromatogram Contig Chromatogram a Amino Acids Contig Feature Matt Format Ruler Motif Default Style Start Stop Variance Table Input Output File Import Report Format Ruler The Format Ruler preferences controls the display defaults for any view which has a format ruler These are just defaults you
225. rested in enzymes with special characteristics such as blunt ends or 3 overhangs explore the commands in the Select menu within the dialog Start by choosing Select None to clear all selections Each menu item you choose after that will add all of the enzymes with the specified characteristic Selections are cumulative no command except Select None removes any enzymes from the list If for example you want all the enzymes that recognize sequences either 4 or 5 bases in length choose Select None followed by 4 base and then 5 base When you are finished selecting enzymes click OK in the lower right corner of the dialog box or press the Return key Sequencher redraws your restriction map with the newly selected enzymes Changing the default enzymes To change the default enzyme selections for items in a project select the fragments or contigs you want to change Go to the Window menu and choose Specify Restriction Enzymes while viewing the Project Window Sequencher will warn you if no sequences are chosen The enzymes you choose at this time are now the default set for any selected sequences or contigs in your project Adding and editing enzymes Sequencher lets you add new enzymes to the list Go to the top of the enzyme editor and select Allow Changes All the sliders will now change from gray to blue Click on New Enzyme to open a blank entry in the enzyme editor Note that the OK button changes to Done You can now enter inform
226. rlaps you may need to decrease the Minimum Match Percentage by moving the slider to the left to lower the percentage If you have an incorrect or unexpected overlap you may need to increase the Minimum Match Percentage by moving the slider to the right higher percentage Minimum Overlap The Minimum Overlap can be used with all three assembly algorithms It is used to set the minimum number of bases that must overlap before Sequencher accepts the sequences as actually overlapping The default value of 20 can be changed by moving a slider If this value does not give you any overlaps you can decrease the Minimum Overlap by moving the slider to the left to indicate a smaller number of bases that must overlap If you have an incorrect or unexpected overlap you may need to increase the Minimum Overlap by moving the slider to the right larger number of bases that must overlap Maximum Loop Out Size The Maximum Loop Out Size is the largest number of consecutively mismatched bases in this instance a gap also counts as a mismatch that are acceptable in a potential overlap The number of mismatching bases is set by moving an elevator button up or down The maximum value that can be set manually is 6 Remember that this option can be used only with the Clean Data algorithm Optimization of gap placement ReAligner is an optional step which can be used with the Clean Data and Dirty Data algorithms It evaluates the disposition of gaps within a con
227. roject from the File menu Sequencher will ask if you want to save the changes to your project Click Yes to save your changes and close the project Click No to close the project without saving your changes Click Cancel if you wish to continue working with the project You can then start working with another project by going to the File menu and choosing New Project or Open Project SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 23 If you have finished working with Sequencher go to the Sequencher menu and choose Quit Sequencher SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 24 5 Organizing project views In this chapter on organizing project views you will learn more about the structure of a Sequencher project as you begin to work with your data We will discuss how to select individual and multiple sequences using various techniques renaming items and saving your work Working with icons and lists Changing between project views Sequencher provides you with a variety of ways to view and organize the contigs and unincorporated data in your Project Window You can view your data as large or small icons or in list form for more information see Chapter 3 The Project Window You can toggle quickly between types of views by holding down the Alt key and clicking in the header title area just above the icons and below the parameter information Yo
228. rporation Inc All rights reserved 105 indicate that there is an associated confidence or quality score range for the variant Variants that require additional checking because they have a low confidence or quality score will be highlighted with dark blue Those with medium or high confidence scores will be highlighted with a medium or pale blue background If you click on Display Color Bases and Display Colors as Backgrounds it can help you to see patterns in your data see Figure 13 14 You can use Base Ambiguities As to distinguish ambiguous or edited bases You can also toggle the display of features from the View menu You can use the Label command to mark sequences in the table Set your preferences using the Label amp Name preferences pane Use the boxes grouped under Available Labels in the preference pane to choose label colors and descriptions for marking individual sequences Select the sequence you want to label by clicking its column header Go to the Edit menu and choose the Label submenu The display name of the sequence will change to match the color of the Label See Figure 13 14 Notice how the tool tip provides the name of the sample its label and other attributes Refer to Chapter 21 Customizing Sequencher and User Preferences for more detailed information Figure 13 14 The Variance Table Display Options O Compare Bases to Reference Sequence K e r Description 15 sequences compared to reference sequence N
229. s checkbox if you do not want them to appear in your chromatogram windows SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 176 Figure 21 11 Chromatogram preference settings User Preferences General B Settings Es futo Save H Cal Confidence Label amp Name Height of Chromatograms Menu C Tiny C Short Sound Standard C Tall New Project Display Lane Identification By Chromatogram me Color C Pattern Contig Chromatogram Contig Options Feature Motif Je Colors Match Current Bases Format Ruler Hide Original Base Calls Start Stop Print Scale Factor Variance Table Rev amp Comp Displayed as Lower Case Input Output File Import Width Peak to Peak Report Narrow Wide If you click the Print Scale Factor checkbox and you used the slider shown at the left in Figure 21 12 the scale factor will appear on any printout of that chromatogram If you click the Rev amp Comp Displayed as Lower Case checkbox the original base calls for a reversed and complemented sequence are shown in lower case as opposed to backwards as the second of two lines of bases above the traces as in Figure 21 12 Figure 21 12 Reversed and complemented sequence Under Width Peak to Peak you can stretch your traces horizontally by selecting the Wide button Contig Chromatogram The Contig Chromatogram preferences control the display of chromatograms of sequenc
230. s in one of forward ORFs Editing bases from traces Now that you can easily locate the bases that need attention you can edit your data in either the contig editor window or the chromatogram display As you edit both windows update automatically Refer to Chapter 12 Editing Contigs for more information on editing To edit a base while viewing a chromatogram click on the current base call upper line of text you want to edit then type the new base call Figure 15 6 You can also select multiple bases for deletion by dragging the cursor across them Figure 15 6 Clicking on a base displayed in a trace Experimental Data chromatogram sequence SEE a rot e E Tac co e om Au aw d Reverting to experimental data If you have deleted too much of a sequence or have made edits you want to remove you can copy the original base calls back into the current sequence From within the chromatogram select the original bases lower line of text you want to restore Figure 15 7 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 134 Figure 15 7 Original base calls selected Go to the Sequence menu and choose Revert To Experimental Data Figure 15 8 shows the original bases restored Figure 15 8 Bases restored Revert To Experimental Data is a powerful feature It allows you to recover original data very precisely as described above or it can be used to globally revert an en
231. s to their sequences with the name indicating something about the data Assemble by Name harnesses the information from your sequence names to automate the process of selecting and naming your contigs In this chapter you will learn how to use Assemble by Name a powerful tool that lets you separately assemble and automatically name multiple contigs from a single selection and assembly command For example in just a few clicks of the mouse Assemble by Name will create ninety six appropriately named contigs from ninety six forward and reverse sequences The Assemble by Name Strategy The names of your samples frequently contain information such as your primer template clone name or sample source Sequencher provides you with tools to separate the descriptive parts of your sequence name and use these to manage the assembly These tools are the Assembly Handles and the Name Delimiters An Assembly Handle is any set of characters from your sequence name that provides information about that sequence such as the primer or clone name A Name Delimiter separates each Assembly Handle from the following one Sequence names frequently have several Assembly Handles and Name Delimiters In its simplest form a Name Delimiter may be a character such as a dash or an underscore When you use a Name Delimiter it will look like this SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 68 handle1 handle2 handle3
232. so sorted by position a faint blue line will connect the name with its fragment arrow SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 78 Figure 11 3 Names of fragments at left E Contig Overview DOR gt oam F ADARC1 ADARC2 i ADARC4 ADARCS ADARCE ADARCS ADARC T ES 1 436 Positions Of Fragments Clicking Positions of Fragments displays the numerical range positions of the sequences in the Overview If you do not wish to see this information leave the box unchecked Start amp Stop Codons You can display a start and stop codon map showing all the start and stop codons in the three reading frames of one strand Start codons are indicated by green flags and stop codons are denoted by vertical red lines To display this map check the Start amp Stop Codons box The map will be positioned at the bottom in the Overview In Figure 11 4 the three frame map of start and stop codons is located just above the diagram key To see start and stop codons in the other strand go the View menu and choose Reverse amp Comp This will reverse and complement your fragments Choosing Reverse amp Comp again will restore your contigs original orientation Figure 11 4 Start and stop codon map E Contig Overview OX oa Ip Onions Fa ADARC1 i C Hole in contig i EH Single fragment Bumps on ES Multiple fragments same direction fragments Diagram Key ES Both st
233. t CMF button A new window called Export CMF will open Choose the Export Sample and Specimen Category from the relevant drop down menus The Fragment Owner Date the Fragment Owner Time and Specimen ID fields are filled in automatically You will need to enter information into the remaining text fields There are a number of optional fields The Source Identified field is a drop down menu There are two text fields for the NCIC ID and ViCAP ID There is also a free text Comment field When you are finished click on the Export button If you have made an error you can click on the Clear All button to reset the form or you can click the Cancel button to return to the Validate MtDNA window Note If you move your cursor to pause over a field you will see a brief description summarizing the information required for that field SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 185 Figure 22 2 Export CMF report Export CMF 4 1 Export Sample mDNA Profe sl Destination ORI Ae ONE Source Lab Name LABONE Submit by User ID Done Fragment Owner User ID Inmn o Fragment Owner Date 20070307 Fragment Owner Time heen 0 Specimen Lab ORI Lem Specimen ID ABY Specimen Category Missing Person r Optional Source Identified NCIC ID ViCAP ID Comment emm Clear All Cancel Further Commands of Interest Set Circular Genome Size The Set Circular Genome Size co
234. t exemplar base position as described previously Click on the Reports button The Variance Table Reports dialog box appears see Figure 13 26 A radio button called Selected Rows will be checked Below that you will see a drop down menu called Report Format Choose either Variance Table Report or Variance Detail Report Click the Open Report button The Sequencher Report Viewer opens in a new window Use the Print Preview button from the button bar at the top of the Report Viewer to preview your report Click on the Print button to send the report to a printer SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 117 Note Not all reports are available when you print selected rows Exporting a Variance Table Report You can export the data from an entire table selected columns or selected rows The type of report you can generate will depend on this selection see Table 13 3 Table 13 3 Variance Table Report export options Variance Individual Variance Population Table Report Variance Table Detail Report Report Reports Entire Table Yes Yes No No Selected Yes Yes No No Columns Selected Yes n a No n a Rows You can remove sample sequences from your table before you export your data Select a continuous range of sequences choose the first sequence by clicking on its column header Then holding down the Shift key select the last sequence in the same fashion
235. t mutations and variants rapidly Reference Sequence properties Before you start work with the Reference Sequence you should be aware of a number of important properties SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 40 Basic Reference Sequence properties When you assemble sequences which include a Reference Sequence the base numbering of a contig will be the same as the base numbering of the Reference Sequence see Figure 7 1 Figure 7 1 Contig showing base numbering set by Reference Sequence E Numbering by Reference Sequence OX ATTCGCCCTTGTGGGTCACCGTCTATTATAGGG EE E ATTCGCCCTTGTGGGTCACCGTCTATTATQGGG GGG 4 frag bases selected at consensus position 1 Select Next Ambiguous Base spacebar When a contig is built that includes a Reference Sequence the contig will always remain in the same orientation relative to that Reference This way you can keep the contig from flipping as more fragments are added Because of this there can only be one Reference Sequence in a contig You can easily spot the Reference Sequence in a contig editor because it is highlighted along its entire length with a gray border When you assemble sequences that include a Reference Sequence you will notice that gaps in the Reference Sequence are given decimal numbers thereby preserving the absolute numbering of base positions The Reference Sequence does not contribute to the calculation o
236. t with a single unsupported sequence Then a second fragment starts to overlap with the first Next there is an area with full coverage both strands are sequenced with additional confirming fragments At the extreme right end of the contig there are just two fragments in opposite directions Overview features Bases To get from the Overview to the view used for editing click the Bases button You can also go to the View menu and choose Bases from the Bases Map Overview submenu Sequencher displays the contig editing window Summary You can press the Summary button to view a compact report of the contig You can also see this report by going to the View menu then choose the Bases Map Overview submenu and click the Summary Report option The features of the Summary Report will be dealt with in more detail later see Chapter 12 Editing Contigs for more information Sort You can sort the sequences in a contig vertically Click the Sort button to display a dialog window of options Choose an option by clicking on its radio button and then pressing OK You may also use the sorting options in the View menu under the Sort Cleanup submenu Overview options To choose the display options click the Options button or go to the View menu and click on View Options An Overview Options dialog box will appear where you can set your choices SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved
237. ta as it can find The auxiliary files must be in the same folder as the ACE file or arranged in the traditional PHRAP assembly hierarchy chromatogram files in a chromat_dir folder and PHD files in a phd_dir folder If the singlets file the qual file the PHD files and the chromatogram files all exist and are available Sequencher will import them Alternatively you can import parts of the PHRAP project by going to the File menu and choosing Import and then Sequences or by going to the File menu and clicking on Import and then Folder of Sequences e If you import the chromatogram files from the chromat_dir these fragments will have PHRED base calls and chromatogram data but no base qualities e Ifthe _ phd files are loaded and _ fasta qual files are available you will have imported all of the fragments from the PHRAP project They will have base quality and chromatogram data e The qual files alone will not load your sequences e Ifyou import PHD files from the phd_dir and the corresponding chromatogram files are available these files will import with PHRED base calls chromatogram data and base qualities SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 22 Note When using ftp to transfer your data to your local computer it is ALWAYS important to transfer these files in binary mode If you are not familiar with ftp speak with your network administrator Removing sequences and con
238. tart and stop map Notice the diagram key at the base of the window Figure 11 1 Contig overview E Contig Overview OX GI ES Set Info ADARC1 cic Deeg EJ Single fragment Diagram Key Bumps on PZ Multiple fragments same direction fragments ES Both strands show motifs 7 E Both strands plus hollow Start codon frame 1 rectangles E Stop codon frame 2 show features Selection marquee In the Overview you will notice a dashed rectangle This animated marquee sometimes called the marching ants highlights the section of the contig displayed when you switch to the Bases View You can change the location you want to view by dragging the marquee to a new location As you move the cursor over the marquee you will see it change to a hand icon You can now hold down the mouse button and drag the marquee to the desired location Checking coverage in the Overview The bar across the bottom of the overview shows various levels of sequence coverage indicated by different graphic patterns For instance a wide solid green line indicates that the sequence coverage comprises both strands A region with no color or pattern indicates a hole in the coverage The full array of patterns indicating coverage are shown at the bottom of the Overview in the figure above SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 76 In the example in the figure above the overview shows the contig starting ou
239. te Specify the template you want by clicking on a name in the drop down menu Templates are stored in a folder on your system If you click the Open Templates Folder button Sequencher will open a window displaying the templates You can store up to 1 000 templates on your system For more information about templates see Chapter 3 The Project Window Figure 21 10 New Project User Preferences User Preferences General B Settings futo Save fis Confidence Label amp Name New Project Menu Sound C Use Blank Project New Project Display Chromatogram Contig Chromatogram Contig e Use Project Template Feature Motif My Template Project sl Format Ruler My Template Project v Start St Caransa Th Open Templates Folder Input Output File Import Report Display Chromatogram Chromatogram preferences control the display of chromatograms You can specify the Height Of Chromatograms by clicking one of the buttons from Tiny to Tall If you use a black and white monitor click the Pattern button for Lane Identification By You can set lanes to be identified by color or pattern by selecting the appropriate radio button When viewing multiple sequences you will notice that a reversed and complemented sequence displays the lane colors according to the original data Click the Colors Match Current Bases checkbox to display all current base calls in the same color Click the Hide Original Base Call
240. teria For example you can choose sequences that have chromatograms or that include a particular annotation in the comments field The menus are context sensitive so not all options may be available for a particular view Most options will have a dialog box containing a number of important buttons which control the results of your search The Select All Matches button will select matches based on your search criteria If you change your criteria and search again these results will be added to your initial search If you don t want this information added you must choose Select None SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 154 A status line will indicate how many items matched out of the total number of items available You can dismiss the window by clicking on Done or by simply clicking on the Project Window or performing another action If you want to select all the items that do not contain the specified string go to Select All Items That click on All Items That command and click the Done button Then go to the Select menu and choose Invert Selection Contain a subsequence To select all the sequences and contigs in the Project Window that contain a specific string of bases you can use the Contain Subsequence command Go to the Select menu click on the All Items That command and then choose the Contain Subsequence menu item to display a dialog box Figure 18 4 Note You can do a
241. tersrsrsrsreeresestsest 50 Selecting Bases neen anena nanan a a a a a a a a 51 Finding Ambiguous Bases ss eege ENEE E EERE AE RERE R E E 51 Finding Open Reading Frames ORFE aaa A A R R E EAE RER ts 51 e EE 52 ET ae codon Maps EE 52 Looking at Restriction Mapsen esses sdvace ea casde ceased T T EET T T E TA 52 Ammotating Sequences EIERE AER 53 Customizing your SEQUENCE VIEW 253 cc25i sacddeains ataiesiecddedauenaiatspeddvarvendanaetedeanneeetaennade 54 Formatting Klerus ee ee EE 54 Mano in EE 54 DADS LAC OMe ses eRe ced ee ee OEE a es de ae ed Edel E 55 BaSe e geirinn ot iE EE EER tas stage EEE cl dear ane reg ad ureutesv RAE EE ER 55 IEN ET 33 Case and EOS E E E E E AETA EE EEEE EEEE E EEEE 55 Voice Verification EE 56 Selecting a voice Bled 56 Identifying a sp akersiciirniicciiriisiiii NEEN ai 56 Audible Keystrokes vii Arie geg ieii enia a ORAE ed aad an Me AA AA cea EE eld Reed 56 Read sequence selection EE 56 9 SEQUENCE ASSEMBLY tiseiccntisndsctacuticandivtisebisesscuuiduhddesteutddundeuedduhddeuaace 58 Th Assembly S trates ET 58 Setting the Assembly Conditions sssseeeseeessssssseereeesssssserrreeesssssstreereesssssserreeeesssssseee 59 Setting assembly parameters insonini AAE TAEAE TARR 59 Assembly EL EE 59 Assembling with Dirty Data scsssssssesesceesecencsceesescoesesosenescseveceaesccacsscessnsecsnesesevecensrssessseeeseveceavsesarecess 60 Assembling with Clean Data 60 vi Assembling with Large Gap
242. that affect the sequence fragments and those that affect the display of protein translations You can change the consensus calculation while in this view Any change you make will be reflected immediately in the report Sequencher displays the window shown in Figure 14 1 Figure 14 1 Summary report options Summary Options Compare To Consensus Reference Cancel Display Iw Icons Iw Bullets Pluses amp Dashes Reference Sequence I Matching Bases As Dashes Iw Fragment Sequences When viewing proteins show translations for V Reference Matching Proteins As Dashes Fragment Compare To Consensus or Reference Sequence If you have a Reference Sequence in your contig you can choose whether to display a Summary Report compared to the consensus sequence or to the assembled Reference Sequence The default setting is Compare To Consensus but you can change this by clicking on the Reference radio button Some of the Display options are context sensitive and will change to reflect your choice The Reference radio button will be grayed out when the selected contig does not contain a Reference Sequence Icons If you check the box for the Icons display you will see the sequence fragment icons positioned to the left of the fragment name Unchecking this option will hide the icons Note When you have a Reference Sequence in your contig its icon is distinguished by the addition of an R SEQUENCHER Us
243. the Project Window You can organize items visually in the Project Window by applying Labels to the icons Before using Labels for the first time you should customize the list of labels under the General section of User Preferences You will see a list of Label amp Name options You can also select from the default labels available under the Edit menu and Label submenu See Chapter 21 Customizing Sequencher and User Preferences for more information When viewing your project as a list you will see the name of the label in the list Labeling your icons allows you to sort or find your information easily For instance you might define red labels as cDNA data and blue labels as genomic data Clicking on the label heading in the Project Window will sort your project items by label see Figure 5 3 You can also find sequences bearing a particular label by going to the Select menu and choosing the All Items That command with the Have Labels Containing suboption SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 28 Figure 5 3 Viewing labels as a list S labels SPF S o D Assembly Parameters Auto Assemble by Name Assemble Interactively Parameters Assemble by Handle3 Dirty Data With ReAligner 3 gap placement Min Overlap 20 Hande size quality kind Tree i 4 AA 123_for 337 BPs FutoSeq Frag Interesting a DS G _123_rev 335 BPs FutoSeq Frag Interesting PS G 456_for 600
244. the column Note When comparing multiple consensus sequences the View menu s Display Base Confidences option is not relevant because confidence values exist only in fragments and not in consensus sequences Figure 13 5 Compare Consensus to Reference Variance Table D Compare Consensus to Reference K DOK Fre Description 5 consensus sequences compared to reference sequence NC_001807a 3 end Comparison Range Unfiltered Base Positions 15884 16571 of See Ce oa Ce E For more information about Assemble by Name and Reference Sequences see Chapter 10 Assemble by Name and Chapter 7 The Reference Sequence Translated Variance Table for sequences in the same contig When you are ready to begin creating your Translated Variance Table either select individual sequences from within a contig or go to the Project Window and select a contig by clicking on its icon This will select all of the contig s sequences Then go to the Sequence menu and choose Compare Translation To and either Consensus Reference Sequence or Top Sequence from the submenu The Translated Variance Table appears in a new window see Figure 13 6 The Translated Variance Table header summarizes the table s contents including a Description the Base Positions and the Amino Acid Positions If a sequence in the Table does not extend the full length of the Comparison Range its column header will be shaded in pink The pink X s mark cells th
245. the consensus at the time of its creation Future edits will not be marked SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 92 Removing sequences from a contig To remove sequences from a contig click on the name of the sequence you wish to remove from the list at the left side of the Contig Editor window or in the Overview window To remove more than one sequence at a time hold down the Shift key while you click the names Next go to the Contig menu and choose Remove Selected Sequences Sequencher will ask you if you are sure you wish to remove the sequence s Click on the Remove Fragments button to delete the sequences from the contig The sequences you just removed from the contig are moved out of the contig and back into the Project Window They will also stay selected Note This action may break a contig into multiple pieces If this is about to happen Sequencher will display a dialog box Figure 12 7 to ask if this is really what you intend to do Press the Cancel button if you do not wish to proceed Any new contigs generated in this way are named by appending an alphabetic character to the name of the original contig Figure 12 7 Warning dialog when about to make a hole in a contig Sequencher in a contig that has a hole in the middle If you proceed with this A Removing the sequence fragments that you have selected will result action the contig will be seperated into smaller cont
246. the report if the sequence has associated chromatogram data Figure 13 21 Extract from a Variance Detail Report 128 Variant 1 of 9 e Position 15 576 G T Sequence Orientation Confidence Base Primary Secondary Secondary as Call Peak Peak of Primary Gene4A_09 Reverse 47 T T n a lt 5 Gene_128 R Gene4A_09 Forward 31 T T A 18 Gene_128 F Gene4A_09 Gene_128 R Fragment base 15 576 Base 184 of 216 Y TTG AGA A C GG CG TV TT T TA TG CTA A AAD TIT TFT d A3 DADA A AA AAT AD U AT Gene4A_09 Gene_128 F Fragment base 15 576 Base 111 of 197 TE TT TG AGA AC GCG GE WG TE TT T TAT CC TA FETTO 2G KR AO OG SG SG EG EST EH UDF Uh AT SKS TS Note If the Detail Table displays a gap for a specific base position this will still be given a confidence value The confidence value will be equal to the average confidence ratings of the bases flanking the gap Sequencher will append an asterisk to the confidence value and a footnote will explain how the value was derived Population Report The Population Report is a special report which groups together all sequences sharing the same set of variants This report contains the Common Header and a report specific header The report specific header lists the number of population groups and the how many samples this consists of This header also indicates whether any samples have been dropped from the report This will only happen if a sample does not have complete SEQUENCHER
247. the trimmed and untrimmed data Below the arrows is a slider for changing the height of the trace Below the slider are four buttons that let you hide the lanes for any or all of the traces Click on the button to hide any lane Click it again to display the lane Figure 15 2 shows a contig with adjusted scales and hidden lanes Scrolling through a single chromatogram is simple Position your cursor on the small rectangular slider button to the right of the trace With the mouse button held down you can move the slider down or up depending on your current position within the sequence Alternately you can use the left arrow and right arrow keys to step through the sequence trace base by base When you are working in a contig if you highlight a base in the consensus line you can use the arrow keys to move through the consensus sequence Note that as you do so all the contig chromatograms will move left or right in step with the highlighted base Figure 15 2 Trace with hidden lanes and adjusted scales You can specify the settings for trace display by going to the Window menu and choosing User Preferences and selecting Chromatogram You can specify the height of the trace the width from peak to peak the way the lanes are identified and various screen display attributes See Chapter 21 on Customizing Sequencher and User Preferences for more information SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All r
248. three parts the header information the Comparison Range Coverage and the Translated Variance Table The Translated Variance Table contains the name of the sample sequence the variant codon the amino acid residue and the type of coverage whether the coverage is complete or incomplete for each variant in the table The Comparison Range Coverage table lists the name of each sequence and the range of bases in the comparison Working with a Translated Variance Table Report There are a number of ways to present the data from a Translated Variance Table You can export the data from an entire table selected columns or selected rows You can also remove data from the table before you export it Exporting a Translated Variance Table Report To export all the data in the table click on the Reports button The Translated Variance Table Reports dialog box appears see Figure 13 28 A radio button called Entire Table will be checked Figure 13 28 Translated Variance Table Reports dialog window Translated Variance Table Report Entire Table C C Cancel Copy as Text You will see three enabled buttons in the Translate Variance Table Reports dialog window Cancel Copy as Text and Save as Text Save as Text is the default setting If you click on the Save as Text button Sequencher presents you with a dialog to assign the name and location of your export The report is saved in a tabbed text format SEQUENCHER User
249. tig and optimizes their placement ReAligner facilitates editing in the consensus sequence and clearly displays the effect of insertions and deletions To use ReAligner select the checkbox in the Assembly Parameters dialog box In some areas of DNA analysis the standard is to gather the gaps to the right If you require this select Prefer 3 Gap Placement Note This option cannot be used with the Large Gap algorithm SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 62 Performing the assembly Automatic assembly Bring the Project Window to the front and select the sequence fragments you want to assemble Click the Assemble Automatically button or go to the Contig menu choose the Assemble Contigs command and Automatically from the submenu Sequencher compares all the selected sequences including reverse complements and assembles the best matches that fall within the chosen assembly parameters When assembly is done the advisory window appears See Figure 9 3 Click Close to go back to the Project Window Figure 9 3 Assembly done alert Assembly Completed Time Elapsed 00 00 00 Assemble by Name Off Items Selected 8 Number of Contigs 1 Comparisons Performed 82 Number of Fragments 0 Close Adding selected items to others incremental building of contigs The assembly option Add Selected Items To Others which is a special case of automatic assembly is very useful when you h
250. tigs from a project To remove items from the project select the items you want to remove Go to the Edit menu and choose Remove From Project or click the Trash Can button in the button bar Removing an item from a project is permanent so Sequencher asks whether you really want to do it Note Remember you cannot back out of this action with a simple Undo command but if you close the project without saving all changes including removed items will be forgotten The Revert to Saved Project command has the same effect Closing an editor window When you have finished entering and or editing a sequence click the close button in the top right corner of the window at the right end of the title bar see Figure 4 4 or choose Close Window from the File menu Sequencher then removes the Sequence Editor window from the screen Once you have closed the window a sequencing icon will represent your sequence Figure 4 4 Figure 4 4 Clicking the close box i Frag 0001 KPP E Dis Residue 275 Sequenced Strand Sequence Length 274 AAACTGATTA TATCCNCATG CATCTGTTCT ACCATGTTAT TTTTCCACAT GTTAAAATTT TCTGTCACAT TITCCAATAC TATTTCTTGT GGGTTAGGGT CTGTGGGTAC ACAGGCATGT GTGGCCCAGA CATTATGTAC CTCTGTGTCA TAGGCTTTAG CATCTGATGC ACAAAATAGA GTGGTGGTTG CTTCTTTCCA CACAGGTACC CCATAATAGA CGGTGACCCA CAAGGGCGAA TICTGCAGAT ATCCATCACA CTGGCGGCCG cTcq Closing a project If you have finished working with the project but not with Sequencher choose Close P
251. tion Inc All rights reserved 144 17 Motifs and Features In this chapter we describe Sequencher s ability to locate and highlight subsequences that you specify as motifs You will learn how to create edit and display features and list a sequence s features Motifs Sequencher lets you define up to 12 patterns of 50 or fewer bases as motifs Entering motifs You define new motifs by going to the Window menu and clicking on the Motif Definitions command Figure 17 1 Motif entry window Motifs Load Motifs Find Exact Ambiguous Reverse Vetchez Matches Com Only Allowed Display Style w plement MUM EU GK 399999999990 To view motifs in editors select the Display Motifs option in the dew menu The default setting shows exact matches only To show the reverse complement for a motif click the Find Reverse Complement checkbox Use the radio buttons to specify SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 145 whether you want to highlight the motif using the exact bases you entered or using ambiguous matches To specify how you would like the motif you are entering to be highlighted click on the Display Style pull down menu Figure 17 2 to choose color case and underlining while the cursor is still blinking in the entry field In the example Figure 17 2 every instance of AAAAA will be underlined and every start codon ATG will be shown in low
252. tion with No Coverage The first column of the Variance Table displays the position and the second column displays the base call of the exemplar sequence but only at positions that differ The additional columns in the Variance Table are created by the sequences or contigs you select for comparison You can view or edit the data in any version of the Variance Table by double clicking on any of the cells in that table In Review mode when you type a new base in the Variance Table you edit the underlying sequence or contig automatically SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 96 Structure of the Translated Variance Table The structure of the Translated Variance Table follows the format of the Variance Table but its display consists of an amino acid and its codon in each cell see Figure 13 3 When you see a cell in the table with an amino acid and its codon written in black typeface this indicates a difference between your exemplar and your sample You may also see some cells in the table where the amino acid and codon are displayed in light gray Amino acids in these cells match the exemplar but use either an identical or synonymous codon The numbers which appear in the first column of the Translated Variance Table refer to the position of the first base of the codon in the upper location and the position of the amino acid in the lower one Figure 13 3 The Translated Variance Tab
253. tire sequence or selection of sequences You can revert unassembled sequences to original data by selecting the sequences in the Project Window and executing the Revert To Experimental Data command Reverting assembled sequences back to experimental data If you need to revert a sequence that has been assembled in a contig select its icon from the left hand side of the Contig Editor Go to the Contig menu and use the Remove Selected Sequence command Now you can use the Revert To Experimental Data command as described above If your sequence has trace data select all of the original bases Go to the Sequence menu and choose Revert To Experimental Data Note The Extend Selection command allows you to select all of the bases in a chromatogram without scrolling SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 135 Adjusting trace positions As the four dyes migrate in different lanes on certain automated sequencers the synchronization among the four traces can deteriorate in output files Sequencher allows for adjusting or re tracking the position of one trace relative to the others When you are looking at a chromatogram specify which of the four traces you want to shift by selecting a base in the original data line the lower of the two lines of sequence characters Go to the Edit menu and choose Shift to display a submenu that lets you shift the base s line left or right by a fe
254. ton and the button bar will hide behind the title bar When the button bar is hidden you can still see the bottom of the left hand button under the title bar It will show a down pointing arrowhead Click on the arrowhead and the button bar will reappear Sequencher s menus Sequencher The Sequencher menu gives you information on the version of Sequencher you have contact information for Gene Codes Corporation and Sequencher technical support access to the User Preferences and the Quit Sequencher command File menu The commands in the File menu let you open close and save projects as well as import and export data You can also open and close windows specify page setups and print from the File menu Edit menu The commands in the Edit menu let you cut copy and paste selected data You can also duplicate fragments mark motifs reposition sequences and choose your editing mode from the Edit menu Select menu The commands in the Select menu cover a variety of items you might want to find and or highlight to help you in your analysis These commands use a wide variety of criteria to help you locate items from sequence fragments subsequences and individual bases SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 5 Contig menu The commands in the Contig menu facilitate most of your work in contig editing You can set assembly parameters and consensus mode assemble dissolve and re
255. tures go to the Sequence menu and choose Feature Listing This will display a text list of each feature giving its name Feature Key range style and any Feature Qualifiers You can print out this listing for your records Note If you have imported a GenBank sequence and Sequencher does not recognize any Feature Keys this list may differ SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 150 18 Finding items In this chapter you will learn how to find items according to specific criteria It also discusses how to refine your subsequence search and search for and find particular items in the Project Window This chapter also discusses how to find open windows and editors Finding the Project Window You can always bring the Project Window to the front by going to the Window menu and choosing Project Window Finding open windows with menu commands Sequencher has a number of commands in the Window menu for finding open windows hidden behind other windows Project Window Variance Table Translated Variance Table Chromatograms Contig Editors and Sequence Editors Choose the appropriate command to bring the window you want to the front of your screen If one or another of the commands is dimmed you do not currently have that editor open If you have several windows of a type open these will be listed as submenus of that type Finding open windows in the Project View If you are looking at a Pr
256. u save the new project a new project icon appears on your disk Reverting to the saved version of the project If you are working in a project and want to cancel all the changes you have made since you last saved it go to the File menu and choose Revert To Saved Project Sequencher displays a dialog box that tells you when your project was last saved If you click the Yes Revert to Saved Version button you will lose the most recent changes Click on the Cancel button to cancel the command Note Remember if you click the Yes Revert to Saved Version button you cannot undo this action SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 29 Closing the Project Window When you have finished working on an open project and want to work on another you must first close the open project You can either close the project by going to the File menu and clicking on Close Project or you can close the Project Window by clicking on the close box In either case Sequencher will prompt you to save your work if you have made any changes since your last save To continue working in Sequencher either go to the File menu to create a new project by choosing New Project or click on Open Project to work on a previously saved file Note If you want to save the data you have entered you must save the project Saving a project is different from recording a sequence as experimental data Only saving the project actuall
257. u can display a list of sequences in a single contig in the list view by clicking on the plus sign to left of the contig icon You can see a list of sequences in all your contigs by going to the View menu and choosing Expand All Collecting sequences in refrigerators Sequencher lets you create refrigerators in the Project Window to hold certain subsets of your project Refrigerators hold only raw sequences not contigs or other refrigerators To put away a few sequences in cold storage select the sequences and then go to the Edit menu and choose Refrigerate Sequencher collects the sequences and puts them in the refrigerator To remove sequences from the refrigerator open the refrigerator by double clicking on it You will see a list of items Select the items you want to remove and then click the button called Move Selected Items To Project Window to execute the command SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 25 Selecting sequences Selecting sequences and contigs To select sequences you can go to the Select menu and use the Select All Select None and Invert Selection commands Invert Selection will give you the opposite of your previous selection There are also selection functions that are increasingly specific You can choose data based on All Items That and then further specify the data by choosing narrower classes such as Contain Subsequence Contain Items N
258. u want to assign a GenBank feature key make a selection from the drop down menu called Feature Key To give the feature a name type your text into the Feature Name box You can use the default display style or you can assign a Feature Color and Feature Style such as inverted case or underlined from the dialog box You can also use the Display radio buttons to determine whether single stranded DNA is displayed or RNA protein translation or the complement will be shown For more information see Chapter 17 Motifs and Features Customizing your sequence view Formatting Ruler Formatting can be controlled with a ruler shown in Figure 8 12 that lets you adjust several display parameters Click the Ruler button in the Sequence Editor or contig Summary Window Alternately you can go the View menu and choose Display Format Ruler Figure 8 12 The formatting ruler imal SIC E BIE N IIE Font DI Uri Li gp 25 E SG 1 13 15 Margins You can drag the margin triangles on this ruler to set the width of your sequence Figure 8 13 shows margins that are limited to 42 bases per line Figure 8 13 Sequence Editor with ruler and translations E p53 genomic SEE Residue 9424 Sequenced Strand Sequence Length 20303 lbs ID lS EIEN a WIG Font DI Mra woo 13 A LS IB i KE CAA ACA CTT ACA GTA GGC ATG TTT CTT AGC AAA TCT GAT GAC AAA TTT GGC ATA AAG AAA GAG AGC ATC CCT GAA AAA AAA AAA AAG AAA F G I K K E 5 I P E K K K K
259. ual Variance Reports Variance Detail Report Use the Print Preview button from the button bar at the top of the Report Viewer to preview your report see Figure 13 24 Figure 13 24 Reports Viewer button bar Print Print Preview Save as PDF Page Setup Close The Report View is replaced by the Preview View Note the Zoom In and Zoom Out buttons in the button bar see Figure 13 25 Click the Report View button to return to the previous view Figure 13 25 Print Preview button bar Print Report view Save as PDF Page Setup Zoom In Zoom Out Close Use the Page Setup button to choose paper size Click on the Print button to send the report to a printer You can also save the report in PDF format by clicking the Save as PDF button If you wish to dismiss the window without performing any of these actions click on the Close button Printing selected data from a Variance Table You can print selected data from your table You have several options You can remove sample sequences from the table using the Remove from Table command You can print selected variants by choosing selected rows and you can print selected samples by choosing selected columns You may already have restricted the data in the table by using the Comparison Range command SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 116 If you want to remove sample sequences from your table select a continuous ran
260. uence in Find checkbox Close the dialog box by pressing Done Get Info Summary information about the open contig can be obtained by pressing the Get Info button or go to the File menu and click on the Get Info command The Bases View Working in the Bases View To get from the Overview to the view used for editing a contig click the Bases button Alternatively you can go to the View menu and then go to the Bases Map Overview submenu and choose Bases Sequencher displays the contig editing window shown in Figure 11 5 SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 80 Figure 11 5 The contig editing window E Contig Editor D Ga Sumas f cuwan WR Stow Cronstoganelf Reale Bor highlight base call disagreements Gu highlight ambiguities Select Next Ambiguous Base spacebar To return to the overview click the Overview button or go to the Bases Map Overview submenu and choose Overview The labels in Figure 11 6 identify the different parts of the contig editor Figure 11 6 Identifying parts of the contig editor E Contig Editor D Pa Ern Cutan RR Fatze Den D Ee ATATAATCAGTTTATGGGATCAAAGTCTAAAG Hee highligh base call spacebar Table 11 1 Key to the Contig Editor Label Name Function A Button Bar Contains several buttons you click to perform frequently used functions B Sequence Shows which sequences make up the contig you are editing list c
261. uence that has been incorporated into a contig it is locked A padlock icon shown to the left of the word Residue in the information bar above the sequence in Figure 8 1 shows that the file is locked Figure 8 1 A locked editor E p53 genomic Eek es fa Residue 1 Sequenced Strand Sequence Length 20303 ITTCCCATCAA GCCCTAGGGC TCCTCGTGGC TGCTGGGAGT TGTAGTCTGA ACGCTTCTAT CTTGGCGAGA AGCGCCTACG CTCCCCCTAC CGAGTCCCGC GGTAATTCTT AAAGCACCTG CACCGCCCCC CCGCCGCCTG CAGAGGGCGC AGCAGGTCTT GCACCTCTTC TGCATCTCAT TCTCCAGGCT TCAGACCTGT CTCCCTCATT CAAAAAATAT TTATTATCGA GCTCTTACTT GCTACCCAGC ACTGATATAG GCACTCAGGA ATACAACAAT GAATAAGATA SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 45 A locked editor is read only You cannot directly edit data you can only view it However if the sequence fragment has been incorporated into a contig you can use the contig editor to make changes See Chapter 12 Editing Contigs Basic sequence editing The Sequence Editor shown in Figure 8 2 is a text processor that is specially optimized to work with DNA sequences Figure 8 2 The Sequence Editor Frag 0001 DER E De Residue 40 Sequenced Strand Sequence Length 39 CTGACTGACT GCTGACTGAC TGCGCGTACG TACGTAGC The default system for coding in Sequencher is the UPAC IUB coding system You can define your own coding system for bases and ambiguities See Chapter 21 Customizing Sequenc
262. uencher Help or press F1 Sequencher Help is organized into two main areas application screens and the Sequencher User Manual The Sequencher application screens contain context sensitive links to relevant sections of Sequencher Help Note You can leave the Help window open and return to Sequencher by clicking on an open window SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 3 Navigating Sequencher Help The Sequencher Help window consists of three main parts There is a button bar at the top of the window There is also a navigation pane on the left side of the window and a content pane on the right side of the window The button bar has Back and Forward icons which function like a web browser s Back and Forward buttons If you click the Home button you will return to the Welcome screen You can also type keywords into the Search entry text box The navigation pane displays a Table of Contents Click on the relevant icon to display any subtopics Click on a page link icon in the navigation pane to display the Sequencher Help article in the content pane Some of these pages contain underlined hyperlinks to related topics in the manual and a hyperlink to the following topic in the Table of Contents You can also get information on specific topics such as Assembly Parameters or the Sequence Editor from context sensitive screens To search for a specific topic type the topic name into the se
263. uity Sequencher may change an ambiguous base call to an A C G or T if it does not meet the secondary peak criteria If you prefer it to remain ambiguous do not select this option If you wish to search just a range of bases highlight them and check the Search Selection Only checkbox For more information see Chapter 15 Chromatograms SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 188 23 Appendix Keyboard shortcuts Menu Window Command Windows Help F1 Open window or Open project set in Ctri O User Preferences Close window Ctri W Get Info Ctrl Save Project Ctr S Print Ctrl P Exit Ctri Q Undo Ctrl Z Cut Selection Ctrl X Copy Selection Ctrl C Paste Ctrl V Edit Comments Ctrl Select menu Select All Ctri A Select None Ctri Find bases Ctrl F Find bases again Ctrl G Contig column Ctrl K Select Next Ambiguous base Ctri N Select Next Contig Disagree Ctrl D Select Next Edited Base Ctrl E SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 189 Menu Window iew submenu Overview Options Command Select Next Low Confidence Base Select Next Met to Stop Mark Selection As Feature Display Format Ruler Reverse amp Comp Sequenced Strand Display 1 frame translation Display 2 frame translation Display 3 frame translation Single stranded Double stranded Chromatogram User P
264. unds if the original data included base quality scores For information on how to change the threshold values see the section User Preferences in this chapter You may need to turn off Colors As Backgrounds Display Features Sequencher can display features you have entered Go to the View menu and click on Display Features to toggle this view on and off For each individual feature you must also enable or disable its display in User Preferences See also Chapter 17 Motifs and Features for more information Display Motifs Sequencher can display motifs you have entered Go to the View menu and click on Display Motifs to toggle this view on and off See Chapter 17 Motifs and Features for more information Setting up custom codes Standard coding system The IUPAC IUB ambiguity set is the most common coding system for representing DNA Some commercial and academic software packages also use the Stanford Extended System codes which include one character representations for maybe A probably G etc Sequencher defaults to the UPAC IUB system but includes a copy of the Stanford extended set See Appendix 25 and Appendix 0 for tables of the IUPAC IUB and Stanford Extended code systems Configuring Ambiguity Codes If you wish to configure Sequencher s ambiguity coding system yourself go to the Window menu and choose Ambiguity Key Codes Sequencher displays the dialog box shown in Figure 21 1 SEQUENCH
265. uppercase to lowercase text To see the options for displaying edits go to the View menu and click on the Base Edits As submenu to choose from Not Highlighted Bold Magenta or bOLD and cASE cHANGE Collect Gaps The Collect Gaps command allows you to select a range in a fragment or contig consensus and move all of the gaps within this selection to either the 5 or the 3 end of the selection If you wish to collect the gaps within a single fragment you must select the range within that fragment Go to the Sequence menu and choose the Collect Gaps command To SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 87 move the gaps to the 5 end select Left from the submenu To move gaps to the 3 end of the selection choose Right from the submenu If you wish to collect the gaps within a region of a single contig you must select the range from the consensus line Go to the Sequence menu and choose the Collect Gaps command To move the gaps to the 5 end select Left from the submenu To move gaps to the 3 end of the selection choose Right from the submenu Note The Collect Gaps command may have the effect of increasing the number of ambiguities Check the consensus line to see if the number of ambiguities has increased or decreased You can go to the Edit menu and use the Undo command to reverse the Collect Gaps command Moving Bases and Gaps You can move a selection within a sing
266. ut Map Inset Like the full page cut map the window restriction map is linked to the Sequence Editor If you click between two cut sites Sequencher highlights the resulting restriction fragment as shown in Figure 8 11 If you click on the name of an enzyme Sequencher highlights the recognition sites Figure 8 11 Marking restriction and recognition sites E p53 genomic DER g Residue 10382 to 10943 D Low Confidence 0 0 0 Ambiguous 0 0 Selection Length 562 Bsp106 GO Bert 20 Drall 22 Annotating Sequences You can assign a standardized GenBank Feature Key in addition to personal feature annotations to describe subsequences Select the range of bases you wish to annotate then choose Mark Selection As Feature from the Sequence menu SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 53 If you want to assign a personal annotation select Sequencher from the drop down menu called Feature Key To give the feature a name type your text into the Feature Name box You can use the default display style or you can assign a Feature Color and Feature Style such as inverted case or underlined from the dialog box You can also use the Display radio buttons to determine whether single stranded DNA is displayed or RNA protein translation or the complement will be shown For more information see Chapter 17 Motifs and Features If yo
267. utton A dialog window to assign the name and location of your export appears You can type a name for the table in this window Click the Save button to save your table as a tabbed text file Although the Selected Rows radio button is checked you can change your selection to include all rows by clicking the Entire Table radio button see Figure 13 31 Figure 13 31 Translated Variance Table Reports dialog window Selected Rows Translated Yariance Table Report C Entire Table ei Selected Rows Cancel Copy as Tex SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 122 You can copy your Translated Variance Table into another document or application that accepts tabbed text To paste the contents of your Translated Variance Table Report click on the Copy as Text button Removing data from a table before export If you need to reduce the amount of data in your table you can remove selected sample sequences Make your selection by using Shift click for a continuous range or Ctrl click for a discontinuous range Go to the Edit menu and choose the Remove From Table command Your selection will be removed You can then export the remaining data as usual Note This command does not remove sequences from the underlying contig SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 123 14 The Summary Report In this chapter we explain how to us
268. vely forensic Further note that this type of editing does not change Reference Sequences Split After Selection You can split a sequence fragment into two pieces Put the cursor where you want to split the sequence Go to the Sequence menu and choose Split After Selection Sequencher will ask you if you want to split the sequence after the last currently selected base Click on the Split button to proceed Sequencher will then split the sequence in two and label the new fragments for you You can also apply the Split After Selection command to an entire contig by simply making your selection in the consensus sequence and then choosing Split After Selection from the Sequence menu Auto Match Sometimes you may wish to edit a number of sequences simultaneously perhaps to match a high quality segment of sequence Sequencher allows you to perform this type of mass edit on a range of bases using the Auto Match command SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 91 First select the base or range of bases to which the other sequences should match then go to the Edit menu and choose Auto Match This command will edit and highlight every overlapping base so that it will agree with the selected sequence To reverse this procedure go to the Edit menu and click on Undo Create new sequence from a consensus You can create a new sequence from the consensus sequence by going to the Cont
269. very time you select the Show Chromatograms button Sequencher will open the windows in your layout positions Figure 21 4 Remember Window Layout Show Ambiguity Helper Chromatogram Ctrl T Ambiguity Key Codes Genetic Code Motif Definitions Specify Restriction Enzymes Specify Vector Insertion Sites Project Window Bases Variance Table Remember Window Layout User Preferences Ctrl U You can also use the Remember Window Layout command to define default positions for the Contig Editor and Contig Chromatogram and Variance Table or Translated Variance Table when you are in the Review mode First you must organize the windows Then go to the Window menu and choose the command called Remember Window Layout Select the submenu option called Single Variance Table Review if you have one Variance Table open and Double Variance Table Review if both are open Every SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 171 time you select the Review button from the Variance Table Sequencher will open the windows in your layout positions User Preferences You can specify your preferences for many program settings in Sequencher To set these go to the Window menu and choose User Preferences The User Preferences window contains a hierarchical list of preference settings The main categories of settings are General Display and Input Output Click the triangle sign next to a cate
270. w Feature Jv Display Feature in Editors Feature Location Fag km Which strand Complement Single Strand Only Complement g Add Remove RNA v Protein Translation To alter default styles go to the Featur None Blue v cyan Green Magenta Red Yellow Invert Case Underline Editing an existing feature To edit an existing feature go to the Sequence menu and choose Edit Features To display the Feature Editor click on the name of the feature in the feature list Use the Display Style pull down menu to specify new display attributes You can also change the Feature Key edit the Feature Name and the Feature Location For example you could change a Sequencher annotation into a GenBank feature If you want to remove a feature select its name from the list and click on the Remove button When you have finished entering or editing your features click Done The features will appear in the Sequence Editor if you have checked the Display Features in Editors checkbox and Display Features in the View menu Quick feature creation To create a feature quickly select a region in the sequence or contig editor and then go to the Sequence menu and choose Mark Selection As Feature Sequencher displays a dialog box called Mark Selection As Feature that lets you assign a Feature Key name the feature and specify display attributes Figure 17 7 When you have finished assigning the feature attributes
271. w pixels The adjustments made are to aid you in reviewing the original data they will not be saved to the file when you save the project SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 136 16 Restriction Maps In this chapter we discuss restriction maps and how to work with them We discuss choosing Restriction Map display options selecting and changing selected enzymes saving new or modified enzymes and copying the map Displaying a Restriction Map You can view a Restriction Map for a sequence or a contig by selecting its icon in the Project Window and opening an editor Go to the button bar of the editor and click on the Cut Map button Alternately from the editor go the View menu and choose Bases Map Overview then choose Restriction Map from the submenu Figure 16 1 shows a restriction map display The number in parentheses shows the cut location Figure 16 1 A single line restriction map ae 425 Drall 44 K pal 431 Drall 704 S 45 Gaattt 117 Drall 476 b 781 Apal 48 k 123 Es 234 We 328 me 496 fe 631 Apal 707 fe 800 fe 1 029 Mapping all cutsites In this type of restriction map clicking the name of an enzyme highlights all the locations where that enzyme cuts Restriction Map display options How to select enzymes To change the restriction map display click the Options button in the button bar or go the View menu
272. which uses portions of sequences s name to determine against which other sequences it will be compared against to construct a contig Program that analyses fluorescent trace intensities from automated sequencer and assigns bases together with a probability of error The tool region in a Sequencher window The button bar is located just below the title bar A format for describing sequence assemblies A control that lets you toggle a parameter Click the box to turn it on or off When a checkbox is active it is displayed with an X in the box Press and immediately release the mouse button A number associated with each base call and which defines the likelihood that a base call is incorrect The most common scale is from 1 60 where 60 represents a 1 10 chance of a wrong call and 20 represents a 1 10 chance See also Quality score 206 Name Contig Cursor Dialog box Delimiter Double click Drag Elevator button Enzyme Exemplar Field Handle Hierarchical menu Modal dialog box Motif Mouse MSF SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved Definition Contiguous alignment of a set of sequences to make up the sequence of bases from a longer piece of DNA The indicator on a computer screen that shows the currently active location Cursors have various shapes indicating different purposes and capabilities Any of several sp
273. x Stanford Extended Code Code Base s Meaning A A Adenine C C Cytosine G G Guanine T T Thymine U U Uracil RNA P AorG Purine Q CorT or U Pyrimidine K Gor T or U Keto J AorC M GorC L Aor T or U B C G T or U not A D G A T or U not C H A C T or U not G V A C orG not T SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 194 Code Base s Meaning 5 A or Another Base Probably A 3 C or Another Base Probably C 6 G or Another Base Probably G 4 T or Another Base Probably T 9 A or Nothing Maybe A 7 C or Nothing Maybe C 0 G or Nothing Maybe G 8 T or Nothing Maybe T N A C G T or U Any SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 195 27 Appendix Default Feature Styles The following table displays the defaults settings for the colors and styles of new features New features are those you create in an existing sequence or those described in an imported sequences with a GenBank style Feature Table You can change the default settings in the Feature Motif preference pane For more information refer to Chapter 21 Customizing Sequencher and User Preferences Feature Display Color Style Second Strand A misc_feature 1 misc_difference y Red a conflict y Red b unsure y Red c old_sequence y Red d variation y Red Underlined e modified_base y Red f mutation y Red g allele y
274. y clicking anywhere in the Comments box and typing the information To store more information than the Comments box allows go to the Edit menu and choose Edit Comments Sequencher opens a small text editing window that stores about 250 characters of text for each sequence After you have added text with Edits Comments a small I for Info appears in the upper left corner of the icon If you want to remove the information in the Comments box select the text and then go to the Edit menu and choose Contents from the Clear option The Project Window button bar The button bar contains buttons for frequently used commands It appears just below the top of the Project Window You can use the smaller left hand button which has an up arrow on it to hide the button bar Click on the button and the button bar will hide behind the title bar When the button bar is hidden you can still see the bottom of the left hand button under the title bar It will show a down pointing arrowhead Click on the arrowhead and the button bar will reappear If you click on the Assembly Parameters button this will display the Assembly Parameters window Clicking on the AbN button will enable the Assemble by Name function Assemble Automatically Assemble Interactively and Assemble to Reference are used to assemble sequences you have selected If AbN is enabled then the Assemble Automatically and Assemble to Reference buttons will have different labe
275. y necessary to check and edit your contigs Sequencher provides a powerful interface between the user and the data so that you can analyze and edit sequences and contigs according to your own specifications You can move directly from one ambiguity or disagreement to the next to analyze and override base calling errors eliminating discrepancies and ambiguities Sequencher allows you to choose from a number of consensus calculations and continually recalculates the consensus of the contig as you edit the sequences Overview of the contig The contig overview window When you first open a contig by double clicking on it you will see an overview that shows how the sequences have been assembled Figure 11 1 There is also a button bar at the top of the overview window The schematic provides you with a wealth of useful information enabling you to assess and manage your data A horizontal line represents each fragment in the contig The line will be green and solid if the fragment is in the forward orientation and red and dotted if SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 75 it is in the reverse orientation These orientations may change as more data is added to the contig unless you have used a Reference Sequence in your assembly In this case the Reference Sequence determines the contig orientation Below the fragments you will see the coverage map and below the coverage map you will see the s
276. y of related genes with alternative splicing Assembling with Dirty Data Go to the Assembly Parameters dialog box and select the Dirty Data radio button There are two sliders for changing minimum match values To set the proportion of bases that have to match use the Minimum Match Percentage slider To set the minimum number of bases that must overlap use the Minimum Overlap slider Change the settings by positioning your cursor on the slider holding the mouse button down and dragging the slider left or right The value set will automatically update as you move the slider For example if you attempt to assemble a 17 base sequence that matches perfectly to a selected 1000 base sequence with the minimum overlap set at 20 bases Sequencher s default minimum overlap the two fragments will not assemble You must first reduce minimum overlap to 17 bases or lower and then assemble the fragments Assembling with Clean Data When you select the Clean Data algorithm an additional option called Maximum Loop Out Size is available This has an elevator button for adjusting size value Figure 9 2 below This parameter is the largest number of consecutively mismatched bases that are acceptable in a potential overlap Change the setting by positioning your cursor on the appropriate arrowhead and holding the mouse button down The maximum value which can be set is 6 To set the proportion of bases that have to match use the Minimum Match Percentage
277. y stores data on the disk Exporting data You can save your work in a file format other than Sequencher You can export the entire file or a subset of the project Click on the File menu and choose an Export submenu option A dialog box will allow you to specify the file format in which you want to export your selection Figure 5 4 For example if you choose Selection as Subproject you can specify CAF format for which you can set a number of options Fasta concatenated and older versions of Sequencher This is especially helpful if you want to share data with colleagues working with older versions of Sequencher Note It is important to remember that any version specific features you may have used will be lost if you save to an earlier version of Sequencher See Chapter 19 Exporting Data for more information Figure 5 4 Export sequences format menu Export Sequences Export Location C Documents and Browse Settings Administrator Desktop _Browse Forma gesi v Options Iw Use default names Cancel Exiting the program When you have finished working with Sequencher make sure you have saved your project Then go to the Sequencher menu and choose Quit Sequencher SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 30 6 Preparing Your Data for Assembly In this chapter we introduce you to Sequencher s powerful tools for trimming poor quality data or vector contamination from
278. y to use interface Contextual menus Most windows in Sequencher offer contextual menus You can invoke these by clicking within the window you are reading while holding down the right hand mouse button If you select one or more items and then click the right hand mouse button you will see other menus Note If you have a mouse with only one button you may be able to mimic the effect of a right hand button with the use of a modifier key SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 6 Keyboard shortcuts Sequencher has a number of keyboard shortcuts for major menu items and other important features and allows you to define some of your own See Appendix 23 Keyboard Shortcuts for a complete listing See the Chapter 21 on Customizing Sequencher and User Preferences for more information about how to define your keyboard shortcuts SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 3 The Project Window In this chapter we will introduce you to Sequencher s Project Window We will also show you how to get started with Sequencher by creating a new project or opening an existing one This chapter also describes how you view the constituent sequences or contigs in your project and how to annotate information in your project for later editing and record keeping Sequencher concepts About the Project Window The concept of the project is ce
279. your desktop you will open a new Project Window containing your sequences If Sequencher is already open the sequences will be imported into the open project Note Sequencher always shows a newly imported file as selected so if you do not want to use it right away de select it by clicking elsewhere in the project Copy and paste One of the easiest ways to move data between Sequencher and other programs is to use the clipboard s Copy and Paste commands First launch both Sequencher and the other program Then select and copy the bases of interest in the other program Go to the Sequence menu and choose Create New Sequence Type a name for your sequence into the Sequence s name box and then click the OK button Now go to the Edit menu and choose Paste When you close the window you will see a dialog box with two buttons If you have finished entering your data click the button Record As Experimental Data Otherwise click the button Not Yet Finished Sequencher then removes the window SEQUENCHER User Manual 1991 2011 Gene Codes Corporation Inc All rights reserved 17 How to import using menu commands Importing sequences Sequencher imports sequences in a variety of text and chromatogram formats Some of the most popular are ABI MegaBase CEQ SCF GenBank EMBL DDBJ and plain text files To import a sequence first bring the Project Window to the front by clicking on it Under File menu choose Import Choose Sequenc
Download Pdf Manuals
Related Search
Related Contents
Ksix B0713SC01 screen protector SQL Query for PostgreSQL STRIPE OVERNIGHT Advanced satellite SP - SA series Manual de servicio 医政研発第03 300 ー 8号" 平成ー 9年3月 3 0 日 Samsung SyncMaster 460MX-2 Tenda D301 router Optimus SYSTEM 748 User's Manual タバコは女性の天敵!! 労働安全衛生 マネジメントシステムにおける 産業保健スタッフの役割 Copyright © All rights reserved.
Failed to retrieve file