Bioinformatics for your classroom
description
Transcript of Bioinformatics for your classroom
![Page 1: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/1.jpg)
Bioinformatics for your Bioinformatics for your classroomclassroom
Seth BordensteinSeth Bordenstein
Department of Biological SciencesDepartment of Biological Sciences
Vanderbilt UniversityVanderbilt University
NCBI
BLAST
![Page 2: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/2.jpg)
1. No programming skills needed
2.Familiarity with personal computer and internet browser
3.Customizable and free
Advantages
![Page 3: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/3.jpg)
Bioinformatics is like using ‘Google’ for DNA sequences
![Page 4: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/4.jpg)
National Center for Biotechnology National Center for Biotechnology Information (NCBI)Information (NCBI)
http://www.ncbi.nlm.nih.govhttp://www.ncbi.nlm.nih.gov
![Page 5: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/5.jpg)
Seq
uen
ce R
eco
rds
(mil
lio
ns)
To
tal Base P
airs(b
illion
s)
0
5
10
15
20
25
30
35
0
5
10
15
20
25
30
35
40Sequence recordsTotal base pairs
Release 148: 45.2 million records 49.4 billion nucleotides
Average doubling time ≈ 14 months
’83 ’84 ’85 ’86 ’87 ’88 ’89 ’90 ’91 ’92 ’93 ’94 ’95 ’96 ’97 ’98 ’99 ’00 ’01 ’02 ’03 ’04 ’05 ’06
40
45
45
50
5550
Growth of NCBI - GenBank
![Page 6: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/6.jpg)
DNA RNA
cDNAESTs
phenotype
DNA sequencesgenomes
protein sequence databases
protein
Bioinformatics is NOT just information technology. It can teach the central dogmas of molecular biology
![Page 7: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/7.jpg)
Target database: Adjustable using the pull-down menuTarget database: Adjustable using the pull-down menuTarget database: Adjustable using the pull-down menuTarget database: Adjustable using the pull-down menu
![Page 8: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/8.jpg)
![Page 9: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/9.jpg)
![Page 10: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/10.jpg)
![Page 11: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/11.jpg)
A TraditionalA TraditionalGenBank GenBank
RecordRecord
LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004DEFINITION Malus x domestica (E,E)-alpha-farnesene synthase (AFS1) mRNA, complete cds.ACCESSION AY182241VERSION AY182241.2 GI:32265057KEYWORDS .SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.REFERENCE 1 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Cloning and functional expression of an (E,E)-alpha-farnesene synthase cDNA from peel tissue of apple fruit JOURNAL Planta 219, 84-94 (2004)REFERENCE 2 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-2002) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USAREFERENCE 3 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (25-JUN-2003) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USA REMARK Sequence update by submitterCOMMENT On Jun 26, 2003 this sequence version replaced gi:27804758.FEATURES Location/Qualifiers source 1..1931 /organism="Malus x domestica" /mol_type="mRNA" /cultivar="'Law Rome'" /db_xref="taxon:3750" /tissue_type="peel" gene 1..1931 /gene="AFS1" CDS 54..1784 /gene="AFS1" /note="terpene synthase" /codon_start=1 /product="(E,E)-alpha-farnesene synthase" /protein_id="AAO22848.2" /db_xref="GI:32265058" /translation="MEFRVHLQADNEQKIFQNQMKPEPEASYLINQRRSANYKPNIWK NDFLDQSLISKYDGDEYRKLSEKLIEEVKIYISAETMDLVAKLELIDSVRKLGLANLF EKEIKEALDSIAAIESDNLGTRDDLYGTALHFKILRQHGYKVSQDIFGRFMDEKGTLE DFLHKNEDLLYNISLIVRLNNDLGTSAAEQERGDSPSSIVCYMREVNASEETARKNIK GMIDNAWKKVNGKCFTTNQVPFLSSFMNNATNMARVAHSLYKDGDGFGDQEKGPRTHI LSLLFQPLVN"ORIGIN 1 ttcttgtatc ccaaacatct cgagcttctt gtacaccaaa ttaggtattc actatggaat 61 tcagagttca cttgcaagct gataatgagc agaaaatttt tcaaaaccag atgaaacccg 121 aacctgaagc ctcttacttg attaatcaaa gacggtctgc aaattacaag ccaaatattt 181 ggaagaacga tttcctagat caatctctta tcagcaaata cgatggagat gagtatcgga 241 agctgtctga gaagttaata gaagaagtta agatttatat atctgctgaa acaatggatt//
Header
Feature Table
Sequence
The Flatfile Format
![Page 12: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/12.jpg)
LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004DEFINITION Malus x domestica (E,E)-alpha-farnesene synthase (AFS1) mRNA, complete cds.ACCESSION AY182241VERSION AY182241.2 GI:32265057KEYWORDS .SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.REFERENCE 1 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Cloning and functional expression of an (E,E)-alpha-farnesene synthase cDNA from peel tissue of apple fruit JOURNAL Planta 219, 84-94 (2004)REFERENCE 2 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-2002) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USAREFERENCE 3 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (25-JUN-2003) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USA REMARK Sequence update by submitterCOMMENT On Jun 26, 2003 this sequence version replaced gi:27804758.
The HeaderThe Header
![Page 13: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/13.jpg)
LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004DEFINITION Malus x domestica (E,E)-alpha-farnesene synthase (AFS1) mRNA, complete cds.ACCESSION AY182241VERSION AY182241.2 GI:32265057KEYWORDS .SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.REFERENCE 1 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Cloning and functional expression of an (E,E)-alpha-farnesene synthase cDNA from peel tissue of apple fruit JOURNAL Planta 219, 84-94 (2004)REFERENCE 2 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-2002) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USAREFERENCE 3 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (25-JUN-2003) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USA REMARK Sequence update by submitterCOMMENT On Jun 26, 2003 this sequence version replaced gi:27804758.
Header: Locus LineHeader: Locus LineLOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004
Molecule typeMolecule typeDivisionDivision
Modification DateModification Date
Locus nameLocus name
LengthLength
![Page 14: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/14.jpg)
LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004DEFINITION Malus x domestica (E,E)-alpha-farnesene synthase (AFS1) mRNA, complete cds.ACCESSION AY182241VERSION AY182241.2 GI:32265057KEYWORDS .SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.REFERENCE 1 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Cloning and functional expression of an (E,E)-alpha-farnesene synthase cDNA from peel tissue of apple fruit JOURNAL Planta 219, 84-94 (2004)REFERENCE 2 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-2002) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USAREFERENCE 3 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (25-JUN-2003) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USA REMARK Sequence update by submitterCOMMENT On Jun 26, 2003 this sequence version replaced gi:27804758.
Header: Database IdentifiersHeader: Database Identifiers
ACCESSION AY182241
VERSION AY182241.2 GI:32265057
ACCESSION AY182241
VERSION AY182241.2 GI:32265057
Accession•Stable•Reportable•Universal
Accession•Stable•Reportable•Universal
![Page 15: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/15.jpg)
LOCUS AY182241 1931 bp mRNA linear PLN 04-MAY-2004DEFINITION Malus x domestica (E,E)-alpha-farnesene synthase (AFS1) mRNA, complete cds.ACCESSION AY182241VERSION AY182241.2 GI:32265057KEYWORDS .SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.REFERENCE 1 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Cloning and functional expression of an (E,E)-alpha-farnesene synthase cDNA from peel tissue of apple fruit JOURNAL Planta 219, 84-94 (2004)REFERENCE 2 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (18-NOV-2002) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USAREFERENCE 3 (bases 1 to 1931) AUTHORS Pechous,S.W. and Whitaker,B.D. TITLE Direct Submission JOURNAL Submitted (25-JUN-2003) PSI-Produce Quality and Safety Lab, USDA-ARS, 10300 Baltimore Ave. Bldg. 002, Rm. 205, Beltsville, MD 20705, USA REMARK Sequence update by submitterCOMMENT On Jun 26, 2003 this sequence version replaced gi:27804758.
Header: OrganismHeader: Organism
SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.
SOURCE Malus x domestica (cultivated apple) ORGANISM Malus x domestica Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids; eurosids I; Rosales; Rosaceae; Maloideae; Malus.
NCBI-controlled taxonomy
![Page 16: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/16.jpg)
FEATURES Location/Qualifiers source 1..1931 /organism="Malus x domestica" /mol_type="mRNA" /cultivar="'Law Rome'" /db_xref="taxon:3750" /tissue_type="peel" gene 1..1931 /gene="AFS1" CDS 54..1784 /gene="AFS1" /note="terpene synthase" /codon_start=1 /product="(E,E)-alpha-farnesene synthase" /protein_id="AAO22848.2" /db_xref="GI:32265058" /translation="MEFRVHLQADNEQKIFQNQMKPEPEASYLINQRRSANYKPNIWK NDFLDQSLISKYDGDEYRKLSEKLIEEVKIYISAETMDLVAKLELIDSVRKLGLANLF EKEIKEALDSIAAIESDNLGTRDDLYGTALHFKILRQHGYKVSQDIFGRFMDEKGTLE NHHFAHLKGMLELFEASNLGFEGEDILDEAKASLTLALRDSGHICYPDSNLSRDVVHS LELPSHRRVQWFDVKWQINAYEKDICRVNATLLELAKLNFNVVQAQLQKNLREASRWW ANLGIADNLKFARDRLVECFACAVGVAFEPEHSSFRICLTKVINLVLIIDDVYDIYGS EEELKHFTNAVDRWDSRETEQLPECMKMCFQVLYNTTCEIAREIEEENGWNQVLPQLT KVWADFCKALLVEAEWYNKSHIPTLEEYLRNGCISSSVSVLLVHSFFSITHEGTKEMA DFLHKNEDLLYNISLIVRLNNDLGTSAAEQERGDSPSSIVCYMREVNASEETARKNIK GMIDNAWKKVNGKCFTTNQVPFLSSFMNNATNMARVAHSLYKDGDGFGDQEKGPRTHI LSLLFQPLVN"
The Feature TableThe Feature Table
Coding sequenceCoding sequence
start (atg)start (atg) stop (tag)stop (tag)
![Page 17: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/17.jpg)
The Sequence: The Sequence: What do you do with it?What do you do with it?
ORIGIN 1 ttcttgtatc ccaaacatct cgagcttctt gtacaccaaa ttaggtattc actatggaat 61 tcagagttca cttgcaagct gataatgagc agaaaatttt tcaaaaccag atgaaacccg 121 aacctgaagc ctcttacttg attaatcaaa gacggtctgc aaattacaag ccaaatattt 181 ggaagaacga tttcctagat caatctctta tcagcaaata cgatggagat gagtatcgga
ORIGIN 1 ttcttgtatc ccaaacatct cgagcttctt gtacaccaaa ttaggtattc actatggaat 61 tcagagttca cttgcaagct gataatgagc agaaaatttt tcaaaaccag atgaaacccg 121 aacctgaagc ctcttacttg attaatcaaa gacggtctgc aaattacaag ccaaatattt 181 ggaagaacga tttcctagat caatctctta tcagcaaata cgatggagat gagtatcgga
1741 ggacccacat cctgtcttta ctattccaac ctcttgtaaa ctagtactca tatagtttga 1801 aataaatagc agcaaaagtt tgcggttcag ttcgtcatgg ataaattaat ctttacagtt 1861 tgtaacgttg ttgccaaaga ttatgaataa aaagttgtag tttgtcgttt aaaaaaaaaa 1921 aaaaaaaaaa a//
1741 ggacccacat cctgtcttta ctattccaac ctcttgtaaa ctagtactca tatagtttga 1801 aataaatagc agcaaaagtt tgcggttcag ttcgtcatgg ataaattaat ctttacagtt 1861 tgtaacgttg ttgccaaaga ttatgaataa aaagttgtag tttgtcgttt aaaaaaaaaa 1921 aaaaaaaaaa a//
![Page 18: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/18.jpg)
BLAST:BLAST:
Compare new genes to old ones Compare genes from different species or
hosts Investigate the transcriptome (cDNAs) Identify possible functions based on
similarities to known sequences.
Query a database for sequences similar to an input sequence.
GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <——
——> > CCTTAAGGAAGGAAGGCC--GGTTAAGGTTCCAGAGAAGTGGTGTTCTTTGAGTTCCCTTTGAGTTCC
![Page 19: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/19.jpg)
What are the broad goals of this lab?What are the broad goals of this lab?
To provide an introduction to bioinformatics To provide an introduction to bioinformatics with a focus on NCBIwith a focus on NCBI
To introduce you to searching for articles, To introduce you to searching for articles, sequences, scientists (perhaps yourself ;))sequences, scientists (perhaps yourself ;))
To use the most powerful and reliable To use the most powerful and reliable method to determine evolutionary method to determine evolutionary relationships between genesrelationships between genes
To combine your To combine your WolbachiaWolbachia research with research with computational biologycomputational biology
![Page 20: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/20.jpg)
What are the specific goals of this lab?What are the specific goals of this lab?
To look for brand new W strainsTo look for brand new W strains
To make a phylogenetic tree of WTo make a phylogenetic tree of W
To ultimately compare the W tree to an To ultimately compare the W tree to an insect phylogeny to infer lateral vs. vertical insect phylogeny to infer lateral vs. vertical transmission of your W strainstransmission of your W strains
To contribute to a national sequence To contribute to a national sequence database on the genetic diversity of W 16S database on the genetic diversity of W 16S rRNA generRNA gene
![Page 21: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/21.jpg)
Outcomes: A New Outcomes: A New WolbachiaWolbachia Species? Species?
QuickTime™ and aTIFF (Uncompressed) decompressor
are needed to see this picture.
![Page 22: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/22.jpg)
GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100% GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100%
GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100% GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100%
GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100% GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100%
GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100% GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100%
Insect Phylogeny Top 5 Wolbachia BLAST matches
GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100% GATGGATGCCCCAATTAAGGAAGGCCTTGGTTAAGGTTCCGTGTAACCCCCCT <T <- 100% - 100%
![Page 23: Bioinformatics for your classroom](https://reader033.fdocuments.us/reader033/viewer/2022051821/56815bcc550346895dc9c2de/html5/thumbnails/23.jpg)
Let’s Begin Our Bioinformatic Exercise Let’s Begin Our Bioinformatic Exercise Lab 5Lab 5