Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts...
Transcript of Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts...
![Page 1: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/1.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
Modern Biology in Two Lectures Modern Biology in Two Lectures (Part II)(Part II)
Gil AlterovitzGil Alterovitz
![Page 2: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/2.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
Course AdministrationCourse Administration
HandoutsHandoutsOpen Courseware formOpen Courseware form-- please turn in before leaving please turn in before leaving classclassMatlab formMatlab form-- for free copy of Matlab for students in for free copy of Matlab for students in class for use in 6.092/HST.480 course. You can also class for use in 6.092/HST.480 course. You can also use server Matlab or lab cluster.use server Matlab or lab cluster.Background sheetBackground sheet-- complete and turn in by end of complete and turn in by end of class so we can put you on course email list.class so we can put you on course email list.
Homework 1 (Due next Thurs.)Homework 1 (Due next Thurs.)See assignments section in course site.See assignments section in course site.
![Page 3: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/3.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
BackgroundBackgroundStudent Department Affiliation
0
5
10
15
20
25
30
35
40
45
50
6 HST 2 7
Department
Perc
ent
Student Subject Area Strengths
0
5
10
15
20
25
30
35
40
45
Bio Comp Sci Engineering
Rel
ativ
e S
treng
th- N
orm
aliz
ed to
100
![Page 4: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/4.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
TodayToday
Introduction, Part IIIntroduction, Part II-- Gil AlterovitzGil AlterovitzReview Part IReview Part ISplicingSplicingAlternative SplicingAlternative SplicingPostPost--Translational ModificationsTranslational Modifications
Sequence AnalysisSequence Analysis-- Manolis KellisManolis Kellis
![Page 5: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/5.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
GenesGenes to Proteinsto ProteinsTranscription Translation
DNA: “Lifetime Plan” Protein: MachinesmRNA: “Task List”MWTRFDSALPRSTPSTAKLVMPOILLLLEEEDTYESAQYKTWLMVCSDETTTE5’ATCTACAGATCAGCTACGACGCGACGAT
TTAGCAGCAGCGACGCGACAGCAGCTAGTGACGATAGCACATAGTTAGCACAGAGCAGACACAGACAGCACAGCGACAGCGACGACG-3’
5’AUCUACAGAUCAGCUACGACGCGACGAUUUAGCAGCAGCGACGCGACAGCAGCUAGUGACGAUAGCACAUAGUUAGCACAGAGCAGACACAGACAGCACAGCGACAGCGACGACG-3’
Figure by MIT OCWFigure by MIT OCW
IdentificationPost translation modificationSplicing variantsRelative expression levels
Relative ExpressionLevelsDNA Sequencing
Source: HPCGG
mRNA
Protein
Ribosome
![Page 6: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/6.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
GenesGenes
Intergenic Sequence
Gene 1
Gene 2
Communication analogy: start, message, stop.
Source: Ehsan Afkhami
![Page 7: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/7.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
Stereo Rack AnalogyStereo Rack Analogy
Amplifier
![Page 8: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/8.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
Alternative SplicingAlternative Splicing
Altered translation
Enhancer regulatory element
*Stabilizing domain
*
5' UTR
Change in protein expression levels Altered protein structure and function
mRNA instability
Change in protein expression levels
3' UTRProtein coding sequence
AAAAAA A
AAAAAA A AAAAAA A
Change in protein isoform ratios
Figure by MIT OCW
![Page 9: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/9.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
Sequence OrderingSequence OrderingCoding Strand
(Codons) 5' > > > - - - - - - T T C - - - - - - > > > 3'
Template Strand (Anti-codons) 3' < < < - - - - - - A A G - - - - - - < < < 5'
RNA Message (Codons) 5' > > > - - - - - - U U C - - - - - - > > > 3'
Protein Amino Acid Amino > > > Phenylalanine > > > Carboxy
DNA
DNATranscription
Transcriptional control
Epigenetic factorsFeedback loops
RNA mRNA Protein
Post-transcriptionalcontrol (eg, alternative splicing,
alternative polyadenylation,RNA editing)
Translational and degradationcontrols
Translational frameshifting
Activity controls(eg, post-translational modifications, degradation, compartmentalisation)
Catalytic activity,association, stability, half-life,
localisation, activity, etc)
Effects
>200 known post-translational modifications (eg, phosphorylation, glycosylation lipid attachment, peptide cleavage)
Processing Translation
Figure by MIT OCW
![Page 10: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/10.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
PostPost--translational Modificationstranslational ModificationsRESID ID Name SequenceSpec Weight Keyword Feature Enzyme
AA0039O4'-phospho-L-
tyrosineL-tyrosine
Fc=243.15, Fp=243.0296, Cc=79.98 Cp=79.9663,
Phospho-protein
PIR:Binding site: phosphate (Tyr) (covalent)PIR:Binding site: phosphate (Tyr) (covalent) (by ...)
SP:MOD_RES PHOSPHORYLATIONSP:MOD_RES PHOSPHORYLATION (AUTO-)
protein-tyrosine kinase
(EC 2.7.1.112)
339 modifications in RESID Database HH
H
N
HH
HH
H
O
OH
Phosphate
Carboxyl group
Amino group
Tyrosine side chain
OH
PO O
Figure by MIT OCW
![Page 11: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/11.jpg)
Bioinformatics: Trends, Bioinformatics: Trends, Tools, and DatabasesTools, and Databases
What kind of problems need to be solved?What kind of problems need to be solved?How have previous problems in the field been How have previous problems in the field been
approached?approached?
![Page 12: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/12.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
Databases Needed to Store Databases Needed to Store Growing List of Sequence DataGrowing List of Sequence Data
Entrez Human Nucleotide Sequences
0
1000000
2000000
3000000
4000000
5000000
6000000
7000000
8000000
1993 1995 1997 1999 2001 2003
Year
Num
ber o
f Seq
uenc
es
Entrez Human Protein Sequences
0
50000
100000
150000
200000
250000
1993 1995 1997 1999 2001 2003
Year
Num
ber o
f Seq
uenc
es
Entrez Human Nucleotide Sequences Entrez Human Protein Sequences
* Alterovitz, G., Afkhami, E. & Ramoni, M. in Focus on Robotics and Intelligent Systems Research, ed. Columbus, F. Nova Science Publishers, Inc., New York, 2005 (In press).
![Page 13: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/13.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
Paradigm Shifts in BioinformaticsParadigm Shifts in BioinformaticsSequencing Sequencing (1980’s to early 1990’s)(1980’s to early 1990’s)
DNA/RNA/Protein Sequence Analysis/sequence storageDNA/RNA/Protein Sequence Analysis/sequence storage33--D Protein Structure PredictionD Protein Structure Prediction (Mid(Mid--1980’s1980’s--late late 1990’s)1990’s)
Databases of Protein structuresDatabases of Protein structuresDNA/RNA Microarray Expression ExperimentsDNA/RNA Microarray Expression Experiments (Mid(Mid--1990’s to 2000’s)1990’s to 2000’s)
Databases of expression dataDatabases of expression dataProtein interaction experimentsProtein interaction experiments (Early 2000’s to (Early 2000’s to Present) Present)
Databases with pairwise interactionsDatabases with pairwise interactionsMass Spec proteomic pattern experimentsMass Spec proteomic pattern experiments (Early (Early 2000’s to Present)2000’s to Present)
Databases with mass spec, protein identifications, proteomic Databases with mass spec, protein identifications, proteomic patternspatterns
Integration of multiple modalities (Ongoing)Integration of multiple modalities (Ongoing)
![Page 14: Modern Biology in Two Lectures (Part II) · Science & Technology Course Administration Handouts Open Courseware form- please turn in before leaving class Matlab form- for free copy](https://reader036.fdocuments.us/reader036/viewer/2022081614/5fc0790b2b03ca145834ff57/html5/thumbnails/14.jpg)
HarvardHarvard--MITMITDivision of Health Division of Health Science & TechnologyScience & Technology
Human Genome ProjectHuman Genome Project~ 99% of human genome has been sequenced (2004). ~ 99% of human genome has been sequenced (2004). Nature 431: 931Nature 431: 931--945. 945. Error rate: ~1 event per 100,000 basesError rate: ~1 event per 100,000 basesNumber of proteinNumber of protein--coding genes: 20,000coding genes: 20,000--25,00025,000Number of proteinNumber of protein--coding genes in worm: ~18,000 coding genes in worm: ~18,000 Genes comprise only about 2% of the human genome.Genes comprise only about 2% of the human genome.
The rest consists of nonThe rest consists of non--coding regions: functions coding regions: functions may include providing chromosomal structural may include providing chromosomal structural integrity and gene regulation.integrity and gene regulation.