Algorithms in Computational Biology (236522) Fall 2005-6 Lecture #1
1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours:...
-
date post
20-Dec-2015 -
Category
Documents
-
view
218 -
download
1
Transcript of 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours:...
![Page 1: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/1.jpg)
1
Algorithms in Computational Biology (236522) Spring 2006
Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356)
TA: Itai Sharon Office hours: Tuesday 2:30-3:20 (Taub 621, Tel 4946)
Lecture: Monday 12:30-2:20, Taub 4
Tutorial: Tuesday 1:30-2:20, Taub 7
![Page 2: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/2.jpg)
1
Computational BiologyComputational biology is the application of computational tools and techniques to (primarily) molecular biology. It enables new ways of study in life sciences, allowing analytic and predictive methodologies that support and enhance laboratory work. It is a multidisciplinary area of study that combines Biology, Computer Science, and Statistics.
Computational biology is also called Bioinformatics, although many practitioners define Bioinformatics somewhat narrower by restricting the field to molecular Biology only.
![Page 3: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/3.jpg)
1
Examples of Areas of Interest• Understanding the structure of genomes (detecting
genes, regulatory elements, variations)• Deciphering structure and function of proteins• Discovery of cellular “procedures” (pathways)• Indentifying disease-causing genes• Building the tree of life• More ..
![Page 4: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/4.jpg)
1
Exponential growth of biological information: growth of sequences, structures, and literature.
![Page 5: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/5.jpg)
1
Course’s goals
The focus of this course is the set of algorithms, tools and models used today to analyse molecular biological data, recover and discover hidden information.
![Page 6: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/6.jpg)
1
Course PrerequisitesComputer Science and Probability Background
• Data structure 1 (cs234218)• Algorithms 1 (cs234247)• Probability (any course)
Or permission from instructor
Biology background• Formally: none (to allow CS studnets to take this course)• Recommended: Molecular Biology 1 (especially for those in
the Bioinformatics track), or a similar Biology course, and/or a serious desire to complement your knowledge in Biology by reading the appropriate material (see the course home).
![Page 7: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/7.jpg)
1
Requirements & Grades• 40% homework, in five or six assignments. Homework is
obligatory.
• 60% test. Must pass 55 for the homework’s grade to count
• Exam date: 17.7.06
![Page 8: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/8.jpg)
1
Syllabus
• Introduction, biological background (0.5 weeks)
• Gene detection and function prediction– Pairwise alignment (2 weeks)
– Multiple sequence alignment (2 weeks)
– Profile and Hidden Markov Models (2 weeks)
• Motif detection (1 week)
• Phylogenetic trees (2 weeks)
• Expression data analysis, pathways (2.5 weeks)
• Protein structure analysis (2 weeks)
![Page 9: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/9.jpg)
1
Bibliography• Biological Sequence Analysis, R.Durbin et al. , Cambridge
University Press, 1998
• Introduction to Molecular Biology, J. Setubal, J. Meidanis, PWS publishing Company, 1997
• Misc papers
• Some slides adopted from courses taught by Nir Friedman (Hebrew U), Dan Geiger and Shlomo Moran
• Course home: webcourse.cs.technion.ac.il/~cs236522
![Page 10: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/10.jpg)
1
Biological Background
First home work assignment: Read the first chapter (pages 1-30) of Setubal et al., 1997. (copies are available in the Taub building library, and in the central library). Answer the questions of the first assignment in the course site.
![Page 11: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/11.jpg)
1
Course starts..
![Page 12: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/12.jpg)
1
Human GenomeMost human cells contain
46 chromosomes:
• 2 sex chromosomes (X,Y):
XY – in males.
XX – in females.
• 22 pairs of chromosomes named autosomes.
![Page 13: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/13.jpg)
1
DNA OrganizationS
ourc
e: A
lber
ts e
t al
![Page 14: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/14.jpg)
1
The Double HelixS
ourc
e: A
lber
ts e
t al
![Page 15: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/15.jpg)
1
DNA ComponentsFour nucleotide types:
• Adenine• Guanine• Cytosine• Thymine
![Page 16: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/16.jpg)
1
Base pairsHydrogen bonds (electrostatic connection):
• A-T• C-G
![Page 17: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/17.jpg)
1
![Page 18: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/18.jpg)
1
Genome Sizes• E.Coli (bacteria) 4.6 x 106 bases• Yeast (simple fungi) 15 x 106 bases• Smallest human chromosome 50 x 106 bases• Entire human genome 3 x 109 bases
![Page 19: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/19.jpg)
1
Genetic Information
• Genome – the collection of genetic information.
• Chromosomes – storage units of genes.
• Gene – basic unit of genetic information. They determine the inherited characters.
![Page 20: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/20.jpg)
1
GenesThe DNA strings include:• Coding regions (“genes”)
– E. coli has ~4,000 genes – Yeast has ~6,000 genes– C. Elegans has ~13,000 genes– Humans have ~32,000 genes
• Control regions – These typically are adjacent to the genes– They determine when a gene should be “expressed”
• “Junk” DNA (unknown function - ~90% of the DNA in human’s chromosomes)
![Page 21: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/21.jpg)
1
The cell
All cells of an organism contain the same DNA content (and the same genes) yet there is a variety of cell types.
![Page 22: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/22.jpg)
1
Example: Tissues in Stomach
How is this variety encoded and expressed ?
![Page 23: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/23.jpg)
1
Central Dogma
Transcription
mRNA
Translation
ProteinGene
cells express different subset of the genesIn different tissues and under different conditions
שעתוק תרגום
![Page 24: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/24.jpg)
1
Central dogma
![Page 25: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/25.jpg)
1
Transcription• Coding sequences can be transcribed to
RNA
• RNA – Similar to DNA, slightly different nucleotides:
different backbone– Uracil (U) instead of Thymine (T)
Sou
rce:
Mat
hew
s &
van
Hol
de
![Page 26: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/26.jpg)
1
![Page 27: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/27.jpg)
1
Transcription: Junk DNA, RNA Editing, Alternative Splicing
Exons hold information, they are more stable during evolution.This process takes place in the nucleus. The mRNA molecules diffuse through the nucleus membrane to the outer cell plasma.
1. Transcribe to RNA2. Eliminate introns3. Splice (connect) exons* Alternative splicing exists
![Page 28: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/28.jpg)
1
RNA roles• Messenger RNA (mRNA)
– Encodes protein sequences. Each three nucleotide acids translate to an amino acid (the protein building block).
• Transfer RNA (tRNA)– Decodes the mRNA molecules to amino-acids. It connects
to the mRNA with one side and holds the appropriate amino acid on its other side.
• Ribosomal RNA (rRNA) – Part of the ribosome, a machine for translating mRNA to
proteins. It catalyzes (like enzymes) the reaction that attaches the hanging amino acid from the tRNA to the amino acid chain being created.
• ...
![Page 29: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/29.jpg)
1
Central dogma
![Page 30: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/30.jpg)
1
ProteinsMade of 20
Amino acids
![Page 31: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/31.jpg)
1
Translation
• Translation is mediated by the ribosome• Ribosome is a complex of protein & rRNA
molecules• The ribosome attaches to the mRNA at a
translation initiation site• Then ribosome moves along the mRNA
sequence and in the process constructs a sequence of amino acids (polypeptide) which is released and folds into a protein.
![Page 32: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/32.jpg)
1
![Page 33: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/33.jpg)
1
Helper molecules: tRNA
![Page 34: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/34.jpg)
1
The Genetic Code
![Page 35: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/35.jpg)
1
![Page 36: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/36.jpg)
1
![Page 37: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/37.jpg)
1
![Page 38: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/38.jpg)
1
Protein Structure
• Proteins are poly-peptides of 70-3000 amino-acids
• This structure is (mostly) determined by the sequence of amino-acids that make up the protein
![Page 39: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/39.jpg)
1
Various structures with different functions
• Structural framework (keratin, collagen)• Transport and storage of small molecules
(hemoglobin)• Transmit information (hormones, receptors)• Antibodies• Blood clotting factors• Enzymes
Protein structures
![Page 40: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/40.jpg)
1
Protein-Protein interactions
![Page 41: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/41.jpg)
1
Pathways
![Page 42: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/42.jpg)
1
![Page 43: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/43.jpg)
1
![Page 44: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/44.jpg)
1
Evolution
• Related organisms have similar DNA– Similarity in sequences of proteins– Similarity in organization of genes along the
chromosomes
• Evolution plays a major role in biology– Many mechanisms are shared across a wide
range of organisms– During the course of evolution existing
components are adapted for new functions
![Page 45: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/45.jpg)
1
Evolution
Evolution of new organisms is driven by
• Diversity– Different individuals carry different variants of
the same basic blue print
• Mutations– The DNA sequence can be changed due to
single base changes, deletion/insertion of DNA segments, etc.
• Selection bias
![Page 46: 1 Algorithms in Computational Biology (236522) Spring 2006 Lecturer: Golan Yona Office hours: Wednesday or Thursday 2-3pm (Taub 632, Tel 4356) TA: Itai.](https://reader034.fdocuments.us/reader034/viewer/2022042702/56649d455503460f94a224ad/html5/thumbnails/46.jpg)
1
The Tree of Life
Sou
rce:
Alb
erts
et
al