Post on 08-Jan-2016
description
1
Bioinformatics,Bioinformatics,Computational BiologyComputational Biology — An Introduction — An Introduction
2
“…the most wondrous map ever produced by mankind” — Bill Clinton
3
4
DNA
5
The difference between you & chimp is ~1.24% The difference between you and Maggie is ~0.1%
Post Genome Era Why small variation, BIG DIFFERENCE?
6
Genetics:
From DNA topopulation
Source: gsk
7
8
Introduction – Gene History
1865 Mendel: The basic unit of inheritance is a gene.
Mendel’s work was forgotten until 1900s.1944 The gene was known to be made of
DNA (Deoxyribonucleic Acid).1953 James Watson and Francis Crick :
Double helical structure of DNA.
( 雙股螺旋 )
9
Introduction – Gene History (Cont.)
1990 The Human Genome Project ( 人類基 因體計畫 ) started.
1995 The first free-living organism to be sequenced : haemophilus influenzae
( 流行性感冒嗜血桿菌 )1998 CELERA joined the gene research.2000 The human DNA sequence draft was completed (published in 2001).
10
動物細胞 ( 細胞核、細胞質、細胞膜 )DNA 位於細胞核內之「核仁」
11
DNA Sequence
12
DNA Length
The total length of the human DNA is about 3109 (30 億 ) base pairs.1% ~ 1.5% of DNA sequence is useful.# of human genes: 30,000~40,000 Conclusion from the human genome proj
ect Expected # is 100,000 originally.
13
14
DNA Double Helix ( 雙股螺旋)
15
DNA/RNA 核甘酸分子核甘酸 (Nucleotide) 包含:- 五碳糖 ( 去氧核糖 , deoxyribose)- 磷酸基 (phosphate group)- 四種含氮鹼基之一 (A 、 G 、 C 、 T/U)
16
Backbone of DNA and RNA
17
Watson-Crick Base Pairs
18
DNA Double Helix ( 雙股螺旋)
19
From DNA to RNA to Protein
20
Biochemical Context of Genomics and Proteomics
DNA
mRNA
Proteins
Cell functions
Genome “Genomics”
Proteome“Proteomics”
21
What is Bioinformatics?
Deduction of knowledge by computer analysis of biological data
See 988000 pages on this issue on the WWWInformation stored in the genetic code (DNA), protein sequencesProtein 3D structures, chromosome structureProtein interaction, transcription factor, motifMicro array gene expression, functional MRI, 2D-gelExperimental resultsPatient statisticsScientific literatureAnalysis tools
22
Computational Biology & Bioinformatics
Computational Biology
Biological Hypothesis
Formal Specifications
AlgorithmsRaw Data
Information
Bioinformatics End with Experiments
___
23
Key Strategy for Analysis
Information
Consensus
Clustering
Distance Measurement
Data
FESS
Evolution
Functions
StructuresSequences
In Computer Sciences In Biology
24
Key Strategy for System Biology Experiment Computer Aided Design Specification, Simulation and Reverse Engineering
22
Reverse Engineering StrategyHypothesis
Simulation Results
Candidate Set
Match 實際Microarray 輸出結果
Believe it or not是否唯一吻合
重新假設
再作 Distinguishable實驗
是
否
y
n
25
Problems on Different Levels
26
Some Problems in Bioinformatics
Sequence comparison Longest common subsequence Edit distance Similarity Multiple sequence alignment
Fragment assembly of DNA sequences Shortest common superstring
Physical mapping Double digest problem Consecutive ones problem
Evolutionary treesMolecular structure prediction
Protein folding
27
Bioinformatics and Computer Science
Algorithm: all computing problems.Image processing: 3D images of RNA folds or protein.Database: massive database and retrieval.Distributed system and parallel processing: massive storage and accelerating computation.
28
Conclusion
Biology easily has 500 years of exciting problems to work on.
-- Donald E. Knuth
Nano
Cognition
Biology
Informatics !
Go working for Integrating
30
Reference – Journals
Bioinfomatics (SCI)Bulletin of Mathematical Biology (SCI)Computer Applications in the BiosciencesJournal of Computational Biology (SCI expanded)Journal of Mathematical Biology (SCI)Journal of Molecular Biology (SCI)Nucleic Acids Research (SCI)Gene (SCI)Science (SCI)
31
Reference – Web Sites
BioWeb http://bioweb.uwlax.edu/MIT Biology Hypertextbook http://esg-www.mit.edu:8001/esgbio/Bioinformatics Related Journals http://www.iscb.org/journals.htmlNCBI http://www.ncbi.nlm.nih.gov/