Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li...
Transcript of Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li...
![Page 1: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/1.jpg)
Brief Overview of SOC’sComputational Biology LabComputational Biology Lab
http://www.comp.nus.edu.sg/~cbl
10 May 2011
![Page 2: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/2.jpg)
2
Computational BiologyComputational Biology• Aims • Combinatoric Algo
– Improve understanding of molecular circuits
– Deliver better diagnosis and treatment of diseases
• DB & Knowledge DiscoveryKen SungLeong Hon Wai
• Modeling & Simulation
P.S. ThiagarajanDavid Hsu
Tan Kian LeeWong Limsoon
Copyright © 2011by Limsoon Wong
g j
Anthony Tung Lee Mong LiWynne Hsu
![Page 3: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/3.jpg)
3
Recent HonoursRecent Honours• Ken Sung
2008 NUS Y R h A d– 2008 NUS Young Researcher Award: Contribution to research in algorithm & computational biology
– 2006 Singapore National Science– 2006 Singapore National Science Award: Paired End diTag sequencing technology
• Limsoon Wong– 2006 Singapore Youth Award Medal of
Commendation: Sustained contributions to science & technology
– 2003 Far Eastern Economic Review Asian Innovation Gold Award: A simple test for childhood leukaemia
Copyright © 2011by Limsoon Wong
test for childhood leukaemia
![Page 4: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/4.jpg)
4
DREAM Challenge 2007DREAM Challenge 2007• 5 bioinformatics
h ll• Challenge 1: BCL6 target
fi dichallenges
• Participants must predict
genes finding– Lee et al., Ann NY Acad Sci, 2009
Ch ll 2 PPIthe answer using bioinformatics methods
• Challenge 2: PPI subnetwork prediction
– Chua et al., Ann NY Acad Sci, 2009
• We participated in 2 challenges and we were the best performers in both
Copyright © 2011by Limsoon Wong
![Page 5: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/5.jpg)
5
Recent Professional ActivitiesRecent Professional Activities• Journals edited: • Annually involve in ~10
bi i f ti f &bioinformatics conf prog & org committees– RECOMB, ISMB, CSB,
GIW APBC ACM BCB
Info Systems
GIW, APBC, ACM BCB, IEEE BIBM, …
• Annually publish ~30 papers
JBCBBioinformaticsDDT
• Books/Proceedings edited:
– Bioinformatics, JCB, BMC, JBCB, PLoS CB, EMBO, NAR, Cell, JBC, TKDE, VLDBJ SIAM J C t
TCBB
VLDBJ, SIAM J Comput, Clin Cancer Res, JPR, …
• Annually give ~10 keynotes & i it d t lk i f
Copyright © 2011by Limsoon Wong
invited talks in conferences
Algo in BioinfoBIBM2010
RECOMB’083rd Regulatory
Genomics
![Page 6: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/6.jpg)
6
Recent Conferences HostedRecent Conferences Hosted• International Conferences • Regional Workshops
– 18th Intl Conf on Genome Informatics (GIW2007)
– 2nd Intl Symp on Languages in
– 1st Japan-Singapore Workshop on Computational Systems Biology (2008)
2 Intl Symp on Languages in Biology and Medicine (LBM2007)
6th Assoc of Asian Societies for
– 8th Korea-Singapore Workshop on Bioinformatics & NLP (KSW2008)
– 6th Assoc of Asian Societies for Bioinformatics Symp (AASBi2007)
– 3rd Japan-Singapore Workshop on Computational Systems Biology (2011)
– 12th Intl Conf on Research in Computational Molecular Cell Biology (RECOMB2008)
– 2nd IPM-NUS Workshop on Computational Biology (2011)
Copyright © 2011by Limsoon Wong
![Page 7: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/7.jpg)
7
Main Courses TaughtMain Courses Taught• CS2220 Introduction to
C t ti l Bi l• CS5238 Advanced
C bi t i l M th d iComputational Biology– Understand bioinformatics
problems; interpretational skills
Combinatorial Methods in Bioinformatics
– Seq alignment, whole-genome alignment suffix tree seq
• CS3225 Combinatorial Methods in Bioinformatics
alignment, suffix tree, seq indexing, motif finding, RNA sec struct prediction, phylogeny reconstruction
• CS4220 Knowledge Discovery Methods in Bioinformatics
• CS6280 Computational Systems Biology
Dynamics of biochemical andBioinformatics – Clustering; classification;
association rules; SVM; HMM; Mining of seq, trees, & graphs
– Dynamics of biochemical and signaling networks; modeling, simulating, & analyzing them
Copyright © 2011by Limsoon Wong
• ~15 students a year in NUS undergrad comp bio prog
![Page 8: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/8.jpg)
8
Recent Comp Bio PhD StudentsRecent Comp Bio PhD Studentshttp://www.comp.nus.edu.sg/~cbl/theses,html
• 2005: 3 PhD’s awarded• 2006: 4 PhD’s awarded
• Charlie Lee (2010)– Bioinformatics applications for• 2006: 4 PhD s awarded
• 2007: 4 PhD’s awarded• 2008: 8 PhD’s awarded
Bioinformatics applications for virology research
– RF at A*STAR GIS
• Hugo Willy (2011)• 2009: 4 PhD’s awarded • 2010: 3 PhD’s awarded• 2011: 3-5 PhD’s expected
– Interaction motif inference from biomolecular interactions
– RF at NUS SOC
Bi Li (2011)
• Mengling Feng (2010)– Frequent pattern space
• Bing Liu (2011)– Probabilistic approx and analysis
techniques for biopathway models– RF at NUS SOCeque pa e space
maintenance: Theories and algos– RF at A*STAR I2R
• Donny Soh (2010)
a US SOC
• Brandon Ooi (2011)– Molecular and computational
approaches to understanding
Copyright © 2011by Limsoon Wong
– Understanding pathways– RF at A*STAR I2R
keloid scarring– Acad staff at Republic Poly
![Page 9: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/9.jpg)
Research DirectionsResearch Directions
![Page 10: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/10.jpg)
10
Recently Funded ProjectsRecently Funded Projects• Supporting diagnostic data
mining via exploratory• Extracting biological signals
from second generationmining via exploratory hypothesis testing and analysis, $655k, SERC PSF
from second generation sequencing, $888k, MOE T2
• Lipidomics – Novel Tools and $ $• Construction of reliable
protein interactomes for infectious diseases, $850k, MOE T2
Applications, $1m of $8m, NRF CRP
• Total new funding
• Decomposition and Composition of Large Signalling Pathway Models
Total new funding– $2.88m + $1m of 8m
Signalling Pathway Models with Emphasis on Parameter Estimation, $487k, MOE AcRF T2
Copyright © 2011by Limsoon Wong
![Page 11: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/11.jpg)
11
Systems Biology Modeling andModeling and
Simulation
Copyright © 2011by Limsoon Wong
![Page 12: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/12.jpg)
12
Protein Structure,Protein Structure, Folding, & Motion
Copyright © 2011by Limsoon Wong
![Page 13: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/13.jpg)
13
Protein Function & Protein ComplexProtein Complex Prediction Using
PPI NetworksPPI Networks
Copyright © 2011by Limsoon Wong
![Page 14: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/14.jpg)
14
Cloud Computing for GWASCloud Computing for GWAS
• Cloud-based epistasis pcomputing model for large-scale GWAS– Efficient– Flexible
S l bl– Scalable– Practical
• eCEO can do pairwise• eCEO can do pairwise testing of 500k SNPs in <9 hrs using 40 nodes
Copyright © 2011by Limsoon Wong
9 hrs using 40 nodes
Wang et al, Bioinformatics, 27:1045-1051, 2011
![Page 15: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/15.jpg)
15
Next-Gen Sequencing & q g
Transcriptomics
Copyright © 2011by Limsoon Wong
![Page 16: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/16.jpg)
16
Network-Based Gene ExpressionGene Expression
& Proteomic Profile AnalysisProfile Analysis
Copyright © 2011by Limsoon Wong
![Page 17: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/17.jpg)
17
Fast DNA AlignmentFast DNA Alignment• BLAST is one of the best
methods for identify approxmethods for identify approx matching in a large seq db
• However, it is a heuristics. It will miss answerswill miss answers
• We introduce meaningful alignment based on compressed suffix tree$ a c
ag $ compressed suffix tree
New DNA alignment method that does not miss answers and is as fast as BLAST
a
cag
c
7 6
$
$
a d s as ast as Scg
c g5
2 4
agg$ $
$$a
Copyright © 2011by Limsoon Wong
1 32 4$ $
Lam et al, Bioinformatics, 24:791-797, 2008
![Page 18: Brief Overview of SOC’s Computational Biology ...cbl/soc-cbl.pdf · – RF at NUS SOC Bi Li (2011) • Mengling Feng (2010) – Frequent pattern space • Bing Liu (2011) – Probabilistic](https://reader034.fdocuments.us/reader034/viewer/2022050203/5f56ba79c9b4b528d76164e9/html5/thumbnails/18.jpg)
Any Question?Any Question?
Contact: Professor Wong Limsoon
http://www.comp.nus.edu.sg/~wongls