Whole-genome bisulfite sequencing of bovine gametes and in ...
Introduction to Bioinformatics of Bisulfite Sequencing...
Transcript of Introduction to Bioinformatics of Bisulfite Sequencing...
![Page 1: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/1.jpg)
©2018 MFMER | slide-1
Introduction to Bioinformatics of Bisulfite Sequencing Methylation DataGarrett Jenkinson, PhDLead Informatics SpecialistBiomedical Statistics and InformaticsDepartment of Health Sciences Research
![Page 2: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/2.jpg)
©2018 MFMER | slide-2
Genetics versus Epigenetics• Which is more different at cellular level phenotypically?
• Your heart cells from your brain cells• A monkey’s heart cells from your heart cells
• How about the old nature versus nurture question?• Two healthy but unrelated peoples’ livers• Identical twins’ livers when only one is alcoholic
• Regulation of gene expression can be as critical as underlying genomic sequence
• A gene can be turned off by regulation which can be functionally the same as obliteration of the genomic sequence
• Not all regulation is epigenetic and causality rarely understood…don’t be oversold
2
![Page 3: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/3.jpg)
©2018 MFMER | slide-3
• DNA methylation is crucial if you want to understand:
• Developmental biology, stem cells, differentiation
• Carcinogenesis, imprinting disorders• Aging, environmental exposures
Motivation
3
![Page 4: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/4.jpg)
©2018 MFMER | slide-4
Biology of DNA methylation in mammals• Covalent addition of a methyl group to the 5’
carbon of cytosine residues (5mC)• Predominantly at CG dinucleotides (CpG sites)
4
![Page 5: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/5.jpg)
©2018 MFMER | slide-5
CpG sites
• The “p” represents phosphate backbone to distinguish between CpG and C—G hydrogen bonding between the strands of DNA
• Only positions in human genome with known mechanisms for epigenetic inheritance past cell division (DNMT enzymes)
• Dense regions of CpG sites referred to as CpG islands which are flanked by shores, shelves and then CpG depleted open seas
• Methylated islands in promoters linked to repressed gene expression
• Methylation has complicated relationships to chromatin structure and gene expression
• Mechanistic understanding of DNA methylation in gene regulation is incomplete
5
![Page 6: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/6.jpg)
©2018 MFMER | slide-6
Agouti Mouse Model• Genetically identical, phenotype differences
driven by difference in methylation at agouti gene
• Expose pregnant mice to bisphenol A (BPA in plastic products)
• Disproportionate number of yellow, obese progeny than would normally be expected
• DNA methylation at the agouti gene sites is decreased (hypomethlyated)
• Need sequencing methods to probe the state of DNA methylation
6
![Page 7: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/7.jpg)
©2018 MFMER | slide-7
Detailed View of Bisulfite Sequencing
https://software.broadinstitute.org/software/igv/sites/cancerinformatics.org.igv/files/SL_IGV_bisulfiteflow2.png
7
![Page 8: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/8.jpg)
©2018 MFMER | slide-8
QC and Alignment of BS-seq data• Need specialized algorithms/tools to deal with
“heavily mutated” BS-seq data• trimgalore! is a package that wraps cutadapt
and allows for the trimming of low quality bases and adapters from sequencing reads
• Bismark is a bisulfite-aware aligner using bowtie2
• Can also produce QC and methylation summarization information
8
![Page 9: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/9.jpg)
©2018 MFMER | slide-9
Post-alignment Data in IGV
9
![Page 10: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/10.jpg)
©2018 MFMER | slide-10
Common BS-seq methods• WGBS completely unfocused
• Comprehensive ~13 million CpG sites profiled• Gold standard• ~$5K per sample
• RRBS, 1% of genome with 1.5 million CpGs• Most common BS-seq• Restriction enzymes chop DNA and results in
enrichment for CGIs• ~$500 per sample
• “Capture” protocols (e.g., EPIC TruSeq), 3 million CpGs• Least common• Looks more like “focused” WGBS
10
![Page 11: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/11.jpg)
“Raw” Data
11
![Page 12: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/12.jpg)
©2018 MFMER | slide-12
Methylation status not as “fixed” as genetic• Populations of genetically homogeneous cells
can and do differ in methylation• Maternal and paternal alleles can and do differ
(e.g., imprinting)• At a given time, each cell’s DNA is either
methylated (1) or unmethylated (0), but state can change during life of cell
• End result: we talk of probability that a CpG site is methylated in a given tissue/sequencing run
12
![Page 13: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/13.jpg)
MarginalEstimation
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
13
![Page 14: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/14.jpg)
814
𝑝!(1)
MarginalEstimation
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
14
![Page 15: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/15.jpg)
814
MarginalEstimation
714
𝑝!(1)
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
15
![Page 16: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/16.jpg)
814
MarginalEstimation
714
715
𝑝!(1)
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
16
![Page 17: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/17.jpg)
814
MarginalEstimation
714
715
814
𝑝!(1)
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
17
![Page 18: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/18.jpg)
814
MarginalEstimation
714
715
814
1114
𝑝!(1)
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
18
![Page 19: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/19.jpg)
814
MarginalEstimation
714
715
814
1114
1014
𝑝!(1)
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
19
![Page 20: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/20.jpg)
814
MarginalEstimation
714
715
814
1114
1014
1115
𝑝!(1)
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
20
![Page 21: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/21.jpg)
814
MarginalEstimation
714
715
814
1114
1014
1115
1115
𝑝!(1)
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
21
![Page 22: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/22.jpg)
814
MarginalEstimation
714
715
814
1114
1014
1115
1115
1013
𝑝!(1)
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
22
![Page 23: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/23.jpg)
814
MarginalEstimation
714
715
814
1114
1014
1115
1115
1013
1013
𝑝!(1)
Xn = 1 if nth site methylatedXn = 0 if unmethylated
Pn(1) = Pr[Xn=1]
23
![Page 24: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/24.jpg)
814
SmoothedMarginals
Use smoothing to improve marginal estimates
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
24
![Page 25: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/25.jpg)
814
SmoothedMarginals
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
1528
25
![Page 26: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/26.jpg)
814
SmoothedMarginals
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
1528
323630
26
![Page 27: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/27.jpg)
814
SmoothedMarginals
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
1528
323630
323630
27
![Page 28: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/28.jpg)
814
SmoothedMarginals
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
1528
323630
323630
383630
28
![Page 29: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/29.jpg)
814
SmoothedMarginals
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
1528
323630
323630
383630
2942
29
![Page 30: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/30.jpg)
814
SmoothedMarginals
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
1528
323630
323630
383630
2942
6790
30
![Page 31: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/31.jpg)
814
SmoothedMarginals
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
1528
323630
323630
383630
2942
6790
229315
31
![Page 32: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/32.jpg)
814
SmoothedMarginals
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
1528
323630
323630
383630
2942
6790
229315
436585
32
![Page 33: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/33.jpg)
814
SmoothedMarginals
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
1528
323630
323630
383630
2942
6790
229315
436585
443585
33
![Page 34: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/34.jpg)
814
SmoothedMarginals
714
715
814
1114
1014
1115
1115
1013
1013raw estimates:
smoothed estimates:
1528
323630
323630
383630
2942
6790
229315
436585
443585
1013
34
![Page 35: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/35.jpg)
©2018 MFMER | slide-35
Methylation is a coordinated phenomena• Rarely do we care about methylation for a
single CpG site…often care about entire island’s coordinated behavior
• To the extent people care about single sites, it is due to the highly correlated/coordinated behaviors of site with neighbors
• “Marginal” view of methylation as a probability at each site is inadequate to capture the richness and diversity of the underlying biology
35
![Page 36: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/36.jpg)
©2018 MFMER | slide-36
Stochasticity: Epipolymorphism/Entropy
Landan et al. Epigenetic polymorphism and the stochastic formation of differentially methylated regions in normal and cancerous tissues. Nat. Gen. 2012
36
![Page 37: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/37.jpg)
©2018 MFMER | slide-37
Joint Probability Distributions• Need to talk about probabilities of patterns of
CpG sites• From such probabilities, any other quantity of
interest is available• Epipolymorphism• Entropy
• Now possible to detect not just hypo- or hyper-methylation changes in the mean, but any difference in methylation behavior
37
![Page 38: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/38.jpg)
EmpiricalEstimation
38
![Page 39: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/39.jpg)
EmpiricalEstimation
39
![Page 40: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/40.jpg)
Pattern Probability = 1/10
Pattern Probability = 2/10
Pattern Probability = 1/10
Pattern Probability = 1/10
Pattern Probability = 1/10
Pattern Probability = 1/10
Pattern Probability = 3/10
The 1017 other patterns are assigned zero probability40
![Page 41: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/41.jpg)
Ising model• Each read is a single-cell measurement
even in bulk sequencing• Means and nearest-neighbor correlations
frequently observed • 1D Ising model is MaxEnt model
consistent with these quantities• Well studied model in statistical physics
with many existing computational techniques/results
• Provides full joint distribution
![Page 42: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/42.jpg)
Ising model performance
• Empirical and marginal methods under- and over- estimate heterogeneity
• Ising is accurate even in low data
![Page 43: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/43.jpg)
Ising model specification
• All patterns have non-zero probability• General model requires estimation of an and cn
parameters; (2N-1) << 2N
• Improve performance further by imposing parametric structure based on the biology
𝑃 𝐱 =1𝑍exp −𝑈 𝐱 𝑍 =#
xexp −𝑈 𝐱
![Page 44: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/44.jpg)
Normalized Methylation Entropy
• Rigorously quantifies stochasticity in DNA methylation using Shannon entropy• Another degree-of-freedom compared to standard
mean analyses• Shown to have discriminatory power in aging, carcinogenesisand stem cell differentiation
![Page 45: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/45.jpg)
Jensen Shannon distance
![Page 46: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/46.jpg)
©2018 MFMER | slide-46
Information Theoretic Bioinformatics Software• informME is an information theoretic package
designed to implement the Ising model, NME, JSD
• Available as a thoroughly used/tested matlab/C++ code base, with bash wrappers and SLURM/SGE submission scripts
• Or recently informME.jl is released as a trial package in julia language requiring no licensing or complex pipelines
46
![Page 47: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/47.jpg)
©2018 MFMER | slide-47
Example Application
47
![Page 48: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/48.jpg)
©2018 MFMER | slide-48
Highlights of DNA methylation in twins study• Twin astronauts with similar past flight
experience studied in detail during longest American spaceflight in history
• Surprising result that space twin globally had less DNA-methylation variability than ground twin; hypotheses why?
48
![Page 49: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/49.jpg)
©2018 MFMER | slide-49
Focal changes in DNA methylation• Less surprising results when looking for focal
genes with DNA methylation differences:• Regulation of ossification, and cellular
response to ultraviolet-B (UV-B), platelet aggregation
• Somatostatin signaling pathway and regulation of superoxide anion generation
• Response to platelet-derived growth factor (PDGF) and T cell differentiation and activation pathways
49
![Page 50: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/50.jpg)
©2018 MFMER | slide-50
Example Detailed Analysis of NOTCH3
50
![Page 51: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/51.jpg)
©2018 MFMER | slide-51
Papers for more detail or applications
51
![Page 52: Introduction to Bioinformatics of Bisulfite Sequencing ...publish.illinois.edu/compgenomicscourse/files/2020/06/Jenkinson_UI… · Introduction to Bioinformatics of Bisulfite Sequencing](https://reader035.fdocuments.us/reader035/viewer/2022062506/5f7289b22c454f58731478ee/html5/thumbnails/52.jpg)
©2018 MFMER | slide-52
Questions?
52