HapMap PROJECT
description
Transcript of HapMap PROJECT
![Page 1: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/1.jpg)
HapMap PROJECT
Basics
![Page 2: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/2.jpg)
HapMap
• The International HapMap Project is analyzing DNA from populations with African, Asian, and European ancestry
![Page 3: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/3.jpg)
Multiple Populations
• The DNA samples for the HapMap have come from a total of 270 people. – The Yoruba people of Ibadan, Nigeria, provided 30
sets of samples from two parents and an adult child (each such set is called a trio).
– In Japan, 45 unrelated individuals from the Tokyo area provided samples.
– In China, 45 unrelated individuals from Beijing provided samples.
– Thirty U.S. trios provided samples, which were collected in 1980 from U.S. residents with northern and western European ancestry by the Centre d'Etude du Polymorphisme Humain (CEPH).
![Page 4: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/4.jpg)
Methods
• The blood samples are being converted into cell lines, DNA extracted.
• The samples and cell lines are not linked to any individual in the populations studied. However, the samples and cell lines are identified as coming from one of the four populations participating in the study, which raises ethical issues associated with conducting genetic research in named populations.
![Page 5: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/5.jpg)
SNP Nomenclature
• http://snp500cancer.nci.nih.gov/terms_snp_region.cfm
![Page 6: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/6.jpg)
Hardy Weinberg Test
• http://innateimmunity.net/IIPGA2/Bioinformatics/exacthweform
![Page 7: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/7.jpg)
IIPGA
![Page 8: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/8.jpg)
Exact HWE
![Page 9: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/9.jpg)
Fishers Exact Test
![Page 10: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/10.jpg)
Fishers Exact Test
![Page 11: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/11.jpg)
Homework
http://www.hsph.harvard.edu/bioinfocore/Documents/Talk%20slides/Bioinfo_training_August_10_05_tutorial_Niu_T.pdf
![Page 12: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/12.jpg)
SNPcutterhttp://bioinfo.bsd.uchicago.edu/SNP_cutter.htm
![Page 13: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/13.jpg)
SNP and Cancer
• A SNP is defined as a genomic locus where two or more alternative bases occur with appreciable frequency (>1%).
• Occurs every several hundred bases.
• Whole genome SNP analysis is possible.
![Page 14: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/14.jpg)
Applications
• Direct Association Analysis:– Test association between putative functional
variants and disease risk.• Evaluation of nonsynonymous SNPs or regulatory
polymorphisms = functional SNPs.• Problem: there are not that many functional SNPs.• Uncharacterized de novo mutations???
![Page 15: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/15.jpg)
Examples
• 2 MMP9 nonsynonymous SNPs associated with risk of lung cancer with metastasis (Hu et al. 2005b)
• Coding polymorphisms within UGT1A7 predict response of colorectal patients to capecitabine (Carlini et al. 2005).
• Functional MTHFR mutations linked to several different cancers.
![Page 16: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/16.jpg)
Direct Association
• Candidate gene or genomic region.– Linkage analysis– Expression array analysis– Knowledge of development and physiology– Comparative genomics
![Page 17: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/17.jpg)
Tools
• PANTHER database- evolutionary analysis of coding SNPs.
• SNPEffect-estimate likelihood that a particular SNP is causing a functional effect.
• SNPSeek->90 000 coding SNPs in the exons of known genes
• SNP500Cancer – identification, validation, and characterization of polymorphisms.
![Page 18: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/18.jpg)
PANTHERhttp://www.pantherdb.org/tools/csnpScoreForm.jsp
![Page 19: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/19.jpg)
PANTHERhttp://www.pantherdb.org/tools/csnpScoreForm.jsp
![Page 20: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/20.jpg)
PANTHER
![Page 21: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/21.jpg)
ABCA1
![Page 22: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/22.jpg)
PolyPhen
• http://genetics.bwh.harvard.edu/pph/
![Page 23: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/23.jpg)
PolyPhen
• http://genetics.bwh.harvard.edu/pph/
![Page 24: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/24.jpg)
SNPEffecthttp://snpeffect.vib.be/search.php
![Page 25: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/25.jpg)
SNPSeek
![Page 26: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/26.jpg)
Search for BRCA1
![Page 27: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/27.jpg)
Search for BRCA1
![Page 28: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/28.jpg)
Search for BRCA1
![Page 29: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/29.jpg)
Search for BRCA1
![Page 30: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/30.jpg)
SNP500 Cancer Database
![Page 31: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/31.jpg)
SNP500 vs HDP
![Page 32: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/32.jpg)
Test if SNP500 and HDP differ
![Page 33: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/33.jpg)
Do subpopulations differ?
![Page 34: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/34.jpg)
Compare Caucasion vs African
![Page 35: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/35.jpg)
Compare Caucasian vs Hispanic
![Page 36: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/36.jpg)
Test whether in HWE
![Page 37: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/37.jpg)
HWE
![Page 38: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/38.jpg)
HWE
![Page 39: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/39.jpg)
TDT
![Page 40: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/40.jpg)
TDT
![Page 41: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/41.jpg)
HapMap
• Polymorphisms identified by HapMap are likely to be neural in phenotypic effect but can inform on nearby alleles that might play a role in disease.
![Page 42: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/42.jpg)
Haplotype
• SNP alleles tend to be correlated together in a predictable way-known as haplotype.– The linear, LD ordered arrangement of alleles
on a chromosome
• The correlation between SNPs is mediated by linkage disequilibrium (LD).– LD exists when alleles at distinctive loci occur
together more frequently than expected given the known allele frequencies and recombination fraction between the loci.
![Page 43: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/43.jpg)
Disease allele and haplotypes
• In the presence of LD, polymorphisms that are in physical proximity to a causal polymorphism will show a difference between cases and controls.
![Page 44: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/44.jpg)
HapMap
• Three phases:I, II, III• I: completed in October 2005-genotyping
of 1M SNPs at average spacing of 5kb. An additional SNP finding in 48 samples from original populations across 10 specific 500kb ENCODE regions (represent a genome wide rage of evolutionary conservation and gene density). Later this was extended to 269 samples.
![Page 45: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/45.jpg)
HapMap
• Phase II: 269 samples, 2.9 M SNPs were genotyped, a total of 3.9 M.
• Phase III: other populations will be added.
![Page 46: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/46.jpg)
Results of Phase I and Phase II
• Intensity of SNP data across ENCODE regions 1SNP/279 bp.
• Intensity of phase II Hapmap 1SNP/kb
![Page 47: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/47.jpg)
Robust measures of LD
• D’ and r2 are the two major measures of LD.
• D’, if two SNPs have not been separated by recombination during the history of the sample D’ is 1.
• R2 is the correlation between two SNPs; when two SNPs always observed together r2 is 1. Generally is a better measure.
![Page 48: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/48.jpg)
![Page 49: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/49.jpg)
![Page 50: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/50.jpg)
![Page 51: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/51.jpg)
![Page 52: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/52.jpg)
Linkage Studies
• Family-based approaches to identify a disease gene.
• A disease gene segregates in a family, genomic markers in close proximity to the disease will segregate in the same manner due to lack of recombination.– Identify families with disease; genotype each
individual.– Compare the marker allele and disease distributions
within the family. Assign a LOD score.
![Page 53: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/53.jpg)
Linkage studies
• Genome wide scans for linkage analysis performed using several hundred microsatellites at a 10cM density throughout genome.
• SNP-based linkage studies use a panel of 10000 SNPs.
![Page 54: HapMap PROJECT](https://reader034.fdocuments.us/reader034/viewer/2022051821/56815a72550346895dc7d748/html5/thumbnails/54.jpg)
Examples
• Multiple sclerosis
• Neonatal diabetes
• Familial glucocorticoid deficiency.