Methods and challenges in the analysis of admixed human genomes

27
Methods and challenges in the analysis of admixed human genomes Simon Gravel Stanford University

description

Methods and challenges in the analysis of admixed human genomes. Simon Gravel Stanford University. An individual is admixed if its ancestors from G generations ago belong to distinct groups. . Map from National Geographic. Admixed populations are underrepresented in medical genetics. - PowerPoint PPT Presentation

Transcript of Methods and challenges in the analysis of admixed human genomes

Page 1: Methods and challenges in the analysis of admixed human genomes

Methods and challenges in the analysis of admixed human genomes

Simon GravelStanford University

Page 2: Methods and challenges in the analysis of admixed human genomes

Map from National Geographic

An individual is admixed if its ancestors from G generations ago belong to distinct groups.

Page 3: Methods and challenges in the analysis of admixed human genomes

Admixed populations are underrepresented in medical genetics

96% of participants in genome-wide association studies are of European origin, because of • Geography/politics• Europeans are a model organism• “More statistically uniform”

Page 4: Methods and challenges in the analysis of admixed human genomes

Goals

• Develop detailed quantitative models of human evolution

• Understand the impact of gene flow on genetic diversity patterns

• Empower medical association studies

Page 5: Methods and challenges in the analysis of admixed human genomes

Everyone is admixed

Gravel et al, PNAS 108, 11983 (2011)

Yoruba-European

Page 6: Methods and challenges in the analysis of admixed human genomes

Global population structure

Henn*, Gravel*, et al, Hum. Mol. Genet. 19, R221 (2010)

Page 7: Methods and challenges in the analysis of admixed human genomes

Wright-Fisher Simulations

Page 8: Methods and challenges in the analysis of admixed human genomes

Variance in ancestry across individuals:simulations

Page 9: Methods and challenges in the analysis of admixed human genomes

Variance in ancestry across individuals:Models

Page 10: Methods and challenges in the analysis of admixed human genomes

Variance in ancestry across individuals:Models

Page 11: Methods and challenges in the analysis of admixed human genomes

Variance in ancestry across individuals:Models

Page 12: Methods and challenges in the analysis of admixed human genomes

Variance in ancestry across individuals:Models

Page 13: Methods and challenges in the analysis of admixed human genomes

Ancestry Varies Along The Genome

• Local ancestry inferred from phased HapMap3 SNPs

• 88% Native American, 11% European, and 1.5% African segments

Mexican-American

AfricanEuropeanNative AmericanUnassignedAssembly gap

Kidd*, Gravel* et al (in Review)

Page 14: Methods and challenges in the analysis of admixed human genomes

Ancestry Varies Along The GenomeASW1 MXL1African-American Mexican

Kidd*, Gravel* et al (in Review)

Page 15: Methods and challenges in the analysis of admixed human genomes

Modeling ancestry tracts using a Markov model

Each recombination occurs independently, giving rise to a Poisson process.(Work in genetic units)

T1

Gravel S, Genetics 191, 607 (2012)

Page 16: Methods and challenges in the analysis of admixed human genomes

More complex demographic histories can be modeled via multiple-state Markov model

T1

T2

The entire demographic history contained in the transition matrix

Gravel S, Genetics 191, 607 (2012)

Page 17: Methods and challenges in the analysis of admixed human genomes

Markov model vs simulation

Gravel S, Genetics 191, 607 (2012)

Page 18: Methods and challenges in the analysis of admixed human genomes

Evidence for continuous gene flowAfrican-Americans Mexican-Americans

Kidd*, Gravel* et al (in Review)

18 GA 15 GA

Gravel S, Genetics 191, 607 (2012)

Page 19: Methods and challenges in the analysis of admixed human genomes

Summary

• Proposed detailed models of linkage patterns in admixed populations

• Found evidence for continuous gene flow in many populations

• We can learn about recent admixture, and older demography of highly admixed populations

• Results are consistent with historical records

Page 20: Methods and challenges in the analysis of admixed human genomes

Challenges

• Finer scale admixture:– Local ancestry assignment difficult/impossible– Large number of source populations

• Population genetics– Expand single-population, panmictic models to

models with population structure• Medical genetics – Facilitate inclusion of all individuals in medical

association studies

Page 21: Methods and challenges in the analysis of admixed human genomes

Thanks to

• Stanford– Carlos Bustamante– Jake Byrnes– Brenna Henn– Jeff Kidd– Andres Moreno– Patricia Ortiz-Tello– Fouad Zakharia

• NHLBI exome sequencing project

• 1000 Genomes Project

Page 22: Methods and challenges in the analysis of admixed human genomes

Ancestry statistics

• Global Ancestry• Tract Lengths• X-vs-Autosomes• Variance across individuals• Local ancestry fluctuations• Demography of source populations

Kidd*, Gravel* et al (in Review)

Page 23: Methods and challenges in the analysis of admixed human genomes

Modeling the admixture process

Kidd*, Gravel* et al (in Review)

Individuals

Position (Mb)

Page 24: Methods and challenges in the analysis of admixed human genomes

Demography estimates using coalescent approach

Kidd*, Gravel* et al (in Review)2000

Effective population size (x104)

Thousands of years ago

Page 25: Methods and challenges in the analysis of admixed human genomes

25

West African

European

African Americans (ASW)

Analyzing admixed populations

Kidd*, Gravel* et al (in Review)

Europeans

Native Americans

Yorubans

African-Americans

Mexicans

Puerto Ricans

Page 26: Methods and challenges in the analysis of admixed human genomes

Hernan Cortes [1519] North America

Slave Trade [1500-1860]

Spanish Central America

Comparing with slave trade records

Page 27: Methods and challenges in the analysis of admixed human genomes

The simplest model of admixture

T1

time