What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic...

34
What should we study ? Levels of genetic variability - intrapopulational Population structure - interpopulational Geographic distribution of genetic diversity Taxonomic uncertainties – taxonomic and systematic studies Number of species – taxonomic and ecological approaches

Transcript of What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic...

Page 1: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

What should we study ?

• Levels of genetic variability - intrapopulational• Population structure - interpopulational• Geographic distribution of genetic diversity

• Taxonomic uncertainties – taxonomic and systematic studies

• Number of species – taxonomic and ecological approaches

Page 2: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Intrapopulational measures

Page 3: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Why Genetic Diversity

• Genetic diversity is important because it is the raw material on which selection can act, and thus species can respond to selective pressure.

• Majority of low frequency alleles exist in heterozygous states, and there if they are deleterious, their action may be fully or partially masked.

Page 4: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Why Genetic Diversity

• Genetic diversity also plays a role in determining IUCN categories.

• The lower the genetic diversity, the higher the perceived risk of threat.

Page 5: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Measuring Genetic Diversity

• Measures of genetic diversity depend on the data analyzed.

• One set of measures focuses on heterozygositymeasures and is based on diploid, co-dominant markers.

• Other set of measures focuses on allelic information, and or unphased diploid data.

Page 6: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

• Some indexes implemented in Arlequin

Measures of Genetic diversity

Page 7: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Molecular Markers• Sequence data

• Single Nucleotide Polymorphism (SNP) data

• Microsatellite data

• Allozyme data

• Amplified Fragment Lengths Polymorphism (AFLP) data

• Randomly Amplified Polymorphic DNA (RAPD) data

• Hybridization data

• Chromosomal pattern data

Page 8: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Sequence data

Page 9: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Sequence data• Differences in haplotypes are due to point mutations

(transition or transversion types), due to insertions or due to deletions.

• In diploid organisms, differences are also due to recombination.

• Molecular models of evolution dealing with point mutations are very well studied.

Page 10: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Microsatellite data

Page 11: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Microsatellite data

Page 12: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Microsatellite data

Page 13: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Microsatellite data

Template strand

+1 repeat -1 repeat

Slippage

Misalignment

Growing strand

Page 14: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Microsatellite data• Differences in haplotypes are due to unequal crossing

over, or due to slippage in strand replication.

• This class of markers is co-dominant, i.e. heterozygous and both homozygous classes of individuals can be distinguished.

• Fast rate of molecular evolution.

• Models of molecular evolution are not well known.

Page 15: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Allozyme data

Page 16: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Allozyme data• Properties of allozyme data are very similar to

microsatellite data.

Page 17: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

RFLP

Page 18: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

RFLP data• Differences in haplotypes are due to point mutations

(transition or transversion types), due to insertions or due to deletions.

• In diploid organisms, differences are also due to recombination.

• This class of markers is dominant, i.e. heterozygous and homozygous dominant individuals cannot be distinguished.

Page 19: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Chromosomal data

Page 20: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Best Markers• Theoretically the best markers are sequence markers.

• If there is sufficient variation – sufficient sequence length.

• If the differences can be phased.

• And because we have the best models of molecular evolution for these markers.

Page 21: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

HaplotypesSample 1 AAAAASample 2 AAAAASample 3 AGAAASample 4 AGAAASample 5 AGAAGSample 6 AGAAGSample 7 GGAAASample 8 GGAAASample 9 GGGAASample 10 GGGAASample 11 GGGGASample 12 GGGGA

Page 22: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Measuring Genetic DiversitySample 1 AGAACTTCTGSample 2 AGAACTTCTGSample 3 AGAACTTCTGSample 4 AAAA TTTTTGSample 5 AAAA TTTTTGSample 6 AAAATCTTTG

Number of segregating sites– Is the total number of mutations observed in the dataset.

Page 23: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Measuring Genetic DiversitySample 1 AGAACTTCTGSample 2 AGAACTTCTGSample 3 AGAACTTCTGSample 4 AAAA TTTTTGSample 5 AAAA TTTTTGSample 6 AAAATCTTTG

Gene Diversity –Is equivalent to expected heterozygosity for diploid data. It is defined as the probability that any two randomly selected sequences will be different.

Page 24: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Measuring Genetic DiversitySample 1 AGAACTTCTGSample 2 AGAACTTCTGSample 3 AGAACTTCTGSample 4 AAAA TTTTTGSample 5 AAAA TTTTTGSample 6 AAAATCTTTG

Mean number of pairwise differences –Mean number of differences between all pairs of haplotypes in the sample.d = mutational difference, p = allele frequency, k = allele number, n = sample size

Page 25: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Measuring Genetic DiversitySample 1 AGAACTTCTGSample 2 AGAACTTCTGSample 3 AGAACTTCTGSample 4 AAAA TTTTTGSample 5 AAAA TTTTTGSample 6 AAAATCTTTG

NucleotideDiversity –It is computed as the probability that two randomly chosen homologous sites are different.d = mutational difference, p = allele frequency, k = allele number, L = number of loci (allele number)

Page 26: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Measuring Genetic Diversity

• Theta = θ = 4Nµ = 4Nm = 4N(µ+m)• For haploid markers θ = 2Nµ = 2Nm = 2N(µ+m)• The all important population genetic parameter.• It is based on the number of alleles or the number of

different nucleotides in a given sample.• It quantifies genetic diversity of a given population.

Page 27: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Theta (θ) Hom

• The expected homozygosity (Zouros, 1979; Chakraborty and Weiss (1991) in a population at equilibrium between drift and mutation.

• Sensitive to small sample and allele sizes

• For microsat data

Page 28: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Theta (θ) S

• Estimated from the infinite-site equilibrium relationship (Watterson, 1975) between the number of segregating sites (S), the sample size (n) and θ for a sample of non-recombining DNA.

Page 29: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Theta (θ) k

• Estimated from the infinite-allele equilibrium relationship (Ewens, 1972) between the expected number of alleles (k), the sample size (n) and θ.

• 95% confidence limits are calculated as

Sterling number (expansion factor of a factorialFalling factorial

Page 30: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Theta (θ) πˆ

• Estimated from the infinite-site equilibrium (Tajima, 1983) relationship between the mean number of pair-wise differences (πˆ) and theta (θ ).

Page 31: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Why so many θ measures

• Not all methods are suitable for all types of data.• Ultimately all methods should result in the same

estimates of theta.• Differences in estimates can be interpreted as

violations of assumptions, and each method is sensitive to different assumptions.

Page 32: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Tajima’s D

• Tajima’s (1989) D test quantifies the discordance between the estimate of theta from number of segregating sites and from average pair-wise sequence divergence.

Page 33: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Fu’s Fs

• Fu’s (1997) Fs measures the probability of observing a certain number of haplotypes given particular value of θ

Page 34: What should we study - UFSCarevolucao/TGE/Lect02.pdf · Microsoft PowerPoint - Lect 02 Genetic Diversity.ppt Author: Tomas Hrbek Created Date: 3/8/2008 9:21:16 AM ...

Differences in θ measures

• Have selective interpretations.• Have demographic interpretations.