How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus,...

44
How complex can it be? The bread wheat genome and its implications for wheat intolerance Manuel Spannagl PGSB, March, 29 th 2019

Transcript of How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus,...

Page 1: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

How complex can it be? The bread wheat genome and its implications for wheat intolerance

Manuel SpannaglPGSB, March, 29th 2019

Page 2: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Outline

o Introduction – what makes the wheat genome so special aka complex?

o Previous wheat WGS/survey sequences

o The bread wheat reference genome sequence

o The Wheat prolamin map

o What’s next?

Page 3: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:
Page 4: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:
Page 5: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Wheat (yield) is threatened by climate change

Page 6: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Wheat is not good for everyone

Celiac disease

Wheat allergyWheat insensitivities

Baker‘s asthma

Page 7: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

>60% of world’s food:

WheatRice SoyMaize

Page 8: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Filling the yield gap – a challenge for agricultural research

• Staple food source for 30% of the world’s population• ~20% of human’s daily consumed calories• 620 million tonnes harvested (2012)• US$200 billion trade volume

2050

2006

+69%Demand in food calories

Page 9: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

The genome of bread wheat (T. aestivum) is …

…large and complex!• allohexaploid: three “diploid‐like” (2n = 6x = 42)• ~17Gb genome size; > 80% repeat content• Highly similar genomes (>97% identity)

TaAA TaBB TaDD

TaAA

TaBB

TaDD

Page 10: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

The genome of bread wheat (T. aestivum) is …

…large and complex!• allohexaploid: three “diploid‐like” (2n = 6x = 42)• ~17Gb genome size; > 80% repeat content• Highly similar genomes (>97% identity)

TaAA TaBB TaDD

TaAA

TaBB

TaDD

Page 11: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

The genome of bread wheat (T. aestivum) is …

…large and complex!• allohexaploid: three “diploid‐like” (2n = 6x = 42)• ~17Gb genome size; > 80% repeat content• Highly similar genomes (>97% identity)

TaAA TaBB TaDD

TaAA

TaBB

TaDD

Page 12: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

The genome of bread wheat (T. aestivum) is …

…large and complex!• allohexaploid: three “diploid‐like” (2n = 6x = 42)• ~17Gb genome size; > 80% repeat content• Highly similar genomes (>97% identity)

TaAA TaBB TaDD

TaAA

TaBB

TaDD

Page 13: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Sequencing complex cereal genome ‐ strategies

c/o Eversole

Page 14: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Why is the assembly so complicated?

Page 15: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Wheat genome sequencing efforts

• 2014: Chromosome‐sorted WGS assembly

• ~100,000 gene modelsidentified (highly similarA,B,D homeologous)

• BUT: highly fragmented, little inter‐genicsequence available

• Newer assembly by:

Page 16: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:
Page 17: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Sequence Assembly Breakthrough!

Novel algorithm for (genome) sequence assembly Uses standard Illumina short reads (at high coverage) Some specific insert library sizes needed Long scaffolds even in presence of high repetitity and large genome sizes

Fast: 14 days for hexaploid wheat genome (17 Gb!)

Page 18: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

• Pseudo‐chromosomeassembly

• ~108,000 HC genemodels

• ...still with gaps andmissing sequence

Page 19: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Finally!! 

A true referencegenome sequence for

bread wheat

Page 20: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

The genes of bread wheat

= ~108,000 gene models

Page 21: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

LTR-retrotransposons

unassigned LTR-retrotransposons

Gypsy LTR-retrotransposons DNA-transposons

Copia LTR-retrotransposons Genes (cds)

Genes = "needles in the haystack"

Page 22: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:
Page 23: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Distinct chromosomal compartments:

Zone 1: the „quite“ neighborhood: centromer/housekeepingZone 2: the „industrial area“: middle partsZone 3: the „innovation quarter“: chromosomeends/high recombination

Page 24: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Genomic distribution of genes

Page 25: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

What is it good for?

Page 26: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:
Page 27: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

First comprehensive map of wheat prolamins

• Within the IWGSC RefSeq v1 annotation, 731 proteins were manually corrected, including 135 proteins that were added as a completely new sequence

• Expressed everywhere e.g.  in roots, leaves, spike, pollen or grain

IWGSC, Science 2018

Page 28: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

The wheat prolamin superfamily

Pfam domains – Prolamin clan CL0482GliadinTryp‐alpha‐amylLTP2Hydrophob_seedProlamin‐like

HMW gluteninsDomainless protein groups

omega gliadinssmall cysteine‐rich proteins

Juhasz et al., Science Advances, 2018

Page 29: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

A genome

Wheat allergy

Celiac disease

Baker’s asthma

The classic prolaminsas immune reactive proteinsJuhasz et al., Science Advances, 2018

Page 30: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Juhàsz et al. 2018, Sci. Adv.

Creation of gene family dataset:• Protein sequences screened for

known domains using HMMER3• protein sequences were clustered by

similarity NCBI blast+• “gene families” were constructed in

the form of orthologous groups usingOrthoFinder

• All protein sequences with a HMWglutenin or a Prolamin clan domainwere extracted 1783 sequences

The Prolamin Clan in Grasses

Page 31: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Hydrophob seed:Cell growth, Root development, defense

LTP_2, Tryp_alpha_amy:Cell growth, SenescenceProlamin-like :Fertilization

Gliadin, Tryp_alpha_amy:Nutrient storage, defense

Juhàsz et al. 2018, Sci. Adv.

Page 32: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Juhàsz et al. 2018, Sci. Adv.

Omega gliadins

HMW glutenins

Page 33: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Seed allergen expression in different environmentsJuhasz et al., Science Advances, 2018

Chinese SpringBjarneBerserk

Low:  15/10°C day/nightNormal: 20/16°C day/nightHigh: 26/20°C day/night

High temperatureIncreasing effect on the celiac disease associated gene expressions –gliadins and glutenins

Low temperatureIncreasing effect on baker’s asthma and food allergy genes

Page 34: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

What‘s next??

Page 35: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

One reference genome is not enough!

Page 36: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Chinese Spring“Gold Standard”

10 Genomes Core and Pan Genomes

100 Genomes

1,000 Genomes

10,000 Genomes

Structural / PAV

Haplotypes

SNPs / Alleles

Page 37: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

• 2 Canadian spring wheat (CDC Landmark, CDC Stanley)• 1 USA (Jagger)  ‐ J. Poland, G. Muehlbauer• 1 German winter wheat variety (Julius) – N. Stein, K. Mayer• 1 Swiss winter wheat variety (Arina) – B. Keller• 2 Australian varieties (Mace, Lancer) – P. Langridge• 1 Japanese variety (Norin61) – K. Shimizu• 2 Chinese varieties (Kenong 9204*) HongQing Ling• Syngenta – (SY Mattis)

Cadenza, Paragon, Kronos, Robigus, Claire

The Ten+ Genomes Project

NRGeneRefSeqAssemblies

W2RAP Assemblies

Page 38: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Genomic Diversity Analysis:‐ 3,819 samples 

(mostly hexaploid)‐ 1,779 SNP markers from 

the 35K and 90K arrays, distributed across all chromosomes

‐ PCA transformed and clustered using k‐means

Harnessing the diversity from global breeding programs 

Arina Julius

MaceWeebilChinese Spring Paragon

Lancer

RobigusClaire

CDC Landmark Jagger

CDC Stanley

Cadenza

Page 39: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Tetraploid wheats – the wild and the domesticated

wild emmer wheatT. turgidum ssp. dicoccoides

• A and B subgenome, genome size ~10 Gb• plus diploid wheats and ancestors + Rye & Spelt

Durum (pasta) wheatT. turgidum ssp. durum 

Page 40: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Perspectives...what else is it good for?

Gene and marker discovery!

Predictive Breeding/genomic selection

Accessing variation 

Reduced genetic variation in modern breeding lines; 

Only 10% of genetic variation in wheat has been captured in 

modern varieties

Comparative Triticeae Genomics – the non‐recombining part of 

the genome, gene regulation/networks

Towards a less allergenic wheat…

Page 41: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Meet the team…

www.wheatgenome.org

PGSBDaniel Lang

Heidrun GundlachThomas LuxIris Fischer

Michael SeidelVerena PradeNadia Kamal

Georg HabererSven Twardziok

Klaus Mayer

Murdoch UniversityAngela Juhasz

Rudi Appels

Norwegian Academy of LifesciencesTetiana Belova

Odd-Arne Olsen

Page 42: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

acknowledgementsAcknowledgements Wheat

Page 43: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Acknowledgements WheatIPKNils SteinMartin Mascher

IPK BITSebastian BeierUwe Scholz

IWGSC: Kellye Eversole, Jane RogersBayer: Catherine FeuilletUniv. of Zurich: Thomas Wicker & Beat KellerUniv. of Udine: Michele Morgante et al.KWS: V KorzunCNRGV: H Berges,A BellecHaifa: A Korol, Z FrenkelKSU: Jesse PolandTGAC: Mario Caccamo et al.USASK: Curtis PozniakU.Tel Aviv: Assaf DistelfeldJohn Innes Center: Cristobal Uauy et alEarlham Institute: Anthony Hall et al.

PGSBHeidrun GundlachThomas LuxIris FischerDaniel LangVerena PradeGeorg HabererMichael SeidelSven TwardziokKlaus Mayer

IEB OlomoucHana SimkovaJaroslav Dolezel

INRAFred ChouletEtienne PauxMichael Alauxet. al.

Murdoch UniversityAngela JuhaszRudi Appels

Norwegian Academy ofLifesciencesTetiana BelovaOdd-Arne Olsen

WheatScan partnersKatharina Scherf

Page 44: How complex can it be? The bread wheat genome and its ... · Cadenza, Paragon, Kronos, Robigus, Claire The Ten+ Genomes Project NRGene RefSeq ... Bayer: Catherine Feuillet Univ. ofZurich:

Thank you for your attention!