CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the...

85
CNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection Current methods NGS-based methods: Problems NGS-based methods: Review of approaches WGS vs. WES vs. targeted enrichment Paired-end mapping (WGS) Split-read-based methods (WGS) Read-depth methods Read-depth methods (WGS) Read-depth methods (WES and hybridization-based panels) Read-depth methods (Amplicon Capture) De novo assembly (WGS) B-Allele Frequency CNV detection Introduction and detection in NGS data G. Demidov 1,2 1 Genomic and Epigenomic Variation in Disease group, Centre for Genomic Regulation 2 Universitat Pompeu Fabra NGSchool2016

Transcript of CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the...

Page 1: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

CNV detectionIntroduction and detection in NGS data

G. Demidov1,2

1Genomic and Epigenomic Variation in Disease group,Centre for Genomic Regulation

2Universitat Pompeu Fabra

NGSchool2016

Page 2: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 3: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Disclaimer

I There is no “silver bullet” for CNVs detection

I The successful variants’ detection is only possible withthe right understanding of the situation and your needs

I There is a huge pool of methods for CNV detection, butthe very best and reliable mean of selection andverification of its results is your knowledge and commonsense

Page 4: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 5: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Structural variation (SV)

Page 6: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Structural variation (SV)

Comprise unbalanced copy-number variations ≥ 50 bp,including deletions, insertions and duplications, as well asbalanced variants such as inversions and translocations.

Page 7: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Molecular mechanisms leading to structuralvariant formation

Recurrent structural variants often result from non-allelichomologous recombination (NAHR) which involvesrecombination between long highly similar low-copy-numberrepeats.

Page 8: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Molecular mechanisms leading to structuralvariant formation

Homologous chromosomes are shown in blue and red, andsister chromatids are depicted in the same colour. Low-copyrepeats (LCRs, SDs) – white and black arrows.Genome destabilization by homologous recombination in thegerm line, Sasaki et al., Nature Reviews Molecular CellBiology 11, 182-195 (March 2010)

Page 9: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Molecular mechanisms leading to structuralvariant formation

Genomic insertions can involve mobile element insertions oftransposable elements by retrotransposition.

Page 10: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Molecular mechanisms leading to structuralvariant formation

DNA-replication-associated template-switching events,involving the fork-stalling and template switching (FoSTeS)and microhomology-mediated break-induced replication(MMBIR) mechanisms.

Page 11: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Molecular mechanisms leading to structuralvariant formation

Chromoanagenesis and cancer: mechanisms andconsequences of localized, complex chromosomalrearrangements, Holland et al, Nature Medicine 18, 2012.

Page 12: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Molecular mechanisms leading to structuralvariant formation

Non-homologous end joining (NHEJ) is a process thatrepairs DNA double-strand breaks in the absence ofextensive sequence homology and is often accompanied bythe addition or deletion of several nucleotides in the form ofa ’repair-scar’.

Page 13: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Molecular mechanisms leading to structuralvariant formation

NHEJ is an emergency repair mechanism which involves a“repair or die” chance.Chris from biology.stackexchange.com .

Page 14: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Molecular mechanisms leading to structuralvariant formation

Chromothripsis is a phenomenon that seems to involvechromosome shattering leading to numerous breakpoints,followed by error-prone DNA repair.In both cancer and congenital diseases.

Page 15: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Molecular mechanisms leading to structuralvariant formation

Cancer: When catastrophe strikes a cell. Tubio and Estivill,Nature 470, 476477 (24 February 2011)

Page 16: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 17: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Aneuploidy

It is not a CNV.

1. (45, X) - Turner syndrome.

2. In uniparental disomy, both copies of a chromosomecome from the same parent (with no contribution fromthe other parent).

3. Trisomy 21, Trisomy 18, Trisomy 13 - Down, Edwards,Patau. (47, XXX), (47, XXY), (47, XYY).

4. XXXX, XXYY, XXXXX, XXXXY and XYYYY.

Page 18: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

What is a CNV/CNADifferent definitions

Page 19: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

What is a CNV/CNADifferent definitions

I Copy number alterations and copy number aberrationsare synonyms.

I Copy number alterations/aberrations (CNAs) arechanges in copy number that have arisen in somatictissue (for example, just in a tumor), copy numbervariations (CNVs) originated from changes in copynumber in germline cells (and are thus in all cells ofthe organism).

I However some articles seem to use copy numberalterations/aberrations (CNAs) as the termencompassing both germline and somatic copy numberchanges and use somatic copy numberalterations/aberrations (SCNAs) as the term forsomatic copy number changes.

Page 20: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Segmental Duplication

Hotspot regions in the genome where copy number variationsare four times more enriched. Range from 1 to 400 kb inlength and occur at more than one site within the genome.

I Segmental duplications (SDs), low copy repeats, arelarge continuous stretches of DNA that can be mappedto multiple locations on the genome and share > 90%nucleotide similarity with each other.

I These hotspot regions have an increased rate ofchromosomal rearrangement.

I The higher frequencies of SDs within the humanpopulation suggest that they are shared duplicationsthat have been fixed in the population rather than beingrecurrent structural mutations.

Page 21: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

What is a CNV/CNAWhen CNV and indel become different

I CNVs: “a segment of DNA that is 1 kb or larger and ispresent at a variable copy number in comparison with areference genome”. However, the cutoff of 1 kb iscompletely arbitrary.

I Based on a functional definition, it may be better tochoose an average exon size (∼ 100 bp) as a parameterfor defining CNV.

I Recent observations in the Watson and Venter genomesclearly indicate that the CNV size distributions show amarked enrichment in the range of 300 to 350 bp owingto the known retrotransposition-based Alupolymorphisms.

Page 22: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 23: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Importance of CNV/CNA detection

Structural variants account for 1.2% of the variation amonghuman genomes while single nucleotide polymorphisms(SNPs) represent 0.1% (Pang et al., 2010).

Page 24: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Is there a lot of CNVs?

(Zarrei et al, Nature, 2015)

Page 25: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Are the CNVs disease-causing?

I Most of CNVs age benign variants that will not directlycause disease.

I Some CNVs found in the general population can bemillions of bases in size, affecting numerous genes, yetthey have no observable consequence.

I Was estimated that 4.8–9.5% of the genome contributesto CNV and found approximately 100 genes that can becompletely deleted without producing apparentphenotypic consequences. (Zarrei et al, Nature, 2015)

I There are several instances where CNVs that affectcritical developmental genes do cause disease. Forexample, recent reviews have listed 17 conditions of thenervous system alone – including Parkinsons Diseaseand Alzheimers Disease – that can result from CNV.(http://www.gene-quantification.de/cnv-faq.pdf)

Page 26: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Are CNVs distributed uniformly?

(Zarrei et al, Nature, 2015)

Page 27: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Are the CNVs population-specific?

As with all types of genetic variation, CNVs can vary infrequency and occurrence between populations telling ussomething of our shared history. As a result of our recentcommon origin in Africa, the vast majority of copy-numbervariation around 89% is shared among the diverse humanpopulations studied.

Page 28: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Importance of CNV/CNA detection: Mendeliandisease

Page 29: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Importance of CNV/CNA detection: Complextraits

Four examples of CNVs associated with complex traits:

1. a 20-kb deletion upstream of the IRGM gene withCrohns disease,

2. a 45-kb deletion upstream of NEGR with body massindex,

3. a 32-kb deletion that removes two late-cornifiedenvelope genes with psoriasis,

4. a 117-kb deletion of UGT2B17 with osteoporosis.

(Conrad et al, 2013, Nature)Also well known connections: Parkinson disease, Alzheimer,Mental retardation, Autism, Schizophrenia, etc.

Page 30: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Importance of CNV/CNA detection: Cancer

Whole-exome sequencing of breast cancer, malignantperipheral nerve sheath tumor and neurofibroma from apatient with neurofibromatosis type 1, Cancer Medicine,2015, John R McPherson et al.

Page 31: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Importance of CNV/CNA detection: Evolution

Purifying Selection

CNVs are preferentially located outside of genes andultraconserved elements in the human genome and that asignificantly lower proportion of deletions than duplicationsoverlaps with disease-related genes and RefSeq genes.Zhang et al, Annu Rev Genomics Hum Genet., 2009.

Page 32: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Importance of CNV/CNA detection: Evolution

Gene duplication and Positive Selection

Gene duplication has long been thought to be a centralmechanism driving long-term evolutionary changes.Selection has also been shown to shape the architecture ofsegmental duplications during human genome evolution.CNVs encompassing functional genes can be evolutionallyfavored because of their adaptive benefits.Zhang et al, Annu Rev Genomics Hum Genet., 2009.

Page 33: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

CNA/CNV databases

I Several databases e.g., the Database of GenomicVariants archive which reports structural variationidentified in healthy control samples (DGVa) have beencreated for the collection of SVs data (Lappalainen etal., 2013).

I Public data resources have been developed with thepurpose of supporting the interpretation of clinicallyrelevant variants, e.g., dbVar, or collecting knowndisease genes (Online Mendelian Inheritance in Man,OMIM) hit by SVs.

Page 34: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 35: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

What does it mean to detect a CNV?

Before choosing the tool for CNV detection, the researchershould understand what does he/she wants to detect:

I exons with CN change

I regions with CN change (not necessarily protein-coding)

I panel of genes of interest with CN change

I etc.

Page 36: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

What does it mean to detect a CNV?

One can be interested in:

I populational CNVs

I rare CNVs

I CNVs/CNAs that happen only in a subclone of thesequenced cells (cancer or prenatal diagnostics)

Page 37: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

What does it mean to detect a CNV?

The goal can be

I identify CN of genes

I to find breakpoints of CNVs

I to find just some regions with CNVs for further analysis(i.e., tumor purity and clonal structure)

I to identify CNVs associated with traits (for example,level of RNA expression)

I etc.

Page 38: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Locus-specific CNV detection

I MLPA

I fusion amplicon formation

I qPCR, dPCR

I FISH-hybridization

I paralog-ratio testing

I molecular copy number counting

I RFLP (restriction fragment length polymorphism)followed by Southern blot analysis

I long-range PCR

I Sanger sequencing of the fragment of interest

Page 39: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Locus specific CNV detection

Common features:

I reliable

I takes a lot of time and money

I often allow to detect breakpoints of the CNV

Page 40: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Genome-wide CNV detection

I NanoString nCounter (a lot of genes at once, but notwhole genome)

I aCGH and SNP arrays“The Agilent Human Genome CGH Microarray is a dualcolor array containing 60-mer oligonucleotide probes,Distinct Biological Features: 963,029, Probe Spacing:2.1 KB overall median probe spacing (1.8 KB in Refseqgenes)”“The SNP array platform includes ∼ 900000 SNPprobes and 900000 non-SNP oligonucleotide probes atan average distance of 0.7Kb”

I NGS-based methods

Page 41: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Array-based CNV detection

Common features:

I quite cheap

I has comparatively low resolution

I difficult to find exact breakpoints

Page 42: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Resolution

I aCGH: from 100 kilobases (according to wiki)

I 25-50 kbps according to Affymetrix websiteOncoScan FFPE Assay Kit: 50-100 kb copy numberresolution in ∼ 900 cancer genes, 300 kb genome-widecopy number resolution outside of the cancer genesIn Practice: “To reduce the number of false positives,parameters were set to consider only imbalances> 75Kb encompassing at least 80 probe sets.”Bernardini, 2010, Eur J Hum Genet.

I NGS-based methods can potentially detect 1 kbpsevents and even less. NGS and SNP-based arrays havetroubles with duplications’ detection in comparison withaCGH.

Page 43: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Resolution: Important remark

April 2008 Roche NimbleGen

Page 44: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Resolution: Important remark

Structural variations of DNA greater than 1 kilobase in sizeaccount for most bases that vary among human genomes,but are still relatively under-ascertained. Here we use tilingoligonucleotide microarrays, comprising 42 million probes, togenerate a comprehensive map of 11,700 copy numbervariations (CNVs) greater than 443 base pairs, of whichmost (8,599) have been validated independently.Origins and functional impact of copy number variation inthe human genome, Conrad et al, Nature, 2013.

Page 45: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Resolution: Important remarkI Experimental strategy to discover CNVs greater than

500 base pairs (bp) in individuals with European orWest African ancestry

I 20 NimbleGen arrays (median spacing of 56 bp)I 800 comparative genome hybridization (CGH)

experiments with female lymphoblastoid cell-line DNAcompeted against a common male European referencesample (NA10851)

I The female test DNAs comprised 19 CEU EuropeanHapMap individuals, 20 YRI (Yoruba, Nigeria)-WestAfricans, and a Polymorphism Discovery Resourceindividual (NA15510)

I It was estimated that 40 samples would provide 95%power to sample variants with minor allele frequenciesof 5% in either population

Origins and functional impact of copy number variation inthe human genome, Conrad et al, Nature, 2013.

Page 46: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Resolution: Important remark II, Venter’sgenome

Towards a comprehensive structural variation map of anindividual human genome, Pang et al, 2010, GenomeBiology.

Page 47: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 48: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

NGS-based methods: Problems

“Copy-number variants (CNVs) are considerably moredifficult to find – at least using NGS.Why? In a word, length.Todays NGS technologies produce millions upon millions ofsequence reads, but they’re mostly relatively short,measuring a few hundred bases in size. Its difficult usingsuch data to piece together the subtle structural variationsthat distinguish one individual from another, simply becauseindividual reads often are too short to span the variantregions of the genome.” - Jeffrey M. Perkel, biocompare.com

Page 49: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

NGS-based methods: General Problems

I Low coverage and high level of noise

I Events that look like CNVs but they are not (i.e.,alignment artifacts)

I Batch effects

I Bad quality of extracted DNA

I Wrong preparation of experiment

I Low-complexity regions

Page 50: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

NGS-based methods: Problem in CNA

Page 51: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

NGS-based methods: Problem in CNA

Page 52: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

NGS-based methods: Problem in CNA

Page 53: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

NGS-based methods: Problem in CNA

Page 54: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Does sequencing technology influence the CNVdetection protocol?

Page 55: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 56: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Targeted enrichment

Page 57: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Hybridization vs AmpliSeq

Q: What type of biases may arise?

(from http://www.mdpi.com)

Page 58: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Design of the panel

Q: What type of biases may arise due different designs?

Page 59: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Pooling strategy

Q: What type of biases may arise due different designs?

(from BioWatch PCR Assays: Building Confidence, EnsuringReliability; Abbreviated Version (2015))

Page 60: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Overview of the approaches

Deletion (A), novel sequence insertion (B), inversion (C),and tandem duplication (D) in read count (RC), read-pair(RP), split-read (SR), and de novo assembly (AS) methods.Tattini et al, Front. Bioeng. Biotechnol., 25 June 2015

Page 61: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 62: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Read-pair

I Two different strategies have been used in PEM-basedtools to detect SVs/CNVs, namely the clusteringapproach and the model-based approach.

I The difference lies in that the clustering approachemploys a predefined distance to identify discordantreads, while the model-based approach adopts aprobability test to discover the unusual distancebetween read pairs in comparison to the distancedistribution in genome.

Page 63: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Read-pair: Conclusion

I This method is quite reliable.

I It can not detect exact copy numbers.

I RP algorithms cannot detect the signatures of novelsequence insertions larger than the average insert size.

Page 64: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 65: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Split-read-based methods (WGS)

I SR methods were conceived for Sanger sequencingreads.

I SR methods start from read pairs in which one readfrom each pair is aligned to the reference genomeuniquely while the other one fails to map or onlypartially maps to the genome.

I Those unmapped or partially mapped reads potentiallyprovide accurate breaking points at the single base pairlevel for SVs/ CNVs. SR methods split the incompletelymapped reads into multiple fragments. The first andlast fragments of each split read are then aligned to thereference genome independently.

Page 66: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Split-read-based methods (WGS): Conclusion

I The SR-based approach heavily relies on the length ofreads and is only applicable to the unique regions in thereference genome.

I It provides super high resolution (1 bp), however it ishighly rely on the coverage and these divided reads donot necessarily show the CNV/SV (depending on probepreparation).

I It can not detect exact copy numbers.

Page 67: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 68: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Read-depth methods

I The underlying hypothesis of RD-based methods is thatthe depth of coverage in a genomic region is correlatedwith the copy number of the region, e.g., a gain of copynumber should have a higher intensity than expected.

I Compared to PEM and SR-based tools, RD-basedmethods can detect the exact copy numbers, which theformer approaches are lacking because PEM/SRmethods only use the position information.

I RD-based methods can detect large insertions andCNVs in complex genomic region classes, which aredifficult to detect using PEM and SR methods.

Page 69: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Read-depth methods

I Generally, RD-based tools can be classified into threecategories depending on the study design: singlesamples, paired case/control samples (somtimes trios),and a large population of samples.

Q: How the copy number detection is different betweenthese cases?

Page 70: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Read-depth methods

I Basically, RD-based methods follow a four-stepprocedure to discover CNVs: mapping, normalization,estimation of copy number, and segmentation.

Q: How would you do the normalization? Which areimportant facts that need to be taken into account?

Page 71: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Circular Binary Segmentation and HMM

Page 72: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Others

I Mean Shift-Based

I Shifting Level Model

I Poisson modelling

I a lot of crazy algorithms

Page 73: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 74: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Read-depth methods (WGS)

I Generally, RD-based tools define non-overlappinggenomic windows, calculate read depths for thesewindows, and estimate copy numbers for each of them.

Q: How would you do the segmentation into windows?

Page 75: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 76: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

WES-based CNV detection

I The full spectrum of CNVs and breakpoints may not becompletely characterized.

I Cross-chromosome events may not be detected.

I In contrast to WGS, WES data have higher depth fortargeted regions, which is ideal for more accurate CNVsusing an RD-based calling approach.

Page 77: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

WES-based CNV detection

I Due to differing capture efficiency, the depth fromdifferent genomic regions may vary substantially andshould be considered in the downstream analysis ofCNV calling.

I Due to inconsistent capture efficiency, there might beregions that are poorly sequenced, which requirespre-processing for WES data.

I The assumption of normal distribution may no longer bevalid due to the biases regarding read depth distribution.

I Due to the discontinuation of genomic regions, mostCNV breakpoints could not be detected.

Page 78: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

WES-based CNV detection: Conclusion

I Cheap

I Not reliable

Page 79: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 80: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

Amplicon Capture

I Amplicon sequencing data show different biases inrespect of WES data.

I Protocols involved in the preparation of ampliconlibraries result in high depth of coverage at the expenseof coverage homogeneity.

Q: PCR duplicates are typically removed before CNVdetection in WES data. Should they be removed fromAmpliSeq data?

Page 81: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 82: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

De novo assembly

I By comparing the assembled contigs to the referencegenome, the genomic regions with discordant copynumbers are then identified

I This direct assembly of short reads without using areference is called de novo assembly.

I Assembly can also use a reference genome as a guide toimprove its computational efficiency and contig quality.

Page 83: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

De novo assembly: Conclusion

I Time consuming

I Requires quite high coverage

I Has problems with non-unique positions in the genome

Page 84: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

OutlineIntroduction to the problem

Mechanisms of variationWhat is a CNV/CNAImportance of CNV/CNA detectionCurrent methodsNGS-based methods: Problems

NGS-based methods: Review of approachesWGS vs. WES vs. targeted enrichmentPaired-end mapping (WGS)Split-read-based methods (WGS)Read-depth methodsRead-depth methods (WGS)Read-depth methods (WES and hybridization-basedpanels)Read-depth methods (Amplicon Capture)De novo assembly (WGS)B-Allele Frequency

Page 85: CNV detection - Introduction and detection in NGS dataCNV detection G. Demidov Introduction to the problem Mechanisms of variation What is a CNV/CNA Importance of CNV/CNA detection

CNV detection

G. Demidov

Introduction to theproblem

Mechanisms ofvariation

What is a CNV/CNA

Importance ofCNV/CNA detection

Current methods

NGS-based methods:Problems

NGS-basedmethods: Reviewof approaches

WGS vs. WES vs.targeted enrichment

Paired-end mapping(WGS)

Split-read-basedmethods (WGS)

Read-depth methods

Read-depth methods(WGS)

Read-depth methods(WES andhybridization-basedpanels)

Read-depth methods(Amplicon Capture)

De novo assembly(WGS)

B-Allele Frequency

B-Allele Frequency

I Used in cancer-specific field mainly