Cloning and sequencing of a gene encoding acidophilic amylase ...
Gene Technology - 2020 06 - DNA Sequencing 1szilagyl.web.elte.hu/Gene Technology-2020/PDF/06... ·...
Transcript of Gene Technology - 2020 06 - DNA Sequencing 1szilagyl.web.elte.hu/Gene Technology-2020/PDF/06... ·...
-
Gene Technology - 2020 106 - DNA Sequencing
-
Peptide sequencingEdman degradation: Edman, P. et al.; (1950). "Method for determination
of the amino acid sequence in peptides". Acta Chem. Scand. 4: 283–293.
sequential labeling and cleavage of N-terminal residue withoutdisrupting peptide bond between other a.a. residues
Gene Technology - 2020 06 - DNA Sequencing 2
procedure can then be repeated again to identify the next amino acidstep-by step yield 90-95 %problems with long streches ofidentical a.a.-smaximum length: 30-50 a.a.
This approach is not suitable forsequencing nucleic acids
https://doi.org/10.3891/acta.chem.scand.04-0283
-
Peptide sequencer
Gene Technology - 2020 06 - DNA Sequencing 3
-
First generation DNA sequencing
• 3’ or 5’ P32 labeled template, strand separation
• partial, specific modification of bases,
G dimethylsulfate (N7-methylation)
Pu formic acid
Py hydrazin
C hydrazin + NaCl
cleavage of phosphodiester bond at modified bases: hot piperidin
separation of fragments by PAGE
• SV40 (5243 bp)
4Gene Technology - 2020 06 - DNA Sequencing
Does not requries previous sequence information!!!
-
Maxam & Gilbert method
5Gene Technology - 2020 06 - DNA Sequencing
-
Maxam & Gilbert method
6Gene Technology - 2020 06 - DNA Sequencing
-
ENZYMATIC (dideoxy) METHODFrederick Sanger 13 August 1918 – 19 November 2013 1951-52 determination of amino acid sequence of insulin
Nobel Prize in chemistry 1958: for his work on the structure of proteins, especially that of insulin
1977 DNA sequencingNobel Prize in chemistry 1980: Walter Gilbert and Sanger shared
half of the chemistry prize "for their contributions concerning the determination of base sequences in nucleic acids"
7Gene Technology - 2020 06 - DNA Sequencing
-
ENZYMATIC (dideoxy) METHOD
template + primer + DNA polymerase + dNTP/ddNTP ( 1 %)template: single or double stranded DNA, PCR product labeling: primer (5’, [-32P]ATP)
during polymerization ([-35S]dATP)Enzyme: Sequenase (T 7 pol), Taq pol (processivity is important)polymerization: 4 parallel reactions (four ddNTPs)electrophoresis: denaturing PAGE (3 kV, 7M urea, 70 °C)
the product is double strandedoriginal template strand interferes separation
autoradiographymanual evaluation
8Gene Technology - 2020 06 - DNA Sequencing
-
ENZYMATIC (dideoxy) METHOD
„Manual” sequencing
Gene Technology - 2020 906 - DNA Sequencing
-
ENZYMATIC (dideoxy) METHOD
Gene Technology - 2020 1006 - DNA Sequencing
-
Automated fluorescent sequencing
based on enzymatic method
labeling: primer or dNTP
ddNTP 4 different colors: BigDye
polymerization: cyclic sequencing
(linear template amplification)
1 primer, AmpliTaq polymerase
electrophoresis: capillary electrophoresis
synthetic polymer gel (400-700 bases)
online detection (laser)
automatic digital read-out (free software: Chromas)
ABI Prism 310, 373, 377 (PE Biosystems)
integrated sequencing systems (10 kb-1 Mb/day)
11Gene Technology - 2020 06 - DNA Sequencing
-
Gene Technology - 2020 1206 - DNA Sequencing
-
Automated fluorescent sequencing
13Gene Technology - 2020 06 - DNA Sequencing
-
14CHROMASGene Technology - 2020 06 - DNA Sequencing
-
Whole genome sequencing
• Chromosome walking / primer walking
• Requires cloning long, contiguous DNA Gene Technology - 2020 1506 - DNA Sequencing
-
By Commins, J., Toft, C., Fares, M. A. - "Computational Biology Methods and Their Application to the Comparative Genomics of Endocellular Symbiotic Bacteria of Insects." Biol. Procedures Online (2009).
Whole genome shotgun sequencing
Gene Technology - 2020 1606 - DNA Sequencing
-
Assembly of the reads
Gene Technology - 2020 1706 - DNA Sequencing
-
By Commins, J., Toft, C., Fares, M. A. - "Computational Biology Methods and Their Application to the Comparative Genomics of Endocellular Symbiotic Bacteria of Insects." Biol. Procedures Online (2009).
Hierachical shotgun sequencing
Gene Technology - 2020 1806 - DNA Sequencing
-
By Becchamm - Own work, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=17509620
Assemblig the tiling pathend-reads: short sequences at the 5’ and 3’ ends of a DNA fragment
which are unique enough that they (theoretically) exist together only once in a genome
contig: a continuous sequence of DNA assembled from overlappingframents
scaffold:
tiling path:
Gene Technology - 2020 1906 - DNA Sequencing
-
Gene Technology - 2020 2006 - DNA Sequencing
-
Next generation sequencing
Gene Technology - 2020 2106 - DNA Sequencing
-
22
Margulies2005-454 Supplement
Gene Technology - 2020 06 - DNA Sequencing
http://szilagyl.web.elte.hu/SH-2020/Classical%20papers/margulies2005-454.pdfhttp://szilagyl.web.elte.hu/SH-2020/Classical%20papers/Supplement%20for%20454.pdf
-
Steps of the sequencing process• Generation of DNA fragments (nebulizer)
fragment size range: 50-900 bp (mean: 325 bp) fragments blunt-ended and phosphorylated (polishing)T4 DNA polymerase, E. coli DNA polymerase (Klenow fragment),T4 polynucleotide kinase
• Ligation of adapters – binding to beads and sequencing oligogeneration single strands with different sequences at the endsone end allows annealing to complementary oligos immobilized on beadsother end binds sequencing oligo
• Emulsion PCR for amplification on the beads
• Sequencing by synthesis in picoliter volume well
• Lumimetric detection of pyrophosphate release
23Gene Technology - 2020 06 - DNA Sequencing
-
24
• DNA Capture Beads
• N-hydroxysuccinimide ester (NHS)-activated
Sepharose HP
• (5’-Amine-hexa-ethyleneglycol spacers
CCATCTGTTGCGTGCGTGTC-3’)
• hybridization at limiting dilution (A seq.)
only ONE copy of DNA/bead
• Amplification in emulsion (ePCR)
• 0.625 µM forward (complemenary to B)
(5’ - CGTTTCCCCTGTGTGCCTTG-3’)
• 0.039 µM reverse (complemenary to A)
(5’-CCATCTGTTGCGTGCGTGTC-3’)
• „asymmertic” PCR – reverse oligo bound to beads!
ONE MILLION copies/bead
Ligation of different adapters to the ends of fragment
A: CCATCTGTTGCTGCGTGTCCCATCTGTTCCCTCCCTGTCTCAG
B: /5BioTEG/CCTTTCCCCTGTGTGCCTTGCCTATCCCCTGTTGCGTGTCTCAG
Gene Technology - 2020 06 - DNA Sequencing
Avidin / Streptavidin
-
Steps of the sequencing process• Generation of DNA fragments (nebulizer)
fragment size range: 50-900 bp (mean: 325 bp)
fragments blunt-ended and phosphorylated (polishing) T4 DNA polymerase, E. coli DNA polymerase (Klenow fragment) T4 polynucleotide kinase
• Ligation of adapters – binding to beads and sequencing oligogeneration single strands with different sequences at the endsone end allows annealing to complementary oligos immobilized on beads
other end binds sequencing oligo
• Emulsion PCR for amplification on the beads
• Sequencing by synthesis in picoliter volume well
• Lumimetric detection of pyrophosphate release
25Gene Technology - 2020 06 - DNA Sequencing
-
Emulsion PCR• Amplification mix contains
– dNTP, buffer, forward and reverse primer
– Hi-Fi Taq polymerase, thermostable pyrophosphatase
– 1.5 M DNA capture beads – containing single template chain
• Formation of water in oil emulsion
• PCR reaction: amplification 40 cycle
• Breakage of the emulsion
• Second-strand removal, enrichment of beads
Gene Technology - 2020 06 - DNA Sequencing 26
-
Preparation for sequencing
• 44 mm in diameter
• 55 mm in depth
• 75 pl
• 480 wells/mm2
• 1.6 million wells
27
DNA capture beads (25 mm) containingannealed sequencing primer + Bst DNA Polymerase Large Fragment + SSB protein
Dynal enzyme beadsUltraGlow Luciferase and Bst ATP sulfurylasewere prepared as biotin carboxyl carrier protein (BCCP) fusions
Gene Technology - 2020 06 - DNA Sequencing
PicoTiterPlate
-
Pyrosequencing• Bases (TACG) are flown sequentially and always in the same
order (100 times for a large FLX run) across the PicoTiterPlateduring a sequencing run
• An incorporated nucleotide complementary to the template strand generates pyrophosphate.
• A coupled enzyme reaction transforms PPi to ATP
• Firefly luciferase is a light-emitting enzyme catalyses the oxidation of firefly luciferin, requiring oxygen and ATP.
• The light signal is recorded by the CCD camera
• The signal strength is proportional to the number of nucleotide incorporated
Gene Technology - 2020 06 - DNA Sequencing 28
-
Gene Technology - 2020 06 - DNA Sequencing 29
-
Principle of pyrosequencing
Gene Technology - 2020 06 - DNA Sequencing 30
Apyrase added after each cycle
dATP is also a substrate for luciferase
-
Adventages of Roche 454
• long read lengths (700 bp)
• fast operation
• high accuracy
Disadvantages of Roche 454
• error rate increases with theincrease of the length ofpolybase
• high cost
• low throughput
• low scalability
Gene Technology - 2020 06 - DNA Sequencing 31
production discontinued in October 2016
-
Life Technologies - Ion torrentsequencing
32
Ion torrent
Gene Technology - 2020 06 - DNA Sequencing
../Videos/Ion Torrent next-gen sequencing technology - YouTube.mp4
-
Life Technologies - Ion torrentsequencing
• Sample preparation similar to 454 – based on PCR amplification
• Sequencing by synthesis – sequential additon of dNTP-s
• detection of released H+ - CMOS transformed into miniaturepH meter
33
Ion torrent
Gene Technology - 2020 06 - DNA Sequencing
../Videos/Ion Torrent next-gen sequencing technology - YouTube.mp4
-
Sequencing by ligation – AppliedBiosystems
Gene Technology - 2020 06 - DNA Sequencing 34
-
Sequencing by ligation – Applied BiosystemsLigation of adapter sequences
Hybridization to beads
Emulsion PCR
Remove empty beads
Beads are attached to glass plate
Gene Technology - 2020 3506 - DNA Sequencing
-
Sequencing by ligationComponents of the sequencing reactions1. Template (PCR amplified on the beads)
2. Primers complementary to P1
3. 8 nt long probes with dye at 5’ end
4. Ligase
Structure of the probes (16 altogether)
Color coding of 16 probes
Gene Technology - 2020 3606 - DNA Sequencing
-
Sequencing by ligationComponents of the sequencing reactions1. Template (PCR amplified on the beads)
2. Primers complementary to P1
3. 8 nt long probes with dye at 5’ end
4. Ligase
Structure of the probes (16 altogether)
Color coding of 16 probes
Gene Technology - 2020 3706 - DNA Sequencing
-
Steps in sequencing by ligation
1. Primer binds to templatestrands
2. Probe hybridization and ligation
3. Fluorescence measurement
4. Dye end (3) nucleotidescleaved (phosphorothioate; silver or mercury salt)
5. Steps 1-4 repeated 6+ times
6. The synthesized strandswashed away
Process completed 5 times, each time newprimer - offset by -1 base
Gene Technology - 2020 3806 - DNA Sequencing
-
Sequencing by ligation
After 5 ligation cycle we have information on every 5th base!
Repeat the process 4 times, with primers offset by 1 base:
Gene Technology - 2020 3906 - DNA Sequencing
-
Data analysis
We know that the last base of P1 adapter is TThe first base of probe ligated in the 2nd roud (n-1) must be complementary to T = A
The first read of probe n-1 is orange
If first base A and color is orange: second base of the probe is G
The first base of the template is C
A base and a color define the next base in the sequence!SoLiD
Gene Technology - 2020 4006 - DNA Sequencing
../Videos/SOLiD DNA Sequencing.mp4
-
Pacific Biosciences - SMRT
Gene Technology - 2020 06 - DNA Sequencing 41
-
Pacific Biosciences - SMRT
• Single molecule real time sequencing
• Phospholinked fluorescent nucleotides – polymerase cleavesoff label
• Zero Mode Waveguide – nanophotonic visualization chamber– illuminated volume 20 zeptoliter (10-21)
• single polymerase molecule bound to the bottom ofthechamber - real time sequence reading
• Possibility to detect methylated bases
42
SMRT
Gene Technology - 2020 06 - DNA Sequencing
../Videos/Single Molecule Real Time Sequencing - Pacific Biosciences - YouTube.mp4
-
Pacific Biosciences - SMRT
43Gene Technology - 2020 06 - DNA Sequencing
-
Epigenetic analysis by SMRT
Gene Technology - 2020 4406 - DNA Sequencing
-
Helicos - tSMS
45Gene Technology - 2020 06 - DNA Sequencing
-
Helicos - tSMS• True single molecule sequencing – no PCR amplification
• Immobilized oligoT on chip
• polyA tailing of fragmented (~100 -200 base) DNA – last A fluorescent
• Polymerization using virtual terminator nucleotides
• Paralell reading billions ofsequences
• One day – 1000 $ genome
• Bankruptcy in 2012
46Gene Technology - 2020 06 - DNA Sequencing
-
• Q1 pyrophosphatase???
• A1
• Q2 how to get rid of empty beads?
• A2
• DROPLET DIGITAL PCR VIRTUAL SYMPOSIUM, 5 NOVEMBER 2020
• Live from 11:00-17:00 CETBio-Rad’s Droplet Digital PCR System provides ultrasensitive and absolutenucleic acid quantification. This breakthrough technology is particularlyuseful for low-abundance targets, targets in complex backgrounds, allelicvariants (SNPs), and for monitoring subtle changes in target levels thatcannot be detected with real-time PCR.
• https://bio-rad.vfairs.com/en/#droplet-digital
Gene Technology - 2020 06 - DNA Sequencing 47
https://bio-rad.vfairs.com/en/
-
Illumina sequencing
Illumina
Gene Technology - 2020 4806 - DNA Sequencing
../Videos/Illumina Sequencing by Synthesis (Now in 3D).mp4
-
Illumina sequencing
Ligation of different adaptors to endsof DNA fragments (150-200 bp)
Attachment to flowcell
Bridge amplification
Cluster formation
Sequencing with virtualterminator nucleotides
In one cycle 4 bases are addedtogether
After each cycle detection at4 channels
Illumina
Gene Technology - 2020 4906 - DNA Sequencing
../Videos/Illumina Sequencing by Synthesis (Now in 3D).mp4
-
Base reading in Illumina sequencer
Gene Technology - 2020 5006 - DNA Sequencing
4-channel base detection
-
Simplified base detection
• Rather than a separate dye for each base, 2-channel SBS uses a mix of dyes. Images are taken of each DNA cluster using red and green wavelength filter bands
• Clusters seen in red or green images are interpreted as C and T bases, respectively. Clusters observed in both red and green images are flagged as A bases (appearing as yellow clusters), while unlabeled clusters are identified as G bases.
Gene Technology - 2020 06 - DNA Sequencing 51
-
Illumina
Gene Technology - 2020 5206 - DNA Sequencing
-
Gene Technology - 2020 06 - DNA Sequencing 53
-
Oxford Nanopore Technologies
Nanopore Sequencing
Gene Technology - 2020 5406 - DNA Sequencing
-
Introduction to nanopore sensing
A nanopore: a nano-scale hole.
• Biological: a pore-forming protein (e.g. α-Hemolysin) in a membrane (e.g. lipid bilayer)
• Solid-state: in synthetic materials ( e.g. silicon nitride or graphene)
• Hybrid: formed by a pore-forming protein set in synthetic material
Gene Technology - 2020 5506 - DNA Sequencing
-
Nanopore sensing
•Disruption in current detected when analyte passes through the pore or near its aperture.
•Characteristic disruption indentifies the molecule in question.
Ionic current passed through membrane by setting a voltage across the membrane.
Gene Technology - 2020 5606 - DNA Sequencing
-
Nanopore DNA sequencing
• Strand sequencing:
– Sequencing in real-time as the intact DNA polymer passes through the nanopore.
• Exonuclease sequencing:
– Individual nucleotides pass through the nanoporeby the aid of processive exonuclease.
Oxford
Gene Technology - 2020 5706 - DNA Sequencing
../Videos/Movies - News - Oxford Nanopore Technologies.mp4
-
Oxford Nanopore
Gene Technology - 2020 06 - DNA Sequencing 58
-
Strand Sequencing
Snapshot from movie at http://www.nanoporetech.comGene Technology - 2020 5906 - DNA Sequencing
-
Electron-based read out
Four different magnitudes of disruption which can be classified as C, G, A or T
Modified base, e.g. methylatedcytosine, can be directly distinguished from the four standard bases
Gene Technology - 2020 6006 - DNA Sequencing
-
Strand Sequencing
Snapshot from movie at http://www.nanoporetech.com
▪ Hairpin structure:
▪Sense and anti-sense sequencing
▪Advantages in Data Analysis
Gene Technology - 2020 6106 - DNA Sequencing
-
Exonuclease Sequencing
Snapshot from movie at http://www.nanoporetech.comGene Technology - 2020 6206 - DNA Sequencing
-
Exonuclease Sequencing
Snapshot from movie at http://www.nanoporetech.com
▪ Adapter molecule (cyclodextrin):
• Accuracy averaging 99.8%
• Identification of meC
Gene Technology - 2020 6306 - DNA Sequencing
-
Peptide sequencingEdman degradation: Edman, P. et al.; (1950). "Method for determination
of the amino acid sequence in peptides". Acta Chem. Scand. 4: 283–293.
sequential labeling and cleavage of N-terminal residue withoutdisrupting peptide bond between other a.a. residues
Gene Technology - 2020 06 - DNA Sequencing 64
procedure can then be repeated again to identify the next amino acidstep-by step yield 90-95 %problems with long streches ofidentical a.a.-smaximum length: 30-50 a.a.
This approach is not suitable forsequencing nucleic acids
https://doi.org/10.3891/acta.chem.scand.04-0283
-
Working strategy
• GridION system
– Uses single-use, self-contained cartridge.
– Can be used as a single instrument: Node
– Can be used in a cluster, connected through network.
– Low power and space required.
Gene Technology - 2020 6506 - DNA Sequencing
-
Working strategy
• MinION: a miniaturised sensing instrument
– Portable.
– Field-deployable.
– Requires minimal sample prep.
– Compatible with blood serum, plasma and whole blood.
MinION
Gene Technology - 2020 6606 - DNA Sequencing
../Videos/MinION_ A Portable, Real-Time DNA_RNA Sequencing Device.mp4
-
Workflow versatility
• No fixed run time
– Can be run one or more nodes for minutes or days.
– Data analysis takes place in real time.
– Longer run enables collecting more data points.
• Run until... sufficient data
– The GridION system enables users to run an experiment until sufficient data has been collected to reach a predetermined experimental endpoint.
Gene Technology - 2020 6706 - DNA Sequencing
-
Advantages over present sequencing technologies
• No need for expensive and time-consuming mate pair library construction.
• Real-time sequencing strategy.
• No strand amplification needed.
• No bias due to sequencing amplification.
• Low cost: trying to fulfil the target of $1000 per human genome.
• Lager read size: read size is limited only by preparation.
• No requirement for large amounts of high-performance disk storage.
• Large-scale structural variation can be detected at lower depth of coverage.
• Enable long-range haplotyping.
Gene Technology - 2020 6806 - DNA Sequencing
-
Gene Technology - 2020 6906 - DNA Sequencing
-
Single cell sequencing
Gene Technology - 2020 06 - DNA Sequencing 70