Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar...

31
Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun , and Ion Mandoiu University of Connecticut
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    218
  • download

    0

Transcript of Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar...

Page 1: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Primer Selection Methods for Detection of Genomic

Inversionsand Deletions via PAMP

Bhaskar DasGupta,University of Illinois at Chicago

Jin Jun, and Ion MandoiuUniversity of Connecticut

Page 2: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Outline Introduction

Anchored Deletion Detection

Inversion Detection

Conclusions

Page 3: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Genomic Structural Variation Deletions Inversions

Translocations, insertions, fissions, fussions…

Page 4: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Primer Approximation Multiplex PCR (PAMP) Introduced by [Liu&Carson 2007] Experimental technique for detecting large-scale

cancer genome lesions such as inversions and deletions from heterogeneous samples containing a mixture of cancer and normal cells

Can be used for Tracking how genetic breakpoints are generated during

cancer development Monitoring the status of cancer progression with a highly

sensitive assays

Page 5: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

PAMP detailsA. Large number of multiplex

PCR primers selected s.t. There is no PCR amplification in

the absence of genomic lesions A genomic lesion brings one or

more pairs of primers in the proximity of each other with high probability, resulting in PCR amplification

B. Amplification products are hybridized to a microarray to identify the pair(s) of primers that yield amplification Liu&Carson 2007

Page 6: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Outline Introduction

Anchored Deletion Detection

Inversion Detection

Conclusions

Page 7: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Anchored Deletion Detection Assume that the deletion spans a known genomic

location (anchored deletions)

[Bashir et al. 2007] proposed ILP formulations and simulated annealing algorithms for PAMP primer selection for anchored deletions

Page 8: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Criteria for Primer Selection Standard criteria for multiplex PCR primer

selection Melting temperature, Tm

Lack of hairpin secondary structure, and No dimerization between pairs of primers

Single pair of dimerizing primers is sufficient to negate the amplification [Bashir et al. 2007]

Page 9: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Optimization Objective Multiplex PCR primer set selection

Minimize number of primers and/or multiplex PCR reactions needed to amplify a given set of discrete amplification targets

PAMP primer set selection Minimize the probability that an unknown

genomic lesion fails to be detected by the assay

Page 10: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

PCR Amplification Efficiency Model

0-1 Step model (used in our simulations)

1

0L L+1 Distance between

two primers

PCR amplificationsuccess probability

1

0L Distance between

two primers

PCR amplificationsuccess probability

Exponential decay in amplification efficiency above a certain product length

Page 11: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

pl,r: probability of having a lesion with endpoints, l and r where

Simple model: uniform distribution pl,r=h if r-l>D, 0 otherwise

Function of distance pl,r=f(r-l) e.g. a peak at r-l=d

Function of hotspots High probability around

hotspots e.g. two (pairs of) hotspots

Probabilistic Models for Lesion Location

r

l

D

xmin

r

l

r

l

Hotspots

Hot-spots

r-l=dxmax

pl .rxmin lrxmax 1

h

Page 12: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

PAMP Primer Selection Problem for Anchored Deletion Detection (PAMP-DEL) Given:

Sets of forward and reverse candidate primers, {p1,p2,…,pm} and {q1,q2,…,qn}

Set E of primer pairs that form dimers Maximum multiplexing degrees Nf and Nr, and

amplification length upper-bound L Find: Subset P’ of at most Nf forward and at most

Nr reverse primers such that 1. P’ does not include any pair of primers in E2. P’ minimizes the failure probability

where f(P’;l,r) = 1 if P’ fails to yield a PCR product when the deletion with endpoints (l,r) is present in the sample, and f(P’;l,r) = 0 otherwise.

maxmin ,

,,;'xrlx

rlprlPf

Page 13: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

ILP Formulation for PAMP-DEL

xi’ xi

yj

yj’

(l-1-xi’ )+(yj’ -r-1) = L

5’

5’

3’

3’pi’ pi qj’qj

l

r

xi’ yj’

Deletionanchorl1 r1

l1

r1

maxmin ,

,,;'xrlx

rlprlPf

Failure

f(P’;l,r)=1

(l1-1-xi’ )+(yj’ -r1-1) > L

Page 14: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

ILP Formulation for PAMP-DEL

xi’ xi

yj

yj’

5’

5’

3’

3’pi’ pi qj’qj

l

r

xi’ yj’l2 r2

l2

r2

(l2-1-xi’ )+(yj’ -r2-1) ≤ L Success

f(P’;l,r)=0

0/1 variables fi (ri) to indicate when pi (respectively qi) is selected in P’, fi,j (ri,j) to indicate that pi and pj (respectively qi and qj) are

consecutive primers in P’, ei,i‘,j,j‘ to indicate that both (pi, pi’) and (qj, qj’) are pairs of are

consecutive primers in P’

Deletionanchor

(l-1-xi’ )+(yj’ -r-1) = L

Page 15: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Failure probability

Compatibility constraints

ILP Formulation for PAMP-DEL (2)

Path connecting constraintsNo dimerization constraints

p0pm+1pi pj pk

f0,i fi,j fj,k fi,m+1

. . . . . .

::

::

Max. multiplex degreeconstraints

Page 16: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

PAMP-1SDEL One-sided version of PAMP-DEL in which one of the

deletion endpoints is known in advance Introduced by [Bhasir et al. 2007]

Assume we know the left deletion endpoint Let x1<x2<…<xn be the hybridization positions for the

reverse candidate primers q1,…, qn

Ci,j: probability that a deletion whose right endpoint falls between xi and xj does not result in PCR amplification

ri, ri,j: 0/1 decision variables similar to those in PAMP-DEL ILP

Page 17: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

PAMP-1SDEL ILP

Page 18: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Comparison to Bashir et al. Formulation PAMP-DEL formulation in Bashir et al.

Each primer responsible for covering L/2 bases Covered area by adjacent primers u, v:

dimerization0 L 2L 2.5L 3L

Forward primers l1 l2

Unconveredarea

L/2Forward primers + l1

L/2Forward primers + l2

Failure prob.

1/2

0

}2

||,0max{L

ll vu

Page 19: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

PAMP-DEL Heuristics ITERATIVE-1SDEL

Iteratively solve PAMP-1SDEL with fixed primers from previous PAMP-1SDEL

Fixed Nf (Nr) at each step

INCREMENTAL-1SDEL ITERATIVE-1SDEL but with incremental

multiplexing degrees E.g. k/2k·Nf, (k+1)/2k·Nf, … , Nf

where k is the number of steps

Page 20: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Comparison of PAMP-DEL Heuristics

m=n=Nf=Nr=15, xmax-xmin=5Kb, L=2Kb, 5 random instances

PAMP-DEL ILP can handle only very small problem Both ITERATED-1SDEL and INCREMENTAL-1SDEL solutions

are very close to optimal for low dimerization rates For larger dimerization rates INCREMENTAL-1SDEL detection

probability is still close to optimal

Page 21: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

INCREMENTAL-1SDEL Scalability

L=20Kb, 5 random instances

Page 22: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Outline Introduction

Anchored Deletion Detection

Inversion Detection

Conclusions

Page 23: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Inversion Detection

Page 24: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

PAMP Primer Selection Problem for Inversion Detection (PAMP-INV) Given:

Set P of candidate primers Set E of dimerizing candidate primer pairs Maximum multiplexing degree N and amplification length

upper-bound L

Find: a subset P’ of P such that 1. |P’| ≤ N2. P’ does not include any pair of primers in E3. P’ minimizes the failure probability

where f(P’;l,r)=1 if P’ fails to yield a PCR product when the inversion with endpoints (l,r) is present in the sample, and f(P’;l,r)=0 otherwise.

maxmin

,,;'xrlx

rlprlPf

Page 25: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

ILP Formulation for PAMP-INV

xi xi’

xj

xj’

(l-1-xi)+(r-xj) = L5’

5’

3’

3’pi pi’

pj’pj

l

r l r

l

r

(l-1-xi )+(r-xj) ≤ L Success

f(P';l,r)=0

0/1 variables ei =1 iff pi is selected in P’, ei,j =1 iff pi and pj are consecutive primers in P’, ei,i‘,j,j‘ =1 iff (pi, pi’) and (pj, pj’) are pairs of are consecutive

primers in P’

5’

5’

3’

3’pi

pi’ pj’pj

xixj

f(P';l',r')=1

Page 26: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

ILP Formulation for PAMP-INV (2)

Page 27: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Detection Probability and Runtime for PAMP-INV ILP

PAMP-INV ILP can be solved to optimality within a few hours Runtime is relatively robust to changes in dimerization rate,

candidate primer density, and constraints on multiplexing degree.

xmax-xmin =100Kb

L=20Kb 5 random

instances

Page 28: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Effect of Inversion Length and Dimerization Rate

xmax-xmin=100Kb, L=20Kb, n=30, dimerization rate r between

0 and 20% and N=20

Detection probability is relatively insensitive to Length of Inversion

Page 29: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Outline Introduction

Anchored Deletion Detection

Inversion Detection

Conclusions

Page 30: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Summary ILP formulations for PAMP primer selection

Anchored deletion detection (PAMP-DEL) 1-sided anchored deletion detection (PAMP-1SDEL) Inversion detection (PAMP-INV) Practical runtime for mid-sized PAMP-INV ILP, highly

scalable PAMP-1SDEL ILP

Heuristics for PAMP-DEL based on PAMP-1SDEL ILP Near optimal solutions with highly scalable runtime

Page 31: Primer Selection Methods for Detection of Genomic Inversions and Deletions via PAMP Bhaskar DasGupta, University of Illinois at Chicago Jin Jun, and Ion.

Questions?