Single-cell RNA sequencing of paclitaxel-treated Abstract ... · Single-cell RNA Sequencing...

1
Abstract # 1229 San Antonio Breast Cancer SymposiumDecember 4-8, 2012 Five3 genomics Cancer treatments act on a population of cells, each of which may experience dierent individual responses to treatment. Such dierential response will result in resistance to treatment even if a majority of cancerous cells are eliminated. To examine dierential cell response, we simultaneously profiled the gene expression and mutation spectrum of individual cells from the MDAMB231 cell line using next generation sequencing of isolated RNA. A total of 23 transcriptomes were characterized from paclitaxel-treated and paclitaxel-surviving cells. We found significant dierent changes in mutation rates between paclitaxel treated cells, with a dose-dependent increase in single nucleotide changes in RNA in paclitaxel-treated cells. Cells undergoing exposure to paclitaxel also showed higher pathway activity in SRC, as well as an integrin switch from ITGB1 to ITGB3. In contrast, cells that survived a high dose of paclitaxel showed an insignificant number of single nucleotide changes, suggesting that these cells either evaded initial paclitaxel exposure or were better able to repair the eects of paclitaxel exposure. Despite the RNA sequence similarity between surviving and untreated cells, there were changes in gene expression and pathway activities including higher PI3K activity. Paclitaxel-surviving cells also showed activation of pathways associated with higher proliferation. Abstract Taxol Treatment Scheme Single-cell RNA Expression and Pathway Analysis This presentation is the intellectual property of the author/presenter. Contact charlie@five3genomics.com for permission to reprint and/or distribute. Single-cell RNA sequencing of paclitaxel-treated breast cancer cell lines to find individual cell response Charles J Vaske 1 , Wendy Lee 2 , Stephen C Benz 1 , John Z Sanborn 1 , Fernando Lopez-Diaz 3 , Beverly M Emerson 3, and Nader Pourmand 2 1 Five3 Genomics, LLC, Santa Cruz, CA, United States, 95060; 2 Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, United States, 95064 and 3 Salk Institute, La Jolla, CA, United States, 92037 MDA-MB-231-Tx100S Propagated cells for future studies 100 µm 100 µm 100 µm Fresh media 5 0 3 6 20 days Drug Free (DF) Ptx [1-100 nM] MTWT F S S 1 15 10 15 MDA-MB-231 x3 500,000 cells/plate 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 ROC for highcoverage DBSNP loci 1Sensitivity Specificity Sp: 0.729, Sn: 0.712 Score: 113 Group A Group B Difference Small variant calling Genes mutated in surviving cells Single-cell RNA Sequencing Coverage Aggregate RNA sequence coverage across known genes shows coverage across entire gene length. The top plot shows the total number of reads that cover any gene starting at the 5’ end of genes. The bottom blot shows aggregate coverage starting from the 3’ end of genes. There is a small 3’ bias towards reads, but solid coverage even within the 5’ end of genes. 0 500 1000 1500 2000 0e+00 3e+06 6e+06 5' aligned distance from 5' end (bp) read depth 0 500 1000 1500 2000 0e+00 3e+06 6e+06 3' aligned distance from 3' end (bp) read depth CDK4 in untreated-4 DNA (population) RNA Cosmic CDK4 in untreated-3 DNA (population) RNA Cosmic ATM in survivor-68 ATM in untreated-3 ATM in survivor-63 Individual cell RNA sequence coverage shows which sections of mRNAs are amplified in the single cell preparation. Typical islands are 500+ bp in length. The coding length of CDK4 is 303 amino acids, while ATM has a coding length of 3056 amino acids. The coverage of RNA in individual cells, or DNA from a population of cells, is shown for each gene. At each location in the coverage plot, the fraction of non- reference bases is indicated by colored lines. The exon structure and PFAM domains are shown at the bottom, above mutations in the Cosmic database. 5 10 15 20 25 30 5000 10000 20000 Exon coverage at >= 50x (megabases) High dose High dose survivors Low dose Untreated High-confidence non reference variants Using a Bayesian variant caller originally developed for DNA and tumor-normal pairs, we called variants in single-sample mode on both the RNA and on a 10x whole genome DNA sequencing of the tumor population. Using the DNA calls at DBSNP loci as the gold standard, we assessed the sensitivity and specificity of the RNA calls at high-coverage (30x) loci, finding ~70% sensitivity and specificity, disregarding the confounding eects of RNA editing. Using the Paradigm pathway analysis method, dierential pathway activities were extracted from the three treatment groups. Paradigm was able to find statistically significant dierential sub-pathways in each condition comparison, something that challenged traditional RNA-seq dierential expression methods such as DEseq. 342 genes were found to have non-silent mutations in at least one of the surviving cells, with no observed variants in the untreated cells’ RNA or in population’s genomic DNA. Examining genes mutated in at least four of the surviving cells found functional motifs for tubulin related proteins, DNA-binding proteins, and proteins involved in inter-cellular signaling. STAT1 (dimer) (complex) CHUK IFNAR2:p-JAK1:STAT2 (complex) CRK-2 CRK STAT2 IFNG IL12RB2 FOXO3 HLA-DRA Antigen/IgE/Fc epsilon R1/LYN/SYK (complex) FCER1G LAT-2 TCR/CD3/MHC II/CD4/LCK/ZAP-70 (complex) FER TCR/CD3/MHC II/CD4/LCK (complex) TCR/CD3/MHC II/CD4 (complex) PRKCE RET51/GFRalpha1/GDNF (complex) PRKCB Src Family Kinases (family) Typical PKCs (family) LYN PDPK1 BTK PI3K (complex) Splice change, not modeled correctly Untreated has higher kinase signaling T-Cell Receptor Signaling Untreated vs. Survivors T-Cell Receptor Signaling Cyclin B/CDC2 (complex) RET51/GFRalpha1/GDNF (complex) STAT6 (dimer) (complex) ITGB3 G2/M transition of mitotic cell cycle (abstract) RET9/GFRalpha1/GDNF (complex) GFRA1 COL1A2 XPO1 Chromosomal passenger complex (complex) CD3G CCL5 SIN3/HDAC complex (complex) CD40/CD40L (trimer)/TRAF2 (trimer) (complex) B-Lymphocyte (abstract) ZBTB17 SAP18 JNK family (family) SKP2 HSPA1A CENPA NEK2 FOXM1 AURKB LAT-2 TCR/CD3/MHC II/CD4/LCK/ZAP-70 (complex) CD247 Lose Integrin Beta 3 Survivors have high proliferation Survivors lose B-lymphocyte-like signature RET tyrosine kinase receptor Transcription factor, discovered via MYC Treated vs. Survivors JNK family (family) ZBTB17 B-Lymphocyte (abstract) CCL5 SRC JUN-FOS (family) JUN/JUN-FOS (complex) JUN EGFR/EGFR/EGF/EGF (complex) L1CAM ACTA1 JUN (dimer) (complex) Fibronectin/alpha4/beta1 Integrin (complex) Integrin alpha4beta1 (VLA-4) (complex) alpha4/beta1 Integrin (complex) ITGB1 alpha1/beta1 Integrin (complex) alpha9/beta1 Integrin (complex) alpha8/beta1 Integrin (complex) AR/T-DHT (complex) HDAC7 Caspase 3 (tetramer) (complex) IL4/IL4R/JAK1/IL2R gamma/JAK3 (complex) IRF4 alpha2/beta1 Integrin (complex) STAT6 (dimer) (complex) Integrin alphaIIb beta3 (complex) ITGB3 alphaIIb/beta3 Integrin (complex) alphav/beta3 Integrin (complex) Insulin Receptor/Insulin (complex) INSR DOK1 AR Integrin Beta Switch B-lymphocyte-like signature in dosed cells Integrin Beta1 Untreated vs. Treated Treated cells show a switch from Integrin Beta 1 to Integrin Beta 3, however Beta 3 is lost in the survivors. Survivors show increased G2/M transition as well as generally higher proliferation, concordant with a resumption of cell cycle after stalling. We find highly dierential mutation rates within the dierent treatment conditions, and a dosage eect between low and high doses of paclitaxel. Surviving cells show a variant rate similar to the untreated cells, indicating hds unt ld hd 0 200 400 600 800 1000 1200 Variants / Megabase that survivors either repaired mutations, or experienced fewer mutations initially. MAP1A - Microtubule-associated protein 1A TTLL5 - Tubulin tyrosine ligase-like family, member 5 TBCD - Tubulin folding cofactor CCT4 - Chaperonin containing TCP1, subunit 4 (delta) CHD1L - Chromodomain helicase DNA binding protein 1-like TOX - Thymocyte selection-associated high mobility group box NFRKB - Nuclear factor related to kappaB binding protein CPA3 - Carboxypeptidase A3 (mast cell) ELFN2 - Extracellular leucine-rich repeat and fibronectin type III CTNND1 - Catenin (cadherin-associated protein), delta 1 Tubulin DNA Inter-cell

Transcript of Single-cell RNA sequencing of paclitaxel-treated Abstract ... · Single-cell RNA Sequencing...

Page 1: Single-cell RNA sequencing of paclitaxel-treated Abstract ... · Single-cell RNA Sequencing Coverage Aggregate RNA sequence coverage across known genes shows coverage across entire

Abstract #1229

San Antonio Breast Cancer Symposium– December 4-8, 2012Five3

g e n o m i c s

Cancer treatments act on a population of cells, each of which may experience different individual responses to treatment. Such differential response will result in resistance to treatment even if a majority of cancerous cells are eliminated. To examine differential cell response, we simultaneously profiled the gene expression and mutation spectrum of individual cells from the MDAMB231 cell line using next generation sequencing of isolated RNA. A total of 23 transcriptomes were characterized from paclitaxel-treated and paclitaxel-surviving cells. We found significant different changes in mutation rates between paclitaxel treated cells, with a dose-dependent increase in single nucleotide changes in RNA in paclitaxel-treated cells. Cells undergoing exposure to paclitaxel also showed higher pathway activity in SRC, as well as an integrin switch from ITGB1 to ITGB3. In contrast, cells that survived a high dose of paclitaxel showed an insignificant number of single nucleotide changes, suggesting that these cells either evaded initial paclitaxel exposure or were better able to repair the effects of paclitaxel exposure. Despite the RNA sequence similarity between surviving and untreated cells, there were changes in gene expression and pathway activities including higher PI3K activity. Paclitaxel-surviving cells also showed activation of pathways associated with higher proliferation.

Abstract Taxol Treatment Scheme

Single-cell RNA Expression and Pathway Analysis

This presentation is the intellectual property of the author/presenter. Contact [email protected] for permission to reprint and/or distribute.

Single-cell RNA sequencing of paclitaxel-treated breast cancer cell lines to find individual cell response

Charles J Vaske1, Wendy Lee2, Stephen C Benz1, John Z Sanborn1, Fernando Lopez-Diaz3, Beverly M Emerson3, and Nader Pourmand2

1Five3 Genomics, LLC, Santa Cruz, CA, United States, 95060; 2Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA, United States, 95064 and

3Salk Institute, La Jolla, CA, United States, 92037

MDA-MB-231-Tx100S

Propagated cells for future studies

100  µm

100  µm100  µm

Fresh media

50 3 6 20days

Drug Free (DF)Ptx

[1-100 nM]

M T W T F SS1 15

10 15

MDA-MB-231

x3500,000 cells/plate

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.2

0.4

0.6

0.8

1.0

ROC for high−coverage DBSNP loci

1−SensitivitySp

ecifi

city

Sp: 0.729, Sn: 0.712Score: 113

Group A

Group B

Difference

Small variant calling

Genes mutated in surviving cells

Single-cell RNA Sequencing Coverage

Aggregate RNA sequence coverage across known genes shows coverage across entire gene length. The top plot shows the total number of reads that cover any gene starting at the 5’ end of genes. The bottom blot shows aggregate coverage starting from the 3’ end of genes. There is a small 3’ bias towards reads, but solid coverage even within the 5’ end of genes.

0 500 1000 1500 2000

0e+0

03e

+06

6e+0

6

5' aligned

distance from 5' end (bp)

read

dep

th

0 500 1000 1500 2000

0e+0

03e

+06

6e+0

6

3' aligned

distance from 3' end (bp)

read

dep

th

CDK4 in untreated-4

DNA(population)

RNA

Cosmic

CDK4 in untreated-3

DNA(population)

RNA

Cosmic

ATM in survivor-68

ATM in untreated-3

ATM in survivor-63

Individual cell RNA sequence coverage shows which sections of mRNAs are amplified in the single cell preparation. Typical islands are 500+ bp in length. The coding length of CDK4 is 303 amino acids, while ATM has a coding length of 3056 amino acids.

The coverage of RNA in individual cells, or DNA from a population of cells, is shown for each gene. At each location in the coverage plot, the fraction of non-reference bases is indicated by colored lines. The exon structure and PFAM domains are shown at the bottom, above mutations in the Cosmic database.

●●

5 10 15 20 25 30

5000

1000

020

000

Exon coverage at >= 50x (megabases)

Hig

h−co

nfid

ence

non−r

efer

ence

SN

Ps ● High doseHigh dose survivorsLow doseUntreated

Hig

h-co

nfide

nce

non

refe

renc

e va

riant

s

Using a Bayesian variant caller originally developed for DNA and tumor-normal pairs, we called variants in single-sample mode on both the RNA and on a 10x whole genome DNA sequencing of the tumor population. Using the DNA calls at DBSNP loci as the gold standard, we assessed the sensitivity and specificity of the RNA calls at high-coverage (30x) loci, finding ~70% sensitivity and specificity, disregarding the confounding effects of RNA editing.

Using the Paradigm pathway analysis method, differential pathway activities were extracted from the three treatment groups. Paradigm was able to find statistically significant differential sub-pathways in each condition comparison, something that challenged traditional RNA-seq differential expression methods such as DEseq.

342 genes were found to have non-silent mutations in at least one of the surviving cells, with no observed variants in the untreated cells’ RNA or in population’s genomic DNA. Examining genes mutated in at least four of the surviving cells found functional motifs for tubulin related proteins, DNA-binding proteins, and proteins involved in inter-cellular signaling.

STAT1 (dimer) (complex)

CHUKIFNAR2:p-JAK1:STAT2 (complex)

CRK-2CRK

STAT2

IFNG

IL12RB2

FOXO3HLA-DRA

Antigen/IgE/Fc epsilon

R1/LYN/SYK (complex)

FCER1GLAT-2

TCR/CD3/MHC II/CD4/LCK/ZAP-70

(complex)

FER

TCR/CD3/MHC II/CD4/LCK (complex)

TCR/CD3/MHC II/CD4

(complex)

PRKCE

RET51/GFRalpha1/GDNF (complex)

PRKCBSrc Family Kinases (family)

Typical PKCs (family)

LYN PDPK1BTKPI3K (complex)

Splice change, not modeled correctly

Untreated has higher kinase signaling

T-Cell Receptor Signaling

Untreated vs. Survivors

T-Cell Receptor Signaling

Cyclin B/CDC2 (complex)

RET51/GFRalpha1/GDNF (complex)

STAT6 (dimer) (complex) ITGB3G2/M transition

of mitotic cell cycle (abstract)

RET9/GFRalpha1/GDNF (complex)

GFRA1COL1A2XPO1

Chromosomal passenger complex

(complex)

CD3G

CCL5SIN3/HDAC complex

(complex)

CD40/CD40L (trimer)/TRAF2

(trimer) (complex)

B-Lymphocyte (abstract)

ZBTB17

SAP18JNK family

(family)

SKP2

HSPA1A

CENPA

NEK2

FOXM1

AURKB

LAT-2

TCR/CD3/MHC II/CD4/LCK/ZAP-70

(complex)

CD247

Lose Integrin Beta 3

Survivors have high proliferation

Survivors lose B-lymphocyte-like signature

RET tyrosine kinase receptor

Transcription factor, discovered

via MYC

Treated vs. Survivors

JNK family (family)

ZBTB17

B-Lymphocyte (abstract)

CCL5

SRC

JUN-FOS (family)

JUN/JUN-FOS (complex)

JUN

EGFR/EGFR/EGF/EGF (complex)

L1CAM

ACTA1

JUN (dimer) (complex)

Fibronectin/alpha4/beta1 Integrin

(complex)

Integrin alpha4beta1

(VLA-4) (complex)

alpha4/beta1 Integrin

(complex)

ITGB1alpha1/beta1 Integrin

(complex)

alpha9/beta1 Integrin

(complex)

alpha8/beta1 Integrin

(complex)

AR/T-DHT (complex)

HDAC7

Caspase 3 (tetramer) (complex)

IL4/IL4R/JAK1/IL2R gamma/JAK3

(complex)

IRF4

alpha2/beta1 Integrin

(complex)

STAT6 (dimer) (complex)

Integrin alphaIIb beta3 (complex)

ITGB3

alphaIIb/beta3 Integrin

(complex)

alphav/beta3 Integrin

(complex)

Insulin Receptor/Insulin

(complex)

INSR

DOK1

AR

Integrin Beta Switch

B-lymphocyte-like signature in dosed cells

Integrin Beta1

Untreated vs. Treated

Treated cells show a switch from Integrin Beta 1 to Integrin Beta 3, however Beta 3 is lost in the survivors. Survivors show increased G2/M transition as well as generally higher proliferation, concordant with a resumption of cell cycle after stalling.

We find highly differential mutation rates within the different treatment conditions, and a dosage effect between low and high doses of paclitaxel. Surviving cells show a variant rate similar to the untreated cells, indicating

hdsunt

ldhd

SNP / Megabase

0 200 400 600 800 1000 1200

Variants / Megabase

that survivors either repaired mutations, or experienced fewer mutations initially. • MAP1A - Microtubule-associated protein 1A

• TTLL5 - Tubulin tyrosine ligase-like family, member 5

• TBCD - Tubulin folding cofactor

• CCT4 - Chaperonin containing TCP1, subunit 4 (delta)

• CHD1L - Chromodomain helicase DNA binding protein 1-like

• TOX - Thymocyte selection-associated high mobility group box• NFRKB - Nuclear factor related to kappaB binding protein• CPA3 - Carboxypeptidase A3 (mast cell)• ELFN2 - Extracellular leucine-rich repeat and fibronectin type III• CTNND1 - Catenin (cadherin-associated protein), delta 1

Tubu

linD

NA

Inte

r-ce

ll