Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW •...

37
W A C Running CAW with AWS Batch Hopefully my last talk about CAW ever... Maxime Garcia @gau @MaxUlysse https://maxulysse.github.io/ 2018-01-31 SciLifeLab NGI / BarnTumörBanken

Transcript of Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW •...

Page 1: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

WACRunning CAW with AWS BatchHopefully my last talk about CAW ever...

Maxime Garcia @gau @MaxUlysse https://maxulysse.github.io/2018-01-31

SciLifeLab NGI / BarnTumörBanken

Page 2: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Definition

1/8

Page 3: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What is CAW?

WAC http://opensource.scilifelab.se/projects/caw/

• Nextflow pipeline

• Developed at NGI• In collaboration with NBIS• Support of The Swedish Pediatric Tumor Biobank

2/8

Page 4: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What is CAW?

WAC http://opensource.scilifelab.se/projects/caw/

• Nextflow pipeline• Developed at NGI

• In collaboration with NBIS• Support of The Swedish Pediatric Tumor Biobank

2/8

Page 5: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What is CAW?

WAC http://opensource.scilifelab.se/projects/caw/

• Nextflow pipeline• Developed at NGI• In collaboration with NBIS

• Support of The Swedish Pediatric Tumor Biobank

2/8

Page 6: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What is CAW?

WAC http://opensource.scilifelab.se/projects/caw/

• Nextflow pipeline• Developed at NGI• In collaboration with NBIS• Support of The Swedish Pediatric Tumor Biobank

2/8

Page 7: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC

• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files

• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller

• Structural variants

• Manta

• Heterogeneity, ploidy and CNVs

• ASCAT

• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 8: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)

• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files

• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller

• Structural variants

• Manta

• Heterogeneity, ploidy and CNVs

• ASCAT

• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 9: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38

• Based on GATK best practices for processing FASTQ files

• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller

• Structural variants

• Manta

• Heterogeneity, ploidy and CNVs

• ASCAT

• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 10: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files

• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller

• Structural variants

• Manta

• Heterogeneity, ploidy and CNVs

• ASCAT

• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 11: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller

• Structural variants

• Manta

• Heterogeneity, ploidy and CNVs

• ASCAT

• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 12: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller

• Structural variants

• Manta• Heterogeneity, ploidy and CNVs

• ASCAT

• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 13: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller• Structural variants

• Manta• Heterogeneity, ploidy and CNVs

• ASCAT• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 14: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller• Structural variants

• Manta

• Heterogeneity, ploidy and CNVs

• ASCAT

• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 15: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller• Structural variants

• Manta• Heterogeneity, ploidy and CNVs

• ASCAT• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 16: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller• Structural variants

• Manta• Heterogeneity, ploidy and CNVs

• ASCAT

• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 17: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller• Structural variants

• Manta• Heterogeneity, ploidy and CNVs

• ASCAT• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 18: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

What does CAW do?

WAC• WGS analysis (Tumor/Normal pair or Germline)• Handles both GRCh37 and GRCh38• Based on GATK best practices for processing FASTQ files• SNPs, SNVs and indels

• MuTect1, MuTect2, Strelka, and GATK HaplotyeCaller• Structural variants

• Manta• Heterogeneity, ploidy and CNVs

• ASCAT• Containers (portable, reproducible)

• Docker or Singularity

3/8

Page 19: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Where to use CAW?

• Any POSIX compatible system

• Rackham• Bianca• Irma

• AWS Batch

4/8

Page 20: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Where to use CAW?

• Any POSIX compatible system

• Rackham

• Bianca• Irma

• AWS Batch

4/8

Page 21: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Where to use CAW?

• Any POSIX compatible system

• Rackham• Bianca

• Irma

• AWS Batch

4/8

Page 22: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Where to use CAW?

• Any POSIX compatible system

• Rackham• Bianca• Irma

• AWS Batch

4/8

Page 23: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Where to use CAW?

• Any POSIX compatible system

• Rackham• Bianca• Irma

• AWS Batch

4/8

Page 24: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Where to use CAW?

• Any POSIX compatible system• Rackham• Bianca• Irma

• AWS Batch

4/8

Page 25: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Where to use CAW?

• Any POSIX compatible system• Rackham• Bianca• Irma• AWS Batch

4/8

Page 26: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

CAW with AWS Batch

• A single command line

nextflow run main.nf -profile awsbatch -w s3://caw-test-results/work \--genome smallGRCh37 --sample s3://caw-test-data/tsv/tiny-s3.tsv \--outDir s3://caw-test-results/Results

https://maxulysse.github.io/2017/11/16/Running-CAW-with-AWS-Batch/

5/8

Page 27: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

CAW with AWS Batch

• A single command line

nextflow run main.nf -profile awsbatch -w s3://caw-test-results/work \--genome smallGRCh37 --sample s3://caw-test-data/tsv/tiny-s3.tsv \--outDir s3://caw-test-results/Results

https://maxulysse.github.io/2017/11/16/Running-CAW-with-AWS-Batch/

5/8

Page 28: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

CAW with AWS Batch

• A single command line

nextflow run main.nf -profile awsbatch -w s3://caw-test-results/work \--genome smallGRCh37 --sample s3://caw-test-data/tsv/tiny-s3.tsv \--outDir s3://caw-test-results/Results

https://maxulysse.github.io/2017/11/16/Running-CAW-with-AWS-Batch/

5/8

Page 29: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Going further

• Run a full size test sample

• Gather reports• Get a pricing• Wait for Amazon to finally come to Stockholm

6/8

Page 30: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Going further

• Run a full size test sample• Gather reports

• Get a pricing• Wait for Amazon to finally come to Stockholm

6/8

Page 31: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Going further

• Run a full size test sample• Gather reports• Get a pricing

• Wait for Amazon to finally come to Stockholm

6/8

Page 32: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Going further

• Run a full size test sample• Gather reports• Get a pricing• Wait for Amazon to finally come to Stockholm

6/8

Page 33: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

The List of People Involved

Sebastian DiLorenzo Markus MayrhoferJesper Eisfeldt Monica NistèrPhil Ewels Björn NystedtMaxime Garcia Pall OlasonSzilveszter Juhos Markus RingnérMax Käller Pelin SahlénMalin Larsson Johanna SandgrenMarcel Martin Teresita Díaz De Ståhl

7/8

Page 34: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Where to find us?

• We are on the SciLifeLab Slack

#cancer-pipeline

• We have a gitter channel https://gitter.im/SciLifeLab/CAW

• Our code is hosted on Github

https://github.com/SciLifeLab/CAW

8/8

Page 35: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Where to find us?

• We are on the SciLifeLab Slack

#cancer-pipeline

• We have a gitter channel https://gitter.im/SciLifeLab/CAW

• Our code is hosted on Github

https://github.com/SciLifeLab/CAW

8/8

Page 36: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Where to find us?

• We are on the SciLifeLab Slack

#cancer-pipeline

• We have a gitter channel https://gitter.im/SciLifeLab/CAW

• Our code is hosted on Github

https://github.com/SciLifeLab/CAW

8/8

Page 37: Running CAW with AWS Batch...2018/01/31  · • Docker or Singularity 3/8 What does CAW do? CAW • WGS analysis (Tumor/Normal pair or Germline) • Handles both GRCh37 and GRCh38

Any questions?