Nov 2014 ouellette_windsor_icgc_final
-
Upload
ontario-institute-for-cancer-research -
Category
Science
-
view
544 -
download
2
description
Transcript of Nov 2014 ouellette_windsor_icgc_final
![Page 1: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/1.jpg)
A project status for the International Cancer Genome
Consortium (ICGC).
November 21th 2014
B.F. Francis Ouellette [email protected]
• Senior Scientists & Associate Director,
Informatics and Biocomputing, Ontario Institute for
Cancer Research, Toronto, ON
• Associate Professor, Department of Cell and Systems Biology,
University of Toronto, Toronto, ON.
@bffo on
![Page 2: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/2.jpg)
2
You are free to:
Copy, share, adapt, or re-mix;
Photograph, film, or broadcast;
Blog, live-blog, or post video of;
This presentation. Provided that:
You attribute the work to its author and respect the rights
and licenses associated with its components.
Slide Concept by Cameron Neylon, who has waived all copyright and related or neighbouring rights. This slide only ccZero.
Social Media Icons adapted with permission from originals by Christopher Ross. Original images are available under GPL at;
http://www.thisismyurl.com/free-downloads/15-free-speech-bubble-icons-for-popular-websites
![Page 3: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/3.jpg)
3
But first, a little about me …
… an unfinished story!
![Page 4: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/4.jpg)
4http://goo.gl/v8G57
![Page 5: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/5.jpg)
5http://goo.gl/WKLNr
![Page 6: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/6.jpg)
http://goo.gl/dJIur
![Page 7: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/7.jpg)
http://goo.gl/LwVOZ
![Page 8: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/8.jpg)
http://goo.gl/kTGkG
![Page 9: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/9.jpg)
http://goo.gl/sO5Na
![Page 10: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/10.jpg)
http://goo.gl/LwVOZ
![Page 11: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/11.jpg)
http://goo.gl/QI6aL
![Page 12: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/12.jpg)
http://goo.gl/mYHFO
![Page 13: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/13.jpg)
http://goo.gl/Jc5TK
![Page 14: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/14.jpg)
![Page 15: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/15.jpg)
15
(from the National Centre for Biotechnology Information)
from the National Centre for Biotechnology Information
![Page 16: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/16.jpg)
16
from the National Centre for Biotechnology Information
![Page 17: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/17.jpg)
17
from the National Centre for Biotechnology Information
PANIC
![Page 18: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/18.jpg)
18
![Page 19: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/19.jpg)
19
PANIC
![Page 20: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/20.jpg)
20
PANIC
![Page 21: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/21.jpg)
1998
2001
2004
![Page 22: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/22.jpg)
22
![Page 23: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/23.jpg)
![Page 24: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/24.jpg)
![Page 25: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/25.jpg)
1999
![Page 26: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/26.jpg)
![Page 27: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/27.jpg)
![Page 28: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/28.jpg)
![Page 29: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/29.jpg)
![Page 30: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/30.jpg)
30http://goo.gl/ZKzMV
![Page 31: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/31.jpg)
31http://goo.gl/ZKzMV
![Page 32: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/32.jpg)
32
![Page 33: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/33.jpg)
33
International Cancer Genome Consortium: icgc.org
![Page 34: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/34.jpg)
34
http://www.csb.utoronto.ca/
![Page 35: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/35.jpg)
35
http://bioinformatics.ca/
![Page 36: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/36.jpg)
36
![Page 37: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/37.jpg)
37
http://bioinformatics.ca/workshops/2014
![Page 39: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/39.jpg)
39
CancerA Disease of the Genome
Challenge in Treating Cancer:
Every tumor is different
Every cancer patient is different
![Page 40: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/40.jpg)
40
Johns Hopkins
> 18,000 genes analyzed for mutations
11 breast and 11 colon tumors
L.D. Wood et al, Science, Oct. 2007
Wellcome Trust Sanger Institute
518 genes analyzed for mutations
210 tumors of various types
C. Greenman et al, Nature, Mar. 2007
TCGA (NIH)
Multiple technologies
brain (glioblastoma multiforme), lung (squamous carcinoma), and ovarian (serous cystadenocarcinoma).
F.S. Collins & A.D. Barker, Sci. Am, Mar. 2007
Large-Scale Studies of Cancer Genomes
![Page 41: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/41.jpg)
41
Heterogeneity within and across tumor types
High rate of abnormalities (driver vs
passenger)
Sample quality matters
Consent and controlled data access is
complicated
Lessons learned
![Page 42: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/42.jpg)
42
International Cancer Genome Consortium
• Collect ~500 tumour/normal pairs from each of 50 different major
cancer types;
• Comprehensive genome analysis of each T/N pair:
– Genome
– Transcriptome
– Methylome
– Clinical data
• Make the data available to the research community & public.
Identify
genome
changes
…GATTATTCCAGGTAT… …GATTATTGCAGGTAT… …GATTATTGCAGGTAT…
![Page 43: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/43.jpg)
43
Rationale for the ICGC
• The scope is huge, such that no country can do it all.
• Coordinated cancer genome initiatives will reduce
duplication of effort for common and easy to acquire
tumor samples and and ensure complete studies for many
less frequent forms of cancer.
• Standardization and uniform quality measures across
studies will enable the merging of datasets, increasing
power to detect additional targets.
• The spectrum of many cancers varies across the
world for many tumor types, because of environmental,
genetic and other causes.
• The ICGC will accelerate the dissemination of genomic
and analytical methods across participating sites, and
the user community
![Page 44: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/44.jpg)
44
International Cancer Genome Consortium
(ICGC)Goals
• Catalogue genomic abnormalities in tumors in 50 different cancer types and/or subtypes of clinical and societal importance across the globe
• Generate complementary catalogues of transcriptomic and epigenomic datasets from the same tumors
• Make the data available to research community rapidly with minimal restrictions to accelerate research into the causes and control of cancer
50 tumor types and/or subtypes
500 tumors + 500 controls per subtype
50,000 Human Genome Projects!
Nature (2010) 464:993
![Page 45: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/45.jpg)
45
ICGC
Goals, Structure,
Policies & Guidelines
http://goo.gl/sPGLQN
![Page 46: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/46.jpg)
46
Primary Goal: coordinate efforts to
reach goals (50 tumours)
![Page 47: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/47.jpg)
47
http://docs.icgc.org/dcc-data-element-specifications
![Page 48: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/48.jpg)
48
Primary Goal: be comprehensive
http://goo.gl/BE7KH1
![Page 49: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/49.jpg)
49
Analysis Data Types
• Germline variants (SNPs)
• Simple Somatic Mutations (SSM)
• Copy Number Alterations (CNA)
• Structural Variants (SV)
• Gene Expression (micro-arrays and RNASeq)
• miRNA Expression (RNASeq)
• Epigenomics (Arrays and Methylation)
• Splicing Variation (RNASeq)
• Protein Expression (Arrays)
![Page 50: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/50.jpg)
50
Primary Goal: generate highest quality
http://goo.gl/FXCvi9
![Page 51: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/51.jpg)
51
![Page 52: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/52.jpg)
52
Primary Goal: available to all
![Page 53: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/53.jpg)
53
Primary Goal: available to all
![Page 54: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/54.jpg)
54
• Detailed Phenotype and Outcome data
Region of residence
Risk factors
Examination
Surgery
Radiation
Sample
Slide
Specific histological features
Analyte
Aliquot
Donor notes
• Gene Expression (probe-level data)
• Raw genotype calls
• Gene-sample identifier links
• Genome sequence files
ICGC Controlled
Access Datasets
• Cancer Pathology
Histologic type or subtype
Histologic nuclear grade
• Patient/Person
Gender, Age range,
Vital status, Survival time
Relapse type, Status at follow-up
• Gene Expression (normalized)
• DNA methylation
•Computed Copy Number and
Loss of Heterozygosity
• Newly discovered somatic variants
ICGC OA
Datasets
http://goo.gl/w4mrV
![Page 55: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/55.jpg)
55
Secondary Goal: coordinate
work to benefit productivity
http://goo.gl/K5mHC3
![Page 56: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/56.jpg)
56
https://icgc.org/icgc/committees-and-working-groups
![Page 57: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/57.jpg)
57
Secondary Goal: disseminate knowledge
http://goo.gl/ObcZXy
![Page 58: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/58.jpg)
58
ICGC
Goals, Structure,
Policies & Guidelines
http://goo.gl/sPGLQN
![Page 59: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/59.jpg)
59
Policy
ICGC membership implies compliance with Core
Bioethical Elements for samples used in ICGC
Cancer Projects:
http://goo.gl/TFrCmK
http://goo.gl/nYx6YG
![Page 60: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/60.jpg)
60
POLICY:
The members of the International Cancer Genomics
Consortium (ICGC) are committed to the principle of
rapid data release to the scientific community.
http://goo.gl/TFrCmK
![Page 61: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/61.jpg)
61
Publication Policy
• The individual research groups in
the ICGC are free to publish the
results of their own efforts in
independent publications at any
time (subject, of course, to any
policies of any collaborations in
which they may be participating).
![Page 62: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/62.jpg)
62
Moratorium: http://www.icgc.org/icgc/goals-structure-policies-guidelines/e3-publication-policy
![Page 63: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/63.jpg)
63
Publication Policy
![Page 64: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/64.jpg)
64
Where do you find that information?
• We actually make it hard to find, but we are
working on that! (this is an example of where ICGC
would like to do what TCGA does!)
• http://cancergenome.nih.gov/publications/publicatio
nguidelines
![Page 65: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/65.jpg)
65
Where do you find that information?
For ICGC data:
• Need to find the policy!• http://icgc.org/icgc/goals-structure-policies-
guidelines/e3-publication-policy
• Find text:
• Find date: in README on FTP file
• This is bad, we know it, and we are fixing it!
• In doubt, contact us: [email protected]
![Page 66: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/66.jpg)
66
Policy on Intellectual Property
• All ICGC members agree not to make claims to
possible IP derived from primary data (including
somatic mutations) and to not pursue IP
protections that would prevent or block access to
or use of any element of ICGC data or conclusions
drawn directly from those data.
http://goo.gl/TCMXCl
![Page 67: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/67.jpg)
67
ICGC Map – May 201472 projects launched
![Page 68: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/68.jpg)
68
DCC ActivitiesDCC activities are split between two groups:
• Software Development
– DCC portal
– Submission tool
• Biocuration (which also includes Content
Management)
– Data level management
– Submitter “handling”
– Coordination with secretariat
– User support
http://dcc.icgc.org/team68
![Page 69: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/69.jpg)
69
Data
ValidationValidationValidation(dictionary)
Validation(across fields)
Validation(across fields)
Validation(across fields)
indexing
Happy Users
http://goo.gl/1EcyR
![Page 70: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/70.jpg)
70
http://docs.icgc.org/methods
![Page 71: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/71.jpg)
71
http://docs.icgc.org/dcc-data-element-specifications
![Page 72: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/72.jpg)
72
ICGC Biocuration
• Helping submitters get their data to ICGC
• Progress reporting (data audit)
• Quality checks (coverage, correctness, etc.)
• Helping users get to the data
• Validate and check (and recheck) metadata on public
repositories
• Test and integrate with other public repositories via
standard data formats, ontologies.
• Documentation, documentation, and more documentation
• Training
72
![Page 73: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/73.jpg)
73
ICGC datasets to date
ICGC Data Portal Cumulative Donor Count for Member Projects
2000
4000
6000
8000
10,000
12,000
14,000
0
Number of
Donors
Release 7
Release 8
Release 9
Release 10
Release 11
Release 12Release 13
Release 14
Release 15
Release 16Release 17
![Page 74: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/74.jpg)
•Cancer types: 50
•Body sites: 18
•Donors: 12,232
•Specimens: 24, 661
•Simple somatic mutations: 9,871,477
•Mutated genes: 57,526
ICGC dataset version 17
Sept 11th 2014
![Page 75: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/75.jpg)
75
Clinical Data Completeness
Donor interval of last followup
Donor Tumour stage at diagnosis
Donor Tumour staging system at diagnosis
Donor diagnosis ICG10
DonorFields
Donor survival time
Donor Tumour stage at diagnosis supplemental
Donor relapse interval
Donor age at last followup
Donor relapse type
Donor age at diagnosis
Disease status last followup
Donor region of residence
Donor sex
Donor ID
Donor vital status
Average Percentage Completeness
Overall Donor Clinical Data Completeness
![Page 76: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/76.jpg)
76
Clinical Data Completeness
Donor interval of last followup
Donor Tumour stage at diagnosis
Donor Tumour staging system at diagnosis
Donor diagnosis ICG10
DonorFields
Donor survival time
Donor Tumour stage at diagnosis supplemental
Donor relapse interval
Donor age at last followup
Donor relapse type
Donor age at diagnosis
Disease status last followup
Donor region of residence
Donor sex
Donor ID
Donor vital status
Average Percentage Completeness
Overall Donor Clinical Data Completeness
![Page 77: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/77.jpg)
77
Clinical Data Completeness
Overall Specimen Clinical Data Completeness
Level of cellularity
Percentage cellularity
Digital Image of Stained Section
Tumour Stage Supplemental
Tumour Stage
Tumour Stage System
Tumour Grade Supplemental
Tumour Grade
Tumour Grading System
Tumour Histological Type
Specimen available
Specimen Biobank ID
Specimen Biobank
Tumour confirmed
Specimen storage other
Specimen storage
Specimen processing other
Specimen processing
Specimen donor treatment type
Specimen Interval
Specimen type
Specimen type other
Specimen ID
Donor ID
Specimen donor treatment type other
SpecimenFields
0 20 40 60 80
Average Percentage Completeness
10 30 50 70 90 100
![Page 78: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/78.jpg)
78
Clinical Data Completeness
Overall Specimen Clinical Data Completeness
Level of cellularity
Percentage cellularity
Digital Image of Stained Section
Tumour Stage Supplemental
Tumour Stage
Tumour Stage System
Tumour Grade Supplemental
Tumour Grade
Tumour Grading System
Tumour Histological Type
Specimen available
Specimen Biobank ID
Specimen Biobank
Tumour confirmed
Specimen storage other
Specimen storage
Specimen processing other
Specimen processing
Specimen donor treatment type
Specimen Interval
Specimen type
Specimen type other
Specimen ID
Donor ID
Specimen donor treatment type other
SpecimenFields
0 20 40 60 80
Average Percentage Completeness
10 30 50 70 90 100
![Page 79: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/79.jpg)
79
DACO
ICGC
cgHUB
EGA
TCGA
BAM
Open
OpenBA
M
Germ
Line
+ EGA id
BA
MBA
M
ERA
![Page 80: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/80.jpg)
ICGCBAM/FASTQ
TCGABAM/FASTQ
ICGCOpen
Data
(includes
TCGA
Open Data)
COSMICOpen
Data
![Page 81: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/81.jpg)
81
Raw Data Availability at EGA by Project and Data Type
• https://www.ebi.ac.uk/ega/organisations/EGAO00000000024
![Page 82: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/82.jpg)
82
![Page 83: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/83.jpg)
83
![Page 84: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/84.jpg)
84
![Page 85: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/85.jpg)
85
Select “Bladder Cancer – China”
![Page 86: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/86.jpg)
86
Select “Pancreatic cancer – Canada”
![Page 87: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/87.jpg)
87
… But where is the data?
![Page 88: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/88.jpg)
88
![Page 89: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/89.jpg)
89
http://dcc.icgc.org/
![Page 90: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/90.jpg)
90
![Page 91: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/91.jpg)
91
![Page 92: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/92.jpg)
92
Highlights of the new portal: dcc.icgc.org
• Faceted searches capabilities for variants, genes and
donors
– Interactive data exploration fast and easy
• Mutation aggregation & counts across donors and cancers
– # of pancreatic cancers donors with mutation KRAS G12D
• Standardized gene consequence across all projects
• Genome browser
• Data doewnload
• Protein domains
• Links to repositories
![Page 93: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/93.jpg)
93
KRAS search
![Page 94: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/94.jpg)
94
• Summary
• Cancer type distribution
• Other links (Cosmic, Entrez, etc)
• Mutation profile in protein
• Domains
• Genomic Context
• Mutation profile
• Most common mutations
![Page 95: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/95.jpg)
95
http://dcc.icgc.org/genes/ENSG00000133703
![Page 96: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/96.jpg)
96
![Page 97: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/97.jpg)
97
![Page 98: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/98.jpg)
98
![Page 99: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/99.jpg)
99
Donor• Donor ID
• Primary site
• Cancer Project
• Gender
• Tumor Stage
• Vital Status
• Disease Status
• Release type
• Age at diagnosis
• Available data types
• Analysis types
![Page 100: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/100.jpg)
100
Genes
![Page 101: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/101.jpg)
101
Mutations• Consequences
• Type
• Platform
• Verification status
![Page 102: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/102.jpg)
102
Exporting data
![Page 103: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/103.jpg)
103
Exporting data
![Page 104: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/104.jpg)
104
![Page 105: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/105.jpg)
105
Exporting data
![Page 106: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/106.jpg)
106
Can do bulk download of the data …
![Page 107: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/107.jpg)
107
BIGDATA
ValidationValidationRAW
DATA
MetaDATA
Interpreted data
✔
✔
✔
✔
✔
![Page 108: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/108.jpg)
108
DACO
ICGC
dbGaP
EGA
TCGA
BAM
Open
Open
ERA
BA
M
Germ
Line
+ EGA id
BA
MBA
M
![Page 109: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/109.jpg)
109
ICGC Data Categories
ICGC Open Access Datasets ICGC Controlled Access Datasets
Cancer Pathology
Histologic type or subtype
Histologic nuclear grade
Donor
Gender
Age range
RNA expression (normalized)
DNA methylation
Genotype frequencies
Somatic mutations (SNV,
CNV and Structural
Rearrangement)
Detailed Phenotype and Outcome Data
Patient demography
Risk factors
Examination
Surgery/Drugs/Radiation
Sample/Slide
Specific histological features
Protocol
Analyte/Aliquot
Gene Expression (probe-level data)
Raw genotype calls (germline)
Gene-sample identifier links
Genome sequence files
Most of the data in the portal is publically available without restriction. However,
access to some data, like the germline mutations, requires authorization by the Data
Access Compliance Office (DACO)
![Page 110: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/110.jpg)
![Page 111: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/111.jpg)
http://icgc.org/daco
![Page 112: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/112.jpg)
112
• Detailed Phenotype and Outcome data
Region of residence
Risk factors
Examination
Surgery
Radiation
Sample
Slide
Specific histological features
Analyte
Aliquot
Donor notes
• Gene Expression (probe-level data)
• Raw genotype calls
• Gene-sample identifier links
• Genome sequence files
ICGC Controlled
Access Datasets
• Cancer Pathology
Histologic type or subtype
Histologic nuclear grade
• Patient/Person
Gender, Age range,
Vital status, Survival time
Relapse type, Status at follow-up
• Gene Expression (normalized)
• DNA methylation
•Computed Copy Number and
Loss of Heterozygosity
• Newly discovered somatic variants
ICGC OA
Datasets
http://goo.gl/w4mrV
![Page 113: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/113.jpg)
Identify
yourselfFill out detail form which
includes:
• Contact and Project
Information
•Information Technology
details and procedures
for keeping data secure
•Data Access Agreement
All of these
documents are
put into a PDF
file that you
print and get your
institution to sign
off on your behalf
![Page 114: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/114.jpg)
![Page 115: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/115.jpg)
‹#›
![Page 116: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/116.jpg)
‹#›
![Page 117: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/117.jpg)
‹#›
![Page 118: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/118.jpg)
‹#›
![Page 119: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/119.jpg)
‹#›
![Page 120: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/120.jpg)
‹#›
DACO approved projects:
> 160 groups - 75% academic
(> 870 people)
![Page 121: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/121.jpg)
121121
Nature 409:452
Bioinformatics Citizenship: What it means,
and what does it cost?
![Page 122: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/122.jpg)
122
Important messages:
• The ICGC portal is evolving and getting better all
the time
• Lots of data provided by the ICGC
• Important to be good citizens of the scientific world
• The idea behind all of this is to provide tools to
help cure cancer
• Need to respect policies and guidelines
• There is help out there, and user feedback is
*always* welcome.
![Page 123: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/123.jpg)
123
DCC Software
Developer
Vincent Ferretti
Daniel Chang
Anthony Cros
Jerry Lam
Brian O'Connor
Bob Tiernay
Stuart Watt
Shane Wilson
Junjun Zhang
Acknowledgments
ICGC Project leaders at the OICR:
Tom Hudson
John McPherson
Lincoln Stein
Jared Simpson
Paul Boutros
Vincent Ferretti
Francis Ouellette
Jennifer Jennings
Ouellette Lab
Michelle Brazas
Emilie Chautard
Nina Palikuca
Zhibin Lu
Web Dev
Joseph Yamada
Angela Chao
Daniel Gross
Kamen Wu
Kim Cullion
Miyuki Fukuma
Wen Xu
Pipeline Development
& Evaluation
Morgan Taschuk
Michael Laszloffy
Peter Ruzanov
ICGC DCC Biocuration
Hardeep Nahal
Marc Perry
http://oicr.on.ca http://icgc.org
… and all the patients and their
families that that are putting their
hopes into our work!
Research IT/Systems
David Sutton,
Bob Gibson
Sam Maclennan
David Magda
Rob Naccarato
Brian Ott
Gino Yearwood
EGA
Justin Paschall
Jeff Almeida-King
Ilkka Lappalainen
Jordi Rambla De Argila
Marc Sitges Puy
![Page 124: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/124.jpg)
124Informatics and Biocomputing at the OICR
![Page 125: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/125.jpg)
125
http://oicr.on.ca/careers
![Page 126: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/126.jpg)
126
![Page 127: Nov 2014 ouellette_windsor_icgc_final](https://reader034.fdocuments.us/reader034/viewer/2022042715/5594435f1a28ab06308b46da/html5/thumbnails/127.jpg)
127
http://icgc.org
http://dcc.icgc.org
http://docs.icgc.org
@bffo
Video tutorial: https://vimeo.com/75522669