Transforming Business driven to Technology driven organizations
Transforming Science Through Data -driven Discovery ... · Transforming Science Through Data-Driven...
Transcript of Transforming Science Through Data -driven Discovery ... · Transforming Science Through Data-Driven...
TransformingScienceThroughData-drivenDiscovery
CyVerseOverviewNationalAcademiesSpecialTopicsSummerInstitute
onQuantitativeBiologyJasonWilliams– Lead,CyVerse– Education,Outreach,Training
[email protected] [@JasonWilliamsNY]
DownloadSlidesandFollowAlong
cyverse-qubes.readthedocs.io
CyVerseEvolution
iPlant2008EmpoweringaNewPlant
Biology
iPlant2013CyberinfrastructureforLife
Science
CyVerse2016TransformingScienceThroughData-Driven
Discovery
WearefundedbytheNationalScienceFoundation
• Weareyourcolleaguesandcollaborators!• $100Millionininvestment• Freelyavailabletothecommunity• Spurnational/internationalcollaboration• CiteCyVerse:
CyVerse.org/acknowledge-cite-cyverse
DBI-0735191andDBI-1265383
CyVerseEvolution
CyVerseEvolution
CyVerse2016TransformingScienceThroughData-Driven
Discovery
Vision:Transformingsciencethroughdata-drivendiscovery
Mission:Design,develop,deploy,andexpandanationalcyberinfrastructure forlifescienceresearch,andtrainscientistsinitsuse
Morethan30Kusers,PBofdata,andhundredsofpublications,courses,anddiscoveries
WhatisCyberinfrastructure?
•Datastorage•Software•High-performancecomputing•People
organizedintosystemsthatsolveproblemsofsizeandscopethatwouldnototherwisebesolvable.
WhatisCyberinfrastructure?
Platforms,tools,datasets Storageandcompute Trainingandsupport
CyVersesupportsalldomainsoflifescience
Plant/MicrobialAnimal BiomedicalEcological/Climate
CyVerseisbuiltforData
CyVerseproductstackReadytousePlatforms
FoundationalCapabilities
EstablishedCIComponents
ExtensibleServices
EaseofU
se
HowwasCyVersebuilt?
Genomicdataandanalysis:• Referenceguidedassembly• Denovoassembly• RNA-Seq(expression;gene/isoformdiscovery)• Variantcalling• Genome/Transcriptomeannotation• ChIP-Seq/Integrationofepigeneticinformation• Multiplesequencingplatforms• Newandevolvingtechnologies
CyVerseCommunityPriorities
Genomicdata
Environmentaldata
Phenotypedata
Phylogenetic Inferences
EcologicalModels
EvolutionaryModels
AssociationStudies
PathwayAnalysis
Predictiveandsynthetic
Knowledgegathering
Retrodictiveinsights
CyVerseCommunityPriorities
CyVerse is a collaborative virtual organization
CyVerseInstitutions
CyVerseUK
• WestrivetobetheCILegoblocks• Danish'leggodt'- 'playwell’• Alsotranslatesas'Iputtogether'inLatin• IfasolutionisnotavailableyoucancraftyourownusingCyVerseCIcomponents
CyVerseProducts
Data Store
ü Initial100GBallocation– TBallocationsavailable
ü Automaticdatabackup
ü Easyupload/downloadandsharing
Theresourcesyouneedtoshareandmanagedatawithyourlab,colleaguesandcommunity
Discovery EnvironmentHundredsofbioinformaticsAppsinaneasy-to-useinterface
ü Aplatform thatcanrunalmostanybioinformaticsapplication
ü Seamlesslyintegratedwithdataandhighperformancecomputing
ü Userextensible– addyourownapplications
AtmosphereCloudcomputingforthelifesciences
ü Simple:One-clickaccesstomorethan200virtualmachineimages
ü Flexible:Fullycustomizeyoursoftwaresetup
ü Powerful:IntegratedwithiPlantcomputinganddataresources
Science APIsFullycustomizeiPlant resources
ü Science-as-a-serviceplatform
ü Defineyourowncompute,andstorageresources(localandiPlant)
ü Buildyourownappstoreofscientificcodesandworkflows
DNA SubwayEducationalworkflowsforGenomes,DNABarcoding,RNA-Seq
ü Commonlyusedbioinformaticstoolsinstreamlinedworkflows
ü Teachimportantconceptsinbiologyandbioinformatics
ü Inquiry-basedexperimentsfornoveldiscoveryandpublicationofdata
BisqueImageanalysis,management,andmetadata
ü Secureimagestorage,analysis,anddatamanagement
ü Integrateexistingapplicationsorcreatenewones
ü CustomvisualizationandimagehandlingroutinesandAPIs
TransformingScienceThrough Data-drivenDiscovery
ParkerAntinNiravMerchant
EricLyons
MattVaughn DoreenWareDaveMicklos
CyVerseissupported bytheNationalScienceFoundation underGrantNo.DBI-0735191andDBI-1265383.
CyVerseExecutiveTeam