From Deadly E. coli to Endangered Polar Bear: GigaScience Provides First Citable Data
doi:10.5524/100008
doi:10.5524/100001
Scott Edmunds, PhDAlexandra Basford, PhD
The “big-data” era
Flickr cc: opensourcewaySource: E Lander/Broad
Traditional Form of Data-Release
Data Production
Data Analysis
Peer Review
PublicationPublication
Months/Years
Incentives/creditCredit where credit is overdue:“One option would be to provide researchers who release data to public repositories with a means of accreditation.”“An ability to search the literature for all online papers that used a particular data set would enable appropriate attribution for those who share. “Nature Biotechnology 27, 579 (2009)
Prepublication data sharing (Toronto International Data Release Workshop)“Data producers benefit from creating a citable reference, as it can later be used to reflect impact of the data sets.” Nature 461, 168-170 (2009)
A new open-access open-data journal and database
www.gigasciencejournal.comPublished by BGI in
partnership with BioMed Central
… “big and sharable”
Innovative article publishing and data hosting
Launching today…
Why DOI®s?
– Clear method for data tracking and data citation, allowing: • Increased the searchability (and use) of data • Credit for data production, making it clear who produced the data
and when• The ability to track and receive feedback on data usage• Credit to original authors for their data’s use • A data citation metric potentially rivaling and complementary to
the impact factor• The potential to publish papers relating to a dataset, while making
the data available and receiving credit for it earlier
To maximize its utility to the research community and aid those fighting the current epidemic, genomic data is released here into the public domain under a CC0 license. Until the publication of research papers on the assembly and whole-genome analysis of this isolate we would ask you to cite this dataset as:
Li, D; Xi, F; Zhao, M; Liang, Y; Chen, W; Cao, S; Xu, R; Wang, G; Wang, J; Zhang, Z; Li, Y; Cui, Y; Chang, C; Cui, C; Luo, Y; Qin, J; Li, S; Li, J; Peng, Y; Pu, F; Sun, Y; Chen,Y; Zong, Y; Ma, X; Yang, X; Cen, Z; Zhao, X; Chen, F; Yin, X; Song,Y ; Rohde, H; Li, Y; Wang, J; Wang, J and the Escherichia coli O104:H4 TY-2482 isolate genome sequencing consortium (2011) Genomic data from Escherichia coli O104:H4 isolate TY-2482. BGI Shenzhen. doi:10.5524/100001 http://dx.doi.org/10.5524/100001
Our first DOI®:
To the extent possible under law, BGI Shenzhen has waived all copyright and related or neighboring rights to Genomic Data from the 2011 E. coli outbreak. This work is published from: China.
G10K Genomes Get DOI®s
doi:10.5524/100004
About The Journal Open access, online journal optimized for the publication of all types of
biological studies that use or create large-scale data sets. New journal format that combines standard manuscript publication
with an extensive database developed to host all associated data. Evolving data repository beginning with 'omic’-type data and growing to
support other biological large-scale data, such as imaging data, cohort data, metadata, and others. Editorial interaction with the different biological communities to
determine the best means of hosting and accessing their type of data. Integrated tools to promote more widespread access, viewing, and
analysis of the stored data. BGI Cloud Computing resources for handling and analyzing large-scale
data. All Data given a DOI to allow ease of finding and citing datasets, as well
as for citation tracking.
www.gigasciencejournal.com
Questions?
Editor-in-Chief: Laurie Goodman, PhDEditor: Scott Edmunds, PhDAssistant Editor: Alexandra Basford, PhD
Launching today…
Follow GigaScience on Twitter @GigaScience
Contact: [email protected]
Top Related