SlideShare a Scribd company logo
1 of 21
Scott Edmunds Data Dissemination:  Difficulties, Data Citation, DOIs and, (“Mo Data Mo Problems”)
The Ecoresponsive Genome of Daphnia pulexColbourne et al., Science4 February 2011:  200Mb Genome, 30,907 genes Duplicated genes most responsive to ecological challenges
Daphnia Genome Consortium wFleabase: 					Mar 2006 Genome release: 			July 2007 Genome Published:		Feb 2011 >58 companion papers https://daphnia.cgb.indiana.edu/Publications
Difficulties Flickr cc: opensourceway
Sequencing cost($ per Mbp) Moore’s Law ~100,000X Sequencing Source: E Lander/Broad
Sequencing Output Data Storage Moore’s/Kryders Law
Sequencing Output Data Publication Dissemination?
Potential sequencing capacity 1 IlluminaHiSeq 2000 (+Truseq upgrade)  = 600Gb/run (12 days) X 128 Hiseq= 6Tb/day = >2Pb/year = ~ 2000 Human Genomes/day
SRA Closure
Incentives/credit Credit where credit is overdue: “One option would be to provide researchers who release data to public repositories with a means of accreditation.” “An ability to search the literature for all online papers that used a particular data set would enable appropriate attribution for those who share. “ Nature Biotechnology 27, 579 (2009)  Prepublication data sharing  (Toronto International Data Release Workshop) “Data producers benefit from creating a citable reference, as it can later be used to reflect impact of the data sets.” Nature461, 168-170 (2009)
Datacitation: Datacite and DOIs Digital Object Identifiers (DOIs) offer a solution ,[object Object]
Researchers, authors, publishers know how to use them
Put datasets on the same playing field as articles Dataset Yancheva et al (2007). Analyses on sediment of Lake Maar. PANGAEA. doi:10.1594/PANGAEA.587840
Datacitation: Datacite and DOIs >1 million DOIs since Dec 2009 Central metadata repository to link with WoS/ISI - finally can track and credit use!
Coming soon… Large-Scale Data  Journal/Database In conjunction with: Editor-in-Chief: Laurie Goodman, PhD 	Editor: Scott Edmunds, PhD 	Assistant Editor: Alexandra Basford, PhD www.gigasciencejournal.com
Criteria and Focus of Journal/Database ,[object Object]
Utility/Usability
Standards/Searchability/Scale/Sharing
Data publishing/DOIwww.gigasciencejournal.com
Use of Data  = Importance   +  Usability easier to assess subjective?  www.gigasciencejournal.com
Reproducibility/Reuse ,[object Object]

More Related Content

What's hot

Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
CEDAR: Center for Expanded Data Annotation and Retrieval
 

What's hot (20)

GigaScience: data and beta-database launch. Announcing GigaDB
GigaScience: data and beta-database launch. Announcing GigaDBGigaScience: data and beta-database launch. Announcing GigaDB
GigaScience: data and beta-database launch. Announcing GigaDB
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
 
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
Nicole Nogoy's talk at eResearchNZ 2014: Improving data sharing, integration ...
 
Data citation and sharing during article publication
Data citation and sharing during article publicationData citation and sharing during article publication
Data citation and sharing during article publication
 
dkNET Poster ENDO 2019
dkNET Poster ENDO 2019dkNET Poster ENDO 2019
dkNET Poster ENDO 2019
 
Role of Amyloid Burden in cognitive decline
Role of Amyloid Burden in cognitive decline Role of Amyloid Burden in cognitive decline
Role of Amyloid Burden in cognitive decline
 
The State of Open Research Data
The State of Open Research DataThe State of Open Research Data
The State of Open Research Data
 
ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika! ContentMine + EPMC: Finding Zika!
ContentMine + EPMC: Finding Zika!
 
1476-4598-7-18
1476-4598-7-181476-4598-7-18
1476-4598-7-18
 
Content Mining of Science in Cambridge
Content Mining of Science in CambridgeContent Mining of Science in Cambridge
Content Mining of Science in Cambridge
 
Jsm madduri-august-2015
Jsm madduri-august-2015Jsm madduri-august-2015
Jsm madduri-august-2015
 
Automatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literature Automatic Extraction of Knowledge from Biomedical literature
Automatic Extraction of Knowledge from Biomedical literature
 
Museum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on themMuseum impact: linking-up specimens with research published on them
Museum impact: linking-up specimens with research published on them
 
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
Embracing Semantic Technology for Better Metadata Authoring in Biomedicine (S...
 
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
Metadata in the BioSample Online Repository are Impaired by Numerous Anomalie...
 
Shorthouse
ShorthouseShorthouse
Shorthouse
 
Mcb database resources workshop 2013
Mcb database resources workshop 2013Mcb database resources workshop 2013
Mcb database resources workshop 2013
 
Freeing scientific data using CC0
Freeing scientific data using CC0Freeing scientific data using CC0
Freeing scientific data using CC0
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
An Open Repository Model for Acquiring Knowledge About Scientific ExperimentsAn Open Repository Model for Acquiring Knowledge About Scientific Experiments
An Open Repository Model for Acquiring Knowledge About Scientific Experiments
 

Viewers also liked

Resources the transition-article #2
Resources the transition-article #2Resources the transition-article #2
Resources the transition-article #2
Montgomery Norton
 
Powerpoint about me
Powerpoint about mePowerpoint about me
Powerpoint about me
cornpoland11
 
Obhajoba prezentace vyns
Obhajoba prezentace vynsObhajoba prezentace vyns
Obhajoba prezentace vyns
Vladimír Vynš
 
Presentation Tools for the Classroom
Presentation Tools for the ClassroomPresentation Tools for the Classroom
Presentation Tools for the Classroom
cynthiafarrell
 
What have i learnt from audience feedback
What have i learnt from audience feedbackWhat have i learnt from audience feedback
What have i learnt from audience feedback
chocolatecake
 

Viewers also liked (20)

Resources the transition-article #2
Resources the transition-article #2Resources the transition-article #2
Resources the transition-article #2
 
Fred williams ica-2012
Fred williams ica-2012Fred williams ica-2012
Fred williams ica-2012
 
Bondelandet
BondelandetBondelandet
Bondelandet
 
Powerpoint about me
Powerpoint about mePowerpoint about me
Powerpoint about me
 
Obhajoba prezentace vyns
Obhajoba prezentace vynsObhajoba prezentace vyns
Obhajoba prezentace vyns
 
Do you feel secure online? Beliefs and Attitudes on Security and Privacy
Do you feel secure online? Beliefs and Attitudes on Security and PrivacyDo you feel secure online? Beliefs and Attitudes on Security and Privacy
Do you feel secure online? Beliefs and Attitudes on Security and Privacy
 
Bauhina Genome slides for school visit
Bauhina Genome slides for school visitBauhina Genome slides for school visit
Bauhina Genome slides for school visit
 
Emocions
EmocionsEmocions
Emocions
 
Ginger Cookies
Ginger CookiesGinger Cookies
Ginger Cookies
 
Jennifer Manz: From Parked Page to Profitable Business in 3 Months
Jennifer Manz: From Parked Page to Profitable Business in 3 MonthsJennifer Manz: From Parked Page to Profitable Business in 3 Months
Jennifer Manz: From Parked Page to Profitable Business in 3 Months
 
Banana Split
Banana SplitBanana Split
Banana Split
 
Presentation Tools for the Classroom
Presentation Tools for the ClassroomPresentation Tools for the Classroom
Presentation Tools for the Classroom
 
Arec 17 feb11
Arec 17 feb11Arec 17 feb11
Arec 17 feb11
 
Bacsac @ Keukenhof
Bacsac @ KeukenhofBacsac @ Keukenhof
Bacsac @ Keukenhof
 
Be hirable! be unique!
Be hirable! be unique!Be hirable! be unique!
Be hirable! be unique!
 
Sudak presentation
Sudak presentation Sudak presentation
Sudak presentation
 
Paolo DiVincenzo: 4 Steps to Increase Parking Revenue by 70%
Paolo DiVincenzo: 4 Steps to Increase Parking Revenue by 70%Paolo DiVincenzo: 4 Steps to Increase Parking Revenue by 70%
Paolo DiVincenzo: 4 Steps to Increase Parking Revenue by 70%
 
What have i learnt from audience feedback
What have i learnt from audience feedbackWhat have i learnt from audience feedback
What have i learnt from audience feedback
 
Fish and Chips
Fish and ChipsFish and Chips
Fish and Chips
 
How Far Can You See?
How Far Can You See?How Far Can You See?
How Far Can You See?
 

Similar to Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaSciece)

Similar to Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaSciece) (20)

HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Scott Edmunds: GigaScience Datacite meeting Rapid Fire Talk
Scott Edmunds: GigaScience Datacite meeting Rapid Fire TalkScott Edmunds: GigaScience Datacite meeting Rapid Fire Talk
Scott Edmunds: GigaScience Datacite meeting Rapid Fire Talk
 
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data HandlingScott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
Scott Edmunds: GigaScience - Big-Data, Data Citation and Future Data Handling
 
Nicole Nogoy at the Auckland BMC RoadShow
Nicole Nogoy at the Auckland BMC RoadShowNicole Nogoy at the Auckland BMC RoadShow
Nicole Nogoy at the Auckland BMC RoadShow
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"
 
Scott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data delugeScott Edmunds: Data publication in the data deluge
Scott Edmunds: Data publication in the data deluge
 
What is DataCite-screenshots
What is DataCite-screenshotsWhat is DataCite-screenshots
What is DataCite-screenshots
 
Scott Edmunds at DataCite 2012: Adventures in Data Citation
Scott Edmunds at DataCite 2012: Adventures in Data CitationScott Edmunds at DataCite 2012: Adventures in Data Citation
Scott Edmunds at DataCite 2012: Adventures in Data Citation
 
Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"Scott Edmunds: Data Dissemination in the era of "Big-Data"
Scott Edmunds: Data Dissemination in the era of "Big-Data"
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Data Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access SymposiumData Publishing at Harvard's Research Data Access Symposium
Data Publishing at Harvard's Research Data Access Symposium
 
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
Scott Edmunds: Channeling the Deluge: Reproducibility & Data Dissemination in...
 
Open Data HK: open science meets open data. A primer from Scott Edmunds
Open Data HK: open science meets open data. A primer from Scott EdmundsOpen Data HK: open science meets open data. A primer from Scott Edmunds
Open Data HK: open science meets open data. A primer from Scott Edmunds
 
Building an Information Infrastructure to Support Genetic Sciences
Building an Information Infrastructure to Support Genetic SciencesBuilding an Information Infrastructure to Support Genetic Sciences
Building an Information Infrastructure to Support Genetic Sciences
 
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
Alexandra Basford, InCoB 2011: A Journal’s Perspective on Data Standards and ...
 
British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011British Library Datasets Programme Feb 2011
British Library Datasets Programme Feb 2011
 
Data sharing archiving discovery, Bill Michener
Data sharing archiving discovery, Bill MichenerData sharing archiving discovery, Bill Michener
Data sharing archiving discovery, Bill Michener
 
Laurie Goodman at the BMC Roadshow: Transparency in Publishing and Being an O...
Laurie Goodman at the BMC Roadshow: Transparency in Publishing and Being an O...Laurie Goodman at the BMC Roadshow: Transparency in Publishing and Being an O...
Laurie Goodman at the BMC Roadshow: Transparency in Publishing and Being an O...
 
The State of Open Research Data - OpenCon 2014
The State of Open Research Data - OpenCon 2014The State of Open Research Data - OpenCon 2014
The State of Open Research Data - OpenCon 2014
 
Data Publishing in Archaeozoology
Data Publishing in ArchaeozoologyData Publishing in Archaeozoology
Data Publishing in Archaeozoology
 

More from Scott Edmunds

More from Scott Edmunds (20)

Free the Data! Pitch to Hong Kong Open Data Day 2019
Free the Data! Pitch to Hong Kong Open Data Day 2019Free the Data! Pitch to Hong Kong Open Data Day 2019
Free the Data! Pitch to Hong Kong Open Data Day 2019
 
Scott Edmunds: Access to Information Consultation Recomendations
Scott Edmunds: Access to Information Consultation RecomendationsScott Edmunds: Access to Information Consultation Recomendations
Scott Edmunds: Access to Information Consultation Recomendations
 
Open Data Hong Kong Update: CCCHK@10
Open Data Hong Kong Update: CCCHK@10Open Data Hong Kong Update: CCCHK@10
Open Data Hong Kong Update: CCCHK@10
 
Scott Edmunds Lightning talk: Experiences of NGO
Scott Edmunds Lightning talk: Experiences of NGOScott Edmunds Lightning talk: Experiences of NGO
Scott Edmunds Lightning talk: Experiences of NGO
 
Scott Edmunds & Mendel Wong, Citizen Science #101. HKU MPA lecuture
Scott Edmunds & Mendel Wong, Citizen Science #101. HKU MPA lecutureScott Edmunds & Mendel Wong, Citizen Science #101. HKU MPA lecuture
Scott Edmunds & Mendel Wong, Citizen Science #101. HKU MPA lecuture
 
HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10
 
Emblematic education to know thy DNA? TEDxEduHK
Emblematic education to know thy DNA? TEDxEduHKEmblematic education to know thy DNA? TEDxEduHK
Emblematic education to know thy DNA? TEDxEduHK
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
HKU Data Curation MLIM7350 Class 7
HKU Data Curation MLIM7350 Class 7HKU Data Curation MLIM7350 Class 7
HKU Data Curation MLIM7350 Class 7
 
Hong Kong 2017 Open Data Day hackathon results: RacismWatch:HK
Hong Kong 2017 Open Data Day hackathon results: RacismWatch:HKHong Kong 2017 Open Data Day hackathon results: RacismWatch:HK
Hong Kong 2017 Open Data Day hackathon results: RacismWatch:HK
 
Bauhinia Genome talk at the Galaxy Australasia Meeting
Bauhinia Genome talk at the Galaxy Australasia MeetingBauhinia Genome talk at the Galaxy Australasia Meeting
Bauhinia Genome talk at the Galaxy Australasia Meeting
 
David Palmer: China Open Access week
David Palmer: China Open Access weekDavid Palmer: China Open Access week
David Palmer: China Open Access week
 
Bauhina Genome talk: Grass Roots Genomics: Using Hong Kong's Emblem to Crack ...
Bauhina Genome talk: Grass Roots Genomics: Using Hong Kong's Emblem to Crack ...Bauhina Genome talk: Grass Roots Genomics: Using Hong Kong's Emblem to Crack ...
Bauhina Genome talk: Grass Roots Genomics: Using Hong Kong's Emblem to Crack ...
 
ODHK.Meet.37 Intro to Research Data Policies and Platforms
ODHK.Meet.37 Intro to Research Data Policies and PlatformsODHK.Meet.37 Intro to Research Data Policies and Platforms
ODHK.Meet.37 Intro to Research Data Policies and Platforms
 
Scott Edmunds pitch Mosquito Alert at the Earthwatch HK Citizen Science meetup
Scott Edmunds pitch Mosquito Alert at the Earthwatch HK Citizen Science meetupScott Edmunds pitch Mosquito Alert at the Earthwatch HK Citizen Science meetup
Scott Edmunds pitch Mosquito Alert at the Earthwatch HK Citizen Science meetup
 
Scott Edmunds talking Bauhina Genome at DIYBIOHK
Scott Edmunds talking Bauhina Genome at DIYBIOHKScott Edmunds talking Bauhina Genome at DIYBIOHK
Scott Edmunds talking Bauhina Genome at DIYBIOHK
 
Introductory slides for the MakerBay/ODHK #ZikaHackathon
Introductory slides for the MakerBay/ODHK #ZikaHackathonIntroductory slides for the MakerBay/ODHK #ZikaHackathon
Introductory slides for the MakerBay/ODHK #ZikaHackathon
 
Intro for ODHK.meet.32 on Hacking the "Human Genome"
Intro for ODHK.meet.32 on Hacking the "Human Genome"Intro for ODHK.meet.32 on Hacking the "Human Genome"
Intro for ODHK.meet.32 on Hacking the "Human Genome"
 
BauhinaGenome preview at #ICG10
BauhinaGenome preview at #ICG10BauhinaGenome preview at #ICG10
BauhinaGenome preview at #ICG10
 
Amanda Meng at ODHK meet.29: Open Government Data & Social Impact
Amanda Meng at ODHK meet.29: Open Government Data & Social ImpactAmanda Meng at ODHK meet.29: Open Government Data & Social Impact
Amanda Meng at ODHK meet.29: Open Government Data & Social Impact
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

Scott Edmunds: Data Dissemination: Difficulties, Data Citation, DOI's (and GigaSciece)

  • 1. Scott Edmunds Data Dissemination: Difficulties, Data Citation, DOIs and, (“Mo Data Mo Problems”)
  • 2. The Ecoresponsive Genome of Daphnia pulexColbourne et al., Science4 February 2011: 200Mb Genome, 30,907 genes Duplicated genes most responsive to ecological challenges
  • 3. Daphnia Genome Consortium wFleabase: Mar 2006 Genome release: July 2007 Genome Published: Feb 2011 >58 companion papers https://daphnia.cgb.indiana.edu/Publications
  • 4. Difficulties Flickr cc: opensourceway
  • 5. Sequencing cost($ per Mbp) Moore’s Law ~100,000X Sequencing Source: E Lander/Broad
  • 6. Sequencing Output Data Storage Moore’s/Kryders Law
  • 7. Sequencing Output Data Publication Dissemination?
  • 8. Potential sequencing capacity 1 IlluminaHiSeq 2000 (+Truseq upgrade) = 600Gb/run (12 days) X 128 Hiseq= 6Tb/day = >2Pb/year = ~ 2000 Human Genomes/day
  • 10. Incentives/credit Credit where credit is overdue: “One option would be to provide researchers who release data to public repositories with a means of accreditation.” “An ability to search the literature for all online papers that used a particular data set would enable appropriate attribution for those who share. “ Nature Biotechnology 27, 579 (2009) Prepublication data sharing (Toronto International Data Release Workshop) “Data producers benefit from creating a citable reference, as it can later be used to reflect impact of the data sets.” Nature461, 168-170 (2009)
  • 11.
  • 12. Researchers, authors, publishers know how to use them
  • 13. Put datasets on the same playing field as articles Dataset Yancheva et al (2007). Analyses on sediment of Lake Maar. PANGAEA. doi:10.1594/PANGAEA.587840
  • 14. Datacitation: Datacite and DOIs >1 million DOIs since Dec 2009 Central metadata repository to link with WoS/ISI - finally can track and credit use!
  • 15. Coming soon… Large-Scale Data Journal/Database In conjunction with: Editor-in-Chief: Laurie Goodman, PhD Editor: Scott Edmunds, PhD Assistant Editor: Alexandra Basford, PhD www.gigasciencejournal.com
  • 16.
  • 20. Use of Data = Importance + Usability easier to assess subjective? www.gigasciencejournal.com
  • 21.
  • 22. Integrated tools to promote more widespread access, viewing, and analysis of data.
  • 23. Encourage and aid use of workflow systems for methods (e.g. submission of Galaxy XML files).www.gigasciencejournal.com
  • 24.
  • 25. Allsupporting data must be publically available.
  • 26. Ask for MIBBI compliance and use of reporting checklists.
  • 27. Part of the Biosharing network.www.gigasciencejournal.com
  • 28.
  • 29. Data hosting will follow standard funding agency and community guidelines.
  • 30. DOI assignment available for submitted data to allow ease of findingand citing datasets, as well as for citation tracking.www.gigasciencejournal.com
  • 31. Our first DOI: To maximize its utility to the research community and aid those  fighting the current epidemic, genomic data is released here into the public domain under a CC0 license. Until the publication of research papers on the assembly and whole-genome analysis of this isolate we would ask you to cite this dataset as: Li, D; Xi, F; Zhao, M; Liang, Y; Chen, W; Cao, S; Xu, R; Wang, G; Wang, J; Zhang, Z; Li, Y; Cui, Y; Chang, C; Cui, C; Luo, Y; Qin, J; Li, S; Li, J; Peng, Y; Pu, F; Sun, Y; Chen,Y; Zong, Y; Ma, X; Yang, X; Cen, Z; Zhao, X; Chen, F; Yin, X; Song,Y ; Rohde, H; Li, Y; Wang, J; Wang, J and the Escherichia coli O104:H4 TY-2482 isolate genome sequencing consortium (2011) Genomic data from Escherichia coli O104:H4 isolate TY-2482. BGI Shenzhen. doi:10.5524/100001 http://dx.doi.org/10.5524/100001 To the extent possible under law, BGI Shenzhen has waived all copyright and related or neighboring rights to Genomic Data from the 2011 E. coli outbreak. This work is published from: China.
  • 32. E. Coli #crowdsourcing: the first tweenome?