SlideShare una empresa de Scribd logo
1 de 21
Vicky Schneider, Andrew Pask, Denis O’Meally,
Philippa Griffin, Jeff Christiansen, Mike
Charleston, Dominique Gorse, Andrew Treloar,
Jason Williams, Rebecca Johnson and Andrew Lonie
With comments from Paul Flicek, Michelle Barker,
Susanna Sansone, Dave Burt
An Oz Mammals
Bioinformatics and
Data Resource
• Whole-genome
sequencing reads
• Exon-capture
sequencing reads
• RADseq/GBS/exon-
capture sequencing
reads
Raw Processed
• Genome assemblies
• Gene alignments
• Phylogenetic trees
• Variant calls
• Transcriptome data
• Annotations
• Sequence alignments
• Phylogenetic trees
• Microsatellite datasets
• Cytological data (images?)
• Phenotype information
• …..... From Australian Alps on Flickr:
https://www.flickr.com/photos/australianalps/69549
40609 CC BY-NC-ND 2.0
From Australian Alps on Flickr:
https://www.flickr.com/photos/australianalps/69549
40609 CC BY-NC-ND 2.0
OMG Project
data
● Hugely valuable for
○ Understanding our natural heritage
○ Tackling evolutionary and ecological questions
○ Placental mammal research including human
biomedical research
● Uniquely Australian
● Often irreplaceable samples and data
We are leading the world in generating these data
Could also be leading in sharing the data!
+
Data Life Cycle
framework
visualising
Data Life Cycle
framework
visualising
Metadata (contextual
information about the data)
is key to making this work
e.g.
Sample
- species
- tissue type
- collection
location
- museum ID
Experiment
- sample
processing
method
- technology used
- settings
• A place to store and share data and metadata for the OMG project
• A place to store and share existing Oz mammal datasets
• A place to share data processing/analysis workflows
• A place to access data processing, analysis and visualisation tools
(with appropriate compute resources)
• Integration with external tools, e.g. Atlas of Living Australia
What is not covered by the OMG project?
• 5 strains x 2 growth conditions of 2 bacterial species
• Genomic, transcriptomic, metabolomic and proteomic profiles
Select datasets using drop-down menu
of metadata values:
e.g.
• ‘all raw transcriptomic datasets from
E. coli grown in blood media’
• ‘all datasets from bacterial samples
collected from patients in NSW
before 2010’
Send to local
desktops, HPC
systems for
analysis
Log in
Process/analyse/visualise
in common cloud-based
environment with pre-
installed software tools
Submit data and
metadata to
international
repository
• Large collaborative project funded by Research Data Services (RDS), linked
with the BPA Antibiotic-Resistant Pathogens Project
• Project members from VicNode, QCIF, Melbourne Bioinformatics (formerly
VLSCI), Intersect
• -> There is expertise in Australia in developing this kind of resource
• data storage
• research data management
• delivering analysis tools in a common cloud environment
• linking across storage/management/analysis layers
• Many of the pieces can be reused/adapted for different research projects
Existing Oz Mammal
data resources• For within-project
collaboration
• Focus on data sharing,
(storage), community
genome annotation
• Datasets mostly
unpublished as yet
Tools:
• File downloads
• JBrowse
• BLAST
• Apollo
http://copo-project.org/
Aims to provide an easy-to-use
interface for researches to
access interoperable
• Metadata annotation
services
• Data repository services
• Data analysis services
• Data publishing services
www.cyverse.org
• Capture Oz Mammal data and resources that already exist
• long-term, secure data storage
• Integrate new OMG data and metadata
• Enable data sharing within OMG project (and collaborators)
• Provide access to Oz Mammal data for the world!
What could a well-funded Oz Mammals Data
and Bioinformatics Resource do?
• Capture Oz Mammal data and resources that already exist
• long-term, secure data storage
• Integrate new OMG data and metadata
• Enable data sharing within OMG project (and collaborators)
• Provide access to Oz Mammal data for the world!
• Access to data processing, analysis and visualisation tools in one
place
• Integrate external tools, e.g. Atlas of Living Australia
• Enable sharing of processing/analysis workflows within the project
What could a well-funded Oz Mammals Data
and Bioinformatics Resource do?
• Capture Oz Mammal data and resources that already exist
• long-term, secure data storage
• Integrate new OMG data and metadata
• Enable data sharing within OMG project (and collaborators)
• Provide access to Oz Mammal data for the world!
• Access to data processing, analysis and visualisation tools in one
place
• Integrate external tools, e.g. Atlas of Living Australia
• Enable sharing of processing/analysis workflows within the project
• Enable sharing via submission to appropriate international
repositories
• encourage best-practice data formats
• encourage complete, rich metadata that complies with repository
and community standards
What could a well-funded Oz Mammals Data
and Bioinformatics Resource do?
• Capture Oz Mammal data and resources that already exist
• long-term, secure data storage
• Integrate new OMG data and metadata
• Enable data sharing within OMG project (and collaborators)
• Provide access to Oz Mammal data for the world!
• Access to data processing, analysis and visualisation tools in one
place
• Integrate external tools, e.g. Atlas of Living Australia
• Enable sharing of processing/analysis workflows within the project
• Enable sharing via submission to appropriate international
repositories
• encourage best-practice data formats
• encourage complete, rich metadata that complies with repository
and community standards
• Use and build on existing platforms like the OMICS platform
• Long-term hosting and maintenance
What could a well-funded Oz Mammals Data
and Bioinformatics Resource do?
Current way forward
• Drafting a proposal aimed at ANDS/NeCTAR/RDS
• No funding scheme yet - but possibly later this year
• Engaging with European Bioinformatics Institute (Ensembl Vertebrates),
ISA-Tools, Cyverse for potential collaborations and advice
• Aligns with broader digital infrastructure strategy currently being mapped
at national level
Vicky Schneider, Andrew Pask, Denis
O’Meally, Philippa Griffin, Jeff Christiansen, Mike
Charleston, Dominique Gorse, Andrew Treloar,
Jason Williams, Rebecca Johnson and Andrew
Lonie
With comments from Paul Flicek, Michelle Barker,
Susanna Sansone, Dave Burt
Timescale
• Year 1-2: scoping requirements, building, ongoing testing
• Year 2-3: building, release, outreach/training, improvement
Expertise required
• Research Software Engineering
• Business Analyst expertise + domain knowledge
• Biocuration
• Input on Bioinformatics Needs
• Input on User Experience Design
• Input on Training/Outreach
• Project Management
For comparison
• COPO: 4 FTE for 3 years
• Cyverse: 35 FTE for 5 years ( US$100 million
over 10 years )
Matt Francey on Flickr: https://www.flickr.com/photos/howfardad/31879952075
CC BY-NC 2.0
Your thoughts?
• Are there OMG project needs not covered in this
list?
• Any other Oz Mammal portals/resources to be
aware of / consider incorporating?
• What do you see as the highest priority in data
management / accessing compute resources /
sharing and storing data for the OMG:
• Currently?
• A year from now?

Más contenido relacionado

La actualidad más candente

How to share useful data
How to share useful dataHow to share useful data
How to share useful dataPeter McQuilton
 
Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeCarly Strasser
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015Carly Strasser
 
Data citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research dataData citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research dataLe_GFII
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpCarole Goble
 
Ausplots Training - Session 1
Ausplots Training - Session 1Ausplots Training - Session 1
Ausplots Training - Session 1bensparrowau
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Ola Spjuth
 
Information systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesInformation systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesapaari
 
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...ariadnenetwork
 
Research Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIResearch Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIDaniel S. Katz
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceData Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceCarly Strasser
 
NSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingNSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingDaniel S. Katz
 
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging dataOpenNeuro: a free online platform for sharing and analysis of neuroimaging data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging dataKrzysztof Gorgolewski
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARECASRAI
 
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...Krzysztof Gorgolewski
 
Spark Summit EU talk by Erwin Datema and Roeland van Ham
Spark Summit EU talk by Erwin Datema and Roeland van HamSpark Summit EU talk by Erwin Datema and Roeland van Ham
Spark Summit EU talk by Erwin Datema and Roeland van HamSpark Summit
 
Reproducibility and replicability: a practical approach
Reproducibility and replicability: a practical approachReproducibility and replicability: a practical approach
Reproducibility and replicability: a practical approachKrzysztof Gorgolewski
 

La actualidad más candente (20)

How to share useful data
How to share useful dataHow to share useful data
How to share useful data
 
Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of Change
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
 
Data citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research dataData citation metrics : best practice to enable new metrics for research data
Data citation metrics : best practice to enable new metrics for research data
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 
Ausplots Training - Session 1
Ausplots Training - Session 1Ausplots Training - Session 1
Ausplots Training - Session 1
 
The CATE Project
The CATE ProjectThe CATE Project
The CATE Project
 
sDiv_IJSCM-part_2
sDiv_IJSCM-part_2sDiv_IJSCM-part_2
sDiv_IJSCM-part_2
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
 
Information systems on fish and marine genetic resources
Information systems on fish and marine genetic resourcesInformation systems on fish and marine genetic resources
Information systems on fish and marine genetic resources
 
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...
A First Attempt at Describing, Disseminating and Reusing Methodological Knowl...
 
Research Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSIResearch Software Sustainability: WSSSPE & URSSI
Research Software Sustainability: WSSSPE & URSSI
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceData Matters for AGU Early Career Conference
Data Matters for AGU Early Career Conference
 
NSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meetingNSF SI2 program discussion at 2014 SI2 PI meeting
NSF SI2 program discussion at 2014 SI2 PI meeting
 
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging dataOpenNeuro: a free online platform for sharing and analysis of neuroimaging data
OpenNeuro: a free online platform for sharing and analysis of neuroimaging data
 
L&P Eric Celeste - SHARE
L&P Eric Celeste -  SHAREL&P Eric Celeste -  SHARE
L&P Eric Celeste - SHARE
 
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...Avoiding the tower of babel - The Role of Data Description Standards in Biome...
Avoiding the tower of babel - The Role of Data Description Standards in Biome...
 
Spark Summit EU talk by Erwin Datema and Roeland van Ham
Spark Summit EU talk by Erwin Datema and Roeland van HamSpark Summit EU talk by Erwin Datema and Roeland van Ham
Spark Summit EU talk by Erwin Datema and Roeland van Ham
 
Reproducibility and replicability: a practical approach
Reproducibility and replicability: a practical approachReproducibility and replicability: a practical approach
Reproducibility and replicability: a practical approach
 

Similar a An Oz Mammals Bioinformatics and Data Resource

Datat and donuts: how to write a data management plan
Datat and donuts: how to write a data management planDatat and donuts: how to write a data management plan
Datat and donuts: how to write a data management planC. Tobin Magle
 
NSF Software @ ApacheConNA
NSF Software @ ApacheConNANSF Software @ ApacheConNA
NSF Software @ ApacheConNADaniel S. Katz
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET
 
The pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an exampleThe pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an exampleEnis Afgan
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyICZN
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015Fiona Nielsen
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 Scott Edmunds
 
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016Philippa Griffin
 
e-infrastructural needs to support informatics
e-infrastructural needs to support informaticse-infrastructural needs to support informatics
e-infrastructural needs to support informaticsDavid Wallom
 
Community Standards and Tools for Biodiversity Science at NIEHD
Community Standards and Tools for Biodiversity Science at NIEHDCommunity Standards and Tools for Biodiversity Science at NIEHD
Community Standards and Tools for Biodiversity Science at NIEHDrlwalls2008
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...Sarah Anna Stewart
 
De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...taxonbytes
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Susanna-Assunta Sansone
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesLouise Corti
 
Data and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planData and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planC. Tobin Magle
 
Elixir at de.nbi meeting
Elixir at de.nbi meetingElixir at de.nbi meeting
Elixir at de.nbi meetingNiklas Blomberg
 
BioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next DevelopmentsBioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next DevelopmentsPascale Gaudet
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ LibraryARDC
 

Similar a An Oz Mammals Bioinformatics and Data Resource (20)

Datat and donuts: how to write a data management plan
Datat and donuts: how to write a data management planDatat and donuts: how to write a data management plan
Datat and donuts: how to write a data management plan
 
NSF Software @ ApacheConNA
NSF Software @ ApacheConNANSF Software @ ApacheConNA
NSF Software @ ApacheConNA
 
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
dkNET Office Hours - "Are You Ready for 2023: New NIH Data Management and Sha...
 
The pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an exampleThe pulse of cloud computing with bioinformatics as an example
The pulse of cloud computing with bioinformatics as an example
 
COPO - Collaborative Open Plant Omics, by Rob Davey
COPO - Collaborative Open Plant Omics, by Rob DaveyCOPO - Collaborative Open Plant Omics, by Rob Davey
COPO - Collaborative Open Plant Omics, by Rob Davey
 
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to TaxonomyJim Woolley - Name Registration: One Less Impediment to Taxonomy
Jim Woolley - Name Registration: One Less Impediment to Taxonomy
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9 HKU Data Curation MLIM7350 Class 9
HKU Data Curation MLIM7350 Class 9
 
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016EMBL Australia Bioinformatics Resource BioInfoSummer 2016
EMBL Australia Bioinformatics Resource BioInfoSummer 2016
 
e-infrastructural needs to support informatics
e-infrastructural needs to support informaticse-infrastructural needs to support informatics
e-infrastructural needs to support informatics
 
Community Standards and Tools for Biodiversity Science at NIEHD
Community Standards and Tools for Biodiversity Science at NIEHDCommunity Standards and Tools for Biodiversity Science at NIEHD
Community Standards and Tools for Biodiversity Science at NIEHD
 
Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...De-centralized but global: Redesigning biodiversity data aggregation for impr...
De-centralized but global: Redesigning biodiversity data aggregation for impr...
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014
 
Engaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciencesEngaging with students and researchers: the case of the social sciences
Engaging with students and researchers: the case of the social sciences
 
Data and Donuts: How to write a data management plan
Data and Donuts: How to write a data management planData and Donuts: How to write a data management plan
Data and Donuts: How to write a data management plan
 
Elixir at de.nbi meeting
Elixir at de.nbi meetingElixir at de.nbi meeting
Elixir at de.nbi meeting
 
BioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next DevelopmentsBioDBCore: Current Status and Next Developments
BioDBCore: Current Status and Next Developments
 
Data publishing at the UQ Library
Data publishing at the UQ LibraryData publishing at the UQ Library
Data publishing at the UQ Library
 

Último

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionPriyansha Singh
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptxkhadijarafiq2012
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 

Último (20)

SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Caco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorptionCaco-2 cell permeability assay for drug absorption
Caco-2 cell permeability assay for drug absorption
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Types of different blotting techniques.pptx
Types of different blotting techniques.pptxTypes of different blotting techniques.pptx
Types of different blotting techniques.pptx
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 

An Oz Mammals Bioinformatics and Data Resource

  • 1. Vicky Schneider, Andrew Pask, Denis O’Meally, Philippa Griffin, Jeff Christiansen, Mike Charleston, Dominique Gorse, Andrew Treloar, Jason Williams, Rebecca Johnson and Andrew Lonie With comments from Paul Flicek, Michelle Barker, Susanna Sansone, Dave Burt An Oz Mammals Bioinformatics and Data Resource
  • 2. • Whole-genome sequencing reads • Exon-capture sequencing reads • RADseq/GBS/exon- capture sequencing reads Raw Processed • Genome assemblies • Gene alignments • Phylogenetic trees • Variant calls • Transcriptome data • Annotations • Sequence alignments • Phylogenetic trees • Microsatellite datasets • Cytological data (images?) • Phenotype information • …..... From Australian Alps on Flickr: https://www.flickr.com/photos/australianalps/69549 40609 CC BY-NC-ND 2.0
  • 3. From Australian Alps on Flickr: https://www.flickr.com/photos/australianalps/69549 40609 CC BY-NC-ND 2.0 OMG Project data ● Hugely valuable for ○ Understanding our natural heritage ○ Tackling evolutionary and ecological questions ○ Placental mammal research including human biomedical research ● Uniquely Australian ● Often irreplaceable samples and data We are leading the world in generating these data Could also be leading in sharing the data! +
  • 5. Data Life Cycle framework visualising Metadata (contextual information about the data) is key to making this work e.g. Sample - species - tissue type - collection location - museum ID Experiment - sample processing method - technology used - settings
  • 6. • A place to store and share data and metadata for the OMG project • A place to store and share existing Oz mammal datasets • A place to share data processing/analysis workflows • A place to access data processing, analysis and visualisation tools (with appropriate compute resources) • Integration with external tools, e.g. Atlas of Living Australia What is not covered by the OMG project?
  • 7. • 5 strains x 2 growth conditions of 2 bacterial species • Genomic, transcriptomic, metabolomic and proteomic profiles
  • 8.
  • 9. Select datasets using drop-down menu of metadata values: e.g. • ‘all raw transcriptomic datasets from E. coli grown in blood media’ • ‘all datasets from bacterial samples collected from patients in NSW before 2010’ Send to local desktops, HPC systems for analysis Log in Process/analyse/visualise in common cloud-based environment with pre- installed software tools Submit data and metadata to international repository
  • 10. • Large collaborative project funded by Research Data Services (RDS), linked with the BPA Antibiotic-Resistant Pathogens Project • Project members from VicNode, QCIF, Melbourne Bioinformatics (formerly VLSCI), Intersect • -> There is expertise in Australia in developing this kind of resource • data storage • research data management • delivering analysis tools in a common cloud environment • linking across storage/management/analysis layers • Many of the pieces can be reused/adapted for different research projects
  • 11. Existing Oz Mammal data resources• For within-project collaboration • Focus on data sharing, (storage), community genome annotation • Datasets mostly unpublished as yet Tools: • File downloads • JBrowse • BLAST • Apollo
  • 12. http://copo-project.org/ Aims to provide an easy-to-use interface for researches to access interoperable • Metadata annotation services • Data repository services • Data analysis services • Data publishing services
  • 14. • Capture Oz Mammal data and resources that already exist • long-term, secure data storage • Integrate new OMG data and metadata • Enable data sharing within OMG project (and collaborators) • Provide access to Oz Mammal data for the world! What could a well-funded Oz Mammals Data and Bioinformatics Resource do?
  • 15. • Capture Oz Mammal data and resources that already exist • long-term, secure data storage • Integrate new OMG data and metadata • Enable data sharing within OMG project (and collaborators) • Provide access to Oz Mammal data for the world! • Access to data processing, analysis and visualisation tools in one place • Integrate external tools, e.g. Atlas of Living Australia • Enable sharing of processing/analysis workflows within the project What could a well-funded Oz Mammals Data and Bioinformatics Resource do?
  • 16. • Capture Oz Mammal data and resources that already exist • long-term, secure data storage • Integrate new OMG data and metadata • Enable data sharing within OMG project (and collaborators) • Provide access to Oz Mammal data for the world! • Access to data processing, analysis and visualisation tools in one place • Integrate external tools, e.g. Atlas of Living Australia • Enable sharing of processing/analysis workflows within the project • Enable sharing via submission to appropriate international repositories • encourage best-practice data formats • encourage complete, rich metadata that complies with repository and community standards What could a well-funded Oz Mammals Data and Bioinformatics Resource do?
  • 17. • Capture Oz Mammal data and resources that already exist • long-term, secure data storage • Integrate new OMG data and metadata • Enable data sharing within OMG project (and collaborators) • Provide access to Oz Mammal data for the world! • Access to data processing, analysis and visualisation tools in one place • Integrate external tools, e.g. Atlas of Living Australia • Enable sharing of processing/analysis workflows within the project • Enable sharing via submission to appropriate international repositories • encourage best-practice data formats • encourage complete, rich metadata that complies with repository and community standards • Use and build on existing platforms like the OMICS platform • Long-term hosting and maintenance What could a well-funded Oz Mammals Data and Bioinformatics Resource do?
  • 18.
  • 19. Current way forward • Drafting a proposal aimed at ANDS/NeCTAR/RDS • No funding scheme yet - but possibly later this year • Engaging with European Bioinformatics Institute (Ensembl Vertebrates), ISA-Tools, Cyverse for potential collaborations and advice • Aligns with broader digital infrastructure strategy currently being mapped at national level Vicky Schneider, Andrew Pask, Denis O’Meally, Philippa Griffin, Jeff Christiansen, Mike Charleston, Dominique Gorse, Andrew Treloar, Jason Williams, Rebecca Johnson and Andrew Lonie With comments from Paul Flicek, Michelle Barker, Susanna Sansone, Dave Burt
  • 20. Timescale • Year 1-2: scoping requirements, building, ongoing testing • Year 2-3: building, release, outreach/training, improvement Expertise required • Research Software Engineering • Business Analyst expertise + domain knowledge • Biocuration • Input on Bioinformatics Needs • Input on User Experience Design • Input on Training/Outreach • Project Management For comparison • COPO: 4 FTE for 3 years • Cyverse: 35 FTE for 5 years ( US$100 million over 10 years ) Matt Francey on Flickr: https://www.flickr.com/photos/howfardad/31879952075 CC BY-NC 2.0
  • 21. Your thoughts? • Are there OMG project needs not covered in this list? • Any other Oz Mammal portals/resources to be aware of / consider incorporating? • What do you see as the highest priority in data management / accessing compute resources / sharing and storing data for the OMG: • Currently? • A year from now?