SlideShare una empresa de Scribd logo
1 de 21
Data Ecosystems that Accelerate
Scientific Research• Moderator:
– Allison Proffitt
• Speakers:
– Adam Marko
• Data Visibility and Protection at the Scale of Life Sciences
– Adam Kraut
• Building Data Ecosystems for Accelerated Scientific Discovery
– Fernanda Foertter
• With the Power of AI Comes Great Responsibility
– Jonathan Stokes
• A Deep Learning Approach to Antibiotic Discovery
• Panel Discussion
Data Management and Visibility
at the Scale of Life Sciences
Adam Marko
Scientific Solutions Lead
April 2020
About me
• 10+ Years in Research IT
• Drug discovery, AgBio, NGS Diagnostics,
Research IT consulting
• Every organization had storage and data
management issues
• Help customers protect and understand
their life science data
Agenda
• Challenges extracting value from data
• Igneous core technology
• Use cases
– Data Search and Visibility
– Data Protection
– Data Movement
• How Igneous can help
5
Maximizing the Value of your Data
Can’t protect dataCan’t find my data
Data protection becomes
difficult above 500TB,
especially to the cloud
Backup requires using vendor
specific replication,breaking at
scale
Multiple instrument and storage
silos, requires datacenter space
End users spend 20%+ of their
time looking for data
Simple search does not exist for
research file data
Cannot index data at petabyte
scale fast enough to keep index
relevant
Lost Time
Decreased productivity
Data remains undiscovered
Vendor lock-in
Decreased Productivity
Increased Risk
These challenges exist now, and only get worse as research scales
Can’t move data
Legacy tools are vendor specific or
one-and-done, constrained to NAS
and expensive
Open-source is manual and labor
intensive, slowing collaboration
New workflows require data
next to compute. Compute is
more distributed than ever
Increased costs
Slowing collaboration
Increased Risk
6
Igneous SaaS Solutions Built for Scale
Can’t protect dataCan’t find my data Can’t move data
● Trillion file index
● 400K files per second
● Multithreaded with Dynamic Load Balancing
● Latency Monitoring
● Data compression in flight
● Installed as a VM
DataDiscover DataProtect DataFlow
● Won’t break at any scale
● Faster than any other solution
● Takes advantage of full network bandwidth
● User jobs unaffected by data movement
● Cost effective use of cloud
● Get up and running quick/not a project
Igneous was built for challenging file environments
Finding your data wherever it lives
DataDiscover
What does finding your data mean?
Live Views to customize file visualization
Search for the data that’s important
Share with collaborators
Enabling visibility across NAS systems
Users draw new insights from the data
Time is saved
Research is accelerated
Live views of data
View of data by extensions
Live View
Extension Filter
Capacity & Count
(Match Rate)
Live views of data
View of data owned by Groups
Live View
Group
Search in the live view
Search for specific keywords to narrow results in the live view
Search
Results
Share with collaborators
Share live views with researchers, executives, data-owners for further exploration
Share
Protecting and Moving your Data
DataProtect
DataFlow
Robust protection and movement at scale
Daily Backup to protect raw data and results
Meet SLAs cost-effectively
High Performance File Movement without babysitting
Building confidence with users
Peace of mind for researchers
Native file transfer anywhere
Common Research Infrastructure
HPC Cluster
Analysis and Storage
Devices
Researcher A
Researcher B
NAS 1 NAS 2
HPC Cluster
Analysis and Storage
Devices
Data Generation Rates Continue with no
Backup or Visibility
NAS Storage
is full and not
backed up
NAS 1 NAS 2
Researcher A
No scalable visibility
across file systems
Researcher B
No scalable visibility
across file systems
ls
grep
find
df
rsync/scp
Data
generation
rates grow
HPC Cluster
Analysis and Storage
Devices
Creating Silos and Lost Data
Personal cloud
account
Prosumer
NAS
USB HDD
NAS Storage
is full and not
backed up
NAS 1 NAS 2
Researcher A
No scalable visibility
across file systems
Researcher B
No scalable visibility
across file systems
ls
grep
find
df
No understanding of file type, users, file
age across all NAS systems
rsync/scp
Data
generation
rates grow
Igneous can help protect
Backup daily and
archive forever
Centrally Managed
Cloud Account
Igneous VM
Devices
Researcher A
Researcher B
ls
grep
find
df
Analysis and Storage
HPC Cluster
NAS 1 NAS 2
Igneous can help protect,find
Backup daily and
archive forever
Centrally Managed
Cloud Account
Igneous VM
Devices
Researcher A
Visibility into
directories
Researcher B
Shares views of data
ls
grep
find
df
Analysis and Storage
HPC Cluster
NAS 1 NAS 2
Visibility into file type, users, file age
across all NAS systems
Igneous can help protect,find,move
Backup daily and
archive forever
Centrally Managed
Cloud Compute
and Storage
Igneous VM
Devices
Researcher A
Visibility into
directories
Researcher B
Shares views of data
ls
grep
find
df
Analysis and Storage
HPC Cluster
NAS 1 NAS 2
Visibility into file type, users, file age
across all NAS systems
Native File
Transfer
We’re here to help you get started
Free DataDiscover until September
• Sign up at igneous.io
• Delivered as a service, installed and running in under an
hour
• marko@igneous.io
Thank you, and on to our next presenters
DataDiscover DataProtect DataFlow

Más contenido relacionado

La actualidad más candente

Is Hadoop a Necessity for Data Science
Is Hadoop a Necessity for Data ScienceIs Hadoop a Necessity for Data Science
Is Hadoop a Necessity for Data ScienceEdureka!
 
Altman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data ManagementAltman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data ManagementASIS&T
 
Smith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case StudiesSmith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case StudiesASIS&T
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodeiASIS&T
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystemVarsha Khodiyar
 
Big data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and HadoopBig data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and HadoopSamiraChandan
 
LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?LIBER Europe
 
No Free Lunch: Metadata in the life sciences
No Free Lunch:  Metadata in the life sciencesNo Free Lunch:  Metadata in the life sciences
No Free Lunch: Metadata in the life sciencesChris Dwan
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational ScienceChelle Gentemann
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...Jenny Mitcham
 
Data curation
Data curationData curation
Data curationealtmyer
 
DataTags, The Tags Toolset, and Dataverse Integration
DataTags, The Tags Toolset, and Dataverse IntegrationDataTags, The Tags Toolset, and Dataverse Integration
DataTags, The Tags Toolset, and Dataverse IntegrationMichael Bar-Sinai
 
Metadata stores systems in use 20180322
Metadata stores systems in use 20180322Metadata stores systems in use 20180322
Metadata stores systems in use 20180322Keith Russell
 
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data ManagementD4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data ManagementBlue BRIDGE
 

La actualidad más candente (20)

Is Hadoop a Necessity for Data Science
Is Hadoop a Necessity for Data ScienceIs Hadoop a Necessity for Data Science
Is Hadoop a Necessity for Data Science
 
Altman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data ManagementAltman RDAP11 Policy-based Data Management
Altman RDAP11 Policy-based Data Management
 
Smith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case StudiesSmith RDAP11 NSF Data Management Plan Case Studies
Smith RDAP11 NSF Data Management Plan Case Studies
 
Global registries initiative frumkin omodei
Global registries initiative frumkin omodeiGlobal registries initiative frumkin omodei
Global registries initiative frumkin omodei
 
Data sharing as part of the research ecosystem
Data sharing as part of the research ecosystemData sharing as part of the research ecosystem
Data sharing as part of the research ecosystem
 
Big data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and HadoopBig data analytics - Introduction to Big Data and Hadoop
Big data analytics - Introduction to Big Data and Hadoop
 
LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?
 
No Free Lunch: Metadata in the life sciences
No Free Lunch:  Metadata in the life sciencesNo Free Lunch:  Metadata in the life sciences
No Free Lunch: Metadata in the life sciences
 
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational Science
 
“Filling the digital preservation gap” an update from the Jisc Research Data ...
“Filling the digital preservation gap”an update from the Jisc Research Data ...“Filling the digital preservation gap”an update from the Jisc Research Data ...
“Filling the digital preservation gap” an update from the Jisc Research Data ...
 
Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
 Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un... Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
 
Data curation
Data curationData curation
Data curation
 
DataTags, The Tags Toolset, and Dataverse Integration
DataTags, The Tags Toolset, and Dataverse IntegrationDataTags, The Tags Toolset, and Dataverse Integration
DataTags, The Tags Toolset, and Dataverse Integration
 
Metadata stores systems in use 20180322
Metadata stores systems in use 20180322Metadata stores systems in use 20180322
Metadata stores systems in use 20180322
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data ManagementD4Science Data Infrastructure - Facilitator for a FAIR Data Management
D4Science Data Infrastructure - Facilitator for a FAIR Data Management
 
White Manipulating Metadata to Enhance Access
White Manipulating Metadata to Enhance AccessWhite Manipulating Metadata to Enhance Access
White Manipulating Metadata to Enhance Access
 
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
Data Management Planning for Researchers - 2016-02-08 - University of OxfordData Management Planning for Researchers - 2016-02-08 - University of Oxford
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
 
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of OxfordWriting a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
 

Similar a Data Visibility and Protection at the Scale of Life Sciences

Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Sarah Anna Stewart
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA'saaroncollie
 
Research Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsResearch Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsAaron Collie
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycleSherry Lake
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersRebekah Cummings
 
Analytics with unified file and object
Analytics with unified file and object Analytics with unified file and object
Analytics with unified file and object Sandeep Patil
 
Policy-based Data Management
Policy-based Data Management Policy-based Data Management
Policy-based Data Management Gary Wilhelm
 
John morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxJohn morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxARDC
 
OU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataOU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataIzzyChad
 
Data management plans
Data management plansData management plans
Data management plansBrad Houston
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data ManagementJamie Bisset
 
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS Support
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS SupportWebinar: What Your Object Storage Vendor Isn’t Telling You About NFS Support
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS SupportStorage Switzerland
 
Intelligent Cloud Enablement
Intelligent Cloud EnablementIntelligent Cloud Enablement
Intelligent Cloud EnablementDocuLynx
 

Similar a Data Visibility and Protection at the Scale of Life Sciences (20)

Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
Data management for TA's
Data management for TA'sData management for TA's
Data management for TA's
 
Research Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering StudentsResearch Data Management Fundamentals for MSU Engineering Students
Research Data Management Fundamentals for MSU Engineering Students
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycle
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
 
Sept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the CloudSept 24 NISO Virtual Conference: Library Data in the Cloud
Sept 24 NISO Virtual Conference: Library Data in the Cloud
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Analytics with unified file and object
Analytics with unified file and object Analytics with unified file and object
Analytics with unified file and object
 
Resources for Research Data Managers - 2014-05-28 - University of Oxford
Resources for Research Data Managers - 2014-05-28 - University of OxfordResources for Research Data Managers - 2014-05-28 - University of Oxford
Resources for Research Data Managers - 2014-05-28 - University of Oxford
 
Policy-based Data Management
Policy-based Data Management Policy-based Data Management
Policy-based Data Management
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
John morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptxJohn morrissey c3 dis fair working data.pptx
John morrissey c3 dis fair working data.pptx
 
OU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataOU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research data
 
Introduction to RDM for trainee physicians
Introduction to RDM for trainee physiciansIntroduction to RDM for trainee physicians
Introduction to RDM for trainee physicians
 
Data management plans
Data management plansData management plans
Data management plans
 
Managing your research data
Managing your research dataManaging your research data
Managing your research data
 
Research Data Management
Research Data ManagementResearch Data Management
Research Data Management
 
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS Support
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS SupportWebinar: What Your Object Storage Vendor Isn’t Telling You About NFS Support
Webinar: What Your Object Storage Vendor Isn’t Telling You About NFS Support
 
Intelligent Cloud Enablement
Intelligent Cloud EnablementIntelligent Cloud Enablement
Intelligent Cloud Enablement
 

Último

module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Mohammad Khajehpour
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)AkefAfaneh2
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Joonhun Lee
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxRizalinePalanog2
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Monika Rani
 

Último (20)

module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
Feature-aligned N-BEATS with Sinkhorn divergence (ICLR '24)
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptxSCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
SCIENCE-4-QUARTER4-WEEK-4-PPT-1 (1).pptx
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 

Data Visibility and Protection at the Scale of Life Sciences

  • 1. Data Ecosystems that Accelerate Scientific Research• Moderator: – Allison Proffitt • Speakers: – Adam Marko • Data Visibility and Protection at the Scale of Life Sciences – Adam Kraut • Building Data Ecosystems for Accelerated Scientific Discovery – Fernanda Foertter • With the Power of AI Comes Great Responsibility – Jonathan Stokes • A Deep Learning Approach to Antibiotic Discovery • Panel Discussion
  • 2. Data Management and Visibility at the Scale of Life Sciences Adam Marko Scientific Solutions Lead April 2020
  • 3. About me • 10+ Years in Research IT • Drug discovery, AgBio, NGS Diagnostics, Research IT consulting • Every organization had storage and data management issues • Help customers protect and understand their life science data
  • 4. Agenda • Challenges extracting value from data • Igneous core technology • Use cases – Data Search and Visibility – Data Protection – Data Movement • How Igneous can help
  • 5. 5 Maximizing the Value of your Data Can’t protect dataCan’t find my data Data protection becomes difficult above 500TB, especially to the cloud Backup requires using vendor specific replication,breaking at scale Multiple instrument and storage silos, requires datacenter space End users spend 20%+ of their time looking for data Simple search does not exist for research file data Cannot index data at petabyte scale fast enough to keep index relevant Lost Time Decreased productivity Data remains undiscovered Vendor lock-in Decreased Productivity Increased Risk These challenges exist now, and only get worse as research scales Can’t move data Legacy tools are vendor specific or one-and-done, constrained to NAS and expensive Open-source is manual and labor intensive, slowing collaboration New workflows require data next to compute. Compute is more distributed than ever Increased costs Slowing collaboration Increased Risk
  • 6. 6 Igneous SaaS Solutions Built for Scale Can’t protect dataCan’t find my data Can’t move data ● Trillion file index ● 400K files per second ● Multithreaded with Dynamic Load Balancing ● Latency Monitoring ● Data compression in flight ● Installed as a VM DataDiscover DataProtect DataFlow ● Won’t break at any scale ● Faster than any other solution ● Takes advantage of full network bandwidth ● User jobs unaffected by data movement ● Cost effective use of cloud ● Get up and running quick/not a project Igneous was built for challenging file environments
  • 7. Finding your data wherever it lives DataDiscover
  • 8. What does finding your data mean? Live Views to customize file visualization Search for the data that’s important Share with collaborators Enabling visibility across NAS systems Users draw new insights from the data Time is saved Research is accelerated
  • 9. Live views of data View of data by extensions Live View Extension Filter Capacity & Count (Match Rate)
  • 10. Live views of data View of data owned by Groups Live View Group
  • 11. Search in the live view Search for specific keywords to narrow results in the live view Search Results
  • 12. Share with collaborators Share live views with researchers, executives, data-owners for further exploration Share
  • 13. Protecting and Moving your Data DataProtect DataFlow
  • 14. Robust protection and movement at scale Daily Backup to protect raw data and results Meet SLAs cost-effectively High Performance File Movement without babysitting Building confidence with users Peace of mind for researchers Native file transfer anywhere
  • 15. Common Research Infrastructure HPC Cluster Analysis and Storage Devices Researcher A Researcher B NAS 1 NAS 2
  • 16. HPC Cluster Analysis and Storage Devices Data Generation Rates Continue with no Backup or Visibility NAS Storage is full and not backed up NAS 1 NAS 2 Researcher A No scalable visibility across file systems Researcher B No scalable visibility across file systems ls grep find df rsync/scp Data generation rates grow
  • 17. HPC Cluster Analysis and Storage Devices Creating Silos and Lost Data Personal cloud account Prosumer NAS USB HDD NAS Storage is full and not backed up NAS 1 NAS 2 Researcher A No scalable visibility across file systems Researcher B No scalable visibility across file systems ls grep find df No understanding of file type, users, file age across all NAS systems rsync/scp Data generation rates grow
  • 18. Igneous can help protect Backup daily and archive forever Centrally Managed Cloud Account Igneous VM Devices Researcher A Researcher B ls grep find df Analysis and Storage HPC Cluster NAS 1 NAS 2
  • 19. Igneous can help protect,find Backup daily and archive forever Centrally Managed Cloud Account Igneous VM Devices Researcher A Visibility into directories Researcher B Shares views of data ls grep find df Analysis and Storage HPC Cluster NAS 1 NAS 2 Visibility into file type, users, file age across all NAS systems
  • 20. Igneous can help protect,find,move Backup daily and archive forever Centrally Managed Cloud Compute and Storage Igneous VM Devices Researcher A Visibility into directories Researcher B Shares views of data ls grep find df Analysis and Storage HPC Cluster NAS 1 NAS 2 Visibility into file type, users, file age across all NAS systems Native File Transfer
  • 21. We’re here to help you get started Free DataDiscover until September • Sign up at igneous.io • Delivered as a service, installed and running in under an hour • marko@igneous.io Thank you, and on to our next presenters DataDiscover DataProtect DataFlow