Data generation in the life sciences continues at a rapid pace. There are always risks of data loss, including hardware failures, inability of staff to access data centers, and user error. During challenging times like these, understanding and protecting your data can save lives. Join us to see how you can protect and visualize your files at the scale of Life Sciences, with integrated search, restore, and visibility.
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Data Visibility and Protection at the Scale of Life Sciences
1. Data Ecosystems that Accelerate
Scientific Research• Moderator:
– Allison Proffitt
• Speakers:
– Adam Marko
• Data Visibility and Protection at the Scale of Life Sciences
– Adam Kraut
• Building Data Ecosystems for Accelerated Scientific Discovery
– Fernanda Foertter
• With the Power of AI Comes Great Responsibility
– Jonathan Stokes
• A Deep Learning Approach to Antibiotic Discovery
• Panel Discussion
2. Data Management and Visibility
at the Scale of Life Sciences
Adam Marko
Scientific Solutions Lead
April 2020
3. About me
• 10+ Years in Research IT
• Drug discovery, AgBio, NGS Diagnostics,
Research IT consulting
• Every organization had storage and data
management issues
• Help customers protect and understand
their life science data
4. Agenda
• Challenges extracting value from data
• Igneous core technology
• Use cases
– Data Search and Visibility
– Data Protection
– Data Movement
• How Igneous can help
5. 5
Maximizing the Value of your Data
Can’t protect dataCan’t find my data
Data protection becomes
difficult above 500TB,
especially to the cloud
Backup requires using vendor
specific replication,breaking at
scale
Multiple instrument and storage
silos, requires datacenter space
End users spend 20%+ of their
time looking for data
Simple search does not exist for
research file data
Cannot index data at petabyte
scale fast enough to keep index
relevant
Lost Time
Decreased productivity
Data remains undiscovered
Vendor lock-in
Decreased Productivity
Increased Risk
These challenges exist now, and only get worse as research scales
Can’t move data
Legacy tools are vendor specific or
one-and-done, constrained to NAS
and expensive
Open-source is manual and labor
intensive, slowing collaboration
New workflows require data
next to compute. Compute is
more distributed than ever
Increased costs
Slowing collaboration
Increased Risk
6. 6
Igneous SaaS Solutions Built for Scale
Can’t protect dataCan’t find my data Can’t move data
● Trillion file index
● 400K files per second
● Multithreaded with Dynamic Load Balancing
● Latency Monitoring
● Data compression in flight
● Installed as a VM
DataDiscover DataProtect DataFlow
● Won’t break at any scale
● Faster than any other solution
● Takes advantage of full network bandwidth
● User jobs unaffected by data movement
● Cost effective use of cloud
● Get up and running quick/not a project
Igneous was built for challenging file environments
8. What does finding your data mean?
Live Views to customize file visualization
Search for the data that’s important
Share with collaborators
Enabling visibility across NAS systems
Users draw new insights from the data
Time is saved
Research is accelerated
9. Live views of data
View of data by extensions
Live View
Extension Filter
Capacity & Count
(Match Rate)
10. Live views of data
View of data owned by Groups
Live View
Group
11. Search in the live view
Search for specific keywords to narrow results in the live view
Search
Results
14. Robust protection and movement at scale
Daily Backup to protect raw data and results
Meet SLAs cost-effectively
High Performance File Movement without babysitting
Building confidence with users
Peace of mind for researchers
Native file transfer anywhere
16. HPC Cluster
Analysis and Storage
Devices
Data Generation Rates Continue with no
Backup or Visibility
NAS Storage
is full and not
backed up
NAS 1 NAS 2
Researcher A
No scalable visibility
across file systems
Researcher B
No scalable visibility
across file systems
ls
grep
find
df
rsync/scp
Data
generation
rates grow
17. HPC Cluster
Analysis and Storage
Devices
Creating Silos and Lost Data
Personal cloud
account
Prosumer
NAS
USB HDD
NAS Storage
is full and not
backed up
NAS 1 NAS 2
Researcher A
No scalable visibility
across file systems
Researcher B
No scalable visibility
across file systems
ls
grep
find
df
No understanding of file type, users, file
age across all NAS systems
rsync/scp
Data
generation
rates grow
18. Igneous can help protect
Backup daily and
archive forever
Centrally Managed
Cloud Account
Igneous VM
Devices
Researcher A
Researcher B
ls
grep
find
df
Analysis and Storage
HPC Cluster
NAS 1 NAS 2
19. Igneous can help protect,find
Backup daily and
archive forever
Centrally Managed
Cloud Account
Igneous VM
Devices
Researcher A
Visibility into
directories
Researcher B
Shares views of data
ls
grep
find
df
Analysis and Storage
HPC Cluster
NAS 1 NAS 2
Visibility into file type, users, file age
across all NAS systems
20. Igneous can help protect,find,move
Backup daily and
archive forever
Centrally Managed
Cloud Compute
and Storage
Igneous VM
Devices
Researcher A
Visibility into
directories
Researcher B
Shares views of data
ls
grep
find
df
Analysis and Storage
HPC Cluster
NAS 1 NAS 2
Visibility into file type, users, file age
across all NAS systems
Native File
Transfer
21. We’re here to help you get started
Free DataDiscover until September
• Sign up at igneous.io
• Delivered as a service, installed and running in under an
hour
• marko@igneous.io
Thank you, and on to our next presenters
DataDiscover DataProtect DataFlow