SlideShare una empresa de Scribd logo
1 de 35
Descargar para leer sin conexión
Challenges in Life Sciences data management and
cloud enabled collaboration

Barry Wark, Ph.D.
Founder and President, Physion



barry@physion.us
Twitter @barryjwark
Barry Wark

              PyXG




              Voyeur

             Symphony
The nature of scientific research has changed, challenging
        the fundamentals of the scientific method

Life scientists need solutions that help them bridge local
                needs with global resources

                Think globally, act locally
The nature of scientific research has changed
fundamentally

   Biology is a context dependent system. Studying
       context dependence requires lots of data.
‣Data volume                                     ‣ Analytical tools
 • High-content screening: desktop confocal        • Central computing resources, elastic
   can image 25,000 samples per day                  provisioning

 • Human genome $5000, and falling fast            • Open source software democratizes
                                                     contribution and distribution
 • IonWorks Barracuda® can perform 6,000
   whole-cell patch clamp experiments per hour   ‣Teams
‣Data variety                                      • Experimental and analytical specialization

 • “Coherent” data sets (e.g. Sage, Personal       • Research cores and consortia
   Genome Project)
                                                   • Distributed across organizations and
 • Behavior, anatomy, physiology, genomics           institutions
   experiments on the same subject
Pipelined data flow through computational resources




 Researcher
                       Analyst           Result/Report
  dataset
Data that is not easily pipelined doesn’t get
incorporated



  Researcher
   dataset
                  Not scalable

  Researcher
                         Analyst                Result/Report
   dataset



  Researcher
   dataset
Analysis provenance that transits individual
researchers is hard to track
  Researcher
   dataset



  Researcher
                         Analyst               Result/Report
   dataset



  Researcher
   dataset




  Researcher
                          Analyst              Result/Report
   dataset
Comprehensive data management must span the
entire data lifecycle

                                      Enterprise SDMS
Complexity/Cost




                                                        Analytical tools




                                ELN
                                                                              OSF
                  Paper notebook                                             Figshare


                  Acquisition                                     Analysis

                                Data lifecycle stage
Comprehensive data management must span the
entire data lifecycle

                                      Enterprise SDMS
Complexity/Cost




                                                        Analytical tools


                                       Ovation


                                ELN
                                                                              OSF
                  Paper notebook                                             Figshare


                  Acquisition                                     Analysis

                                Data lifecycle stage
Ovation’s data model describes science

Ovation is built to represent the language of science. Scientific data, regardless of
discipline, fits this model.

        Analogous example shows that representing music in the appropriate language of the domain
                                  provides an appropriate data model




       Music, in the language of the domain expert.     Computer representation in the language of
             May include margin notes, etc.             the domain expert (including “margin notes”
                                                        from composer, conductor, etc.). Any genre
                                                                of music is representable.

              Lab notebook representation                          Ovation representation             11
Ubiquitous data model is the correct granularity for
knowledge transfer

Ovation’s data model is more granular than an ELN. Instead of loosing information
during conversion to (and from) a report format such as a Word document or PDF,
Ovation allows data to be transferred in the natural language and granularity of
science.



                                            Information lost in transfer




Analogous example shows that transferring data via a “report” (a sound recording) produces an information bottleneck




                                            Data transferred directly




                     Seamless collaboration and data transfer removes information bottlenecks                      12
Common data model enables collaboration

Interoperability across institutional boundaries is easier with Ovation than other
solutions. Unlike ad-hoc or customized data management systems, every Ovation
customer uses the same data model.




   Individual                                                       Global
                                Collaborators
  researcher                                                      community


                        Data transfer via Ovation data model



                                                                                     13
The Ovation data model for subject definition



                          Protocol                 Project
      Subject


                            Epoch                Experiment
                           Procedure




      Subject
                {
                    species : Drosophila melanogaster,
                    father : 79326326-9CC0-4770-8DC6-3695113C7A64,
                    mother : A2D40CFF-3016-41AE-AC67-BB09A7D8D9E1
                }
The Ovation data model for measurements

                         Project

           Subject

Protocol                Experiment    EquipmentSetup


           Epoch
           Procedure



                       Measurement
                        DataElement
The Ovation data model for analysis provenance



  AnalysisRecord
                   Optionally named


                      AnalysisRecord   Named
                                               DataElement


  Measurement      Optionally named
    DataElement
  Measurement
    DataElement
  Measurement
    DataElement
Ovation architecture


                     http://ovaiton.io
              ACL                        ACL

                                               Object
 Cloudant
                                               storage




CouchDB        CouchDB                          Local file cache
  hidden            visible
Ovation uses eventual consistency


Ovation chooses availability and partition tolerance
over consistency


 (so you can work from the coffee shop)



 X2
  1    Y1           X2’
                     1     Y1                 X1       Y1

 Client 1           Client 2                   Cloud
Ovation uses eventual consistency

This means conflicting edits can be made by
disconnected clients

Append-only (mostly) and user-isolated
changes at the edge of the object graph
minimize these conflicts.


 X2
  1    Y1          X2’
                    1     Y1          X2     X2’
                                              1    Y1

 Client 1           Client 2                  Cloud
Ovation uses eventual consistency

Ovation requires users to resolve conflicts that
they have authority to decide during sync.




 X2
  3    Y1           X2’
                     3     Y1          X2     X2’
                                               3      Y1

 Client 1           Client 2                      Cloud
Ovation Scientific Data Management System®
• Comprehensive data management

  • Multi-modality

  • Multi-user annotation

  • Analysis provenance

• Seamless user experience

  • Double-click installation

  • Integration with existing tools: Matlab, Python,
    R, Java

  • Guide to success

• Effective collaboration

  • Distributed and co-located experts

  • Data ownership maintained

  • Cloud-based replication and archiving
Integrated analysis workflow


 Analysis pipelines that begin with a search, facilitate
       automatic incorporation of new results


Acquire          Organize                   Search                 Analyze

          %% Run a simple query
          iterator = context.query('Epoch', ' ...criteria... ');

          while(iterator.hasNext())
              currEpoch = itrator.next();
              ...analyze currEpoch...
          end
Integrated analysis workflow




  Acquire                         Organize

                                                                         Search                           Analyze

  Acquire                         Organize


Replication technology allows Ovation to replicate a subset of the database for data locality within a computational cluster.

            Execute workflows on a local or cloud cluster
context = NewDataStoreCoordinator('username', password).getContext();
epochs = context.query(context.getQuery('query-name'));


%% analysis parameters
params = struct();
params.MaxLag = 1000; % time window for cross-correlation function
params.ResponseDelayPts = 0; % exclude at end of modulated light
params.MinAnalysisEpochs = 3;
params.FrequencyCutoff = 500;
params.FlushData = 1;

%% ANALYZE AND COLLECT RESULTS
         ====> ORIGINAL ANALYSIS CODE HERE <====
%% save analysis record for this figure
ar = project.insertAnalysisRecord('Figure 1’, epochs,
'AnalysisFunction.m', params, svnRevision, svnURL);
ar.setUserDescription('Manuscript - Figure 1');
ar.addTag(<manuscript>);
ar.addOutput('Figure 1a’, './Figure1a.pdf');
ar.addResource('Figure 1b’, './Figure1b.pdf');
Share data in context

                                  DerivedResponse
            Trial
                                 name: spikes
                                 parameters: {…}
                                 code: spikes.m




 Stimulus           Response




                               ovation:///f694d05a-131b-4644-aa7c-f6e8934e60c0/




                                                                                  DerivedResponse
                                                               Trial
                                                                                  name: spikes
                                                                                  parameters: {…}
                                                                                  code: spikes.m




                                                    Stimulus           Response
Share data in context


 Project                                                               Source




                       Experiment        Experiment
 Device

                       Trial Group


                                                                            DerivedResponse
               Trial                 Trial               Trial
                                                                            name: spikes
                                                                            parameters: {…}
                                                                            code: spikes.m




                                              Stimulus           Response
    Stimulus            Response
Ovation enables researchers to extract more
      knowledge from existing data
      • Lab’s lifetime work was enough data to answer fundamental questions about signal
        and noise in the early visual system
      • Data was locked in individual’s ad-hoc data management
      • Ovation enabled meta-analysis of this existing data
        • New graduate students start with the old data, not new experiments et al. • Arrestin Competition
 (38):11867–11879                                                        Doan


psin is pro-
 d for each
e transduc-
convert the
nge in cur-
mptions, we
␣ and ␥0/␴
 the single-
  GRK1ϩ/Ϫ,                 “Ovation has changed the way we do science…”
                                                                     —Fred Rieke
able 2). Be-
Our vision: living data sets


          Data




                               Data




          Data
Our vision: living data sets


                          Data




                          Data




                          Data
ovation.io

• Store and archive all your data         • Make your data available wherever you
                                            need it

  • Safe, secure, highly reliable cloud
    storage                                 • Replicate and synchronize data to
                                              multiple devices

  • “Offline” archiving
                                          • Benefit from our scalable cloud-based
                                            architecture
• Collaborate locally and globally

                                            • Pay for what you use
  • Share selected data with designated
    users or the public
                                            • Simple monthly fee
Data replication with ovation.io
Neuron
   Inference in Visual Adaptation


                 Collaboration with ovation.io

                                                                                                             >sp|P63252|1-427
                                                                                                             MGSVRTNRYSIVSSEEDGMKLATMAVANGFG
                                                                                                             NGKSKVHTRQQCRSRFVKKDGHCNVQFIN
                                                                                                             VGEKGQRYLADIFTTCVDIRWRWMLVIFCLA
                                                                                                             FVLSWLFFGCVFWLIALLHGDLDASKEGK
                                                                                                             ACVSEVNSFTAAFLFSIETQTTIGYGFRCVT
                                                                                                             DECPIAVFMVVFQSIVGCIIDAFIIGAVM
                                                                                                             AKMAKPKKRNETLVFSHNAVIAMRDGKLCLM
                                                                                                             WRVGNLRKSHLVEAHVRAQLLKSRITSEG
                                                                                                             EYIPLDQIDINVGFDSGIDRIFLVSPITIVH
                                                                                                             EIDEDSPLYDLSKQDIDNADFEIVVILEG
                                                                                                             MVEATAMTTQCRSSYLANEILWGHRYEPVLF
                                                                                                             EEKHYYKVDYSRFHKTYEVPNTPLCSARD
                                                                                                             LAEKKYILSNANSFCYENEVALTSKEEDDSE
                                                                                                             NGVPESTSTDTPPDIDLHNQASVPLEPRP
                                                                                                             LRRESEI


an Increase in Temporal Contrast Depends on the Period between Contrast Switches
  RGC (holding potential 10 mV) in response to a single switch in stimulus contrast (6%–36%,
n (A) and 32 s in (B).
als as in (A) and (B). Exponential fits to the response following an increase in contrast are shown in red.

   Figure 1. The Time Course of Adaptation following an Increase in Temporal Contrast Depends on the Period between Contrast Switches
 nt (mean ± SEM) of the exponential fit to the response following an increase in contrast (6%–36%) for
 OFF) as a function of stimulus switching period.

   (A and B) Inhibitory synaptic current to an OFF-transient RGC (holding potential 10 mV) in response to a single switch in stimulus contrast (6%–36%,
 Meister, 2002; nonrectified, the r.m.s. current was fit with the same function.
   mean $400 R*/rod/s; red). The switching period was 16 s in (A) and 32 s in (B).
ynamics of the The exponential amplitude A and baseline c did not change
   (C and D) significantly as a function of the switching period approximately 100 trials as in (A) and (B). Exponential fits to the response following an increase in contrast are shown in red.
                Mean synaptic currents from (not shown).
                  Figure 1E shows the population average time constant as
   (E) Population-averaged (n z 10 for each period) time constant (mean ± SEM) of the exponential fit to the response following an increase in contrast (6%–36%) for
                a function of period. The average effective time constant of
                adaptation scales approximately linearly across a broad range
 stall RGC types (ON, OFF-sustained, OFF-transient, and ON-OFF) as a function of stimulus switching period.
                of switching periods ($8–32 s). The observed scaling fails for
 ion depend on         short periods but extends to the longest period (T = 32 s) that
  eriodic switch       we could measure reliably. A similar relationship was observed
scribed below,         when comparing the time constant of an exponential fit to only
 se in contrast        the first 8 s of 8, 16, and 32 s periods (not shown). Thus the effect
   et al., 2001; Smirnakis et al., 1997; Baccus and Meister, 2002;
ptic currents in
                       is not simply the result of fitting an exponential to a nonexponen-
                       tial response over varying time windows. These results indicate
                                                                                                                            nonrectified, the r.m.s. current was fit with the same function.
   Kim and Rieke, 2001). Here we focus on the dynamics of the
 a stimulus that
period of 16 s
                       that a fixed first-order process does not govern the dynamics
                       of contrast adaptation in mouse retina. Instead, the adapting
                                                                                                                            The exponential amplitude A and baseline c did not change
   slow component of adaptation.
d across trials
trast stimulus,
                       machinery has access to multiple timescales.
                       Dynamics of Adaptation to Luminance
                                                                                                                            significantly as a function of the switching period (not shown).
 synaptic input
 urse of several
                       To test the generality of multiple-timescale dynamics of adapta-
                       tion, we measured responses to periodic changes in mean light
                                                                                                                              Figure 1E shows the population average time constant as
   Contrast and Luminance Adaptation
slow relaxation        intensity (luminance). As for contrast adaptation, the dynamics of
                                                                                                                            a function of period. The average effective time constant of
Analysis provenance that transits individual
researchers is hard to track
  Researcher
   dataset



  Researcher
                         Analyst               Result/report
   dataset



  Researcher
   dataset




  Researcher
                          Analyst              Result/report
   dataset
Ovation enables integration of non-pipeline data, and
comprehensive analysis provenance
  Researcher
   dataset



  Researcher
                        Analyst            Result/report
   dataset



  Researcher
   dataset




  Researcher
                         Analyst           Result/report
   dataset
Getting started with Ovation

                                      ✓Signup
                                      ✓Download
                                      ✓Get started




http://ovation.io   info@ovation.io       @ovation_io

Más contenido relacionado

Último

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxnegromaestrong
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfChris Hunter
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 

Último (20)

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Making and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdfMaking and Justifying Mathematical Decisions.pdf
Making and Justifying Mathematical Decisions.pdf
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 

Destacado

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Applitools
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at WorkGetSmarter
 

Destacado (20)

AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 

Uw e sciences institute april 2013

  • 1. Challenges in Life Sciences data management and cloud enabled collaboration Barry Wark, Ph.D. Founder and President, Physion barry@physion.us Twitter @barryjwark
  • 2. Barry Wark PyXG Voyeur Symphony
  • 3. The nature of scientific research has changed, challenging the fundamentals of the scientific method Life scientists need solutions that help them bridge local needs with global resources Think globally, act locally
  • 4.
  • 5. The nature of scientific research has changed fundamentally Biology is a context dependent system. Studying context dependence requires lots of data. ‣Data volume ‣ Analytical tools • High-content screening: desktop confocal • Central computing resources, elastic can image 25,000 samples per day provisioning • Human genome $5000, and falling fast • Open source software democratizes contribution and distribution • IonWorks Barracuda® can perform 6,000 whole-cell patch clamp experiments per hour ‣Teams ‣Data variety • Experimental and analytical specialization • “Coherent” data sets (e.g. Sage, Personal • Research cores and consortia Genome Project) • Distributed across organizations and • Behavior, anatomy, physiology, genomics institutions experiments on the same subject
  • 6. Pipelined data flow through computational resources Researcher Analyst Result/Report dataset
  • 7. Data that is not easily pipelined doesn’t get incorporated Researcher dataset Not scalable Researcher Analyst Result/Report dataset Researcher dataset
  • 8. Analysis provenance that transits individual researchers is hard to track Researcher dataset Researcher Analyst Result/Report dataset Researcher dataset Researcher Analyst Result/Report dataset
  • 9. Comprehensive data management must span the entire data lifecycle Enterprise SDMS Complexity/Cost Analytical tools ELN OSF Paper notebook Figshare Acquisition Analysis Data lifecycle stage
  • 10. Comprehensive data management must span the entire data lifecycle Enterprise SDMS Complexity/Cost Analytical tools Ovation ELN OSF Paper notebook Figshare Acquisition Analysis Data lifecycle stage
  • 11. Ovation’s data model describes science Ovation is built to represent the language of science. Scientific data, regardless of discipline, fits this model. Analogous example shows that representing music in the appropriate language of the domain provides an appropriate data model Music, in the language of the domain expert. Computer representation in the language of May include margin notes, etc. the domain expert (including “margin notes” from composer, conductor, etc.). Any genre of music is representable. Lab notebook representation Ovation representation 11
  • 12. Ubiquitous data model is the correct granularity for knowledge transfer Ovation’s data model is more granular than an ELN. Instead of loosing information during conversion to (and from) a report format such as a Word document or PDF, Ovation allows data to be transferred in the natural language and granularity of science. Information lost in transfer Analogous example shows that transferring data via a “report” (a sound recording) produces an information bottleneck Data transferred directly Seamless collaboration and data transfer removes information bottlenecks 12
  • 13. Common data model enables collaboration Interoperability across institutional boundaries is easier with Ovation than other solutions. Unlike ad-hoc or customized data management systems, every Ovation customer uses the same data model. Individual Global Collaborators researcher community Data transfer via Ovation data model 13
  • 14. The Ovation data model for subject definition Protocol Project Subject Epoch Experiment Procedure Subject { species : Drosophila melanogaster, father : 79326326-9CC0-4770-8DC6-3695113C7A64, mother : A2D40CFF-3016-41AE-AC67-BB09A7D8D9E1 }
  • 15. The Ovation data model for measurements Project Subject Protocol Experiment EquipmentSetup Epoch Procedure Measurement DataElement
  • 16. The Ovation data model for analysis provenance AnalysisRecord Optionally named AnalysisRecord Named DataElement Measurement Optionally named DataElement Measurement DataElement Measurement DataElement
  • 17. Ovation architecture http://ovaiton.io ACL ACL Object Cloudant storage CouchDB CouchDB Local file cache hidden visible
  • 18. Ovation uses eventual consistency Ovation chooses availability and partition tolerance over consistency (so you can work from the coffee shop) X2 1 Y1 X2’ 1 Y1 X1 Y1 Client 1 Client 2 Cloud
  • 19. Ovation uses eventual consistency This means conflicting edits can be made by disconnected clients Append-only (mostly) and user-isolated changes at the edge of the object graph minimize these conflicts. X2 1 Y1 X2’ 1 Y1 X2 X2’ 1 Y1 Client 1 Client 2 Cloud
  • 20. Ovation uses eventual consistency Ovation requires users to resolve conflicts that they have authority to decide during sync. X2 3 Y1 X2’ 3 Y1 X2 X2’ 3 Y1 Client 1 Client 2 Cloud
  • 21. Ovation Scientific Data Management System® • Comprehensive data management • Multi-modality • Multi-user annotation • Analysis provenance • Seamless user experience • Double-click installation • Integration with existing tools: Matlab, Python, R, Java • Guide to success • Effective collaboration • Distributed and co-located experts • Data ownership maintained • Cloud-based replication and archiving
  • 22. Integrated analysis workflow Analysis pipelines that begin with a search, facilitate automatic incorporation of new results Acquire Organize Search Analyze %% Run a simple query iterator = context.query('Epoch', ' ...criteria... '); while(iterator.hasNext()) currEpoch = itrator.next(); ...analyze currEpoch... end
  • 23. Integrated analysis workflow Acquire Organize Search Analyze Acquire Organize Replication technology allows Ovation to replicate a subset of the database for data locality within a computational cluster. Execute workflows on a local or cloud cluster
  • 24. context = NewDataStoreCoordinator('username', password).getContext(); epochs = context.query(context.getQuery('query-name')); %% analysis parameters params = struct(); params.MaxLag = 1000; % time window for cross-correlation function params.ResponseDelayPts = 0; % exclude at end of modulated light params.MinAnalysisEpochs = 3; params.FrequencyCutoff = 500; params.FlushData = 1; %% ANALYZE AND COLLECT RESULTS ====> ORIGINAL ANALYSIS CODE HERE <==== %% save analysis record for this figure ar = project.insertAnalysisRecord('Figure 1’, epochs, 'AnalysisFunction.m', params, svnRevision, svnURL); ar.setUserDescription('Manuscript - Figure 1'); ar.addTag(<manuscript>); ar.addOutput('Figure 1a’, './Figure1a.pdf'); ar.addResource('Figure 1b’, './Figure1b.pdf');
  • 25. Share data in context DerivedResponse Trial name: spikes parameters: {…} code: spikes.m Stimulus Response ovation:///f694d05a-131b-4644-aa7c-f6e8934e60c0/ DerivedResponse Trial name: spikes parameters: {…} code: spikes.m Stimulus Response
  • 26. Share data in context Project Source Experiment Experiment Device Trial Group DerivedResponse Trial Trial Trial name: spikes parameters: {…} code: spikes.m Stimulus Response Stimulus Response
  • 27. Ovation enables researchers to extract more knowledge from existing data • Lab’s lifetime work was enough data to answer fundamental questions about signal and noise in the early visual system • Data was locked in individual’s ad-hoc data management • Ovation enabled meta-analysis of this existing data • New graduate students start with the old data, not new experiments et al. • Arrestin Competition (38):11867–11879 Doan psin is pro- d for each e transduc- convert the nge in cur- mptions, we ␣ and ␥0/␴ the single- GRK1ϩ/Ϫ, “Ovation has changed the way we do science…” —Fred Rieke able 2). Be-
  • 28. Our vision: living data sets Data Data Data
  • 29. Our vision: living data sets Data Data Data
  • 30. ovation.io • Store and archive all your data • Make your data available wherever you need it • Safe, secure, highly reliable cloud storage • Replicate and synchronize data to multiple devices • “Offline” archiving • Benefit from our scalable cloud-based architecture • Collaborate locally and globally • Pay for what you use • Share selected data with designated users or the public • Simple monthly fee
  • 31. Data replication with ovation.io
  • 32. Neuron Inference in Visual Adaptation Collaboration with ovation.io >sp|P63252|1-427 MGSVRTNRYSIVSSEEDGMKLATMAVANGFG NGKSKVHTRQQCRSRFVKKDGHCNVQFIN VGEKGQRYLADIFTTCVDIRWRWMLVIFCLA FVLSWLFFGCVFWLIALLHGDLDASKEGK ACVSEVNSFTAAFLFSIETQTTIGYGFRCVT DECPIAVFMVVFQSIVGCIIDAFIIGAVM AKMAKPKKRNETLVFSHNAVIAMRDGKLCLM WRVGNLRKSHLVEAHVRAQLLKSRITSEG EYIPLDQIDINVGFDSGIDRIFLVSPITIVH EIDEDSPLYDLSKQDIDNADFEIVVILEG MVEATAMTTQCRSSYLANEILWGHRYEPVLF EEKHYYKVDYSRFHKTYEVPNTPLCSARD LAEKKYILSNANSFCYENEVALTSKEEDDSE NGVPESTSTDTPPDIDLHNQASVPLEPRP LRRESEI an Increase in Temporal Contrast Depends on the Period between Contrast Switches RGC (holding potential 10 mV) in response to a single switch in stimulus contrast (6%–36%, n (A) and 32 s in (B). als as in (A) and (B). Exponential fits to the response following an increase in contrast are shown in red. Figure 1. The Time Course of Adaptation following an Increase in Temporal Contrast Depends on the Period between Contrast Switches nt (mean ± SEM) of the exponential fit to the response following an increase in contrast (6%–36%) for OFF) as a function of stimulus switching period. (A and B) Inhibitory synaptic current to an OFF-transient RGC (holding potential 10 mV) in response to a single switch in stimulus contrast (6%–36%, Meister, 2002; nonrectified, the r.m.s. current was fit with the same function. mean $400 R*/rod/s; red). The switching period was 16 s in (A) and 32 s in (B). ynamics of the The exponential amplitude A and baseline c did not change (C and D) significantly as a function of the switching period approximately 100 trials as in (A) and (B). Exponential fits to the response following an increase in contrast are shown in red. Mean synaptic currents from (not shown). Figure 1E shows the population average time constant as (E) Population-averaged (n z 10 for each period) time constant (mean ± SEM) of the exponential fit to the response following an increase in contrast (6%–36%) for a function of period. The average effective time constant of adaptation scales approximately linearly across a broad range stall RGC types (ON, OFF-sustained, OFF-transient, and ON-OFF) as a function of stimulus switching period. of switching periods ($8–32 s). The observed scaling fails for ion depend on short periods but extends to the longest period (T = 32 s) that eriodic switch we could measure reliably. A similar relationship was observed scribed below, when comparing the time constant of an exponential fit to only se in contrast the first 8 s of 8, 16, and 32 s periods (not shown). Thus the effect et al., 2001; Smirnakis et al., 1997; Baccus and Meister, 2002; ptic currents in is not simply the result of fitting an exponential to a nonexponen- tial response over varying time windows. These results indicate nonrectified, the r.m.s. current was fit with the same function. Kim and Rieke, 2001). Here we focus on the dynamics of the a stimulus that period of 16 s that a fixed first-order process does not govern the dynamics of contrast adaptation in mouse retina. Instead, the adapting The exponential amplitude A and baseline c did not change slow component of adaptation. d across trials trast stimulus, machinery has access to multiple timescales. Dynamics of Adaptation to Luminance significantly as a function of the switching period (not shown). synaptic input urse of several To test the generality of multiple-timescale dynamics of adapta- tion, we measured responses to periodic changes in mean light Figure 1E shows the population average time constant as Contrast and Luminance Adaptation slow relaxation intensity (luminance). As for contrast adaptation, the dynamics of a function of period. The average effective time constant of
  • 33. Analysis provenance that transits individual researchers is hard to track Researcher dataset Researcher Analyst Result/report dataset Researcher dataset Researcher Analyst Result/report dataset
  • 34. Ovation enables integration of non-pipeline data, and comprehensive analysis provenance Researcher dataset Researcher Analyst Result/report dataset Researcher dataset Researcher Analyst Result/report dataset
  • 35. Getting started with Ovation ✓Signup ✓Download ✓Get started http://ovation.io info@ovation.io @ovation_io