SlideShare una empresa de Scribd logo
1 de 32
Descargar para leer sin conexión
ELN Architecture


                        Simon Coles
            President & CTO, Amphora Research Systems
So...

    •   You’re on holiday one day
    •   Doing your normal thing
    •   And then you get the call...
    •   they want an ELN!




    http://www.amphora-research.com/
2
http://www.amphora-research.com/
3
ELN architecture

    •   Hopefully
         •   I am not going to self-destruct
         •   Your project won’t be as exciting

    •   Your task is to
         •   Deliver a state-of-the-art ELN system
         •   In tight timescales
         •   With limited budget
         •   In the real world
         •   That the users like
         •   And will serve you for many years




    http://www.amphora-research.com/
4
Introduction

    •   About me
         •   Started working with ELNs in ‘96
         •   President & Co-founder of Amphora
         •   IT background
    •   First ELN was enterprise-scale ELN for Kodak
         •   Worldwide, 1,000’s of users, diverse user base
         •   Completely Electronic Records (no paper)
    •   After a long & windy road
         •   New products, lots more deployments, many industries
         •   Certain amount of realism about ELN implementation
         •   Provide Patent Evidence Creation & Preservation
             Systems
         •   Work with a wide variety of “ELN” systems etc.
         •   Now based in the US & UK


    http://www.amphora-research.com/
5
This presentation

    •   You can download a copy of this presentation from
        our web site




    http://www.amphora-research.com/
6
Why does architecture matter?

    •   A good architecture can help
         •   Integrate “Best of breed” tools with existing investments
         •   Allow you to split the project into manageable pieces
         •   Ensure you don’t get “captured” by the vendor
         •   Help your system withstand the ravages of time
         •   Keep your TCO down
    •   A bad architecture will hurt
         •   Reliability, Scalability problems
         •   Reduce your options going forward
         •   Force you into “Big bang” project

    •   Some random thoughts on architecture




    http://www.amphora-research.com/
7
ELN architecture

    •   Major issues
         •   Diversity & Flexibility
         •   Project size/Justification/ROI
         •   Creating & Preserving Evidence for Patents
         •   Need for long term access to ELN contents
         •   Scalability
         •   Web-based systems
         •   How your network can help you
    •   Trends
         •   Integration methods
         •   Open Source
         •   In the lab
         •   Ones to watch



    http://www.amphora-research.com/
8
Diversity & Flexibility

    •   “Science” covers a wide variety of activity
    •   Each of these is served by its own industry
    •   Improvements in each area needs to happen at its
        own pace
    •   Things change
         •   Different techniques
         •   New data types
         •   Another R&D centre
         •   New devices for use in the lab
    •   The very essence of “Research” is to change the
        way you work
    •   How do we design an ELN which can
        accommodate these changes?


    http://www.amphora-research.com/
9
Dealing with change

     •   Build on other projects & integrate
          •   if it can be done within another project, then do so
          •   Keeps your life simpler and more focused, clear aims
          •   Those other projects can proceed according to the
              rhythm and needs of the specific area
     •   Where possible employ loose coupling between
         systems
          •   Message passing reduces implementation complexity
          •   SOAP/OLE/XML etc.




     http://www.amphora-research.com/
10
Loosely-Coupled
                Systems Keep You
                      Sane


     http://www.amphora-research.com/
11
Project size/Justification/ROI

     •   Two approaches
     •   Either attempt to justify the whole ELN in one go
         (“Big bang”)
     •   Or Phased
          •   Divide the project into phases
                •    Each involves a smaller investment (risk)
                •    With a corresponding payoff
          •   Move forward at a pace that’s comfortable for the
              business




     http://www.amphora-research.com/
12
Phased ELNs

     •   Historically this was very difficult to do with ELNs
          •   Record keeping
          •   Integration with other systems
     •   Needs to be designed into the project (& product)
         from the start
          •   Patent evidence creation/preservation system
          •   Generic science-neutral platform (can often be your
              existing IT infrastructure)
          •   Integrate/collaborate with discipline-specific software
     •   When you can do it, makes a huge difference
          •   Can start at a departmental level if needed
          •   Asking the business to take a small risk each time




     http://www.amphora-research.com/
13
Creating & Preserving Evidence for Patents


     •   Specialized area with very specific (and unique)
         considerations
     •   Best done separately from science-specific ELN
         tools
          •   Hard to reconcile requirements of science and records
              in one system
          •   You’ll often have a number of science-focused systems,
              yet want only one Patent evidence system
          •   Run by a small group of people who know they’ll end
              up in court
          •   Reduce risks & discovery costs
     •   You can have an “Electronic” notebook for the
         scientist and still create a paper record



     http://www.amphora-research.com/
14
Paper or Electronic?

      •            The choice often comes down to
                    •   Comfort
                    •   Practicality
                    •   Cost


                                                           Paper
     System Cost




                                                           Electronic




                   10             100        500    1000




          http://www.amphora-research.com/
15
Long term access to ELN content

     •   Partly this is records management issue
     •   But there’s a heavy technical component
          •   What format you store your data in
          •   How you store your data
          •   Metadata

     •   You need to make Open Data formats part of your
         purchasing requirements




     http://www.amphora-research.com/
16
“Good” (open) file formats

     •   Publicly documented
     •   Legally unencumbered
          •   No patents, copyright concerns etc.
          •   Any patents or copyright must be in the public domain
     •   Ideally, self documenting (XML is a good start)
     •   Degrade gracefully
          •   If you can’t the data, at least you can see a picture
     •   Based on more open, primitive formats where
         possible
     •   At least two implementations of readers, one of
         which is Open Source
     •   Widely used (W3C or IETF standards are good
         signs)


     http://www.amphora-research.com/
17
Data formats for the long term

     •   Good
          •   For text: Plain ASCII, Unicode, HTML, possibly RTF
          •   For graphics: PNG, SVG
          •   For structured data: XML
          •   To preserve appearance: PDF
     •   Worry about
          •   Storing files in databases
                •    The database file format is probably undocumented
                •    Store objects on the file system and use the
                     database to point to them
          •   Anything that is proprietary - there’s no excuse for it,
              and it dramatically increases your risk
          •   Binary files generally
          •   Mixing content in files (e.g. embedding XML in PDF)
          •   Proprietary digital signatures

     http://www.amphora-research.com/
18
IP concerns & data formats

     •   Companies have always used Proprietary Data
         Formats as a competitive weapon
     •   Companies are waking up to the use of IP tools
         (licenses, patents, copyrights) to reinforce their
         control over data formats
     •   Just because a format is published doesn’t mean it
         is open
          •   The Microsoft Office XML formats are a particularly
              bad example
                •    Right now it looks positively radioactive
                •    They’re being very careful what they say which
                     indicates to me they’re planning something
                •    http://www.groklaw.net/article.php?
                     story=20050330133833843
                •    (see section: 4. Dissecting Microsoft’s “Patent License”)

     http://www.amphora-research.com/
19
Standards

     •   There are so many to choose from!
     •   Two key ways of generating “Standards”
          •   De Facto - dominant supplier/format
          •   De Jure - committee based
     •   Who gets to “bless” a standard?
     •   What makes a “good standard”
          •   De Jure process has difficulty keeping up with the real
              world
          •   De Facto process has risk of lock-in
     •   Pragmatic approach
          •   Expect your suppliers to use open file formats
          •   If there is an acceptable standard, use it
          •   Make sure you are using the right kind of format for
              each purpose


     http://www.amphora-research.com/
20
Records considerations

     •   Not all the “Stuff” that’s generated during the
         research process is the same
          •   Some of if needs to be kept for a long time
          •   Some is only useful for the moment
          •   Some will be benefit anyone
          •   Some is only really useful for the person who created it
              (using specialized tools)
     •   Some material is suitable for long term
         preservation, some isn’t
     •   You can go crazy getting into this in too much
         detail
     •   But you also need to make sure your tools and
         processes do allow you to manage the data/
         records you’re creating

     http://www.amphora-research.com/
21
Scalability

     •   Geographical space
          •   In wide area networks, latency becomes the most
              noticeable issue
          •   Over multiple timezones, acceptable “Maintenance
              Windows” disappear
     •   More data
          •   Number of data items
          •   Size of individual data items
     •   Number of users
          •   Larger populations generally mean more disparate
              requirements
          •   How many people will get upset if the system goes
              down



     http://www.amphora-research.com/
22
Latency

     •   The science-specific “Deep” systems
          •   Often highly interactive
                •    Lots of round trips to the server for data etc.
                •    This is what makes them cool
          •   You can’t beat the speed of light (and network
              hardware add significant latency)
          •   Therefore need to have a server close to the end user
          •   Federation will give you a single overview
     •   “Broad” systems have different usage
         characteristics
          •   Very much like a normal web site, latency is much less
              of a problem
          •   Very easy to have one system for worldwide use, even
              for large companies
          •   Building large systems quite easy

     http://www.amphora-research.com/
23
Web-based systems

     •   “Web based” has become a bit of a marketing tool
          •   Generally thin clients offer a lower TCO
          •   And hence IT like them
     •   In practice, most science-supporting ELN front
         ends will be delivered as a “thick” client
          •   There’s a reason it’s called a browser
          •   Wrapping an OLE object in IE is still “thick”
     •   However, “Ajax” systems like GMail and Google
         Maps show just what you can do with a web-based
         system
     •   Web based systems should expose a sensbiel URL
         interface



     http://www.amphora-research.com/
24
How your network can help you

     •   There’s a whole load of useful network services
         and Interfaces that large companies have
     •   Useful ones
          •   Single Sign On
          •   LDAP
          •   Printer/Fileserver etc.
          •   Security/Status monitoring etc.
     •   Beware of Central Digital Signature Infrastructure
          •   Mixing vulnerabilities - leaves you open to accidents
          •   Often not designed for long term use




     http://www.amphora-research.com/
25
ELN architecture

     •   Major issues
          •   Diversity & Flexibility
          •   Project size/Justification/ROI
          •   Creating & Preserving Evidence for Patents
          •   Need for long term access to ELN contents
          •   Scale
          •   Web-based systems
     •   Trends
          •   Integration methods
          •   Open Source
          •   In the lab
          •   Ones to watch




     http://www.amphora-research.com/
26
Integration methods

     •   RPC-like mechanisms
          •   Service Oriented Architecture
          •   SOAP
          •   REST
     •   Text file passing (files, email, etc.)
     •   URL launching
          •   Often overlooked, but very powerful

     •   What’s important
          •   Loose-coupling
          •   Open, lightweight systems
          •   Consistent, stable keys
          •   Stable URL (& domain) space



     http://www.amphora-research.com/
27
Open Source

     •   Definitely one to watch
     •   Not the “Free” lunch you might think, but a
         pragmatic business too
     •   Examples
          •   Linux
          •   Postgres
          •   JBoss,Tomcat etc.
          •   Ghostscript

     •   Open Source is part of everyone’s infrastructure
     •   Make sure you can run your systems on a variety of
         platforms



     http://www.amphora-research.com/
28
Why?

     •   Good for records
          •   Gives you top-to-bottom control
     •   Good for TCO
          •   We’re finding the Open Source infrastructure easier to
              setup and reliable than proprietary alternatives
     •   Enables a better solution
          •   Transparent systems mean you can do things the
              original designers didn't think of
          •   This is especially important for ELNs




     http://www.amphora-research.com/
29
Data point

     •   This is just our experience offering people
         alternatives for the server portion
     •   2000 - “What's Open Source? What’s Linux?”
     •   2001 - No way!
     •   2002 - some pilots underway, some acceptance
     •   2003 - majority of installations are Open Source
         infrastructure
     •   2005 - we’re wondering where Windows is
     •   We’re not abandoning proprietary infrastructure
          •   But it is clear that Open Source is getting serious
              consideration
          •   Seeing a migration away from proprietary infrastructure
              to Open Source



     http://www.amphora-research.com/
30
In the lab

     •   ELN use in the lab is a hard problem
     •   Tablets, Laptops, Palmtops etc. doesn’t seem to be
         working
     •   What does seem to work
          •   Small form-factor PCs on the bench
          •   Remote Desktop & Citrix




     http://www.amphora-research.com/
31
Ones to watch

     •   Technology
          •   XML generally
          •   Web Services
          •   Bluetooth and WiFi
          •   RSS
          •   OpenOffice
          •   Jabber (as computer messaging and IM framework)
     •   Trends
          •   File format nasties
          •   DMCA and other copyright legislation




     http://www.amphora-research.com/
32

Más contenido relacionado

Destacado

Electronic Lab Notebooks
Electronic Lab NotebooksElectronic Lab Notebooks
Electronic Lab NotebooksKristin Briney
 
Most important features when choosing an electronic lab notebook
Most important features when choosing an electronic lab notebookMost important features when choosing an electronic lab notebook
Most important features when choosing an electronic lab notebooksciNote LLC
 
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)Kristin Briney
 
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"CSols, Inc.
 
Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...NextMove Software
 

Destacado (6)

Electronic Lab Notebooks
Electronic Lab NotebooksElectronic Lab Notebooks
Electronic Lab Notebooks
 
Most important features when choosing an electronic lab notebook
Most important features when choosing an electronic lab notebookMost important features when choosing an electronic lab notebook
Most important features when choosing an electronic lab notebook
 
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
Lab Notebooks as Data Management (SLA Winter Virtual Conference 2012)
 
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"SLAS 2017 - "Multiple Research Platforms:  One Single Data Sharing Portal"
SLAS 2017 - "Multiple Research Platforms: One Single Data Sharing Portal"
 
How NOSQL Paid off for Telenor
How NOSQL Paid off for TelenorHow NOSQL Paid off for Telenor
How NOSQL Paid off for Telenor
 
Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...Standardized Representations of ELN Reactions for Categorization and Duplicat...
Standardized Representations of ELN Reactions for Categorization and Duplicat...
 

Similar a 2005 04 05 SRI ELN Architecture

Electronic Laboratory Notebooks
Electronic Laboratory NotebooksElectronic Laboratory Notebooks
Electronic Laboratory NotebooksKristin Briney
 
Technical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerationsTechnical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerationsPaul Walk
 
Electronic Signatures
Electronic SignaturesElectronic Signatures
Electronic SignaturesSimon Coles
 
2008 06 17 IP Protection
2008 06 17 IP Protection2008 06 17 IP Protection
2008 06 17 IP ProtectionSimon Coles
 
2010 AIRI Petabyte Challenge - View From The Trenches
2010 AIRI Petabyte Challenge - View From The Trenches2010 AIRI Petabyte Challenge - View From The Trenches
2010 AIRI Petabyte Challenge - View From The TrenchesGeorge Ang
 
2008 04 01 ELNs, Chemistry, Biology
2008 04 01 ELNs, Chemistry, Biology2008 04 01 ELNs, Chemistry, Biology
2008 04 01 ELNs, Chemistry, BiologySimon Coles
 
But we're already open source! Why would I want to bring my code to Apache?
But we're already open source! Why would I want to bring my code to Apache?But we're already open source! Why would I want to bring my code to Apache?
But we're already open source! Why would I want to bring my code to Apache?gagravarr
 
Integrating ECM (WebCenter Content) with your Enterprise! 5 Tips to Try, 5 Tr...
Integrating ECM (WebCenter Content) with your Enterprise! 5 Tips to Try, 5 Tr...Integrating ECM (WebCenter Content) with your Enterprise! 5 Tips to Try, 5 Tr...
Integrating ECM (WebCenter Content) with your Enterprise! 5 Tips to Try, 5 Tr...Brian Huff
 
Managing eResources at Universities
Managing eResources at UniversitiesManaging eResources at Universities
Managing eResources at UniversitiesPK Mishra
 
2008 03 11 ELNs in different industries
2008 03 11 ELNs in different industries2008 03 11 ELNs in different industries
2008 03 11 ELNs in different industriesSimon Coles
 
Design for Scale / Surge 2010
Design for Scale / Surge 2010Design for Scale / Surge 2010
Design for Scale / Surge 2010Christopher Brown
 
HPCC Systems Engineering Summit: Community Use Case: Because Who Has Time for...
HPCC Systems Engineering Summit: Community Use Case: Because Who Has Time for...HPCC Systems Engineering Summit: Community Use Case: Because Who Has Time for...
HPCC Systems Engineering Summit: Community Use Case: Because Who Has Time for...HPCC Systems
 
Evaluating & Selecting the Right Cytometer for your Lab
Evaluating & Selecting the Right Cytometer for your LabEvaluating & Selecting the Right Cytometer for your Lab
Evaluating & Selecting the Right Cytometer for your LabRyan Duggan
 
How to use ORM
How to use ORMHow to use ORM
How to use ORMciconf
 
Querix 4 gl app analyzer 2016 journey to the center of your 4gl application
Querix 4 gl app analyzer 2016 journey to the center of your 4gl applicationQuerix 4 gl app analyzer 2016 journey to the center of your 4gl application
Querix 4 gl app analyzer 2016 journey to the center of your 4gl applicationBeGooden-IT Consulting
 
But We're Already Open Source! Why Would I Want To Bring My Code To Apache?
But We're Already Open Source! Why Would I Want To Bring My Code To Apache?But We're Already Open Source! Why Would I Want To Bring My Code To Apache?
But We're Already Open Source! Why Would I Want To Bring My Code To Apache?gagravarr
 
Software design with Domain-driven design
Software design with Domain-driven design Software design with Domain-driven design
Software design with Domain-driven design Allan Mangune
 
The Economies of Scaling Software
The Economies of Scaling SoftwareThe Economies of Scaling Software
The Economies of Scaling SoftwareAbdelmonaim Remani
 
The economies of scaling software - Abdel Remani
The economies of scaling software - Abdel RemaniThe economies of scaling software - Abdel Remani
The economies of scaling software - Abdel Remanijaxconf
 
Lipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library SystemsLipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library SystemsDorothea Salo
 

Similar a 2005 04 05 SRI ELN Architecture (20)

Electronic Laboratory Notebooks
Electronic Laboratory NotebooksElectronic Laboratory Notebooks
Electronic Laboratory Notebooks
 
Technical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerationsTechnical standards & the RDTF Vision: some considerations
Technical standards & the RDTF Vision: some considerations
 
Electronic Signatures
Electronic SignaturesElectronic Signatures
Electronic Signatures
 
2008 06 17 IP Protection
2008 06 17 IP Protection2008 06 17 IP Protection
2008 06 17 IP Protection
 
2010 AIRI Petabyte Challenge - View From The Trenches
2010 AIRI Petabyte Challenge - View From The Trenches2010 AIRI Petabyte Challenge - View From The Trenches
2010 AIRI Petabyte Challenge - View From The Trenches
 
2008 04 01 ELNs, Chemistry, Biology
2008 04 01 ELNs, Chemistry, Biology2008 04 01 ELNs, Chemistry, Biology
2008 04 01 ELNs, Chemistry, Biology
 
But we're already open source! Why would I want to bring my code to Apache?
But we're already open source! Why would I want to bring my code to Apache?But we're already open source! Why would I want to bring my code to Apache?
But we're already open source! Why would I want to bring my code to Apache?
 
Integrating ECM (WebCenter Content) with your Enterprise! 5 Tips to Try, 5 Tr...
Integrating ECM (WebCenter Content) with your Enterprise! 5 Tips to Try, 5 Tr...Integrating ECM (WebCenter Content) with your Enterprise! 5 Tips to Try, 5 Tr...
Integrating ECM (WebCenter Content) with your Enterprise! 5 Tips to Try, 5 Tr...
 
Managing eResources at Universities
Managing eResources at UniversitiesManaging eResources at Universities
Managing eResources at Universities
 
2008 03 11 ELNs in different industries
2008 03 11 ELNs in different industries2008 03 11 ELNs in different industries
2008 03 11 ELNs in different industries
 
Design for Scale / Surge 2010
Design for Scale / Surge 2010Design for Scale / Surge 2010
Design for Scale / Surge 2010
 
HPCC Systems Engineering Summit: Community Use Case: Because Who Has Time for...
HPCC Systems Engineering Summit: Community Use Case: Because Who Has Time for...HPCC Systems Engineering Summit: Community Use Case: Because Who Has Time for...
HPCC Systems Engineering Summit: Community Use Case: Because Who Has Time for...
 
Evaluating & Selecting the Right Cytometer for your Lab
Evaluating & Selecting the Right Cytometer for your LabEvaluating & Selecting the Right Cytometer for your Lab
Evaluating & Selecting the Right Cytometer for your Lab
 
How to use ORM
How to use ORMHow to use ORM
How to use ORM
 
Querix 4 gl app analyzer 2016 journey to the center of your 4gl application
Querix 4 gl app analyzer 2016 journey to the center of your 4gl applicationQuerix 4 gl app analyzer 2016 journey to the center of your 4gl application
Querix 4 gl app analyzer 2016 journey to the center of your 4gl application
 
But We're Already Open Source! Why Would I Want To Bring My Code To Apache?
But We're Already Open Source! Why Would I Want To Bring My Code To Apache?But We're Already Open Source! Why Would I Want To Bring My Code To Apache?
But We're Already Open Source! Why Would I Want To Bring My Code To Apache?
 
Software design with Domain-driven design
Software design with Domain-driven design Software design with Domain-driven design
Software design with Domain-driven design
 
The Economies of Scaling Software
The Economies of Scaling SoftwareThe Economies of Scaling Software
The Economies of Scaling Software
 
The economies of scaling software - Abdel Remani
The economies of scaling software - Abdel RemaniThe economies of scaling software - Abdel Remani
The economies of scaling software - Abdel Remani
 
Lipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library SystemsLipstick on a Pig: Integrated Library Systems
Lipstick on a Pig: Integrated Library Systems
 

Más de Simon Coles

2015 06-06 Clean Business
2015 06-06 Clean Business2015 06-06 Clean Business
2015 06-06 Clean BusinessSimon Coles
 
Clean Language in Software
Clean Language in SoftwareClean Language in Software
Clean Language in SoftwareSimon Coles
 
2010 01 27 Chairman Opening Remarks
2010 01 27 Chairman Opening Remarks2010 01 27 Chairman Opening Remarks
2010 01 27 Chairman Opening RemarksSimon Coles
 
2008 01 30 ELNs and Compliance
2008 01 30 ELNs and Compliance2008 01 30 ELNs and Compliance
2008 01 30 ELNs and ComplianceSimon Coles
 
2007 09 28 ELNs as Patent Evidence Systems
2007 09 28 ELNs as Patent Evidence Systems2007 09 28 ELNs as Patent Evidence Systems
2007 09 28 ELNs as Patent Evidence SystemsSimon Coles
 
2007 09 26 ELN Working Routine
2007 09 26 ELN Working Routine2007 09 26 ELN Working Routine
2007 09 26 ELN Working RoutineSimon Coles
 
2006 09 26 Beyond Chemistry
2006 09 26 Beyond Chemistry2006 09 26 Beyond Chemistry
2006 09 26 Beyond ChemistrySimon Coles
 
Reconciling Scientists and Lawyers
Reconciling Scientists and LawyersReconciling Scientists and Lawyers
Reconciling Scientists and LawyersSimon Coles
 
2009 04 21 Lessons Learned Eln Implementations
2009 04 21 Lessons Learned Eln Implementations2009 04 21 Lessons Learned Eln Implementations
2009 04 21 Lessons Learned Eln ImplementationsSimon Coles
 

Más de Simon Coles (9)

2015 06-06 Clean Business
2015 06-06 Clean Business2015 06-06 Clean Business
2015 06-06 Clean Business
 
Clean Language in Software
Clean Language in SoftwareClean Language in Software
Clean Language in Software
 
2010 01 27 Chairman Opening Remarks
2010 01 27 Chairman Opening Remarks2010 01 27 Chairman Opening Remarks
2010 01 27 Chairman Opening Remarks
 
2008 01 30 ELNs and Compliance
2008 01 30 ELNs and Compliance2008 01 30 ELNs and Compliance
2008 01 30 ELNs and Compliance
 
2007 09 28 ELNs as Patent Evidence Systems
2007 09 28 ELNs as Patent Evidence Systems2007 09 28 ELNs as Patent Evidence Systems
2007 09 28 ELNs as Patent Evidence Systems
 
2007 09 26 ELN Working Routine
2007 09 26 ELN Working Routine2007 09 26 ELN Working Routine
2007 09 26 ELN Working Routine
 
2006 09 26 Beyond Chemistry
2006 09 26 Beyond Chemistry2006 09 26 Beyond Chemistry
2006 09 26 Beyond Chemistry
 
Reconciling Scientists and Lawyers
Reconciling Scientists and LawyersReconciling Scientists and Lawyers
Reconciling Scientists and Lawyers
 
2009 04 21 Lessons Learned Eln Implementations
2009 04 21 Lessons Learned Eln Implementations2009 04 21 Lessons Learned Eln Implementations
2009 04 21 Lessons Learned Eln Implementations
 

Último

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 

Último (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 

2005 04 05 SRI ELN Architecture

  • 1. ELN Architecture Simon Coles President & CTO, Amphora Research Systems
  • 2. So... • You’re on holiday one day • Doing your normal thing • And then you get the call... • they want an ELN! http://www.amphora-research.com/ 2
  • 4. ELN architecture • Hopefully • I am not going to self-destruct • Your project won’t be as exciting • Your task is to • Deliver a state-of-the-art ELN system • In tight timescales • With limited budget • In the real world • That the users like • And will serve you for many years http://www.amphora-research.com/ 4
  • 5. Introduction • About me • Started working with ELNs in ‘96 • President & Co-founder of Amphora • IT background • First ELN was enterprise-scale ELN for Kodak • Worldwide, 1,000’s of users, diverse user base • Completely Electronic Records (no paper) • After a long & windy road • New products, lots more deployments, many industries • Certain amount of realism about ELN implementation • Provide Patent Evidence Creation & Preservation Systems • Work with a wide variety of “ELN” systems etc. • Now based in the US & UK http://www.amphora-research.com/ 5
  • 6. This presentation • You can download a copy of this presentation from our web site http://www.amphora-research.com/ 6
  • 7. Why does architecture matter? • A good architecture can help • Integrate “Best of breed” tools with existing investments • Allow you to split the project into manageable pieces • Ensure you don’t get “captured” by the vendor • Help your system withstand the ravages of time • Keep your TCO down • A bad architecture will hurt • Reliability, Scalability problems • Reduce your options going forward • Force you into “Big bang” project • Some random thoughts on architecture http://www.amphora-research.com/ 7
  • 8. ELN architecture • Major issues • Diversity & Flexibility • Project size/Justification/ROI • Creating & Preserving Evidence for Patents • Need for long term access to ELN contents • Scalability • Web-based systems • How your network can help you • Trends • Integration methods • Open Source • In the lab • Ones to watch http://www.amphora-research.com/ 8
  • 9. Diversity & Flexibility • “Science” covers a wide variety of activity • Each of these is served by its own industry • Improvements in each area needs to happen at its own pace • Things change • Different techniques • New data types • Another R&D centre • New devices for use in the lab • The very essence of “Research” is to change the way you work • How do we design an ELN which can accommodate these changes? http://www.amphora-research.com/ 9
  • 10. Dealing with change • Build on other projects & integrate • if it can be done within another project, then do so • Keeps your life simpler and more focused, clear aims • Those other projects can proceed according to the rhythm and needs of the specific area • Where possible employ loose coupling between systems • Message passing reduces implementation complexity • SOAP/OLE/XML etc. http://www.amphora-research.com/ 10
  • 11. Loosely-Coupled Systems Keep You Sane http://www.amphora-research.com/ 11
  • 12. Project size/Justification/ROI • Two approaches • Either attempt to justify the whole ELN in one go (“Big bang”) • Or Phased • Divide the project into phases • Each involves a smaller investment (risk) • With a corresponding payoff • Move forward at a pace that’s comfortable for the business http://www.amphora-research.com/ 12
  • 13. Phased ELNs • Historically this was very difficult to do with ELNs • Record keeping • Integration with other systems • Needs to be designed into the project (& product) from the start • Patent evidence creation/preservation system • Generic science-neutral platform (can often be your existing IT infrastructure) • Integrate/collaborate with discipline-specific software • When you can do it, makes a huge difference • Can start at a departmental level if needed • Asking the business to take a small risk each time http://www.amphora-research.com/ 13
  • 14. Creating & Preserving Evidence for Patents • Specialized area with very specific (and unique) considerations • Best done separately from science-specific ELN tools • Hard to reconcile requirements of science and records in one system • You’ll often have a number of science-focused systems, yet want only one Patent evidence system • Run by a small group of people who know they’ll end up in court • Reduce risks & discovery costs • You can have an “Electronic” notebook for the scientist and still create a paper record http://www.amphora-research.com/ 14
  • 15. Paper or Electronic? • The choice often comes down to • Comfort • Practicality • Cost Paper System Cost Electronic 10 100 500 1000 http://www.amphora-research.com/ 15
  • 16. Long term access to ELN content • Partly this is records management issue • But there’s a heavy technical component • What format you store your data in • How you store your data • Metadata • You need to make Open Data formats part of your purchasing requirements http://www.amphora-research.com/ 16
  • 17. “Good” (open) file formats • Publicly documented • Legally unencumbered • No patents, copyright concerns etc. • Any patents or copyright must be in the public domain • Ideally, self documenting (XML is a good start) • Degrade gracefully • If you can’t the data, at least you can see a picture • Based on more open, primitive formats where possible • At least two implementations of readers, one of which is Open Source • Widely used (W3C or IETF standards are good signs) http://www.amphora-research.com/ 17
  • 18. Data formats for the long term • Good • For text: Plain ASCII, Unicode, HTML, possibly RTF • For graphics: PNG, SVG • For structured data: XML • To preserve appearance: PDF • Worry about • Storing files in databases • The database file format is probably undocumented • Store objects on the file system and use the database to point to them • Anything that is proprietary - there’s no excuse for it, and it dramatically increases your risk • Binary files generally • Mixing content in files (e.g. embedding XML in PDF) • Proprietary digital signatures http://www.amphora-research.com/ 18
  • 19. IP concerns & data formats • Companies have always used Proprietary Data Formats as a competitive weapon • Companies are waking up to the use of IP tools (licenses, patents, copyrights) to reinforce their control over data formats • Just because a format is published doesn’t mean it is open • The Microsoft Office XML formats are a particularly bad example • Right now it looks positively radioactive • They’re being very careful what they say which indicates to me they’re planning something • http://www.groklaw.net/article.php? story=20050330133833843 • (see section: 4. Dissecting Microsoft’s “Patent License”) http://www.amphora-research.com/ 19
  • 20. Standards • There are so many to choose from! • Two key ways of generating “Standards” • De Facto - dominant supplier/format • De Jure - committee based • Who gets to “bless” a standard? • What makes a “good standard” • De Jure process has difficulty keeping up with the real world • De Facto process has risk of lock-in • Pragmatic approach • Expect your suppliers to use open file formats • If there is an acceptable standard, use it • Make sure you are using the right kind of format for each purpose http://www.amphora-research.com/ 20
  • 21. Records considerations • Not all the “Stuff” that’s generated during the research process is the same • Some of if needs to be kept for a long time • Some is only useful for the moment • Some will be benefit anyone • Some is only really useful for the person who created it (using specialized tools) • Some material is suitable for long term preservation, some isn’t • You can go crazy getting into this in too much detail • But you also need to make sure your tools and processes do allow you to manage the data/ records you’re creating http://www.amphora-research.com/ 21
  • 22. Scalability • Geographical space • In wide area networks, latency becomes the most noticeable issue • Over multiple timezones, acceptable “Maintenance Windows” disappear • More data • Number of data items • Size of individual data items • Number of users • Larger populations generally mean more disparate requirements • How many people will get upset if the system goes down http://www.amphora-research.com/ 22
  • 23. Latency • The science-specific “Deep” systems • Often highly interactive • Lots of round trips to the server for data etc. • This is what makes them cool • You can’t beat the speed of light (and network hardware add significant latency) • Therefore need to have a server close to the end user • Federation will give you a single overview • “Broad” systems have different usage characteristics • Very much like a normal web site, latency is much less of a problem • Very easy to have one system for worldwide use, even for large companies • Building large systems quite easy http://www.amphora-research.com/ 23
  • 24. Web-based systems • “Web based” has become a bit of a marketing tool • Generally thin clients offer a lower TCO • And hence IT like them • In practice, most science-supporting ELN front ends will be delivered as a “thick” client • There’s a reason it’s called a browser • Wrapping an OLE object in IE is still “thick” • However, “Ajax” systems like GMail and Google Maps show just what you can do with a web-based system • Web based systems should expose a sensbiel URL interface http://www.amphora-research.com/ 24
  • 25. How your network can help you • There’s a whole load of useful network services and Interfaces that large companies have • Useful ones • Single Sign On • LDAP • Printer/Fileserver etc. • Security/Status monitoring etc. • Beware of Central Digital Signature Infrastructure • Mixing vulnerabilities - leaves you open to accidents • Often not designed for long term use http://www.amphora-research.com/ 25
  • 26. ELN architecture • Major issues • Diversity & Flexibility • Project size/Justification/ROI • Creating & Preserving Evidence for Patents • Need for long term access to ELN contents • Scale • Web-based systems • Trends • Integration methods • Open Source • In the lab • Ones to watch http://www.amphora-research.com/ 26
  • 27. Integration methods • RPC-like mechanisms • Service Oriented Architecture • SOAP • REST • Text file passing (files, email, etc.) • URL launching • Often overlooked, but very powerful • What’s important • Loose-coupling • Open, lightweight systems • Consistent, stable keys • Stable URL (& domain) space http://www.amphora-research.com/ 27
  • 28. Open Source • Definitely one to watch • Not the “Free” lunch you might think, but a pragmatic business too • Examples • Linux • Postgres • JBoss,Tomcat etc. • Ghostscript • Open Source is part of everyone’s infrastructure • Make sure you can run your systems on a variety of platforms http://www.amphora-research.com/ 28
  • 29. Why? • Good for records • Gives you top-to-bottom control • Good for TCO • We’re finding the Open Source infrastructure easier to setup and reliable than proprietary alternatives • Enables a better solution • Transparent systems mean you can do things the original designers didn't think of • This is especially important for ELNs http://www.amphora-research.com/ 29
  • 30. Data point • This is just our experience offering people alternatives for the server portion • 2000 - “What's Open Source? What’s Linux?” • 2001 - No way! • 2002 - some pilots underway, some acceptance • 2003 - majority of installations are Open Source infrastructure • 2005 - we’re wondering where Windows is • We’re not abandoning proprietary infrastructure • But it is clear that Open Source is getting serious consideration • Seeing a migration away from proprietary infrastructure to Open Source http://www.amphora-research.com/ 30
  • 31. In the lab • ELN use in the lab is a hard problem • Tablets, Laptops, Palmtops etc. doesn’t seem to be working • What does seem to work • Small form-factor PCs on the bench • Remote Desktop & Citrix http://www.amphora-research.com/ 31
  • 32. Ones to watch • Technology • XML generally • Web Services • Bluetooth and WiFi • RSS • OpenOffice • Jabber (as computer messaging and IM framework) • Trends • File format nasties • DMCA and other copyright legislation http://www.amphora-research.com/ 32