SlideShare una empresa de Scribd logo
1 de 71
Descargar para leer sin conexión
What can Bioinformaticians
  learn from YouTube?
Data
New project. New schema.
EMBL: 20 different data
      formats
“A biologist would rather share their
toothbrush than their (gene) names”
                              Mike Ashburner
YouTube loves to share
100 million views per day
114 apps
<rdf:RDF xmlns=quot;http://www.affymetrix.com/community/
publications/affymetrix/tmsplice#quot;
<Gene rdf:about=quot;#1110002A21Rikquot;>
    <chr>chr1</chr>
    <hasVariant rdf:parseType=quot;Resourcequot;>
        <representedBy rdf:resource=quot;#gi13385627quot;/>
    </hasVariant>
    <hasVariant rdf:parseType=quot;Resourcequot;>
        <representedBy rdf:resource=quot;#gi18043402quot;/>
    </hasVariant>
    <strand>+</strand>
</Gene>
RDF, OWL,
SPARQL, GRDDL, WTF?
Lower case ‘s’, lower case ‘w’


The semantic web, not
 The Semantic Web
<tr>
<th class=quot;two-columnquot;>Gene</th>
<td class=quot;two-columnquot;><table width=quot;100%quot; cellpadding=quot;4quot;>
<tr>
<td><strong>
<a href=quot;http://www.gene.ucl.ac.uk/cgi-bin/nomenclature/
get_data.pl?hgnc_id=1101quot;>BRCA2</a></div></strong> (HGNC Symbol)
</td>
<td>
<span class=quot;smallquot;> To view all Ensembl genes linked to the name
<a href=quot;/Homo_sapiens/featureview?type=Gene;id=BRCA2quot;>click
here</a>.</span>
</td>
</tr>
</table><p>
This gene is a member of the Human CCDS set: <a href=quot;http://
www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi?
REQUEST=CCDS&amp;DATA=CCDS9344quot;>CCDS9344</a>
</p>
</td>
</tr>
<tr class=quot;hgenequot;>
<th class=quot;two-columnquot;>Gene</th>
<td class=quot;two-columnquot;><table width=quot;100%quot; cellpadding=quot;4quot;>
<tr>
<td><strong>
<a href=quot;http://www.gene.ucl.ac.uk/cgi-bin/nomenclature/
get_data.pl?hgnc_id=1101quot; rel=quot;hgnc_namequot;>BRCA2</a></div></strong>
(HGNC Symbol)
</td>
<td>
<span class=quot;smallquot;> To view all Ensembl genes linked to the name
<a href=quot;/Homo_sapiens/featureview?type=Gene;id=BRCA2quot;
rel=quot;gene_listquot; >click here</a>.</span>
</td>
</tr>
</table><p>
This gene is a member of the Human CCDS set: <a href=quot;http://
www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi?
REQUEST=CCDS&amp;DATA=CCDS9344quot; rel=quot;ccdsquot;>CCDS9344</a>
</p>
</td>
</tr>
Can our web site
  be our API?
http://www.ensembl.org/Homo_sapiens/geneview?gene=ENSG00000139618
http://www.ensembl.org/Homo_sapiens/geneview?gene=ENSG00000139618



my $gene_adaptor   = $registry->get_adaptor( 'Human', 'Core', 'Gene' );

my $gene = fetch_by_gene_stable_id( 'ENSG00000139618' );
More data on our sites
than through the API
   (we’re not the only ones)
RSS
iCal
XML
ational state
Represent
       transfer


    RESTful service
psd-production/projects
GET


psd-production/projects

        RETRIEVE
<projects>

	

   project
	

   id type=quot;integerquot;8/id
	

   created-at type=quot;datetimequot;2007-10-22T09:43:30+01:00/created-at
	

   family-id type=quot;integerquot;3/family-id
	

   nameTest BAC/name
	

   updated-at type=quot;datetimequot;2007-10-22T09:43:30+01:00/updated-at
	

   user-id type=quot;integerquot;1/user-id
	

   workspace-id type=quot;integerquot;/
	

   /project

/projects
POST


psd-production/projects

         CREATE
http://psd-production/projects/67
http://psd-production/projects/67
POST


http://psd-production/projects/67

             UPDATE
DELETE


http://psd-production/projects/67

             DESTROY
No installation
     No setup
No fancy protocols
All you need is curl
Perl API
Ruby API
Tools
ows
   rkfl
Wo
      Tools
It’s all about the workflow
Trace archive vs SSAHA
Workflows are memes
        Users add val
                     ue
YouTube knows memes
Not invented here!
Reproducibility
Go with the flow
Quickly define workflows
Quickly reuse services
Data



          Explore



Service             Service

 Data
Reuse workflows
YouTube for workflows
           +
   Yahoo! Pipes for
    biological data
           =
 Never having to write
another BLAST parser
Design
Stop hacking
Program to interfaces
“The interface is a contract
between data provider and
     data consumer”
                  Lincoln Stein
Design for reuse
Code for maintenance
Foster “accidental
  development”
  114 YouTube apps
However...
Designing for reuse is hard
With great power comes
 great responsibility
With great power comes
 great responsibility
 Available                Reliable
             Accessible
Discoverable:
where is your web site?
Design is for humans
YouTube is ‘only’ an
 online video site
A good UI outweighs
   smart features
“Monolithic solutions
    always fail”
           Graham Cameron
Loose coupling rules
Don’t reinvent Eclipse
Thank you
GREENISGOOD.CO.UK

Más contenido relacionado

Destacado

FSE 2016 Panel: The State of Software Engineering Research
FSE 2016 Panel: The State of Software Engineering ResearchFSE 2016 Panel: The State of Software Engineering Research
FSE 2016 Panel: The State of Software Engineering Research
Margaret-Anne Storey
 
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
Margaret-Anne Storey
 
Understanding Stack Overflow
Understanding Stack OverflowUnderstanding Stack Overflow
Understanding Stack Overflow
Alexander Serebrenik
 
To Bot or Not: How Bots can Support Collaboration in Software Engineering (I...
To Bot or Not:  How Bots can Support Collaboration in Software Engineering (I...To Bot or Not:  How Bots can Support Collaboration in Software Engineering (I...
To Bot or Not: How Bots can Support Collaboration in Software Engineering (I...
Margaret-Anne Storey
 

Destacado (18)

Naïve multi label classification of you tube comments using
Naïve multi label classification of you tube comments usingNaïve multi label classification of you tube comments using
Naïve multi label classification of you tube comments using
 
Transferring Software Testing Tools to Practice
Transferring Software Testing Tools to PracticeTransferring Software Testing Tools to Practice
Transferring Software Testing Tools to Practice
 
Software Analytics: Towards Software Mining that Matters
Software Analytics: Towards Software Mining that MattersSoftware Analytics: Towards Software Mining that Matters
Software Analytics: Towards Software Mining that Matters
 
The (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software EngineeringThe (R)evolution of Social Media in Software Engineering
The (R)evolution of Social Media in Software Engineering
 
Benevol 2012 Keynote: The Social Software (R)evolution
Benevol 2012 Keynote: The Social Software (R)evolutionBenevol 2012 Keynote: The Social Software (R)evolution
Benevol 2012 Keynote: The Social Software (R)evolution
 
FSE 2016 Panel: The State of Software Engineering Research
FSE 2016 Panel: The State of Software Engineering ResearchFSE 2016 Panel: The State of Software Engineering Research
FSE 2016 Panel: The State of Software Engineering Research
 
Research industry panel review
Research industry panel reviewResearch industry panel review
Research industry panel review
 
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
SLE 2012 Keynote: Cognitive and Social Challenges of Ontology Use in the Biom...
 
Crowdsourcing Documentation in Software Engineering
Crowdsourcing Documentation in Software EngineeringCrowdsourcing Documentation in Software Engineering
Crowdsourcing Documentation in Software Engineering
 
How Developers Stay Current Using Twitter
How Developers Stay Current Using TwitterHow Developers Stay Current Using Twitter
How Developers Stay Current Using Twitter
 
Stack Overflow slides Data Analytics
Stack Overflow slides Data Analytics Stack Overflow slides Data Analytics
Stack Overflow slides Data Analytics
 
Data mining on social networks for students learning experiences
Data mining on social networks for students learning experiences Data mining on social networks for students learning experiences
Data mining on social networks for students learning experiences
 
STACK OVERFLOW DATASET ANALYSIS
STACK OVERFLOW DATASET ANALYSISSTACK OVERFLOW DATASET ANALYSIS
STACK OVERFLOW DATASET ANALYSIS
 
Understanding Stack Overflow
Understanding Stack OverflowUnderstanding Stack Overflow
Understanding Stack Overflow
 
To Bot or Not: How Bots can Support Collaboration in Software Engineering (I...
To Bot or Not:  How Bots can Support Collaboration in Software Engineering (I...To Bot or Not:  How Bots can Support Collaboration in Software Engineering (I...
To Bot or Not: How Bots can Support Collaboration in Software Engineering (I...
 
[Report] Social Data Intelligence, by Susan Etlinger
[Report] Social Data Intelligence, by Susan Etlinger [Report] Social Data Intelligence, by Susan Etlinger
[Report] Social Data Intelligence, by Susan Etlinger
 
Crime Analysis using Data Analysis
Crime Analysis using Data AnalysisCrime Analysis using Data Analysis
Crime Analysis using Data Analysis
 
Gartner: Top 10 Technology Trends 2015
Gartner: Top 10 Technology Trends 2015Gartner: Top 10 Technology Trends 2015
Gartner: Top 10 Technology Trends 2015
 

Similar a What can Bioinformaticians learn from YouTube?

Nuxeo JavaOne 2007
Nuxeo JavaOne 2007Nuxeo JavaOne 2007
Nuxeo JavaOne 2007
Stefane Fermigier
 

Similar a What can Bioinformaticians learn from YouTube? (20)

Engineering Velocity @indeed eng presented on Sept 24 2014 at Beyond Agile
Engineering Velocity @indeed eng presented on Sept 24 2014 at Beyond AgileEngineering Velocity @indeed eng presented on Sept 24 2014 at Beyond Agile
Engineering Velocity @indeed eng presented on Sept 24 2014 at Beyond Agile
 
Hadoop institutes in hyderabad
Hadoop institutes in hyderabadHadoop institutes in hyderabad
Hadoop institutes in hyderabad
 
Continuous Deployment: The Dirty Details
Continuous Deployment: The Dirty DetailsContinuous Deployment: The Dirty Details
Continuous Deployment: The Dirty Details
 
Introduction to Azure DocumentDB
Introduction to Azure DocumentDBIntroduction to Azure DocumentDB
Introduction to Azure DocumentDB
 
[PASS Summit 2016] Blazing Fast, Planet-Scale Customer Scenarios with Azure D...
[PASS Summit 2016] Blazing Fast, Planet-Scale Customer Scenarios with Azure D...[PASS Summit 2016] Blazing Fast, Planet-Scale Customer Scenarios with Azure D...
[PASS Summit 2016] Blazing Fast, Planet-Scale Customer Scenarios with Azure D...
 
CQRS and Event Sourcing: A DevOps perspective
CQRS and Event Sourcing: A DevOps perspectiveCQRS and Event Sourcing: A DevOps perspective
CQRS and Event Sourcing: A DevOps perspective
 
OSA Con 2022 - Scaling your Pandas Analytics with Modin - Doris Lee - Ponder.pdf
OSA Con 2022 - Scaling your Pandas Analytics with Modin - Doris Lee - Ponder.pdfOSA Con 2022 - Scaling your Pandas Analytics with Modin - Doris Lee - Ponder.pdf
OSA Con 2022 - Scaling your Pandas Analytics with Modin - Doris Lee - Ponder.pdf
 
Road to Rails
Road to RailsRoad to Rails
Road to Rails
 
Whatever it takes - Fixing SQLIA and XSS in the process
Whatever it takes - Fixing SQLIA and XSS in the processWhatever it takes - Fixing SQLIA and XSS in the process
Whatever it takes - Fixing SQLIA and XSS in the process
 
You Too Can Be a Radio Host Or How We Scaled a .NET Startup And Had Fun Doing It
You Too Can Be a Radio Host Or How We Scaled a .NET Startup And Had Fun Doing ItYou Too Can Be a Radio Host Or How We Scaled a .NET Startup And Had Fun Doing It
You Too Can Be a Radio Host Or How We Scaled a .NET Startup And Had Fun Doing It
 
Our application got popular and now it breaks
Our application got popular and now it breaksOur application got popular and now it breaks
Our application got popular and now it breaks
 
Our application got popular and now it breaks
Our application got popular and now it breaksOur application got popular and now it breaks
Our application got popular and now it breaks
 
Netflix Edge Engineering Open House Presentations - June 9, 2016
Netflix Edge Engineering Open House Presentations - June 9, 2016Netflix Edge Engineering Open House Presentations - June 9, 2016
Netflix Edge Engineering Open House Presentations - June 9, 2016
 
Deploy and Destroy: Testing Environments - Michael Arenzon - DevOpsDays Tel A...
Deploy and Destroy: Testing Environments - Michael Arenzon - DevOpsDays Tel A...Deploy and Destroy: Testing Environments - Michael Arenzon - DevOpsDays Tel A...
Deploy and Destroy: Testing Environments - Michael Arenzon - DevOpsDays Tel A...
 
Putting AI to Work on Apache Spark
Putting AI to Work on Apache SparkPutting AI to Work on Apache Spark
Putting AI to Work on Apache Spark
 
PGQL: A Language for Graphs
PGQL: A Language for GraphsPGQL: A Language for Graphs
PGQL: A Language for Graphs
 
Become a Performance Diagnostics Hero
Become a Performance Diagnostics HeroBecome a Performance Diagnostics Hero
Become a Performance Diagnostics Hero
 
BioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogueBioIT Europe 2010 - BioCatalogue
BioIT Europe 2010 - BioCatalogue
 
Nuxeo JavaOne 2007
Nuxeo JavaOne 2007Nuxeo JavaOne 2007
Nuxeo JavaOne 2007
 
Database Refactoring With Liquibase
Database Refactoring With LiquibaseDatabase Refactoring With Liquibase
Database Refactoring With Liquibase
 

Más de Matt Wood

Más de Matt Wood (12)

Genomics in the Cloud
Genomics in the CloudGenomics in the Cloud
Genomics in the Cloud
 
How to make Friendfeeds and influence people
How to make Friendfeeds and influence peopleHow to make Friendfeeds and influence people
How to make Friendfeeds and influence people
 
Genomes On Rails
Genomes On RailsGenomes On Rails
Genomes On Rails
 
Genomes On Rails
Genomes On RailsGenomes On Rails
Genomes On Rails
 
Into The Wonderful
Into The WonderfulInto The Wonderful
Into The Wonderful
 
Extreme Informatics
Extreme InformaticsExtreme Informatics
Extreme Informatics
 
The A to Z of developing for the web
The A to Z of developing for the webThe A to Z of developing for the web
The A to Z of developing for the web
 
Introduction to Scrum
Introduction to ScrumIntroduction to Scrum
Introduction to Scrum
 
30 Minutes With Rails
30 Minutes With Rails30 Minutes With Rails
30 Minutes With Rails
 
Subversion Best Practices
Subversion Best PracticesSubversion Best Practices
Subversion Best Practices
 
Lucene
LuceneLucene
Lucene
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Último (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

What can Bioinformaticians learn from YouTube?