SlideShare a Scribd company logo
1 of 31
Outlining some ideas...
      Darren Oakley - WTSI
        do2@sanger.ac.uk
Ideas for what?


WP4 - Web Portal
How we can all work together
WP4
Web Portal
WP4 objectives
Create a site to display current repository
information
Create DAS-tracks, to display this information in
its genomic context
Create a Biomart. The Biomart will ser ve DAS-
tracks, provide query web-ser vices, and link to
other Biomarts (including EnsMart), greatly
enhancing the search capability and future utility
of the repository
The idea...
De-centralize the data, everyone who
wants in on the portal: use Biomart!
 Standardized
 Web services and DAS out of the box
This makes the data open to all
We promise not to take over the world
The idea...

2 Interfaces:
                             Damian
 New MartView interface
 (advanced search)
                         Us
 Google-like search
 (simple search - “MartSearch”)
The idea...
Turn the portal into a Biomart mashup!
  “In web development, a mashup is a Web
  application that combines data from one or more
  sources into a single integrated tool. The term
  Mashup implies easy, fast integration,
  frequently done by access to open APIs and data
  sources to produce results that were not the
  original reason for producing the raw source
  data” - Wikipedia
Implementation
100% Javascript driven user interface
User goes to the portal enters a search
term, this gets fired against a cloud of
biomarts and returns a coherent
response
No complex controller logic (it shouldn’t
need any)
Javascript?!? Aaargh!!
 The old days...
   Browser incompatibilities, clunky performance
 Now...
   Javascript is fast!
      Chrome, Firefox 3.1, Safari 4, IE 8
   Libraries take care of the cross-browser issues
Obligatory
Architecture
 Drawings
Plan A

                HTTP request

         MartSearch

                Martservice XML query



                Biomart based federation
Plan A

                         HTTP request

            MartSearch

         You Can only federate
         across 2 marts
                         Martservice XML query
         Search times can vary
         greatly with federation


                         Biomart based federation
Plan B


                HTTP request

         MartSearch


                Martser vice XML query to
                each mart, perform
                federation on the fly
Plan B


                                  HTTP request

                    MartSearchattribute
         Searching on more than one
         requires many XML requests per mart
         No way to page results
                                 Martser vice XML query to
         No way of doing OR queries
                                 each mart, perform
         No way of doing loose text queries
                                 federation on the fly
Plan C

                                         HTTP request
                          1
                                  MartSearch

Send query to Lucene based
search index and retrieve paged
list of genes and linking IDs           2
                                               Martservice XML
                                               query to each mart




  0    Index the searchable fields from the biomarts
Plan C

                           FAST search results     HTTP request
                           Can do loose text and OR queries
                          1
                           Pagination
                                      MartSearch
                           Solr takes care of the federation for you
Send query to Lucene based
search index and retrieve paged
list of genes and linking IDs                     2
                                                         Martservice XML
                                                         query to each mart

                           One more software stack to accommodate
                           Need to re-build index after mart rebuild




  0    Index the searchable fields from the biomarts
Demo
http://www.i-dcc.org/dev/martsearch/
Home
Search
Refined searches
Fast, flexible searching
Customizable
  Add and remove data source from display
  Restrict the data coming back from source
Extensible
  Adding in new data sources should be easy
  Custom templates for every data source
Open
  Anyone can access the data and index (via ser vices)
  Anyone can get the code
How it works...
Apache Solr
(http://lucene.apache.org/solr)
 Enterprise grade search ser ver built
 upon lucene
 Web service driven
 Represents each search object as
 a document
Document XML
How it works...
jQuery (http://jquery.com)
jQuery UI (http://jqueryui.com)
EJS (http://embeddedjs.com)
ActiveRecord.js
(http://activerecordjs.org)
Jamal (http://jamal-mvc.com)
Moving for ward...
Make (and/or integrate) more marts
  MGI, Komp-DCC, Eurexpress, GXD, EuroPhenome
Portal branding, design, colour, layout
How to represent the data
  Dictated by the type of user...
     Who are our users and what do
     they want from us?!?!?
Get the code!
http://github.com/dazoakley/martsearch/
Working together
Typical scenario
Each group says...
 I’ll take this task - will send you the
 results when it’s ready


If we’re (very) lucky, we get something
sort of coherent in the end
We can be better
   than this!
What we should do...
Open source code on a public repository
  Github, Google Code, Sourceforge
  Or even one of our own - as long as its public
Shared bug tracking / support and wiki
  Github (wiki) + Lighthouse (bug tracking)
  Google Code / Sourceforge
  Host an instance of Redmine or Trac
Get the code!
http://github.com/dazoakley/martsearch/

More Related Content

What's hot

CrossMark How To
CrossMark How ToCrossMark How To
CrossMark How ToCrossref
 
Finding your way with Crossref: Getting Started & Additional Services
Finding your way with Crossref: Getting Started & Additional ServicesFinding your way with Crossref: Getting Started & Additional Services
Finding your way with Crossref: Getting Started & Additional ServicesCrossref
 
CrossCheck iThenticate Admin Webinar
CrossCheck iThenticate Admin WebinarCrossCheck iThenticate Admin Webinar
CrossCheck iThenticate Admin WebinarCrossref
 
Maintaining your metadata
Maintaining your metadataMaintaining your metadata
Maintaining your metadataCrossref
 
Search Engines After The Semanatic Web
Search Engines After The Semanatic WebSearch Engines After The Semanatic Web
Search Engines After The Semanatic Websamar_slideshare
 
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118Crossref
 
Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716Crossref
 
introduction-to-crossmark-lastest
introduction-to-crossmark-lastestintroduction-to-crossmark-lastest
introduction-to-crossmark-lastestCrossref
 
Getting started with looking up metadata
Getting started with looking up metadata Getting started with looking up metadata
Getting started with looking up metadata Crossref
 
Getting started with Content Registration 012617
Getting started with Content Registration 012617Getting started with Content Registration 012617
Getting started with Content Registration 012617Crossref
 
Similarity check webinar
Similarity check webinar Similarity check webinar
Similarity check webinar Crossref
 
Talis Platform: A Linked Data Engine
Talis Platform: A Linked Data EngineTalis Platform: A Linked Data Engine
Talis Platform: A Linked Data EngineLeigh Dodds
 
Crossmark how-to, webinar held on 11-20-17
Crossmark how-to, webinar held on 11-20-17Crossmark how-to, webinar held on 11-20-17
Crossmark how-to, webinar held on 11-20-17Crossref
 
crossmark update
crossmark updatecrossmark update
crossmark updateCrossref
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawlerishmecse13
 
Cross mark webinar how to
Cross mark webinar how toCross mark webinar how to
Cross mark webinar how toCrossref
 
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia IndustryFrom Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia IndustryJoel Amoussou
 
Crossref LIVE UK Online
Crossref LIVE UK OnlineCrossref LIVE UK Online
Crossref LIVE UK OnlineCrossref
 
Getting started with looking up metadata
Getting started with looking up metadataGetting started with looking up metadata
Getting started with looking up metadataCrossref
 
An Extensible Framework to Validate and Build Dataset Profiles
An Extensible Framework to Validate and Build Dataset ProfilesAn Extensible Framework to Validate and Build Dataset Profiles
An Extensible Framework to Validate and Build Dataset ProfilesAhmad Assaf
 

What's hot (20)

CrossMark How To
CrossMark How ToCrossMark How To
CrossMark How To
 
Finding your way with Crossref: Getting Started & Additional Services
Finding your way with Crossref: Getting Started & Additional ServicesFinding your way with Crossref: Getting Started & Additional Services
Finding your way with Crossref: Getting Started & Additional Services
 
CrossCheck iThenticate Admin Webinar
CrossCheck iThenticate Admin WebinarCrossCheck iThenticate Admin Webinar
CrossCheck iThenticate Admin Webinar
 
Maintaining your metadata
Maintaining your metadataMaintaining your metadata
Maintaining your metadata
 
Search Engines After The Semanatic Web
Search Engines After The Semanatic WebSearch Engines After The Semanatic Web
Search Engines After The Semanatic Web
 
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118
Crossref webinar: Stephanie Dawson - SciencOpen Metadata 091118
 
Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716Crossref Crossmark Webinar 102716
Crossref Crossmark Webinar 102716
 
introduction-to-crossmark-lastest
introduction-to-crossmark-lastestintroduction-to-crossmark-lastest
introduction-to-crossmark-lastest
 
Getting started with looking up metadata
Getting started with looking up metadata Getting started with looking up metadata
Getting started with looking up metadata
 
Getting started with Content Registration 012617
Getting started with Content Registration 012617Getting started with Content Registration 012617
Getting started with Content Registration 012617
 
Similarity check webinar
Similarity check webinar Similarity check webinar
Similarity check webinar
 
Talis Platform: A Linked Data Engine
Talis Platform: A Linked Data EngineTalis Platform: A Linked Data Engine
Talis Platform: A Linked Data Engine
 
Crossmark how-to, webinar held on 11-20-17
Crossmark how-to, webinar held on 11-20-17Crossmark how-to, webinar held on 11-20-17
Crossmark how-to, webinar held on 11-20-17
 
crossmark update
crossmark updatecrossmark update
crossmark update
 
Search engine and web crawler
Search engine and web crawlerSearch engine and web crawler
Search engine and web crawler
 
Cross mark webinar how to
Cross mark webinar how toCross mark webinar how to
Cross mark webinar how to
 
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia IndustryFrom Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
From Web 2.0 to the Semantic Web: Bridging the Gap in the Newsmedia Industry
 
Crossref LIVE UK Online
Crossref LIVE UK OnlineCrossref LIVE UK Online
Crossref LIVE UK Online
 
Getting started with looking up metadata
Getting started with looking up metadataGetting started with looking up metadata
Getting started with looking up metadata
 
An Extensible Framework to Validate and Build Dataset Profiles
An Extensible Framework to Validate and Build Dataset ProfilesAn Extensible Framework to Validate and Build Dataset Profiles
An Extensible Framework to Validate and Build Dataset Profiles
 

Similar to Outlining some ideas... - I-DCC Kick Off Metting

HIGWGET-A Model for Crawling Secure Hidden WebPages
HIGWGET-A Model for Crawling Secure Hidden WebPagesHIGWGET-A Model for Crawling Secure Hidden WebPages
HIGWGET-A Model for Crawling Secure Hidden WebPagesijdkp
 
3 Understanding Search
3 Understanding Search3 Understanding Search
3 Understanding Searchmasiclat
 
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document RetrievalAn Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document Retrievaliosrjce
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
IRJET - Review on Search Engine Optimization
IRJET - Review on Search Engine OptimizationIRJET - Review on Search Engine Optimization
IRJET - Review on Search Engine OptimizationIRJET Journal
 
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.iosrjce
 
IRJET- A Two-Way Smart Web Spider
IRJET- A Two-Way Smart Web SpiderIRJET- A Two-Way Smart Web Spider
IRJET- A Two-Way Smart Web SpiderIRJET Journal
 
Role of Text Mining in Search Engine
Role of Text Mining in Search EngineRole of Text Mining in Search Engine
Role of Text Mining in Search EngineJay R Modi
 
What are the different types of web scraping approaches
What are the different types of web scraping approachesWhat are the different types of web scraping approaches
What are the different types of web scraping approachesAparna Sharma
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Juan Sequeda
 
Vision Based Deep Web data Extraction on Nested Query Result Records
Vision Based Deep Web data Extraction on Nested Query Result RecordsVision Based Deep Web data Extraction on Nested Query Result Records
Vision Based Deep Web data Extraction on Nested Query Result RecordsIJMER
 
Beyond simple search – adding business value in the enterprise
Beyond simple search – adding business value in the enterpriseBeyond simple search – adding business value in the enterprise
Beyond simple search – adding business value in the enterpriselucenerevolution
 
Working of search engines(rohit sahu cs 17) 5th sem
Working of search engines(rohit sahu cs 17) 5th semWorking of search engines(rohit sahu cs 17) 5th sem
Working of search engines(rohit sahu cs 17) 5th semROHIT SAHU
 
Working Of Search Engine
Working Of Search EngineWorking Of Search Engine
Working Of Search EngineNIKHIL NAIR
 

Similar to Outlining some ideas... - I-DCC Kick Off Metting (20)

HIGWGET-A Model for Crawling Secure Hidden WebPages
HIGWGET-A Model for Crawling Secure Hidden WebPagesHIGWGET-A Model for Crawling Secure Hidden WebPages
HIGWGET-A Model for Crawling Secure Hidden WebPages
 
3 Understanding Search
3 Understanding Search3 Understanding Search
3 Understanding Search
 
G017254554
G017254554G017254554
G017254554
 
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document RetrievalAn Intelligent Meta Search Engine for Efficient Web Document Retrieval
An Intelligent Meta Search Engine for Efficient Web Document Retrieval
 
Applications of semantic web
Applications of semantic webApplications of semantic web
Applications of semantic web
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
IRJET - Review on Search Engine Optimization
IRJET - Review on Search Engine OptimizationIRJET - Review on Search Engine Optimization
IRJET - Review on Search Engine Optimization
 
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
Smart Crawler: A Two Stage Crawler for Concept Based Semantic Search Engine.
 
E017624043
E017624043E017624043
E017624043
 
IRJET- A Two-Way Smart Web Spider
IRJET- A Two-Way Smart Web SpiderIRJET- A Two-Way Smart Web Spider
IRJET- A Two-Way Smart Web Spider
 
Role of Text Mining in Search Engine
Role of Text Mining in Search EngineRole of Text Mining in Search Engine
Role of Text Mining in Search Engine
 
What are the different types of web scraping approaches
What are the different types of web scraping approachesWhat are the different types of web scraping approaches
What are the different types of web scraping approaches
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011
 
Vision Based Deep Web data Extraction on Nested Query Result Records
Vision Based Deep Web data Extraction on Nested Query Result RecordsVision Based Deep Web data Extraction on Nested Query Result Records
Vision Based Deep Web data Extraction on Nested Query Result Records
 
sunny-slides
sunny-slidessunny-slides
sunny-slides
 
Beyond simple search – adding business value in the enterprise
Beyond simple search – adding business value in the enterpriseBeyond simple search – adding business value in the enterprise
Beyond simple search – adding business value in the enterprise
 
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
Chachra, "Improving Discovery Systems Through Post Processing of Harvested Data"
 
Working of search engines(rohit sahu cs 17) 5th sem
Working of search engines(rohit sahu cs 17) 5th semWorking of search engines(rohit sahu cs 17) 5th sem
Working of search engines(rohit sahu cs 17) 5th sem
 
Working Of Search Engine
Working Of Search EngineWorking Of Search Engine
Working Of Search Engine
 

Recently uploaded

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Outlining some ideas... - I-DCC Kick Off Metting

  • 1. Outlining some ideas... Darren Oakley - WTSI do2@sanger.ac.uk
  • 2. Ideas for what? WP4 - Web Portal How we can all work together
  • 4. WP4 objectives Create a site to display current repository information Create DAS-tracks, to display this information in its genomic context Create a Biomart. The Biomart will ser ve DAS- tracks, provide query web-ser vices, and link to other Biomarts (including EnsMart), greatly enhancing the search capability and future utility of the repository
  • 5. The idea... De-centralize the data, everyone who wants in on the portal: use Biomart! Standardized Web services and DAS out of the box This makes the data open to all We promise not to take over the world
  • 6. The idea... 2 Interfaces: Damian New MartView interface (advanced search) Us Google-like search (simple search - “MartSearch”)
  • 7. The idea... Turn the portal into a Biomart mashup! “In web development, a mashup is a Web application that combines data from one or more sources into a single integrated tool. The term Mashup implies easy, fast integration, frequently done by access to open APIs and data sources to produce results that were not the original reason for producing the raw source data” - Wikipedia
  • 8. Implementation 100% Javascript driven user interface User goes to the portal enters a search term, this gets fired against a cloud of biomarts and returns a coherent response No complex controller logic (it shouldn’t need any)
  • 9. Javascript?!? Aaargh!! The old days... Browser incompatibilities, clunky performance Now... Javascript is fast! Chrome, Firefox 3.1, Safari 4, IE 8 Libraries take care of the cross-browser issues
  • 11. Plan A HTTP request MartSearch Martservice XML query Biomart based federation
  • 12. Plan A HTTP request MartSearch You Can only federate across 2 marts Martservice XML query Search times can vary greatly with federation Biomart based federation
  • 13. Plan B HTTP request MartSearch Martser vice XML query to each mart, perform federation on the fly
  • 14. Plan B HTTP request MartSearchattribute Searching on more than one requires many XML requests per mart No way to page results Martser vice XML query to No way of doing OR queries each mart, perform No way of doing loose text queries federation on the fly
  • 15. Plan C HTTP request 1 MartSearch Send query to Lucene based search index and retrieve paged list of genes and linking IDs 2 Martservice XML query to each mart 0 Index the searchable fields from the biomarts
  • 16. Plan C FAST search results HTTP request Can do loose text and OR queries 1 Pagination MartSearch Solr takes care of the federation for you Send query to Lucene based search index and retrieve paged list of genes and linking IDs 2 Martservice XML query to each mart One more software stack to accommodate Need to re-build index after mart rebuild 0 Index the searchable fields from the biomarts
  • 18. Home
  • 21. Fast, flexible searching Customizable Add and remove data source from display Restrict the data coming back from source Extensible Adding in new data sources should be easy Custom templates for every data source Open Anyone can access the data and index (via ser vices) Anyone can get the code
  • 22. How it works... Apache Solr (http://lucene.apache.org/solr) Enterprise grade search ser ver built upon lucene Web service driven Represents each search object as a document
  • 24. How it works... jQuery (http://jquery.com) jQuery UI (http://jqueryui.com) EJS (http://embeddedjs.com) ActiveRecord.js (http://activerecordjs.org) Jamal (http://jamal-mvc.com)
  • 25. Moving for ward... Make (and/or integrate) more marts MGI, Komp-DCC, Eurexpress, GXD, EuroPhenome Portal branding, design, colour, layout How to represent the data Dictated by the type of user... Who are our users and what do they want from us?!?!?
  • 28. Typical scenario Each group says... I’ll take this task - will send you the results when it’s ready If we’re (very) lucky, we get something sort of coherent in the end
  • 29. We can be better than this!
  • 30. What we should do... Open source code on a public repository Github, Google Code, Sourceforge Or even one of our own - as long as its public Shared bug tracking / support and wiki Github (wiki) + Lighthouse (bug tracking) Google Code / Sourceforge Host an instance of Redmine or Trac