Araport integrates JBrowse visualization software from GMOD. In order to support diverse sets of locally and remotely sourced tracks, the “ComboTrackSelector” JBrowse plugin was developed to enable the capability to partition metadata rich tracks in the “Faceted” selector while using the default “Hierarchical” selector for everything else.
A dynamic sequence viewer add-on, “SeqLighter”, was developed using the BioJS framework (http://biojs.net/), configured offer end-users with the capability to view the genomic sequence underlying the gene models (genic regions plus customizable flanking regions), highlight sub-features (like UTRs, exons, introns, start/stop codons) and export the annotated output in various formats (SVG, PNG, JPEG).
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
JBrowse within the Arabidopsis Information Portal - PAG XXIII
1. araport.org@araport
JBrowse within the Arabidopsis
Information Portal
Vivek Krishnakumar
J. Craig Venter Institute
12/14/2015
GMOD
PAG XXIII, San Diego, CA
3. araport.org@araport
About Araport
• Objectives
– Develop community web interface
• sustainable, fundable and community-extensible
• hosts analysis modules, visualization tools, user data
spaces
– Practice data federation
• integrate diverse data sets from distributed sources
• consume and expose data via RESTful web services
– Maintain “gold standard” Col-0 annotation
• assemble tissue-specific transcripts from publicly
available RNA-seq datasets
• incorporate novel coding and non-coding genes
5. araport.org@araport
Araport Architecture
External programsPortal (www.araport.org)
API (api.araport.org)
Agave Core
meta data
user profile
ADAMA
service manage
service enroll
a b c d e f
CGI
Computing
Storage
Databases
ThaleMine JBrowse
Authentication, metering, logging, versioning, HTTPS, CORS
a b c d e f
Apps
Jobs
Systems
CGI
InterMine
Others
CoGe
SOAP
CGI
REST
Science
Apps
7. araport.org@araport
Tracks at Araport
• Combination of local and remotely located datasets
• Local tracks generated from flat-files converted to JSON (In
the near future, will be served by InterMine JBrowse web
services)
– Assembly, annotation, expression, similarity and variation
related data
– Normally associated with only a single unit of metadata
(e.g. citation or text describing how data was generated)
• Remote tracks are federated from CoGe via their
RESTful API infrastructure
– Epigenomics datasets collated from the EPIC project
– Each dataset is associated with rich set of metadata
tag/value pairs (such as experiment name, technique,
conditions, etc.)
9. araport.org@araport
Configuring and enabling the plugin
ComboTrackSelector
Hierarchical selector Faceted selector
• Uses the standard
configuration file,
trackList.json
• Usually generated
programmatically using
loading scripts like
flatfile-to-json.pl,
etc.
• Adding below plugin config
stanza enables dual
selectors:
"plugins" : {
"location" : "./plugins/ComboTrackSelector",
"name" : "ComboTrackSelector"
},
• Relies on a new
configuration file,
trackList2.json
• Requires supporting
comma-separated
configuration file with
metadata about the data
• Both files above are
either hand curated or
programmatically
generated
11. araport.org@araport
Sequence viewer in JBrowse
• JBrowse feature details windows shows
region sequence and subfeatures
separately
• Members of the Arabidopsis community
were used to working with TAIR
SeqViewer and GBrowse2. But, Araport
promotes usage of JBrowse over
GBrowse2
• Users of our portal were missing this
functionality at Araport and made requests
during our portal “Preview” phase
12. araport.org@araport
Developing a sequence viewer for
JBrowse
• Since JBrowse is client side application,
widget would need to be lightweight JS for
ease of integration
• BioJS (http://biojs.net) library of reusable
JS components was the perfect fit
• Wide variety of components, not limited to:
– Sequence, Chromosome karyotype
– Tree, KEGGViewer
– Cytoscape, Interaction Table
13. araport.org@araport
Sequence viewer widget
for JBrowse
SeqLighter
• Can be configured as a
right click menu option for
any track providing gene
structure features
"menuTemplate" : [
…
{
”label" : "View Sequence",
"iconClass" : "dijitIconDatabase",
"action": "contentDialog",
"content": "function(track,feature,div ){ return
SequenceViewer(track,feature,div)}"
},
…
• Allows toggling annotation
of various gene features
(exon, intron, start/stop,
UTR)
• Flanking region sequences
can be added (0.5 to 4 KB)
• Results can be exported to
SVG, PNG and JPG
15. araport.org@araport
Tidbit (1)
Generate screenshots programmatically
• JBrowse does not currently have feature
analogous to gbrowse_img to generate an
image of desired region and set of tracks
• It is possible however to disable
nav/tracklist/ruler elements from the view, by
setting appropriate URL parameters
• Example:
http://path/to/jbrowse/?tracks=A,B,C&nav=0&
tracklist=0&overview=0
16. araport.org@araport
• Possible to do so, with server side JavaScript
technology, nodejs (http://nodejs.org) and a headless
webkit browser, PhantomJS (http://phantomjs.org)
• Wrapped in command line toolkit, pageres
• Install easily via Node Package manager, npm
$ npm install pageres
• Run simple command to generate PNG screenshots
$ pageres <url> <resolution>
$ pageres –d 15
"http://path/to/jbrowse/?tracks=A,B,C&nav=0&tracklist=0&o
verview=0" 1024x768
Tidbit (1)
Generate screenshots programmatically
17. araport.org@araport
Tidbit (2)
Streaming data files from iPlant
• iPlant Data Store (iDS) provides cloud
storage to researchers (initial allocation
100GB)
• Several methods can be used to transfer data
to and from iDS
https://pods.iplantcollaborative.org/wiki/display/DS/Storing+and+Accessing+Your+Data+in+the+Data+Store
– Discovery Environment web interface
http://de.iplantc.org
– iDrop GUI toolkit
– iCommands command line interface
• Instructions on wiki page describing method
used to generate link(s) to BAM/BAI, VCF,
GFF file on the iDS
https://pods.iplantcollaborative.org/wiki/display/DEmanual/Sending+Genome+Files+to+the+Genome+Browser
19. araport.org@araport
Acknowledgements
• JCVI Team Members
– Engineers: Maria Kim
– Analysts: Chia-Yi Cheng, Benjamin Rosen
• Awesome dev community!
• Matt Bomhoff, Eric Lyons from CoGe
• iPlant Collaborative
• Funding Agencies
20. araport.org@araport
Chris Town, PI
Lisa McDonald
Education and
Outreach
Coordinator
Chris Nelson
Project
Manager
Jason Miller, Co-PI
JCVI Technical Lead
Erik Ferlanti
Software Engineer
Vivek Krishnakumar
Bioinf. Engineer
Svetlana Karamycheva
Bioinf Engineer
Eva Huala
Project lead,
TAIR
Bob Muller
Technical lead,
TAIR
Gos Micklem,
co-PI
Sergio Contrino
Software
Engineer
Matt Vaughn
co-PI Steve Mock
Advanced
Computing
Interfaces
Rion Dooley,
Web and Cloud
Services
Matt Hanlon,
Web and
Mobile
Applications
Maria Kim
Bioinf
Engineer
Ben Rosen
Bioinf
Analyst
Joe Stubbs,
API Developer
Platform
Walter Moreira
API Developer
Federation
Chris
Jordan
Database
Manager
Eleanor
Pence
Intern
Chia-Yi
Cheng
Bioinf
Analyst
Seth Schobel
Bioinf. Engineer
Araport
Team
Irina Belyaeva
Software Engineer