SlideShare una empresa de Scribd logo
1 de 27
Descargar para leer sin conexión
From Big Data to Insights:
Opportunities and Challenges
for TEI in Genomics
Orit Shaer, Ali Mazalek, Brygg Ullmer, Miriam K. Konkel
Outline
Introduction to genomics/motivation
Design challenges
Case studies
Opportunities for TEI
Going forward
Genomics
“While the work is a challenge, making genetics
interactive is potentially as
transformative as the move from batch
processing to time sharing”
-Bafna V. et al. Communications of the ACM Jan 2013
Project flow:
Genome Sequencing Project
Sequencing
Centers
High-
throughput
Sequencing
Draft Sequence
Finished Sequence
Sequence Archiving
Genome Annotation
DNA
Sequence
Protein
Prediction
Pathways
Comparative
Analysis
Target Selection
Schkolne, Ishii, and Schroder 2004.
TEI for Scientists
Gillet et al. 2005Brooks et al. 1990
Project GROPE
Tabard, A., et. al 2011. eLabBench.
Challenges
Scale
Heterogeneous Data
Diverse Audience
Scale
Filesystem @ Broad Inst.: 13+PB
One run of an Illumina HiSeq 2500:
6 billion paired-end sequences
(600 gigabases, or 120Gb/day)
Thousand Genomes project:
692 collaborators
110 institutions
>15 groups in (bi-)weekly
conference calls
Blue Waters cluster:
>380K CPU cores
+ >3K GPUs
Heterogeneous Data
Diverse Audience
Genomic Scientists
Citizen Scientist General Public
Future Scientists
How can TEI systems be designed to
• Empower citizens to make informed health decisions?
• Communicate scientific data to communities?
• Enhance learning of complex concepts?
• Support experts interacting with big data?
Challenges
Scale
Heterogeneous Data
Diverse Audience
Case Studies
Tabletop Genome Browsing
& Primer Design
Tangible-targeted
Computational Genomics
Tangibles For Visualizing
Systems Biology
Locate
Learn Retrieve
Annotate
Compare
48.4%
1.0%2.4%
46.6%
1.6%
Human genome: understanding ca. 2012
Mobile elements
Processed pseudogenes
Tandem repeats & low
complexity DNA
Dark matter
Protein & RNA coding
regions
Composition of other primate genomes is very similar
Tangibles-targeted computational genomics
Example projects: rhesus, orangutan, human, marmoset genomes
• Often multi-institution, multi-person efforts
– Above articles: ~250, 100 co-authors
• Often long duration (e.g., 4-6 years before first publication)
• Iterative fusion of computational and “wet bench” analyses
• Some analyses “big CPU” (e.g., 200 cpu cores for weeks);
others, “big RAM” (200+GB RAM)
Tangible Visualization:
persistent representations
of people, projects, activities…
Interactions 2012.07: Entangling space, form, light, time, computational STEAM,
and cultural artifacts
CS3: Systems Biology Modeling
Lessons learned
TEI can facilitate immediate, visible, and easily reversible manipulations
• How to design TEI for open-ended creative inquiries?
Tangible representations can facilitate multi-stage workflows
• Important for execution and tracking of complex analyses
• Need parametrized, annotatable representations of complex large datasets
TEI could facilitate collaboration for distributed and co-located teams
• Large interdisciplinary teams and distributed work are common in this area
• Users can jointly manipulate assumptions and see consequences
Tangible tools can support understanding and discovery
• Provide access to different pieces of the problem (data, reactions)
• Help users forms accurate mental models through tangible/embodied manipulation
Opportunities for TEI Engagement
Understanding Complex Problems
Visualizing Biological Data
Enabling Large Collaborations
Supporting Diverse Audiences
Managing Varied Timescales
Understanding Complex Problems
Enabling Large Collaborations
Managing Varied Timescales
Powers of 10,000:
• Milliseconds
• Minutes
• Months
• Millenia
Entangling Space, Form, Light, Time, Computational STEAM, and Cultural Artifacts
Examples
• Many genome projects: 5+ years
• Sequencing Lincoln’s DNA: under
active discussion since 1991
• Most of us sequenced within decade?
materially impacting all our descendants
Going forward
• Some aspects w/ broad TEI, computational science synergies
• How to visualize and engage data, activity, progress spanning
many systems, people, places, timescales?
• What representational forms, device ecologies, most
appropriate for large, abstract data?
• Facilitating engagement with big data in ways that highlight
connections between multiple forms of evidence
• Some aspects specific to genomics
• 2023: anticipate most of us in room + many thousands of
species having genomes fully or partially sequenced
• Commonalities, distinctions in engagements by scientists,
students, street people, senators, senior citizens, solicitors, …
THANKS!
Orit Shaer: oshaer@wellesley.edu
Ali Mazalek: mazalek@gatech.edu
Brygg Ullmer: ullmer@lsu.edu
Miriam Konkel: konkel@lsu.edu
Consuelo Valdes (Wellesley College) and Andy Wu (Georgia Tech).
This work has been partially funded by NSF IIS-1017693, DRL-
097394084, and CNS-1126739.

Más contenido relacionado

La actualidad más candente

Lessons learned from recent very large-scale disasters in the world
Lessons learned from recent very large-scale disasters in the worldLessons learned from recent very large-scale disasters in the world
Lessons learned from recent very large-scale disasters in the worldGlobal Risk Forum GRFDavos
 
20170410 CENTRA2 meeting - AirBox
20170410 CENTRA2 meeting - AirBox20170410 CENTRA2 meeting - AirBox
20170410 CENTRA2 meeting - AirBoxLing-Jyh Chen
 
Common Ground: a policy framework for open access to research data
Common Ground: a  policy framework for open access to research dataCommon Ground: a  policy framework for open access to research data
Common Ground: a policy framework for open access to research dataLIBER Europe
 
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical ScienceAri Berman
 
Advancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsAdvancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsTao Xie
 
Bioinformatics workflows and study design
Bioinformatics workflows and study designBioinformatics workflows and study design
Bioinformatics workflows and study designElanaFertig
 
Labx-Internship at Fermilab
Labx-Internship at FermilabLabx-Internship at Fermilab
Labx-Internship at FermilabAlfred John
 

La actualidad más candente (8)

Lessons learned from recent very large-scale disasters in the world
Lessons learned from recent very large-scale disasters in the worldLessons learned from recent very large-scale disasters in the world
Lessons learned from recent very large-scale disasters in the world
 
20170410 CENTRA2 meeting - AirBox
20170410 CENTRA2 meeting - AirBox20170410 CENTRA2 meeting - AirBox
20170410 CENTRA2 meeting - AirBox
 
Concept on e-Research
Concept on e-ResearchConcept on e-Research
Concept on e-Research
 
Common Ground: a policy framework for open access to research data
Common Ground: a  policy framework for open access to research dataCommon Ground: a  policy framework for open access to research data
Common Ground: a policy framework for open access to research data
 
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
(Em)Powering Science: High-Performance Infrastructure in Biomedical Science
 
Advancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software AnalyticsAdvancing Foundation and Practice of Software Analytics
Advancing Foundation and Practice of Software Analytics
 
Bioinformatics workflows and study design
Bioinformatics workflows and study designBioinformatics workflows and study design
Bioinformatics workflows and study design
 
Labx-Internship at Fermilab
Labx-Internship at FermilabLabx-Internship at Fermilab
Labx-Internship at Fermilab
 

Destacado

2016-09-03-saveMLAK ウィキチュートリアル
2016-09-03-saveMLAK ウィキチュートリアル2016-09-03-saveMLAK ウィキチュートリアル
2016-09-03-saveMLAK ウィキチュートリアルYuka Egusa
 
ArteMuse - Museums & the Web 2011
ArteMuse - Museums & the Web 2011ArteMuse - Museums & the Web 2011
ArteMuse - Museums & the Web 2011Consuelo Valdes
 
Anat 02 metabolismo power
Anat 02 metabolismo powerAnat 02 metabolismo power
Anat 02 metabolismo powerAna Molina
 
Liberact conference 2013 Gnome Surfer & Moclo Planner
Liberact conference 2013 Gnome Surfer & Moclo PlannerLiberact conference 2013 Gnome Surfer & Moclo Planner
Liberact conference 2013 Gnome Surfer & Moclo PlannerConsuelo Valdes
 
Typical characteristics of IT gradutes
Typical characteristics of IT gradutesTypical characteristics of IT gradutes
Typical characteristics of IT gradutesMohammad Salim
 
The role of it in education
The role of it in educationThe role of it in education
The role of it in educationMohammad Salim
 
BU - Wellesely iGEM 2011 World Finals
BU - Wellesely iGEM 2011 World FinalsBU - Wellesely iGEM 2011 World Finals
BU - Wellesely iGEM 2011 World FinalsConsuelo Valdes
 
Anat 01 introducción power
Anat 01 introducción powerAnat 01 introducción power
Anat 01 introducción powerAna Molina
 
2016-09-11-c4ljp2016-勉強会のすすめ
2016-09-11-c4ljp2016-勉強会のすすめ2016-09-11-c4ljp2016-勉強会のすすめ
2016-09-11-c4ljp2016-勉強会のすすめYuka Egusa
 
Introduction To Web Accessibility
Introduction To Web AccessibilityIntroduction To Web Accessibility
Introduction To Web AccessibilitySteven Swafford
 

Destacado (16)

2016-09-03-saveMLAK ウィキチュートリアル
2016-09-03-saveMLAK ウィキチュートリアル2016-09-03-saveMLAK ウィキチュートリアル
2016-09-03-saveMLAK ウィキチュートリアル
 
Blue team final pres
Blue team final presBlue team final pres
Blue team final pres
 
ArteMuse - Museums & the Web 2011
ArteMuse - Museums & the Web 2011ArteMuse - Museums & the Web 2011
ArteMuse - Museums & the Web 2011
 
Male Shopping Experience Final
Male Shopping Experience FinalMale Shopping Experience Final
Male Shopping Experience Final
 
What Makes A Developer
What Makes A DeveloperWhat Makes A Developer
What Makes A Developer
 
Anat 02 metabolismo power
Anat 02 metabolismo powerAnat 02 metabolismo power
Anat 02 metabolismo power
 
Liberact conference 2013 Gnome Surfer & Moclo Planner
Liberact conference 2013 Gnome Surfer & Moclo PlannerLiberact conference 2013 Gnome Surfer & Moclo Planner
Liberact conference 2013 Gnome Surfer & Moclo Planner
 
Typical characteristics of IT gradutes
Typical characteristics of IT gradutesTypical characteristics of IT gradutes
Typical characteristics of IT gradutes
 
The role of it in education
The role of it in educationThe role of it in education
The role of it in education
 
BU - Wellesely iGEM 2011 World Finals
BU - Wellesely iGEM 2011 World FinalsBU - Wellesely iGEM 2011 World Finals
BU - Wellesely iGEM 2011 World Finals
 
Male shop
Male shopMale shop
Male shop
 
Green Touch - ITS 12
Green Touch - ITS 12Green Touch - ITS 12
Green Touch - ITS 12
 
Anat 01 introducción power
Anat 01 introducción powerAnat 01 introducción power
Anat 01 introducción power
 
2016-09-11-c4ljp2016-勉強会のすすめ
2016-09-11-c4ljp2016-勉強会のすすめ2016-09-11-c4ljp2016-勉強会のすすめ
2016-09-11-c4ljp2016-勉強会のすすめ
 
SMPN 1 BDG_9-11
SMPN 1 BDG_9-11SMPN 1 BDG_9-11
SMPN 1 BDG_9-11
 
Introduction To Web Accessibility
Introduction To Web AccessibilityIntroduction To Web Accessibility
Introduction To Web Accessibility
 

Similar a Big Data and Tangibles - TEI 13

Share and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier TordoirShare and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier TordoirSpark Summit
 
Docker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker, Inc.
 
Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Hilmar Lapp
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesBastian Greshake
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveVince Smith
 
Multi task learning stepping away from narrow expert models 7.11.18
Multi task learning stepping away from narrow expert models 7.11.18Multi task learning stepping away from narrow expert models 7.11.18
Multi task learning stepping away from narrow expert models 7.11.18Cloudera, Inc.
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)James Hendler
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...GigaScience, BGI Hong Kong
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityTERN Australia
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) CommonsJames Hendler
 
Spark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleSpark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleAndy Petrella
 
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016Jisc
 
2016 09 cxo forum
2016 09 cxo forum2016 09 cxo forum
2016 09 cxo forumChris Dwan
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...hsuleslie
 
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Deborah McGuinness
 
Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince Smith
 

Similar a Big Data and Tangibles - TEI 13 (20)

Share and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier TordoirShare and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
Share and analyze geonomic data at scale by Andy Petrella and Xavier Tordoir
 
Docker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce Hoff
 
Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?Open science, open-source, and open data: Collaboration as an emergent property?
Open science, open-source, and open data: Collaboration as an emergent property?
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16Sgci iwsg-a-10-10-16
Sgci iwsg-a-10-10-16
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association Studies
 
The biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspectiveThe biodiversity informatics landscape: a systematics perspective
The biodiversity informatics landscape: a systematics perspective
 
Multi task learning stepping away from narrow expert models 7.11.18
Multi task learning stepping away from narrow expert models 7.11.18Multi task learning stepping away from narrow expert models 7.11.18
Multi task learning stepping away from narrow expert models 7.11.18
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
Scott Edmunds: GigaScience - a journal or a database? Lessons learned from th...
 
Australia's Environmental Predictive Capability
Australia's Environmental Predictive CapabilityAustralia's Environmental Predictive Capability
Australia's Environmental Predictive Capability
 
Tragedy of the (Data) Commons
Tragedy of the (Data) CommonsTragedy of the (Data) Commons
Tragedy of the (Data) Commons
 
Spark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scaleSpark Summit Europe: Share and analyse genomic data at scale
Spark Summit Europe: Share and analyse genomic data at scale
 
2015 04-18-wilson cg
2015 04-18-wilson cg2015 04-18-wilson cg
2015 04-18-wilson cg
 
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
The fourth paradigm: data intensive scientific discovery - Jisc Digifest 2016
 
2016 09 cxo forum
2016 09 cxo forum2016 09 cxo forum
2016 09 cxo forum
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
 
Öppen data och forskningens genomslag
Öppen data och forskningens genomslagÖppen data och forskningens genomslag
Öppen data och forskningens genomslag
 
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
Ontologies For the Modern Age - McGuinness' Keynote at ISWC 2017
 
Vince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notextVince smith-delivering biodiversity knowledge in the information age-notext
Vince smith-delivering biodiversity knowledge in the information age-notext
 

Último

The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 

Último (20)

The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 

Big Data and Tangibles - TEI 13

  • 1. From Big Data to Insights: Opportunities and Challenges for TEI in Genomics Orit Shaer, Ali Mazalek, Brygg Ullmer, Miriam K. Konkel
  • 2. Outline Introduction to genomics/motivation Design challenges Case studies Opportunities for TEI Going forward
  • 3. Genomics “While the work is a challenge, making genetics interactive is potentially as transformative as the move from batch processing to time sharing” -Bafna V. et al. Communications of the ACM Jan 2013
  • 4. Project flow: Genome Sequencing Project Sequencing Centers High- throughput Sequencing Draft Sequence Finished Sequence Sequence Archiving Genome Annotation DNA Sequence Protein Prediction Pathways Comparative Analysis Target Selection
  • 5. Schkolne, Ishii, and Schroder 2004. TEI for Scientists Gillet et al. 2005Brooks et al. 1990 Project GROPE Tabard, A., et. al 2011. eLabBench.
  • 7. Scale Filesystem @ Broad Inst.: 13+PB One run of an Illumina HiSeq 2500: 6 billion paired-end sequences (600 gigabases, or 120Gb/day) Thousand Genomes project: 692 collaborators 110 institutions >15 groups in (bi-)weekly conference calls Blue Waters cluster: >380K CPU cores + >3K GPUs
  • 9. Diverse Audience Genomic Scientists Citizen Scientist General Public Future Scientists
  • 10. How can TEI systems be designed to • Empower citizens to make informed health decisions? • Communicate scientific data to communities? • Enhance learning of complex concepts? • Support experts interacting with big data?
  • 12. Case Studies Tabletop Genome Browsing & Primer Design Tangible-targeted Computational Genomics Tangibles For Visualizing Systems Biology
  • 14.
  • 15. 48.4% 1.0%2.4% 46.6% 1.6% Human genome: understanding ca. 2012 Mobile elements Processed pseudogenes Tandem repeats & low complexity DNA Dark matter Protein & RNA coding regions Composition of other primate genomes is very similar Tangibles-targeted computational genomics
  • 16. Example projects: rhesus, orangutan, human, marmoset genomes • Often multi-institution, multi-person efforts – Above articles: ~250, 100 co-authors • Often long duration (e.g., 4-6 years before first publication) • Iterative fusion of computational and “wet bench” analyses • Some analyses “big CPU” (e.g., 200 cpu cores for weeks); others, “big RAM” (200+GB RAM)
  • 17. Tangible Visualization: persistent representations of people, projects, activities… Interactions 2012.07: Entangling space, form, light, time, computational STEAM, and cultural artifacts
  • 19.
  • 20.
  • 21. Lessons learned TEI can facilitate immediate, visible, and easily reversible manipulations • How to design TEI for open-ended creative inquiries? Tangible representations can facilitate multi-stage workflows • Important for execution and tracking of complex analyses • Need parametrized, annotatable representations of complex large datasets TEI could facilitate collaboration for distributed and co-located teams • Large interdisciplinary teams and distributed work are common in this area • Users can jointly manipulate assumptions and see consequences Tangible tools can support understanding and discovery • Provide access to different pieces of the problem (data, reactions) • Help users forms accurate mental models through tangible/embodied manipulation
  • 22. Opportunities for TEI Engagement Understanding Complex Problems Visualizing Biological Data Enabling Large Collaborations Supporting Diverse Audiences Managing Varied Timescales
  • 25. Managing Varied Timescales Powers of 10,000: • Milliseconds • Minutes • Months • Millenia Entangling Space, Form, Light, Time, Computational STEAM, and Cultural Artifacts Examples • Many genome projects: 5+ years • Sequencing Lincoln’s DNA: under active discussion since 1991 • Most of us sequenced within decade? materially impacting all our descendants
  • 26. Going forward • Some aspects w/ broad TEI, computational science synergies • How to visualize and engage data, activity, progress spanning many systems, people, places, timescales? • What representational forms, device ecologies, most appropriate for large, abstract data? • Facilitating engagement with big data in ways that highlight connections between multiple forms of evidence • Some aspects specific to genomics • 2023: anticipate most of us in room + many thousands of species having genomes fully or partially sequenced • Commonalities, distinctions in engagements by scientists, students, street people, senators, senior citizens, solicitors, …
  • 27. THANKS! Orit Shaer: oshaer@wellesley.edu Ali Mazalek: mazalek@gatech.edu Brygg Ullmer: ullmer@lsu.edu Miriam Konkel: konkel@lsu.edu Consuelo Valdes (Wellesley College) and Andy Wu (Georgia Tech). This work has been partially funded by NSF IIS-1017693, DRL- 097394084, and CNS-1126739.