SlideShare una empresa de Scribd logo
1 de 45
Descargar para leer sin conexión
Copyright © 2014 Intridea Inc. All rights reserved.
Visualizing Big Data on Small Devices
Tom Zeng
Director of Engineering
tom@intridea.com
@tomzeng
www.linkedin.com/in/tomzeng
Copyright © 2014 Intridea Inc. All rights reserved.
Agenda
Introduction
Front End - HTML5/Bootstrap, Backbone/CoffeeScript, D3, MapBox
Backend - Rails, MongoDB
Big Data Processing - Hadoop, Hive, Pig
Showcase - Mobile and Data Visualization Related Projects
Q & A
Copyright © 2014 Intridea Inc. All rights reserved.
Intridea - Rails, UX/Data Visualization, Mobile, Big Data, e-commerce
American Bible Society (ABS http://www.americanbible.org/) - partners with Bible publishers
· Provides API access to 539 Bible versions in 242 languages
· The usage of the APIs is tracked at the verse level, along with ip location, timestamp, and duration
· 530 million view logs/year(’12-’13 data), 1.5 mil/day, each view log has packed about 12 bible views
· Amounts to 5-6 billion Bible views each year
ABS asked Intridea to build the dashboard app Scripture Analytics (http://www.scriptureanalytics.com)
Introduction
Copyright © 2014 Intridea Inc. All rights reserved.
BY WORKING REMOTELY
9,816 Hours Saved Annually
ACROSS THE US & OVERSEAS
30+ Employees
FOUNDED & STARTED IN 2007
Washington D.C.
We Make
! 🌎 #
ON GITHUB
Open Source Software
$
Copyright © 2014 Intridea Inc. All rights reserved.
Major Open Source Contributions
OmniAuth is a flexible authentication
system utilizing Rack middleware.
OmniAuth
An opinionated micro-framework for
creating REST-like APIs in Ruby.
Grape
Hashie is a simple collection of useful
Hash extensions.
Hashie
A Ruby wrapper for the OAuth 2.0
protocol.
oauth2
A symbol font that makes it easy to
create a map of the U.S. with HTML/CSS.
Stately
A generic swappable back-end for JSON
handling.
Multi_JSON
Mission control dashboard for your
distributed teams.
Houston
github.com/intridea
Copyright © 2014 Intridea Inc. All rights reserved.
INTRIDEA
Simplified and modernized application
experience for ADP
Application engineering for
BusinessWeek.com
World’s first 100% web based Point of
Sale system
Copyright © 2014 Intridea Inc. All rights reserved.
Where people go to make a difference
with their investment capital.
Simplifying ADP’s core business:
Payroll
Engineering for the most-trafficked
wedding planning solution
Copyright © 2014 Intridea Inc. All rights reserved.
Copyright © 2014 Intridea Inc. All rights reserved.
ABS Scripture Analytics Query Requirements
Visualizations
Copyright © 2014 Intridea Inc. All rights reserved.
Public and private dashboards
visualizing Bible reading
across the Internet
Up to the minute dashboards showing what Bible
verses are being read when, and where, all over
the globe.
Copyright © 2014 Intridea Inc. All rights reserved.
Mobile
Desktop
Tablet
Responsive Web App
www.scriptureanalytics.com
Copyright © 2014 Intridea Inc. All rights reserved.
Front End User Interface
Single Page Application using Backbone.js
CoffeeScript (Ruby like, Jasmine in CoffeeScript similar to RSpec)

D3 for Data Visualization
Twitter Bootstrap for Responsive UI
Packery for Responsive Layout - http://packery.metafizzy.co/
Mapbox for Map Rendering - https://www.mapbox.com/
Copyright © 2014 Intridea Inc. All rights reserved.
Backend Servers/Services
Rails on Ruby Application mostly as the API server
MongoDB as the data store/cache
Mongoid for Active Record like queries
MongoDB Aggregation Framework for complex queries
Pulling data periodically from S3 to populate the Mongo database
Local R&D Hadoop and Mongo clusters for data exploration
Copyright © 2014 Intridea Inc. All rights reserved.
MongoDB
Document oriented, schema free, JSON format
Very high data read and write throughput
Rich query capabilities (aggregation framework), flexible indexes
Scale with auto-sharded replica sets
Map/Reduce in JavaScript
Copyright © 2014 Intridea Inc. All rights reserved.
Hadoop/Pig/Hive/Impala
Hadoop cluster (AWS Elastic Map/Reduce on-demand) to process and store data in S3
Pig to parse, transform, geo-code data
Hive to query data and generate aggregated JSON reports
Impala is similar to Hive (but much fast than the older version of Hive), used for ETL
!
Copyright © 2014 Intridea Inc. All rights reserved.
Elastic Map/Reduce Hadoop Cluster - On Demand Processing
Copyright © 2014 Intridea Inc. All rights reserved.
Elastic Map/Reduce Hadoop Cluster - Terminated when done
Copyright © 2014 Intridea Inc. All rights reserved.
Cloudera CDH4 - on local 10-node cluster
Copyright © 2014 Intridea Inc. All rights reserved.
Cloudera CDH4 - Streaming Data into Hive Table
Copyright © 2014 Intridea Inc. All rights reserved.
Pig Sample Query
Copyright © 2014 Intridea Inc. All rights reserved.
Hive Sample Query
Copyright © 2014 Intridea Inc. All rights reserved.
Hive Query Results - Bible views by City
Copyright © 2014 Intridea Inc. All rights reserved.
Hive Query Results - Most popular verse before Mother’s day
28
Her children show their appreciation, and her husband praises her.
Copyright © 2014 Intridea Inc. All rights reserved.
Hive Query Results - Most popular verse on Mother’s day
28
Her children show their appreciation, and her husband praises her.
Copyright © 2014 Intridea Inc. All rights reserved.
Hive Query Results - Most popular verse after Mother’s day
28
Her children show their appreciation, and her husband praises her.
Copyright © 2014 Intridea Inc. All rights reserved.
MongoDB Aggregation Framework Example
http://docs.mongodb.org/manual/core/aggregation-pipeline/
Copyright © 2014 Intridea Inc. All rights reserved.
MongoDB Aggregation Framework Example
Copyright © 2014 Intridea Inc. All rights reserved.
ABS Data Processing using Hadoop and MongoDB
Copyright © 2014 Intridea Inc. All rights reserved.
Analyzing Twitter using Hadoop and MongoDB
Copyright © 2014 Intridea Inc. All rights reserved.
Mobile and Data Visualization Project Showcase
ADP
BLiNQ
PEW Templeton - Global Religious Futures
Cato Institute - HumanProgress
!
!
Copyright © 2014 Intridea Inc. All rights reserved.
Redefining ADP’s
touch and desktop
experiences
ADP processes one out of every six
paychecks in the United States. We’re
bringing payroll into the decade of touch.
Copyright © 2014 Intridea Inc. All rights reserved.
ADP TLM
Copyright © 2014 Intridea Inc. All rights reserved.
ADP HCR
Copyright © 2014 Intridea Inc. All rights reserved.
INTRIDEA
Copyright © 2014 Intridea Inc. All rights reserved.
Dashboard insights for 600 of
the world’s largest advertisers
Analytics, planning, and flight management for social
advertising campaigns and brand engagement.
Copyright © 2014 Intridea Inc. All rights reserved.
INTRIDEA
INTRIDEA
Copyright © 2014 Intridea Inc. All rights reserved.
INTRIDEA
INTRIDEA
Copyright © 2014 Intridea Inc. All rights reserved.
INTRIDEA
INTRIDEA
Copyright © 2014 Intridea Inc. All rights reserved.
Analyzing religious
change and its impact
on societies around the
world
Interactive website exploring the patterns
and trends in religions across the globe
Copyright © 2014 Intridea Inc. All rights reserved.
PEW Global Research
PEW Global Research
Copyright © 2014 Intridea Inc. All rights reserved.
PEW Global Research
PEW Global Research
Copyright © 2014 Intridea Inc. All rights reserved.
PEW Global Research
PEW Global Research
Copyright © 2014 Intridea Inc. All rights reserved.
Human advancement to
a higher stage
Human Progress seeks to document changes in
living standards in the past and present while
explaining and exploring the best ways to improve
conditions for people.
Copyright © 2014 Intridea Inc. All rights reserved.
INTRIDEA
humanprogress.org
Gracias
Merci 
ありがとう
Danke 
谢谢
Thank You
Copyright © 2014 Intridea Inc. All rights reserved.
Tom Zeng
Director of Engineering
tom@intridea.com
@tomzeng
www.linkedin.com/in/tomzeng

Más contenido relacionado

Similar a RubyNation Visualizing Big Data on Small Devices

Growth hacking in the age of Data
Growth hacking in the age of DataGrowth hacking in the age of Data
Growth hacking in the age of DataDaniel Saito
 
2014 Big_Data_Forum_Pivotal
2014 Big_Data_Forum_Pivotal2014 Big_Data_Forum_Pivotal
2014 Big_Data_Forum_PivotalCOMPUTEX TAIPEI
 
Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry  Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry Persontyle
 
Data Modeling in the API Economy
Data Modeling in the API EconomyData Modeling in the API Economy
Data Modeling in the API EconomyTed Epstein
 
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REXHadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REXModern Data Stack France
 
From Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessFrom Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessNG DATA
 
Rediscover Software Development Edward Hieatt Web Summit 2014
Rediscover Software Development Edward Hieatt Web Summit 2014Rediscover Software Development Edward Hieatt Web Summit 2014
Rediscover Software Development Edward Hieatt Web Summit 2014VMware Tanzu
 
DevOps, CD and [Data] Microservices
DevOps, CD and [Data] MicroservicesDevOps, CD and [Data] Microservices
DevOps, CD and [Data] MicroservicesFred Melo
 
OpenStackDC and Cloud Foundry Meetup -
OpenStackDC and Cloud Foundry Meetup -OpenStackDC and Cloud Foundry Meetup -
OpenStackDC and Cloud Foundry Meetup -Joshua McKenty
 
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1Hortonworks
 
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder HortonworksThe Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder HortonworksData Con LA
 
Spark and Hadoop Perfect Togeher by Arun Murthy
Spark and Hadoop Perfect Togeher by Arun MurthySpark and Hadoop Perfect Togeher by Arun Murthy
Spark and Hadoop Perfect Togeher by Arun MurthySpark Summit
 
Spark Summit EMEA - Arun Murthy's Keynote
Spark Summit EMEA - Arun Murthy's KeynoteSpark Summit EMEA - Arun Murthy's Keynote
Spark Summit EMEA - Arun Murthy's KeynoteHortonworks
 
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges" Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges" Dataconomy Media
 
Enterprise Apache Hadoop: State of the Union
Enterprise Apache Hadoop: State of the UnionEnterprise Apache Hadoop: State of the Union
Enterprise Apache Hadoop: State of the UnionHortonworks
 
Top 10 Open Source Technologies In 2018 | Trending Technologies 2018 | Edureka
Top 10 Open Source Technologies In 2018 | Trending Technologies 2018 | EdurekaTop 10 Open Source Technologies In 2018 | Trending Technologies 2018 | Edureka
Top 10 Open Source Technologies In 2018 | Trending Technologies 2018 | EdurekaEdureka!
 
Data Science Perspective and DS demo
Data Science Perspective and DS demo Data Science Perspective and DS demo
Data Science Perspective and DS demo PivotalOpenSourceHub
 
Webinar - How SpagoBI 5 faces Big Data challenges to generate new business op...
Webinar - How SpagoBI 5 faces Big Data challenges to generate new business op...Webinar - How SpagoBI 5 faces Big Data challenges to generate new business op...
Webinar - How SpagoBI 5 faces Big Data challenges to generate new business op...SpagoWorld
 
Big Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | Edureka
Big Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | EdurekaBig Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | Edureka
Big Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | EdurekaEdureka!
 

Similar a RubyNation Visualizing Big Data on Small Devices (20)

Growth hacking in the age of Data
Growth hacking in the age of DataGrowth hacking in the age of Data
Growth hacking in the age of Data
 
2014 Big_Data_Forum_Pivotal
2014 Big_Data_Forum_Pivotal2014 Big_Data_Forum_Pivotal
2014 Big_Data_Forum_Pivotal
 
Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry  Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry
 
Data Modeling in the API Economy
Data Modeling in the API EconomyData Modeling in the API Economy
Data Modeling in the API Economy
 
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REXHadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
Hadoop User Group 29Jan2015 Apache Flink / Haven / CapGemnini REX
 
From Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your businessFrom Data to Data Driven - Applications that will change your business
From Data to Data Driven - Applications that will change your business
 
Big Data
Big DataBig Data
Big Data
 
Rediscover Software Development Edward Hieatt Web Summit 2014
Rediscover Software Development Edward Hieatt Web Summit 2014Rediscover Software Development Edward Hieatt Web Summit 2014
Rediscover Software Development Edward Hieatt Web Summit 2014
 
DevOps, CD and [Data] Microservices
DevOps, CD and [Data] MicroservicesDevOps, CD and [Data] Microservices
DevOps, CD and [Data] Microservices
 
OpenStackDC and Cloud Foundry Meetup -
OpenStackDC and Cloud Foundry Meetup -OpenStackDC and Cloud Foundry Meetup -
OpenStackDC and Cloud Foundry Meetup -
 
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
 
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder HortonworksThe Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
 
Spark and Hadoop Perfect Togeher by Arun Murthy
Spark and Hadoop Perfect Togeher by Arun MurthySpark and Hadoop Perfect Togeher by Arun Murthy
Spark and Hadoop Perfect Togeher by Arun Murthy
 
Spark Summit EMEA - Arun Murthy's Keynote
Spark Summit EMEA - Arun Murthy's KeynoteSpark Summit EMEA - Arun Murthy's Keynote
Spark Summit EMEA - Arun Murthy's Keynote
 
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges" Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
Moustafa Soliman "HP Vertica- Solving Facebook Big Data challenges"
 
Enterprise Apache Hadoop: State of the Union
Enterprise Apache Hadoop: State of the UnionEnterprise Apache Hadoop: State of the Union
Enterprise Apache Hadoop: State of the Union
 
Top 10 Open Source Technologies In 2018 | Trending Technologies 2018 | Edureka
Top 10 Open Source Technologies In 2018 | Trending Technologies 2018 | EdurekaTop 10 Open Source Technologies In 2018 | Trending Technologies 2018 | Edureka
Top 10 Open Source Technologies In 2018 | Trending Technologies 2018 | Edureka
 
Data Science Perspective and DS demo
Data Science Perspective and DS demo Data Science Perspective and DS demo
Data Science Perspective and DS demo
 
Webinar - How SpagoBI 5 faces Big Data challenges to generate new business op...
Webinar - How SpagoBI 5 faces Big Data challenges to generate new business op...Webinar - How SpagoBI 5 faces Big Data challenges to generate new business op...
Webinar - How SpagoBI 5 faces Big Data challenges to generate new business op...
 
Big Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | Edureka
Big Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | EdurekaBig Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | Edureka
Big Data Use Cases | Hadoop Tutorial for Beginners | Hadoop Training | Edureka
 

Último

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 

Último (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 

RubyNation Visualizing Big Data on Small Devices

  • 1. Copyright © 2014 Intridea Inc. All rights reserved. Visualizing Big Data on Small Devices Tom Zeng Director of Engineering tom@intridea.com @tomzeng www.linkedin.com/in/tomzeng
  • 2. Copyright © 2014 Intridea Inc. All rights reserved. Agenda Introduction Front End - HTML5/Bootstrap, Backbone/CoffeeScript, D3, MapBox Backend - Rails, MongoDB Big Data Processing - Hadoop, Hive, Pig Showcase - Mobile and Data Visualization Related Projects Q & A
  • 3. Copyright © 2014 Intridea Inc. All rights reserved. Intridea - Rails, UX/Data Visualization, Mobile, Big Data, e-commerce American Bible Society (ABS http://www.americanbible.org/) - partners with Bible publishers · Provides API access to 539 Bible versions in 242 languages · The usage of the APIs is tracked at the verse level, along with ip location, timestamp, and duration · 530 million view logs/year(’12-’13 data), 1.5 mil/day, each view log has packed about 12 bible views · Amounts to 5-6 billion Bible views each year ABS asked Intridea to build the dashboard app Scripture Analytics (http://www.scriptureanalytics.com) Introduction
  • 4. Copyright © 2014 Intridea Inc. All rights reserved. BY WORKING REMOTELY 9,816 Hours Saved Annually ACROSS THE US & OVERSEAS 30+ Employees FOUNDED & STARTED IN 2007 Washington D.C. We Make ! 🌎 # ON GITHUB Open Source Software $
  • 5. Copyright © 2014 Intridea Inc. All rights reserved. Major Open Source Contributions OmniAuth is a flexible authentication system utilizing Rack middleware. OmniAuth An opinionated micro-framework for creating REST-like APIs in Ruby. Grape Hashie is a simple collection of useful Hash extensions. Hashie A Ruby wrapper for the OAuth 2.0 protocol. oauth2 A symbol font that makes it easy to create a map of the U.S. with HTML/CSS. Stately A generic swappable back-end for JSON handling. Multi_JSON Mission control dashboard for your distributed teams. Houston github.com/intridea
  • 6. Copyright © 2014 Intridea Inc. All rights reserved. INTRIDEA Simplified and modernized application experience for ADP Application engineering for BusinessWeek.com World’s first 100% web based Point of Sale system
  • 7. Copyright © 2014 Intridea Inc. All rights reserved. Where people go to make a difference with their investment capital. Simplifying ADP’s core business: Payroll Engineering for the most-trafficked wedding planning solution
  • 8. Copyright © 2014 Intridea Inc. All rights reserved.
  • 9. Copyright © 2014 Intridea Inc. All rights reserved. ABS Scripture Analytics Query Requirements Visualizations
  • 10. Copyright © 2014 Intridea Inc. All rights reserved. Public and private dashboards visualizing Bible reading across the Internet Up to the minute dashboards showing what Bible verses are being read when, and where, all over the globe.
  • 11. Copyright © 2014 Intridea Inc. All rights reserved. Mobile Desktop Tablet Responsive Web App www.scriptureanalytics.com
  • 12. Copyright © 2014 Intridea Inc. All rights reserved. Front End User Interface Single Page Application using Backbone.js CoffeeScript (Ruby like, Jasmine in CoffeeScript similar to RSpec)
 D3 for Data Visualization Twitter Bootstrap for Responsive UI Packery for Responsive Layout - http://packery.metafizzy.co/ Mapbox for Map Rendering - https://www.mapbox.com/
  • 13. Copyright © 2014 Intridea Inc. All rights reserved. Backend Servers/Services Rails on Ruby Application mostly as the API server MongoDB as the data store/cache Mongoid for Active Record like queries MongoDB Aggregation Framework for complex queries Pulling data periodically from S3 to populate the Mongo database Local R&D Hadoop and Mongo clusters for data exploration
  • 14. Copyright © 2014 Intridea Inc. All rights reserved. MongoDB Document oriented, schema free, JSON format Very high data read and write throughput Rich query capabilities (aggregation framework), flexible indexes Scale with auto-sharded replica sets Map/Reduce in JavaScript
  • 15. Copyright © 2014 Intridea Inc. All rights reserved. Hadoop/Pig/Hive/Impala Hadoop cluster (AWS Elastic Map/Reduce on-demand) to process and store data in S3 Pig to parse, transform, geo-code data Hive to query data and generate aggregated JSON reports Impala is similar to Hive (but much fast than the older version of Hive), used for ETL !
  • 16. Copyright © 2014 Intridea Inc. All rights reserved. Elastic Map/Reduce Hadoop Cluster - On Demand Processing
  • 17. Copyright © 2014 Intridea Inc. All rights reserved. Elastic Map/Reduce Hadoop Cluster - Terminated when done
  • 18. Copyright © 2014 Intridea Inc. All rights reserved. Cloudera CDH4 - on local 10-node cluster
  • 19. Copyright © 2014 Intridea Inc. All rights reserved. Cloudera CDH4 - Streaming Data into Hive Table
  • 20. Copyright © 2014 Intridea Inc. All rights reserved. Pig Sample Query
  • 21. Copyright © 2014 Intridea Inc. All rights reserved. Hive Sample Query
  • 22. Copyright © 2014 Intridea Inc. All rights reserved. Hive Query Results - Bible views by City
  • 23. Copyright © 2014 Intridea Inc. All rights reserved. Hive Query Results - Most popular verse before Mother’s day 28 Her children show their appreciation, and her husband praises her.
  • 24. Copyright © 2014 Intridea Inc. All rights reserved. Hive Query Results - Most popular verse on Mother’s day 28 Her children show their appreciation, and her husband praises her.
  • 25. Copyright © 2014 Intridea Inc. All rights reserved. Hive Query Results - Most popular verse after Mother’s day 28 Her children show their appreciation, and her husband praises her.
  • 26. Copyright © 2014 Intridea Inc. All rights reserved. MongoDB Aggregation Framework Example http://docs.mongodb.org/manual/core/aggregation-pipeline/
  • 27. Copyright © 2014 Intridea Inc. All rights reserved. MongoDB Aggregation Framework Example
  • 28. Copyright © 2014 Intridea Inc. All rights reserved. ABS Data Processing using Hadoop and MongoDB
  • 29. Copyright © 2014 Intridea Inc. All rights reserved. Analyzing Twitter using Hadoop and MongoDB
  • 30. Copyright © 2014 Intridea Inc. All rights reserved. Mobile and Data Visualization Project Showcase ADP BLiNQ PEW Templeton - Global Religious Futures Cato Institute - HumanProgress ! !
  • 31. Copyright © 2014 Intridea Inc. All rights reserved. Redefining ADP’s touch and desktop experiences ADP processes one out of every six paychecks in the United States. We’re bringing payroll into the decade of touch.
  • 32. Copyright © 2014 Intridea Inc. All rights reserved. ADP TLM
  • 33. Copyright © 2014 Intridea Inc. All rights reserved. ADP HCR
  • 34. Copyright © 2014 Intridea Inc. All rights reserved. INTRIDEA
  • 35. Copyright © 2014 Intridea Inc. All rights reserved. Dashboard insights for 600 of the world’s largest advertisers Analytics, planning, and flight management for social advertising campaigns and brand engagement.
  • 36. Copyright © 2014 Intridea Inc. All rights reserved. INTRIDEA INTRIDEA
  • 37. Copyright © 2014 Intridea Inc. All rights reserved. INTRIDEA INTRIDEA
  • 38. Copyright © 2014 Intridea Inc. All rights reserved. INTRIDEA INTRIDEA
  • 39. Copyright © 2014 Intridea Inc. All rights reserved. Analyzing religious change and its impact on societies around the world Interactive website exploring the patterns and trends in religions across the globe
  • 40. Copyright © 2014 Intridea Inc. All rights reserved. PEW Global Research PEW Global Research
  • 41. Copyright © 2014 Intridea Inc. All rights reserved. PEW Global Research PEW Global Research
  • 42. Copyright © 2014 Intridea Inc. All rights reserved. PEW Global Research PEW Global Research
  • 43. Copyright © 2014 Intridea Inc. All rights reserved. Human advancement to a higher stage Human Progress seeks to document changes in living standards in the past and present while explaining and exploring the best ways to improve conditions for people.
  • 44. Copyright © 2014 Intridea Inc. All rights reserved. INTRIDEA humanprogress.org
  • 45. Gracias Merci ありがとう Danke 谢谢 Thank You Copyright © 2014 Intridea Inc. All rights reserved. Tom Zeng Director of Engineering tom@intridea.com @tomzeng www.linkedin.com/in/tomzeng