SlideShare a Scribd company logo
1 of 17
Industry specific cover image




Hadoop in the Public Sector
Tom Plunkett
June 13, 2012
Tom Plunkett
• Big Data Technical Lead for Oracle Public Sector
• Led a Team that won a MapReduce Research
  Project from the Office of the Secretary of Defense
  in 2009
• Designed and taught a graduate level course on
  MapReduce for Virginia Tech’s Computer Science
  Department in 2010
• Author of a book on Oracle’s Big Data Products
  (publication date in 2013)



                     Copyright © 2012 Oracle. All rights reserved.
                                                                     2
Hadoop in the Public Sector

• Obama Administration Big Data Initiative
• U.S. Intelligence Agencies
• U.S. Department of Defense
• U.S. Civilian Departments and Agencies
• Other governmental entities




                  Copyright © 2012 Oracle. All rights reserved.
                                                                  3
Obama Administration’s Big Data Initiative

• Office of Science and
  Technology Policy
  announced on March
  29th, 2012 an additional
  $200 million per year in
  funding for Big Data
  Research
• Department of Defense
  currently spending over
  $250 million a year on
  Big Data Research



                      Copyright © 2012 Oracle. All rights reserved.
                                                                      4
Several Projects Received $25 Million a
Year from the Obama Big Data Initiative

• DARPA XDATA
• NSF and NIH Core Techniques for Advancing Big
  Data Science and Engineering
• NIH Thousand Genomes Project
• Department of Energy Scalable Data Management,
  Analysis and Visualization Institute
• US Geological Survey Big Data for Earth Science




                   Copyright © 2012 Oracle. All rights reserved.
                                                                   5
DARPA XDATA

• $25 million per year for four years for data
  processing and visualization research
• Developed Software Code will be licensed
  under Apache or similar license
• Proposals under the Broad Agency
  Announcement were due May 30
• http://www.darpa.mil/NewsEvents/Releases/
  2012/03/29.aspx


                   Copyright © 2012 Oracle. All rights reserved.
                                                                   6
NSF and NIH: Core Techniques
for Advancing Big Data
• $25 million available for funding research
• Up to $1 million per year for five years for
  data processing or visualization research;
  proposals are due June 13 at 5pm
• Up to $250k per year for three years for data
  processing or visualization research;
  proposals are due July 11 at 5pm
• http://www.nsf.gov/news/news_summ.jsp?c
  ntn_id=123607

                   Copyright © 2012 Oracle. All rights reserved.
                                                                   7
NIH 1000 Genomes Project
• 200 TB public
  dataset hosted on
  Amazon S3 for free
• Dataset access is
  free, Researchers
  pay for AWS
  computational
  resources
• http://www.nih.gov/n
  ews/health/mar2012/
  nhgri-29.htm
                  Copyright © 2012 Oracle. All rights reserved.
                                                                  8
Department of Energy Scalable Data
Management, Analysis and Visualization
Institute
• Received $25 million in funding for this year
• The SciDAC (Scientific Discovery through
  Advanced Computing) Institute of Scalable Data
  Management, Analysis and Visualization is funded
  by the DOE Office of Science through the Office of
  Advanced Scientific Computing Research.
• Research in Data Management, Data Analysis,
  Visualization, and Scientific Software Tools
• http://sdav-scidac.org/



                     Copyright © 2012 Oracle. All rights reserved.
                                                                     9
US Geological Survey




• The Powell Center of the US Geological Survey funds Big
  Data research for Earth System Science
• The Powell Center provides annual research awards in the
  area of Earth System Science
• Proposals for FY13 were due on April 30
• http://powellcenter.usgs.gov/

                       Copyright © 2012 Oracle. All rights reserved.
                                                                       10
U.S. Intelligence Agencies
• NSA publicly
  announced it
  was using
  Hadoop at a
  NDU
  conference
  on July 15,
  2009
• NSA donated
  Accumulo
  code in 2011

                 Copyright © 2012 Oracle. All rights reserved.
                                                                 11
U.S. Army Distributed Common
   Ground System (DCGS-A)
• Can predict
  likely IED sites
  based on
  logistic routes
  and past
  attacks.
• http://ctovision.
  com/2011/04/th
  e-cloud-and-
  physical-
  security/

                      Copyright © 2012 Oracle. All rights reserved.
                                                                      12
U.S. Naval Air V-22 Osprey
• US Naval Air Condition Based Maintenance (CBM+) for the
  V-22 Osprey
• http://www.acq.osd.mil/log/mpp/cbm+/Briefings/CBM+_and
  _CAMEO_Overview06Jul2011.pdf
• Photo source: wikipedia




                       Copyright © 2012 Oracle. All rights reserved.
                                                                       13
U.S. Civilian Departments and Agencies
• Department of Homeland Security (DHS)
• Department of Energy(DOE)
• Department of Veteran Affairs
• Health and Human Services (HHS)
• Food and Drug Administration (FDA)
• National Archives and Records Admin
• NASA
• National Institute of Health (NIH)
• National Science Foundation (NSF)
• http://www.whitehouse.gov/sites/default/files/microsi
 tes/ostp/big_data_fact_sheet_final.pdf


                     Copyright © 2012 Oracle. All rights reserved.
                                                                     14
Other Governmental entities

• State and Local governments are looking at
  Hadoop to solve problems involving Health,
  Public Safety, and Traffic
• Foreign Governmental agencies are using
  Hadoop in similar ways to their U.S.
  counterparts (intelligence agencies, etc.)
• Tennessee Valley Authority uses HDFS to
  store power utility information
• Many additional examples

                 Copyright © 2012 Oracle. All rights reserved.
                                                                 15
Questions?

• Obama Administration’s Big Data Initiative
• U.S. Intelligence Agencies
• U.S. Department of Defense
• U.S. Civilian Departments and Agencies
• Other governmental entities




                  Copyright © 2012 Oracle. All rights reserved.
                                                                  16
Oracle’s Big Data Platform



    Big Data                Database /                                      Business
                          Data Warehouse                                   Intelligence




                InfiniBand                                    InfiniBand



                                                                                    Real-Time
                                                                                    Decisions




  Acquire   Organize & Discover                               Analyze           Decide




                   Copyright © 2012 Oracle. All rights reserved.

More Related Content

What's hot

Large Scale Search, Discovery and Analytics in Action
Large Scale Search, Discovery and Analytics in ActionLarge Scale Search, Discovery and Analytics in Action
Large Scale Search, Discovery and Analytics in Action
Grant Ingersoll
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECAProject
 

What's hot (20)

Public Data and Data Mining Competitions - What are Lessons?
Public Data and Data Mining Competitions - What are Lessons?Public Data and Data Mining Competitions - What are Lessons?
Public Data and Data Mining Competitions - What are Lessons?
 
Massive-Scale Analytics Applied to Real-World Problems
Massive-Scale Analytics Applied to Real-World ProblemsMassive-Scale Analytics Applied to Real-World Problems
Massive-Scale Analytics Applied to Real-World Problems
 
Using Open Source Technologies to Spatially Enable Aceh
Using Open Source Technologies to Spatially Enable AcehUsing Open Source Technologies to Spatially Enable Aceh
Using Open Source Technologies to Spatially Enable Aceh
 
Department of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data DashboardsDepartment of Commerce App Challenge: Big Data Dashboards
Department of Commerce App Challenge: Big Data Dashboards
 
Analytics Education in the era of Big Data
Analytics Education in the era of Big DataAnalytics Education in the era of Big Data
Analytics Education in the era of Big Data
 
Introduction of Data Science
Introduction of Data ScienceIntroduction of Data Science
Introduction of Data Science
 
Big Data in small words
Big Data in small wordsBig Data in small words
Big Data in small words
 
#BigDataCanarias: "Big Data & Career Paths"
#BigDataCanarias: "Big Data & Career Paths"#BigDataCanarias: "Big Data & Career Paths"
#BigDataCanarias: "Big Data & Career Paths"
 
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
ExLibris National Library Meeting @ IFLA-Helsinki - Aug 15th 2012
 
Crowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
Crowdsourcing Approaches to Big Data Curation - Rio Big Data MeetupCrowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
Crowdsourcing Approaches to Big Data Curation - Rio Big Data Meetup
 
2013 bio it world
2013 bio it world2013 bio it world
2013 bio it world
 
Data science e machine learning
Data science e machine learningData science e machine learning
Data science e machine learning
 
SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:
SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:  SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:
SPONSORED WORKSHOP by Amplidata from Structure:Data 2012:
 
The Evolution of Data Science
The Evolution of Data ScienceThe Evolution of Data Science
The Evolution of Data Science
 
Large Scale Search, Discovery and Analytics in Action
Large Scale Search, Discovery and Analytics in ActionLarge Scale Search, Discovery and Analytics in Action
Large Scale Search, Discovery and Analytics in Action
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
Towards Lightweight Cyber-Physical Energy Systems using Linked Data, the Web ...
 
New Trends and Directions in Data Science - MIT Information Quality Conferenc...
New Trends and Directions in Data Science - MIT Information Quality Conferenc...New Trends and Directions in Data Science - MIT Information Quality Conferenc...
New Trends and Directions in Data Science - MIT Information Quality Conferenc...
 
Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...
Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...
Maurice Bouwhuis (SARA/Vancis) - Hoe big data te begrijpen door ze te visuali...
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
 

Similar to Hadoop in Public Sector

Similar to Hadoop in Public Sector (20)

EPA OEI Linked Data Process
EPA OEI Linked Data ProcessEPA OEI Linked Data Process
EPA OEI Linked Data Process
 
MarkLogic Semantic use cases
MarkLogic Semantic use cases MarkLogic Semantic use cases
MarkLogic Semantic use cases
 
2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop2012 Fall Data Management Planning Workshop
2012 Fall Data Management Planning Workshop
 
Research, the Cloud, and the IRB
Research, the Cloud, and the IRBResearch, the Cloud, and the IRB
Research, the Cloud, and the IRB
 
GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016GBIF BIFA mentoring, Day 5a Data management, July 2016
GBIF BIFA mentoring, Day 5a Data management, July 2016
 
Big Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and ManagementBig Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and Management
 
Hadoop World 2011: The Hadoop Award for Government Excellence - Bob Gourley -...
Hadoop World 2011: The Hadoop Award for Government Excellence - Bob Gourley -...Hadoop World 2011: The Hadoop Award for Government Excellence - Bob Gourley -...
Hadoop World 2011: The Hadoop Award for Government Excellence - Bob Gourley -...
 
DataCyte - The Future of Data Storage & Retrieval
DataCyte - The Future of Data Storage & RetrievalDataCyte - The Future of Data Storage & Retrieval
DataCyte - The Future of Data Storage & Retrieval
 
Oracle fusion soa operations and configuration
Oracle fusion soa  operations and configurationOracle fusion soa  operations and configuration
Oracle fusion soa operations and configuration
 
Oracle fusion soa operations and configuration
Oracle fusion soa  operations and configurationOracle fusion soa  operations and configuration
Oracle fusion soa operations and configuration
 
Oracle fusion soa operations and configuration
Oracle fusion soa  operations and configurationOracle fusion soa  operations and configuration
Oracle fusion soa operations and configuration
 
Oracle fusion soa operations and configuration
Oracle fusion soa  operations and configurationOracle fusion soa  operations and configuration
Oracle fusion soa operations and configuration
 
Oracle fusion soa operations and configuration
Oracle fusion soa  operations and configurationOracle fusion soa  operations and configuration
Oracle fusion soa operations and configuration
 
Data.gov for KM Latin America
Data.gov for KM Latin AmericaData.gov for KM Latin America
Data.gov for KM Latin America
 
Ottawa NIEM SOA Open Data Event
Ottawa NIEM SOA Open Data EventOttawa NIEM SOA Open Data Event
Ottawa NIEM SOA Open Data Event
 
Big data oracle_introduccion
Big data oracle_introduccionBig data oracle_introduccion
Big data oracle_introduccion
 
LOD2 Webinar Series: Virtuoso 7
LOD2 Webinar Series: Virtuoso 7LOD2 Webinar Series: Virtuoso 7
LOD2 Webinar Series: Virtuoso 7
 
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
 
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...Geospatial Data Insfrastructures, Cybercartography and Open Data:  The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
 
Big Data in Media
Big Data in MediaBig Data in Media
Big Data in Media
 

More from DataWorks Summit

HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
DataWorks Summit
 

More from DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 

Recently uploaded

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 

Hadoop in Public Sector

  • 1. Industry specific cover image Hadoop in the Public Sector Tom Plunkett June 13, 2012
  • 2. Tom Plunkett • Big Data Technical Lead for Oracle Public Sector • Led a Team that won a MapReduce Research Project from the Office of the Secretary of Defense in 2009 • Designed and taught a graduate level course on MapReduce for Virginia Tech’s Computer Science Department in 2010 • Author of a book on Oracle’s Big Data Products (publication date in 2013) Copyright © 2012 Oracle. All rights reserved. 2
  • 3. Hadoop in the Public Sector • Obama Administration Big Data Initiative • U.S. Intelligence Agencies • U.S. Department of Defense • U.S. Civilian Departments and Agencies • Other governmental entities Copyright © 2012 Oracle. All rights reserved. 3
  • 4. Obama Administration’s Big Data Initiative • Office of Science and Technology Policy announced on March 29th, 2012 an additional $200 million per year in funding for Big Data Research • Department of Defense currently spending over $250 million a year on Big Data Research Copyright © 2012 Oracle. All rights reserved. 4
  • 5. Several Projects Received $25 Million a Year from the Obama Big Data Initiative • DARPA XDATA • NSF and NIH Core Techniques for Advancing Big Data Science and Engineering • NIH Thousand Genomes Project • Department of Energy Scalable Data Management, Analysis and Visualization Institute • US Geological Survey Big Data for Earth Science Copyright © 2012 Oracle. All rights reserved. 5
  • 6. DARPA XDATA • $25 million per year for four years for data processing and visualization research • Developed Software Code will be licensed under Apache or similar license • Proposals under the Broad Agency Announcement were due May 30 • http://www.darpa.mil/NewsEvents/Releases/ 2012/03/29.aspx Copyright © 2012 Oracle. All rights reserved. 6
  • 7. NSF and NIH: Core Techniques for Advancing Big Data • $25 million available for funding research • Up to $1 million per year for five years for data processing or visualization research; proposals are due June 13 at 5pm • Up to $250k per year for three years for data processing or visualization research; proposals are due July 11 at 5pm • http://www.nsf.gov/news/news_summ.jsp?c ntn_id=123607 Copyright © 2012 Oracle. All rights reserved. 7
  • 8. NIH 1000 Genomes Project • 200 TB public dataset hosted on Amazon S3 for free • Dataset access is free, Researchers pay for AWS computational resources • http://www.nih.gov/n ews/health/mar2012/ nhgri-29.htm Copyright © 2012 Oracle. All rights reserved. 8
  • 9. Department of Energy Scalable Data Management, Analysis and Visualization Institute • Received $25 million in funding for this year • The SciDAC (Scientific Discovery through Advanced Computing) Institute of Scalable Data Management, Analysis and Visualization is funded by the DOE Office of Science through the Office of Advanced Scientific Computing Research. • Research in Data Management, Data Analysis, Visualization, and Scientific Software Tools • http://sdav-scidac.org/ Copyright © 2012 Oracle. All rights reserved. 9
  • 10. US Geological Survey • The Powell Center of the US Geological Survey funds Big Data research for Earth System Science • The Powell Center provides annual research awards in the area of Earth System Science • Proposals for FY13 were due on April 30 • http://powellcenter.usgs.gov/ Copyright © 2012 Oracle. All rights reserved. 10
  • 11. U.S. Intelligence Agencies • NSA publicly announced it was using Hadoop at a NDU conference on July 15, 2009 • NSA donated Accumulo code in 2011 Copyright © 2012 Oracle. All rights reserved. 11
  • 12. U.S. Army Distributed Common Ground System (DCGS-A) • Can predict likely IED sites based on logistic routes and past attacks. • http://ctovision. com/2011/04/th e-cloud-and- physical- security/ Copyright © 2012 Oracle. All rights reserved. 12
  • 13. U.S. Naval Air V-22 Osprey • US Naval Air Condition Based Maintenance (CBM+) for the V-22 Osprey • http://www.acq.osd.mil/log/mpp/cbm+/Briefings/CBM+_and _CAMEO_Overview06Jul2011.pdf • Photo source: wikipedia Copyright © 2012 Oracle. All rights reserved. 13
  • 14. U.S. Civilian Departments and Agencies • Department of Homeland Security (DHS) • Department of Energy(DOE) • Department of Veteran Affairs • Health and Human Services (HHS) • Food and Drug Administration (FDA) • National Archives and Records Admin • NASA • National Institute of Health (NIH) • National Science Foundation (NSF) • http://www.whitehouse.gov/sites/default/files/microsi tes/ostp/big_data_fact_sheet_final.pdf Copyright © 2012 Oracle. All rights reserved. 14
  • 15. Other Governmental entities • State and Local governments are looking at Hadoop to solve problems involving Health, Public Safety, and Traffic • Foreign Governmental agencies are using Hadoop in similar ways to their U.S. counterparts (intelligence agencies, etc.) • Tennessee Valley Authority uses HDFS to store power utility information • Many additional examples Copyright © 2012 Oracle. All rights reserved. 15
  • 16. Questions? • Obama Administration’s Big Data Initiative • U.S. Intelligence Agencies • U.S. Department of Defense • U.S. Civilian Departments and Agencies • Other governmental entities Copyright © 2012 Oracle. All rights reserved. 16
  • 17. Oracle’s Big Data Platform Big Data Database / Business Data Warehouse Intelligence InfiniBand InfiniBand Real-Time Decisions Acquire Organize & Discover Analyze Decide Copyright © 2012 Oracle. All rights reserved.