SlideShare a Scribd company logo
1 of 46
Download to read offline
Big Data Considerations
Geospatial Intelligence Middle East,
May 2013

Steven Ramage
Head of Ordnance Survey International
Geospatial Intelligence Middle East 2013
Recently the Military GIS and Intelligence communities have
gained a better understanding of the incredible increase of “Cloud”
empowered applications, the challenges and opportunities of Big
Data, the importance of social media, the availability of improved
applications, and the dramatic improvement in quality and
availability of remote sensing data. This, and the increased speed
of GIS applications and the integration of a full-motion video
analysis product, empowers military forces and national security
agencies to exploit and analyze full motion video from UAVs and
other airborne vehicles.
http://tinyurl.com/cd8z6y5
http://www.computerweekly.com/feature/OrdnanceSurvey-gets-to-grips-with-geospatial-big-data
“ Ordnance Survey has all but completed a five-year IT
improvement programme to enhance its operations. That
programme – with Oracle as the main IT partner – has already
transformed those operations into an enterprise grid computing
system that pulls 17 databases into one Oracle spatial
database management platform. The platform supports all
geospatial data types and models. The system combines open
source Linux with Oracle’s grid computing architecture, which
makes it possible to coordinate large numbers of low-cost servers
and corresponding storage so they operate like one large
computer. ”
Big Data challenges
•

Big thinking (value)

•

Big strategy (necessary)

•

Big governance (stewardship)

•

Big access (sharing)

•

Big cooperation (supply chain)

•

Big privacy (security)

•

Big quality (QA/QC)

•

Big people (skills training)
What is big data?

Shutterstock
http://hortonworks.com/blog/big-data-defined/
April 4th, 2013 Russell Jurney

• Wikipedia defines as problems posed by the awkwardness of
legacy tools in supporting massive datasets: what is a massive
dataset? Megabytes  Yottabytes.
• Collection of data sets so large and complex that it becomes
difficult to process using on-hand database management tools
or traditional data processing applications.
• There is a ‘Big Data’ opportunity: transformative economics.
Big Data is the opportunity space created by new open source,
distributed systems from the consumer internet space.
The big data environment
• Volume
• Data at rest; levels increasing
• Velocity
• Data in motion; speed at which it
transits enterprises and entire
industries is faster than ever
• Variety
• Data in many forms; hundreds of
millions of web pages, emails and
unstructured data, such as Word documents and
PDFs, as well as a nearly infinite number of events
and information from every enterprise data centres
• Value
• Do you need it?
Facebook now has 50 billion photographs
The big data environment
• It uses local storage to be fast but inexpensive
• It uses clusters of commodity hardware to be inexpensive
• It uses free software to be inexpensive
• It is open source to build from community learning
• Cheap storage means logging enormous volumes of data to
many disks is easy. Processing this data is less so. Distributed
systems which have the above four properties are disruptive
because they are approximately 100 times cheaper than
other systems for processing large volumes of data, and
because they deliver high I/O performance.
The big data environment
• Apache Hadoop is one such system. Hadoop ties together a
cluster of commodity machines with local storage using free and
open source software to store and process vast amounts of data
at a fraction of the cost of other systems [Example: Esri/spatialframework-for-hadoop, GitHub: social network for programmers]
• SAN Storage $2-10/GB  Local Storage $0.05/GB
The big data environment
• Capture every shred of data in the cheapest place possible
• Provide access to this data across the organization
• Mine the data for value
• “To undergo the transformative processes that unabridged
access to data provides, enabling bigger, better, faster more
profound insight than ever before”. Blogger
Most data isn’t big and businesses are wasting
money pretending it is:
www.qz.com/81661/most-data
• How many of us need to undertake operations that rank every
web page that exists?
• What processing tasks cannot be handled on a single computer
or even a laptop? [Megabyte to Gigabyte range]
• Weren’t you doing data analysis before data became big?

• Do you have the requirement or capability to check
correlations or patterns that you can act on if you have
even more data?
• False positives. Vincent Granville wrote ‘The curse of big data’,
even if a dataset includes 1000 items there are many millions of
correlations, a few will be extremely high just by chance.
• Getting more into the field of data science (stats, quality, etc.)
Mapping the global Twitter heartbeat:
the geography of Twitter
http://firstmonday.org/ojs/index.php/fm/article/vi
ew/4366/3654
• In 2012, supercomputing manufacturer Silicon Graphics
International (SGI), the University of Illinois and social media
data vendor GNIP collaborated to create the “Global Twitter
Heartbeat” project (http://www.sgi.com/go/twitter) in order to
map global emotion expressed on Twitter in real-time.
• GNIP provided access to the Twitter Decahose, which consists
of 10 percent of all tweets sent globally each day.
• SGI provided access to one of its new UV2000 supercomputers
with 256 processors and 4TB of RAM running the Linux
operating system.
Twitter
From 12:01AM 23 October
2012 through to11:59PM 30
November 2012

Twitter Decahose from GNIP
streamed 1,535,929,521
tweets from 71,273,997
unique users, averaging 38
million tweets from 13.7
million users each day.
Use the location of social
media posts for emergency
warning, real-time local
situation reporting, etc.
Big data perspective on mapping the
geography of Twitter
• iPhones and Blackberries yield an additional 1% of all tweets
being georeferenced
• However, they’ve been missed by previous studies because
• They store their geographic information in the textual
Location field rather than the machine-readable Geo
metadata field
• In the big data era we need to look at the data itself, not just
assume it follows the manual.
Kalev Leetaru, University of Illinois on CrisisMappers
http://www.CrisisMappers.net
Why do we need big data?

Shutterstock
Analytics plus geospatial data is changing the
way we get insights (hidden patterns)
• Geospatial analytics gives you the ability to ask “where”
questions of business data
Where did it
happen?

Where will it
happen?

Where is it
happening?

Source: Teradata
Analytics plus geospatial data is changing the
way we get insights
•
•
•
•

Where are my customers?
Where are my competitors?
How far will customers travel to a branch or store?
Which of my competitor’s customers can I draw to a
branch or store?
• Which customers live close to a branch or store?
• Where can I increase profitability?
• How can I mitigate financial risk from flooding?
Is there a ‘problem with crowdsourcing
intelligence’?
DefenceIQ, May 2013 Thomas Chappelow
http://www.defenceiq.com/defence-technology/articles/the-problemwith-crowdsourcing-intelligence-in-syr/
• blogging, tweeting, mapping and photographing every single
detail…creating an unprecedented mountain of information that
can be farmed for actionable intelligence
• lack of traditional sources to rely on, the global intelligence
community has to look elsewhere for information…
crowdsourcing appears a juicy prospect – until it goes wrong
• Provenance, verification and trust
• Just as important for HUMINT as GEOINT
Big data cycle

Shutterstock
Some experience gained
Ordnance Survey today
• Ordnance Survey is 222 years old
• Civilian organisation since 1983; 1100 staff
• Independent Government Department and Executive Agency
reporting directly to a Government Minister
• Trading Fund since April 1999
• Annual Report for 2011/12: Revenue of £141.8m, profit before
exceptional items of £31.9m, dividend £17.2m
• Southampton headquarters with 26 field offices in Great Britain
The size of the task
Topographic Layer
(approximate volumes)
1:1250 Scale = 17 000 km2
1:2500 Scale = 158 000 km2
1:10 000 Scale = 66 000 km2
Over one million units of change per year.

Address Layer
27.5 million geocoded postal addresses,
with 500 000 changes per year.
Transport Network Layer
5.37 million kms of roads, 3.97million links,
885 881 route instructions –
over 20 000 changes per month.
Updating the Ordnance Survey database
Wide Range of Customers and Markets
A database to connect via real world information
• Every object represented in OS MasterMap has a unique
Reference identifier called a TOID. These TOIDs can be used to
connect other information and are linked to other core references
OS MasterMap current layers
Ordnance Survey
and
IBM Netezza

Shutterstock
Using IBM Netezza for high performance
geospatial analytics

Stress
Data Queries
Testing our
Data
Storytelling with
New Insights Location Data
Netezza and geospatial analytics
•
•
•
•
•
•

In-database geospatial analytic functions
Native understanding of geospatial data
High performance out of the box
Scales to terabytes of data
No indexes or aggregates to manage
Open, standards-based interface and data model

Analyse all data in a single appliance
Stress
Testing our
Data
Stress testing our data – Volume of data
? ? ?
? Data
Queries

? ? ?
?

?
Data queries – Volume of data
Data queries – Volume of data

We analysed 41 million
records in 19 hours.
We could not run this
query in the past.
New Insights
New insights – Volume and variety of data
Storytelling
with
Location
Data
Storytelling with location data
Big Data – Linked Data
•

As Ordnance Survey approaches the end of the transformation of its
operations, it is preparing its data to exploit the myriad
interconnections that can exist between physical entities in what has
been described as the “Internet of Things”. This web of
interconnections between disparate objects and ideas is made
possible through linked data technology.

•

Linked data assigns a unique tag – a three-fact, uniform resource
identifier known as a triple – to each thing of interest. For example,
population data can be linked to socio-economic statistics for a
given town.

•

Linked Data Web, currently estimated to include more than 30 billion
triples, with some 20% of those having geographic content.
Joining up Government
Hyperlocal example
‘Find me all GPs in my ward, bus stops within a 500 metre
radius of those GPs, but exclude bus stops in areas of high
crime’.
Environment
Transport

Health

Council

Crime

Education

Weather
Business
Big Data challenges
•

Big thinking (value)

•

Big strategy (necessary)

•

Big governance (stewardship)

•

Big access (sharing)

•

Big cooperation (supply chain)

•

Big privacy (security)

•

Big quality (QA/QC)

•

Big people (skills training)
Ordnance Survey International: advisory services
•

Strategic review and assessment

•

Capacity and capability building

•

Knowledge transfer and training

•

Value of geographic information

•

Technology direction – 3D, quality,
open standards and much more

•

National authoritative mapping

•

National address infrastructure

•

National geodetic infrastructure

•

National spatial data infrastructure
Ordnance Survey International
Thank you for your attention. For further information contact:

Steven Ramage, Head of Ordnance Survey International
steven.ramage@ordnancesurvey.co.uk

More Related Content

What's hot

Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolutionitnewsafrica
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysisPoonam Kshirsagar
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Geoffrey Fox
 
Big Data
Big DataBig Data
Big DataNGDATA
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its ChallengesKathirvel Ayyaswamy
 
Big Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and moreBig Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and moreSoftweb Solutions
 
Introduction to Big Data
Introduction to Big Data Introduction to Big Data
Introduction to Big Data Srinath Perera
 
Big data analytics 1
Big data analytics 1Big data analytics 1
Big data analytics 1gauravsc36
 
Mining Big Data in Real Time
Mining Big Data in Real TimeMining Big Data in Real Time
Mining Big Data in Real TimeAlbert Bifet
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018Leanne Hwee
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Hritika Raj
 
A Big Data Timeline
A Big Data TimelineA Big Data Timeline
A Big Data TimelineBig Cloud
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapSrinath Perera
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big datakk1718
 

What's hot (20)

Big Data Evolution
Big Data EvolutionBig Data Evolution
Big Data Evolution
 
Data mining on big data
Data mining on big dataData mining on big data
Data mining on big data
 
NewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big DataNewMR 2016 presents: 9 Big Applications of Big Data
NewMR 2016 presents: 9 Big Applications of Big Data
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysis
 
Bigdata analytics
Bigdata analyticsBigdata analytics
Bigdata analytics
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
 
Big data ankita1
Big data ankita1Big data ankita1
Big data ankita1
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big Data
Big DataBig Data
Big Data
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
Big data mining
Big data miningBig data mining
Big data mining
 
Big Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and moreBig Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and more
 
Introduction to Big Data
Introduction to Big Data Introduction to Big Data
Introduction to Big Data
 
Big data analytics 1
Big data analytics 1Big data analytics 1
Big data analytics 1
 
Mining Big Data in Real Time
Mining Big Data in Real TimeMining Big Data in Real Time
Mining Big Data in Real Time
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018
 
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
Big data PPT prepared by Hritika Raj (Shivalik college of engg.)
 
A Big Data Timeline
A Big Data TimelineA Big Data Timeline
A Big Data Timeline
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 

Similar to Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage

Similar to Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage (20)

Big Data et eGovernment
Big Data et eGovernmentBig Data et eGovernment
Big Data et eGovernment
 
Big Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalBig Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar Semwal
 
Understanding big data
Understanding big dataUnderstanding big data
Understanding big data
 
Big data and Internet
Big data and InternetBig data and Internet
Big data and Internet
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big data
 
Kartikey tripathi
Kartikey tripathiKartikey tripathi
Kartikey tripathi
 
Big_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptxBig_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptx
 
Bigdata " new level"
Bigdata " new level"Bigdata " new level"
Bigdata " new level"
 
Big data
Big dataBig data
Big data
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptx
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data and analytics
Big data and analyticsBig data and analytics
Big data and analytics
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 

More from Steven Ramage

Steven Ramage GEO keynote RCMRD International Conference Aug 2021
Steven Ramage GEO keynote RCMRD International Conference Aug 2021Steven Ramage GEO keynote RCMRD International Conference Aug 2021
Steven Ramage GEO keynote RCMRD International Conference Aug 2021Steven Ramage
 
Ramage GEO ISPRS July 2021
Ramage GEO ISPRS July 2021Ramage GEO ISPRS July 2021
Ramage GEO ISPRS July 2021Steven Ramage
 
Steven Ramage GEO_World Bank OLC
Steven Ramage GEO_World Bank OLCSteven Ramage GEO_World Bank OLC
Steven Ramage GEO_World Bank OLCSteven Ramage
 
S Ramage GEO REAP UR2020
S Ramage GEO REAP UR2020S Ramage GEO REAP UR2020
S Ramage GEO REAP UR2020Steven Ramage
 
GEO update July 2020
GEO update July 2020GEO update July 2020
GEO update July 2020Steven Ramage
 
GEO101 FOSS4G August 2019
GEO101 FOSS4G August 2019GEO101 FOSS4G August 2019
GEO101 FOSS4G August 2019Steven Ramage
 
Ramage EO4SDG keynote August 2019
Ramage EO4SDG keynote August 2019Ramage EO4SDG keynote August 2019
Ramage EO4SDG keynote August 2019Steven Ramage
 
GEO Data Technology Workshop Ramage April 2019
GEO Data Technology Workshop Ramage April 2019GEO Data Technology Workshop Ramage April 2019
GEO Data Technology Workshop Ramage April 2019Steven Ramage
 
S Ramage WEF Davos 2019
S Ramage WEF Davos 2019S Ramage WEF Davos 2019
S Ramage WEF Davos 2019Steven Ramage
 
S Ramage GEO FOSS$G2018
S Ramage GEO FOSS$G2018S Ramage GEO FOSS$G2018
S Ramage GEO FOSS$G2018Steven Ramage
 
S Ramage GEO UN-GGIM8 August 2018
S Ramage GEO UN-GGIM8 August 2018S Ramage GEO UN-GGIM8 August 2018
S Ramage GEO UN-GGIM8 August 2018Steven Ramage
 
CUF Prague June 2018 Ramage GEO
CUF Prague June 2018 Ramage GEOCUF Prague June 2018 Ramage GEO
CUF Prague June 2018 Ramage GEOSteven Ramage
 
Ramage Gi4DM Keynote March 2018
Ramage Gi4DM Keynote March 2018Ramage Gi4DM Keynote March 2018
Ramage Gi4DM Keynote March 2018Steven Ramage
 
Ramage GEO-CRADLE March 2018 Istanbul
Ramage GEO-CRADLE March 2018 IstanbulRamage GEO-CRADLE March 2018 Istanbul
Ramage GEO-CRADLE March 2018 IstanbulSteven Ramage
 
Ramage GEO World Urban Forum KL 2018
Ramage GEO World Urban Forum KL 2018Ramage GEO World Urban Forum KL 2018
Ramage GEO World Urban Forum KL 2018Steven Ramage
 
S Ramage GEO_global forum disaster resilience_Nov2017
S Ramage GEO_global forum disaster resilience_Nov2017S Ramage GEO_global forum disaster resilience_Nov2017
S Ramage GEO_global forum disaster resilience_Nov2017Steven Ramage
 
S Ramage GEO UN-GGIM HLF Mexico Nov 2017
S Ramage GEO UN-GGIM HLF Mexico Nov 2017S Ramage GEO UN-GGIM HLF Mexico Nov 2017
S Ramage GEO UN-GGIM HLF Mexico Nov 2017Steven Ramage
 
S Ramage GEO_EFGS Nov 2017
S Ramage GEO_EFGS Nov 2017S Ramage GEO_EFGS Nov 2017
S Ramage GEO_EFGS Nov 2017Steven Ramage
 
S Ramage GEO_IEOS Nov 2017
S Ramage GEO_IEOS Nov 2017S Ramage GEO_IEOS Nov 2017
S Ramage GEO_IEOS Nov 2017Steven Ramage
 

More from Steven Ramage (20)

Steven Ramage GEO keynote RCMRD International Conference Aug 2021
Steven Ramage GEO keynote RCMRD International Conference Aug 2021Steven Ramage GEO keynote RCMRD International Conference Aug 2021
Steven Ramage GEO keynote RCMRD International Conference Aug 2021
 
Ramage GEO ISPRS July 2021
Ramage GEO ISPRS July 2021Ramage GEO ISPRS July 2021
Ramage GEO ISPRS July 2021
 
Steven Ramage GEO_World Bank OLC
Steven Ramage GEO_World Bank OLCSteven Ramage GEO_World Bank OLC
Steven Ramage GEO_World Bank OLC
 
S Ramage GEO REAP UR2020
S Ramage GEO REAP UR2020S Ramage GEO REAP UR2020
S Ramage GEO REAP UR2020
 
GEO update July 2020
GEO update July 2020GEO update July 2020
GEO update July 2020
 
GEO101 FOSS4G August 2019
GEO101 FOSS4G August 2019GEO101 FOSS4G August 2019
GEO101 FOSS4G August 2019
 
Ramage EO4SDG keynote August 2019
Ramage EO4SDG keynote August 2019Ramage EO4SDG keynote August 2019
Ramage EO4SDG keynote August 2019
 
GEO Data Technology Workshop Ramage April 2019
GEO Data Technology Workshop Ramage April 2019GEO Data Technology Workshop Ramage April 2019
GEO Data Technology Workshop Ramage April 2019
 
S Ramage WEF Davos 2019
S Ramage WEF Davos 2019S Ramage WEF Davos 2019
S Ramage WEF Davos 2019
 
S Ramage GEO FOSS$G2018
S Ramage GEO FOSS$G2018S Ramage GEO FOSS$G2018
S Ramage GEO FOSS$G2018
 
S Ramage GEO UN-GGIM8 August 2018
S Ramage GEO UN-GGIM8 August 2018S Ramage GEO UN-GGIM8 August 2018
S Ramage GEO UN-GGIM8 August 2018
 
GEO Symposium 2018
GEO Symposium 2018GEO Symposium 2018
GEO Symposium 2018
 
CUF Prague June 2018 Ramage GEO
CUF Prague June 2018 Ramage GEOCUF Prague June 2018 Ramage GEO
CUF Prague June 2018 Ramage GEO
 
Ramage Gi4DM Keynote March 2018
Ramage Gi4DM Keynote March 2018Ramage Gi4DM Keynote March 2018
Ramage Gi4DM Keynote March 2018
 
Ramage GEO-CRADLE March 2018 Istanbul
Ramage GEO-CRADLE March 2018 IstanbulRamage GEO-CRADLE March 2018 Istanbul
Ramage GEO-CRADLE March 2018 Istanbul
 
Ramage GEO World Urban Forum KL 2018
Ramage GEO World Urban Forum KL 2018Ramage GEO World Urban Forum KL 2018
Ramage GEO World Urban Forum KL 2018
 
S Ramage GEO_global forum disaster resilience_Nov2017
S Ramage GEO_global forum disaster resilience_Nov2017S Ramage GEO_global forum disaster resilience_Nov2017
S Ramage GEO_global forum disaster resilience_Nov2017
 
S Ramage GEO UN-GGIM HLF Mexico Nov 2017
S Ramage GEO UN-GGIM HLF Mexico Nov 2017S Ramage GEO UN-GGIM HLF Mexico Nov 2017
S Ramage GEO UN-GGIM HLF Mexico Nov 2017
 
S Ramage GEO_EFGS Nov 2017
S Ramage GEO_EFGS Nov 2017S Ramage GEO_EFGS Nov 2017
S Ramage GEO_EFGS Nov 2017
 
S Ramage GEO_IEOS Nov 2017
S Ramage GEO_IEOS Nov 2017S Ramage GEO_IEOS Nov 2017
S Ramage GEO_IEOS Nov 2017
 

Recently uploaded

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 

Recently uploaded (20)

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 

Geospatial Intelligence Middle East 2013_Big Data_Steven Ramage

  • 1. Big Data Considerations Geospatial Intelligence Middle East, May 2013 Steven Ramage Head of Ordnance Survey International
  • 2. Geospatial Intelligence Middle East 2013 Recently the Military GIS and Intelligence communities have gained a better understanding of the incredible increase of “Cloud” empowered applications, the challenges and opportunities of Big Data, the importance of social media, the availability of improved applications, and the dramatic improvement in quality and availability of remote sensing data. This, and the increased speed of GIS applications and the integration of a full-motion video analysis product, empowers military forces and national security agencies to exploit and analyze full motion video from UAVs and other airborne vehicles. http://tinyurl.com/cd8z6y5
  • 3. http://www.computerweekly.com/feature/OrdnanceSurvey-gets-to-grips-with-geospatial-big-data “ Ordnance Survey has all but completed a five-year IT improvement programme to enhance its operations. That programme – with Oracle as the main IT partner – has already transformed those operations into an enterprise grid computing system that pulls 17 databases into one Oracle spatial database management platform. The platform supports all geospatial data types and models. The system combines open source Linux with Oracle’s grid computing architecture, which makes it possible to coordinate large numbers of low-cost servers and corresponding storage so they operate like one large computer. ”
  • 4. Big Data challenges • Big thinking (value) • Big strategy (necessary) • Big governance (stewardship) • Big access (sharing) • Big cooperation (supply chain) • Big privacy (security) • Big quality (QA/QC) • Big people (skills training)
  • 5. What is big data? Shutterstock
  • 6. http://hortonworks.com/blog/big-data-defined/ April 4th, 2013 Russell Jurney • Wikipedia defines as problems posed by the awkwardness of legacy tools in supporting massive datasets: what is a massive dataset? Megabytes  Yottabytes. • Collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications. • There is a ‘Big Data’ opportunity: transformative economics. Big Data is the opportunity space created by new open source, distributed systems from the consumer internet space.
  • 7. The big data environment • Volume • Data at rest; levels increasing • Velocity • Data in motion; speed at which it transits enterprises and entire industries is faster than ever • Variety • Data in many forms; hundreds of millions of web pages, emails and unstructured data, such as Word documents and PDFs, as well as a nearly infinite number of events and information from every enterprise data centres • Value • Do you need it?
  • 8. Facebook now has 50 billion photographs
  • 9. The big data environment • It uses local storage to be fast but inexpensive • It uses clusters of commodity hardware to be inexpensive • It uses free software to be inexpensive • It is open source to build from community learning • Cheap storage means logging enormous volumes of data to many disks is easy. Processing this data is less so. Distributed systems which have the above four properties are disruptive because they are approximately 100 times cheaper than other systems for processing large volumes of data, and because they deliver high I/O performance.
  • 10. The big data environment • Apache Hadoop is one such system. Hadoop ties together a cluster of commodity machines with local storage using free and open source software to store and process vast amounts of data at a fraction of the cost of other systems [Example: Esri/spatialframework-for-hadoop, GitHub: social network for programmers] • SAN Storage $2-10/GB  Local Storage $0.05/GB
  • 11. The big data environment • Capture every shred of data in the cheapest place possible • Provide access to this data across the organization • Mine the data for value • “To undergo the transformative processes that unabridged access to data provides, enabling bigger, better, faster more profound insight than ever before”. Blogger
  • 12. Most data isn’t big and businesses are wasting money pretending it is: www.qz.com/81661/most-data • How many of us need to undertake operations that rank every web page that exists? • What processing tasks cannot be handled on a single computer or even a laptop? [Megabyte to Gigabyte range] • Weren’t you doing data analysis before data became big? • Do you have the requirement or capability to check correlations or patterns that you can act on if you have even more data? • False positives. Vincent Granville wrote ‘The curse of big data’, even if a dataset includes 1000 items there are many millions of correlations, a few will be extremely high just by chance. • Getting more into the field of data science (stats, quality, etc.)
  • 13. Mapping the global Twitter heartbeat: the geography of Twitter http://firstmonday.org/ojs/index.php/fm/article/vi ew/4366/3654 • In 2012, supercomputing manufacturer Silicon Graphics International (SGI), the University of Illinois and social media data vendor GNIP collaborated to create the “Global Twitter Heartbeat” project (http://www.sgi.com/go/twitter) in order to map global emotion expressed on Twitter in real-time. • GNIP provided access to the Twitter Decahose, which consists of 10 percent of all tweets sent globally each day. • SGI provided access to one of its new UV2000 supercomputers with 256 processors and 4TB of RAM running the Linux operating system.
  • 14. Twitter From 12:01AM 23 October 2012 through to11:59PM 30 November 2012 Twitter Decahose from GNIP streamed 1,535,929,521 tweets from 71,273,997 unique users, averaging 38 million tweets from 13.7 million users each day. Use the location of social media posts for emergency warning, real-time local situation reporting, etc.
  • 15. Big data perspective on mapping the geography of Twitter • iPhones and Blackberries yield an additional 1% of all tweets being georeferenced • However, they’ve been missed by previous studies because • They store their geographic information in the textual Location field rather than the machine-readable Geo metadata field • In the big data era we need to look at the data itself, not just assume it follows the manual. Kalev Leetaru, University of Illinois on CrisisMappers http://www.CrisisMappers.net
  • 16. Why do we need big data? Shutterstock
  • 17. Analytics plus geospatial data is changing the way we get insights (hidden patterns) • Geospatial analytics gives you the ability to ask “where” questions of business data Where did it happen? Where will it happen? Where is it happening? Source: Teradata
  • 18. Analytics plus geospatial data is changing the way we get insights • • • • Where are my customers? Where are my competitors? How far will customers travel to a branch or store? Which of my competitor’s customers can I draw to a branch or store? • Which customers live close to a branch or store? • Where can I increase profitability? • How can I mitigate financial risk from flooding?
  • 19. Is there a ‘problem with crowdsourcing intelligence’? DefenceIQ, May 2013 Thomas Chappelow http://www.defenceiq.com/defence-technology/articles/the-problemwith-crowdsourcing-intelligence-in-syr/ • blogging, tweeting, mapping and photographing every single detail…creating an unprecedented mountain of information that can be farmed for actionable intelligence • lack of traditional sources to rely on, the global intelligence community has to look elsewhere for information… crowdsourcing appears a juicy prospect – until it goes wrong • Provenance, verification and trust • Just as important for HUMINT as GEOINT
  • 22. Ordnance Survey today • Ordnance Survey is 222 years old • Civilian organisation since 1983; 1100 staff • Independent Government Department and Executive Agency reporting directly to a Government Minister • Trading Fund since April 1999 • Annual Report for 2011/12: Revenue of £141.8m, profit before exceptional items of £31.9m, dividend £17.2m • Southampton headquarters with 26 field offices in Great Britain
  • 23. The size of the task Topographic Layer (approximate volumes) 1:1250 Scale = 17 000 km2 1:2500 Scale = 158 000 km2 1:10 000 Scale = 66 000 km2 Over one million units of change per year. Address Layer 27.5 million geocoded postal addresses, with 500 000 changes per year. Transport Network Layer 5.37 million kms of roads, 3.97million links, 885 881 route instructions – over 20 000 changes per month.
  • 24. Updating the Ordnance Survey database
  • 25. Wide Range of Customers and Markets
  • 26.
  • 27. A database to connect via real world information • Every object represented in OS MasterMap has a unique Reference identifier called a TOID. These TOIDs can be used to connect other information and are linked to other core references
  • 30. Using IBM Netezza for high performance geospatial analytics Stress Data Queries Testing our Data Storytelling with New Insights Location Data
  • 31. Netezza and geospatial analytics • • • • • • In-database geospatial analytic functions Native understanding of geospatial data High performance out of the box Scales to terabytes of data No indexes or aggregates to manage Open, standards-based interface and data model Analyse all data in a single appliance
  • 33. Stress testing our data – Volume of data
  • 34. ? ? ? ? Data Queries ? ? ? ? ?
  • 35. Data queries – Volume of data
  • 36. Data queries – Volume of data We analysed 41 million records in 19 hours. We could not run this query in the past.
  • 38. New insights – Volume and variety of data
  • 41. Big Data – Linked Data • As Ordnance Survey approaches the end of the transformation of its operations, it is preparing its data to exploit the myriad interconnections that can exist between physical entities in what has been described as the “Internet of Things”. This web of interconnections between disparate objects and ideas is made possible through linked data technology. • Linked data assigns a unique tag – a three-fact, uniform resource identifier known as a triple – to each thing of interest. For example, population data can be linked to socio-economic statistics for a given town. • Linked Data Web, currently estimated to include more than 30 billion triples, with some 20% of those having geographic content.
  • 43. Hyperlocal example ‘Find me all GPs in my ward, bus stops within a 500 metre radius of those GPs, but exclude bus stops in areas of high crime’. Environment Transport Health Council Crime Education Weather Business
  • 44. Big Data challenges • Big thinking (value) • Big strategy (necessary) • Big governance (stewardship) • Big access (sharing) • Big cooperation (supply chain) • Big privacy (security) • Big quality (QA/QC) • Big people (skills training)
  • 45. Ordnance Survey International: advisory services • Strategic review and assessment • Capacity and capability building • Knowledge transfer and training • Value of geographic information • Technology direction – 3D, quality, open standards and much more • National authoritative mapping • National address infrastructure • National geodetic infrastructure • National spatial data infrastructure
  • 46. Ordnance Survey International Thank you for your attention. For further information contact: Steven Ramage, Head of Ordnance Survey International steven.ramage@ordnancesurvey.co.uk