SlideShare una empresa de Scribd logo
1 de 20
www.vestforsk.no
Connecting	and	
Exploiting	Big	Data
Rajendra Akerkar
rak@vestforsk.no
www.vestforsk.no
 Whilst big data may represent a step forward
in business intelligence and analytics, we see
added value in linking and utilizing big data for
business benefit.
 Once we bring together numerous data
sources to provide a single reference point can
we start to derive new value. Until then, we
only risk creating new data silos.
www.vestforsk.no
Source: Bloor Group
www.vestforsk.no
Hype around Big Data 
Today, the difference between success and failure is the ability to monetize a
new class of data. It’s ironic that, despite billions of dollars spent on business
intelligence systems, we are still data‐bankrupt.
– Roman Stanek, Founder and CEO of Good Data
www.vestforsk.no
What is Big Data?
Too big, moves too fast, or doesn’t fit the structures of your database 
architecture
3 Dimensions: Variety, Velocity, and Volume
Big
Data
social
videos & photos
mobile GPS
email
www.vestforsk.no
The rise and rise of Big Data
www.vestforsk.no
Share of the digital universe by India and China
www.vestforsk.no
Definition(s) of “big data”
Big Data is a term encompassing the use of
techniques to capture, process, analyse and
visualize potentially large datasets in a
reasonable timeframe not accessible to
standard IT technologies.
By extension, the platform, tools and
software used for this purpose are collectively
called ‘Big Data technologies’
(Networked European Software and Service Initiative, 2012). 
www.vestforsk.no
Attributes 
Venue
Veracity
Vocabulary
What happens if the raw data you are 
injecting into your system is 
incomplete or formatted incorrectly 
from the get‐go?
www.vestforsk.no
The potential of Big Data
 Data contains information of enormous 
business value
 Extract those insights and make far better 
decisions
 ...but is data indeed that valuable?
www.vestforsk.no
Correlation versus Causation versus “What’s great for 
the job”
Oncologists might benefit from seeing the similarities among cells in a 
biopsy, but targeting certain markers doesn’t guarantee you can cure 
someone’s cancer.
Source: Columbia University
www.vestforsk.no
www.vestforsk.no
Advanced (intelligent) data analytics
In-Database
analytics
Conventional
Advanced
analytics
Hadoop
Evolution
• Needs human intervention
• Latency, compression and speed
• Coverage is vital rather than thoroughness
• Data can be Tbytes  Pbytes
• Enhances the system performance by scale-out
• Statistical data and data mining
Conventional
• Fully automated thoroughness
is required
• Restricted on kinds of data
• Transaction management
• Volumes of data
Big Data Future
• New insight of multi-structured data
• Real-time big data analytics
• Process information in-memory, In-time, in-place
• Enhanced speed with low latency
• Semantic technologies
Conventional Advanced
(intelligent)
Analytics – NLP
and semantic
technologies
Unstructured data
batch processing -
Hadoop
In-Database
analytics
Information
Applications
Infrastructure
Cohesive
Infrastructure
www.vestforsk.no
Semantics of big data
 shifting from “data of action” to “data of intention.”
 The future of big data will be to use it as a tool to 
discover new segments & audiences, and invent new 
products.
www.vestforsk.no
Why	Linked	Data?
 Big data tends to be unstructured and metadata 
becomes important 
 for example, location data can help to make some sense of the 
data in that it provides particular structure. 
 Here, linked data provides some significant advantages in 
knotting together different records to provide a view of 
the bigger picture.
 linked data sees the Web as a giant database that can be 
mined to link to data, rather than document‐based 
resources.
www.vestforsk.no
Linked Data ‐ Paradigm
 Use URIs as names for things
 Use HTTP URIs so that people can look up 
those names.
 When someone looks up a URI, provide useful 
information.
 Include links to other URIs. so that they can 
discover more things.
www.vestforsk.no
Linked data  ̶ the next evolutionary stage for the 
database?
 Through linked data, data in unstructured databases 
can be linked to data in traditional data stores without 
changing existing schemas
 Using linked data provides a highly scalable solution, 
based on the same principles as unstructured data but 
by structuring the linked data around triples
 An agent!
 mapping and interconnecting, indexing and feeding real‐time 
information from a variety of sources
www.vestforsk.no
Implementing big data and linked data 
• Data integrity
• Understanding of the data
• Integration
• Data quality
• Data storage and replication
• Data migration
• Data security
www.vestforsk.no
But, linked data is no remedy...
if rigorous controls are not applied to the 
metamodel then 
it becomes yet another unstructured data 
source, making the problem worse, rather than 
better!
www.vestforsk.no
Publisher: Taylor & Francis Group/CRC Press
http://www.taylorandfrancis.com/books/details/9781466578371/ 
Read this book!

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Big data
Big dataBig data
Big data
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA Ventures
 
Big Data Trends
Big Data TrendsBig Data Trends
Big Data Trends
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Big Data
Big DataBig Data
Big Data
 
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of thingsBig Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
Big Data & Future - Big Data, Analytics, Cloud, SDN, Internet of things
 
IBM Big Data References
IBM Big Data ReferencesIBM Big Data References
IBM Big Data References
 
Big Data & the importance of Data Science
Big Data & the importance of Data ScienceBig Data & the importance of Data Science
Big Data & the importance of Data Science
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public Cloud
 
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
Big Data Applications | Big Data Analytics Use-Cases | Big Data Tutorial for ...
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
 
Fundamentals of Big Data
Fundamentals of Big DataFundamentals of Big Data
Fundamentals of Big Data
 
Big Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must KnowBig Data - The 5 Vs Everyone Must Know
Big Data - The 5 Vs Everyone Must Know
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 
Big Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation SlidesBig Data Characteristics And Process PowerPoint Presentation Slides
Big Data Characteristics And Process PowerPoint Presentation Slides
 
Big Data Information Architecture PowerPoint Presentation Slide
Big Data Information Architecture PowerPoint Presentation SlideBig Data Information Architecture PowerPoint Presentation Slide
Big Data Information Architecture PowerPoint Presentation Slide
 
Big data case study collection
Big data   case study collectionBig data   case study collection
Big data case study collection
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Data Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data ScienceData Science Courses - BigData VS Data Science
Data Science Courses - BigData VS Data Science
 
5 v of big data
5 v of big data5 v of big data
5 v of big data
 

Destacado

citigroup April 17, 2006 - First Quarter Press Release
citigroup April 17, 2006 - First Quarter Press Releasecitigroup April 17, 2006 - First Quarter Press Release
citigroup April 17, 2006 - First Quarter Press Release
QuarterlyEarningsReports
 
Social Studies 11 - Syllabus
Social Studies 11 - SyllabusSocial Studies 11 - Syllabus
Social Studies 11 - Syllabus
Ashley Slade
 
Personal Logo Project
Personal Logo ProjectPersonal Logo Project
Personal Logo Project
Ashley Slade
 
QBIT - Quiz Generalis Prelims
QBIT - Quiz Generalis PrelimsQBIT - Quiz Generalis Prelims
QBIT - Quiz Generalis Prelims
QBIT Mesra
 

Destacado (19)

Steve Kosten - Exploiting common web application vulnerabilities
Steve Kosten - Exploiting common web application vulnerabilities Steve Kosten - Exploiting common web application vulnerabilities
Steve Kosten - Exploiting common web application vulnerabilities
 
My holiday
My holidayMy holiday
My holiday
 
ELA 10 - Syllabus
ELA 10 - SyllabusELA 10 - Syllabus
ELA 10 - Syllabus
 
citigroup April 17, 2006 - First Quarter Press Release
citigroup April 17, 2006 - First Quarter Press Releasecitigroup April 17, 2006 - First Quarter Press Release
citigroup April 17, 2006 - First Quarter Press Release
 
Social Studies 11 - Syllabus
Social Studies 11 - SyllabusSocial Studies 11 - Syllabus
Social Studies 11 - Syllabus
 
Final Day
Final DayFinal Day
Final Day
 
Vancouver Olympics Brand Preso
Vancouver Olympics Brand PresoVancouver Olympics Brand Preso
Vancouver Olympics Brand Preso
 
Personal Logo Project
Personal Logo ProjectPersonal Logo Project
Personal Logo Project
 
1234
12341234
1234
 
Sonhos e Metas
Sonhos e MetasSonhos e Metas
Sonhos e Metas
 
Doc
DocDoc
Doc
 
QBIT - Quiz Generalis Straight Drive Round
QBIT - Quiz Generalis Straight Drive RoundQBIT - Quiz Generalis Straight Drive Round
QBIT - Quiz Generalis Straight Drive Round
 
Hadoop Developer
Hadoop DeveloperHadoop Developer
Hadoop Developer
 
QBIT - Quiz Generalis Prelims
QBIT - Quiz Generalis PrelimsQBIT - Quiz Generalis Prelims
QBIT - Quiz Generalis Prelims
 
WSO2 Message Broker - Product Overview
WSO2 Message Broker - Product OverviewWSO2 Message Broker - Product Overview
WSO2 Message Broker - Product Overview
 
Variables and constants
Variables and constantsVariables and constants
Variables and constants
 
Chakravyuh-3 : Prelims
Chakravyuh-3 : Prelims Chakravyuh-3 : Prelims
Chakravyuh-3 : Prelims
 
Matek 2. osztály
Matek 2. osztályMatek 2. osztály
Matek 2. osztály
 
Ha cluster -Public to Private
Ha cluster -Public to PrivateHa cluster -Public to Private
Ha cluster -Public to Private
 

Similar a Connecting and Exploiting Big Data

The Business of Big Data (IA Ventures)
The Business of Big Data (IA Ventures)The Business of Big Data (IA Ventures)
The Business of Big Data (IA Ventures)
Ben Siscovick
 

Similar a Connecting and Exploiting Big Data (20)

An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data Analytics
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
Transforming Big Data into business value
Transforming Big Data into business valueTransforming Big Data into business value
Transforming Big Data into business value
 
The ABCs of Big Data
The ABCs of Big DataThe ABCs of Big Data
The ABCs of Big Data
 
1
11
1
 
The Business of Big Data (IA Ventures)
The Business of Big Data (IA Ventures)The Business of Big Data (IA Ventures)
The Business of Big Data (IA Ventures)
 
Data foundation for analytics excellence
Data foundation for analytics excellenceData foundation for analytics excellence
Data foundation for analytics excellence
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
 
Identify and analyze the greatest insights from big data
Identify and analyze the greatest insights from big dataIdentify and analyze the greatest insights from big data
Identify and analyze the greatest insights from big data
 
Big Data at a Glance
Big Data at a GlanceBig Data at a Glance
Big Data at a Glance
 
The Business Of Big Data (Ga Preso) Final
The Business Of Big Data (Ga Preso) FinalThe Business Of Big Data (Ga Preso) Final
The Business Of Big Data (Ga Preso) Final
 
Unlocking Value in the Fragmented World of Big Data Analytics (POV Paper)
Unlocking Value in the Fragmented World of Big Data Analytics (POV Paper)Unlocking Value in the Fragmented World of Big Data Analytics (POV Paper)
Unlocking Value in the Fragmented World of Big Data Analytics (POV Paper)
 
What's the Big Deal About Big Data?
What's the Big Deal About Big Data?What's the Big Deal About Big Data?
What's the Big Deal About Big Data?
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
 
Summiting the Mountain of Big Data
Summiting the Mountain of Big DataSummiting the Mountain of Big Data
Summiting the Mountain of Big Data
 
The value of our data
The value of our dataThe value of our data
The value of our data
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
Policy paper need for focussed big data & analytics skillset building throu...
Policy  paper  need for focussed big data & analytics skillset building throu...Policy  paper  need for focussed big data & analytics skillset building throu...
Policy paper need for focussed big data & analytics skillset building throu...
 
Snowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big DataSnowball Group Whitepaper - Spotlight on Big Data
Snowball Group Whitepaper - Spotlight on Big Data
 
20 Emerging influencers in 2020 for big data
20 Emerging influencers in 2020 for big data20 Emerging influencers in 2020 for big data
20 Emerging influencers in 2020 for big data
 

Más de R A Akerkar

Big data in Business Innovation
Big data in Business Innovation   Big data in Business Innovation
Big data in Business Innovation
R A Akerkar
 
Linked open data
Linked open dataLinked open data
Linked open data
R A Akerkar
 
Semi structure data extraction
Semi structure data extractionSemi structure data extraction
Semi structure data extraction
R A Akerkar
 
Big data: analyzing large data sets
Big data: analyzing large data setsBig data: analyzing large data sets
Big data: analyzing large data sets
R A Akerkar
 
Description logics
Description logicsDescription logics
Description logics
R A Akerkar
 
Case Based Reasoning
Case Based ReasoningCase Based Reasoning
Case Based Reasoning
R A Akerkar
 
Semantic Markup
Semantic Markup Semantic Markup
Semantic Markup
R A Akerkar
 
Intelligent natural language system
Intelligent natural language systemIntelligent natural language system
Intelligent natural language system
R A Akerkar
 
Knowledge Organization Systems
Knowledge Organization SystemsKnowledge Organization Systems
Knowledge Organization Systems
R A Akerkar
 
Rational Unified Process for User Interface Design
Rational Unified Process for User Interface DesignRational Unified Process for User Interface Design
Rational Unified Process for User Interface Design
R A Akerkar
 
Unified Modelling Language
Unified Modelling LanguageUnified Modelling Language
Unified Modelling Language
R A Akerkar
 
Statistical Preliminaries
Statistical PreliminariesStatistical Preliminaries
Statistical Preliminaries
R A Akerkar
 
Statistics and Data Mining
Statistics and  Data MiningStatistics and  Data Mining
Statistics and Data Mining
R A Akerkar
 
Software project management
Software project managementSoftware project management
Software project management
R A Akerkar
 

Más de R A Akerkar (20)

Rajendraakerkar lemoproject
Rajendraakerkar lemoprojectRajendraakerkar lemoproject
Rajendraakerkar lemoproject
 
Big Data and Harvesting Data from Social Media
Big Data and Harvesting Data from Social MediaBig Data and Harvesting Data from Social Media
Big Data and Harvesting Data from Social Media
 
Big data in Business Innovation
Big data in Business Innovation   Big data in Business Innovation
Big data in Business Innovation
 
Linked open data
Linked open dataLinked open data
Linked open data
 
Semi structure data extraction
Semi structure data extractionSemi structure data extraction
Semi structure data extraction
 
Big data: analyzing large data sets
Big data: analyzing large data setsBig data: analyzing large data sets
Big data: analyzing large data sets
 
Description logics
Description logicsDescription logics
Description logics
 
Data Mining
Data MiningData Mining
Data Mining
 
Link analysis
Link analysisLink analysis
Link analysis
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligence
 
Case Based Reasoning
Case Based ReasoningCase Based Reasoning
Case Based Reasoning
 
Semantic Markup
Semantic Markup Semantic Markup
Semantic Markup
 
Intelligent natural language system
Intelligent natural language systemIntelligent natural language system
Intelligent natural language system
 
Data mining
Data miningData mining
Data mining
 
Knowledge Organization Systems
Knowledge Organization SystemsKnowledge Organization Systems
Knowledge Organization Systems
 
Rational Unified Process for User Interface Design
Rational Unified Process for User Interface DesignRational Unified Process for User Interface Design
Rational Unified Process for User Interface Design
 
Unified Modelling Language
Unified Modelling LanguageUnified Modelling Language
Unified Modelling Language
 
Statistical Preliminaries
Statistical PreliminariesStatistical Preliminaries
Statistical Preliminaries
 
Statistics and Data Mining
Statistics and  Data MiningStatistics and  Data Mining
Statistics and Data Mining
 
Software project management
Software project managementSoftware project management
Software project management
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 

Connecting and Exploiting Big Data