This document discusses the importance of connecting and exploiting big data through linking various data sources. It argues that simply creating new large data silos does not provide much value, and that true value is derived when numerous sources are brought together to create a single reference point. The document also discusses how linked data provides advantages over traditional big data approaches by structuring data around semantic triples that can interconnect different records and provide a bigger picture view. However, it cautions that linked data also requires rigorous controls and understanding to avoid simply creating another unstructured data source.
2. www.vestforsk.no
Whilst big data may represent a step forward
in business intelligence and analytics, we see
added value in linking and utilizing big data for
business benefit.
Once we bring together numerous data
sources to provide a single reference point can
we start to derive new value. Until then, we
only risk creating new data silos.
4. www.vestforsk.no
Hype around Big Data
Today, the difference between success and failure is the ability to monetize a
new class of data. It’s ironic that, despite billions of dollars spent on business
intelligence systems, we are still data‐bankrupt.
– Roman Stanek, Founder and CEO of Good Data
8. www.vestforsk.no
Definition(s) of “big data”
Big Data is a term encompassing the use of
techniques to capture, process, analyse and
visualize potentially large datasets in a
reasonable timeframe not accessible to
standard IT technologies.
By extension, the platform, tools and
software used for this purpose are collectively
called ‘Big Data technologies’
(Networked European Software and Service Initiative, 2012).
13. www.vestforsk.no
Advanced (intelligent) data analytics
In-Database
analytics
Conventional
Advanced
analytics
Hadoop
Evolution
• Needs human intervention
• Latency, compression and speed
• Coverage is vital rather than thoroughness
• Data can be Tbytes Pbytes
• Enhances the system performance by scale-out
• Statistical data and data mining
Conventional
• Fully automated thoroughness
is required
• Restricted on kinds of data
• Transaction management
• Volumes of data
Big Data Future
• New insight of multi-structured data
• Real-time big data analytics
• Process information in-memory, In-time, in-place
• Enhanced speed with low latency
• Semantic technologies
Conventional Advanced
(intelligent)
Analytics – NLP
and semantic
technologies
Unstructured data
batch processing -
Hadoop
In-Database
analytics
Information
Applications
Infrastructure
Cohesive
Infrastructure