SlideShare una empresa de Scribd logo
1 de 22
Descargar para leer sin conexión
Your Trusted
                                    Web Presence Partner




          BIG DATA
    Semantic Data
© 2011 Avalon Consulting, LLC   1
Your Speaker

• Kurt Cagle is an Information Architect for
  Avalon Consulting.
• Author of 18 books on XML, Web
  Development and the Semantic Web
• Managing Editor of XMLToday.org
• Email: caglek@avalonconsult.com




© 2011 Avalon Consulting, LLC
Perspectives
• In 1945, the cost to acquire a byte of data was high
  ~ $1/kB in 2011 USD.
• By 1960, that same dollar could get 1000 times as
  much data. This is another expression of Moore's
  Law.
• At 15 years per 1000x increase, the cost to acquire
  a kByte in 2011 is ~ $0.000000001/kB.
• By 2050, this will be
  $0.00000000000000000000001/kB, or
  10,000,000,000,000,000,000,000 kB/$.


© 2011 Avalon Consulting, LLC
Record Data
• The fundamental unit of data is the record.
• A record has an identity, a unique code (within its
  context) that differentiates it from all other
  records.
• A record has zero or more properties that describe
  specific characteristics of the record.
• Some of those properties may be pointers to other
  records that have a given identity.
• The combination of properties and identity also
  share a semantic cohesiveness.

© 2011 Avalon Consulting, LLC
From Record to Resource
• A resource is an abstract entity that is both
  unique and addressable.
• A representation of that resource is a (potentially
  structured) bag of properties that describe
  characteristics of that resource.
• If that resource is part of a collection, then the
  representation of that resource is a record in that
  collection.
• Note that the record is NOT the resource – it is
  only a description of the state of the resource at
  the moment that it is queried.

© 2011 Avalon Consulting, LLC
Addressibility
• A resource is addressible if, for a given collection,
  there is a key that can be used to retrieve a
  representation of the resource from the collection.
• A collection in turn, can be thought of as the
  context or namespace of all addresses within that
  collection.
• Conversely, if there is no collection for which a
  key exists to retrieve a resource then it is not a
  resource.



© 2011 Avalon Consulting, LLC
Resources and Time
• Most resources are state machines – they change
  their (internal) state over time.
• Depending upon the requested representation
  format, a record may be
   – the most recent representation of a resource,
   – the delta of changes from the last request,
   – or may be a log of all changes.
• Through 2000 or so, most resources changed
  slowly. That is no longer necessarily true.
• This means that all resources are services.

© 2011 Avalon Consulting, LLC
BIG DATA and REST
• REST – Representational State Transfer – is
  becoming the primary services representation.
• The period from 2005 to 2020 will be concerned
  with making previously non-addressable data
  RESTful.
• REST imputes CRUD (Create, Read, Update,
  Delete) semantics to addressable networks.
• REST also implies that collections are resources.




© 2011 Avalon Consulting, LLC
Representations
• A collection processor (aka, a server) is an
  abstraction layer between internal and external
  data representations.
• Any time a request is made, there is almost always
  a transformation that maps between an internal
  entity and the requested output.
• Representations do not have to be (and most
  times are not) in the same format as the
  underlying resource – a resource internally shown
  as XML can be rendered as HTML, JSON, zipped
  files, graphics, PDF, etc.

© 2011 Avalon Consulting, LLC
Collections
• A collection establishes a constrained set of
  resources.
• A collection can also be thought of as a category,
  with each resource key as a term in the category's
  taxonomy.
• A resource may belong to more than one
  collection.
• Collections determine available representations
  for resources.



© 2011 Avalon Consulting, LLC
Collections and Search
• A search is the invocation of a parametric
  function on a collection that returns a set of
  resource representations (typically with links to
  more extensive representations).
• It is possible for a collection to have multiple
  search functions bound to that collection – in that
  regard a search function may itself be a resource
  in a different collection.
• Search is the bridge between addressable RESTful
  retrieval and imperative web services.


© 2011 Avalon Consulting, LLC
Resources and Data Models
• A resource is an abstraction of a physical entity or
  process, and as such it is, itself, conformant to a
  data model.
• A representation of a resource is a transformation
  of that resource within the context of its
  collection.
• This means that relationships that are only
  inferred within the internal model may be made
  explicit within the representation.



© 2011 Avalon Consulting, LLC
BIG DATA Stage 1: Hadoop
• Hadoop can take context poor or legacy
  structured data and create from it
  contextual richer data records, which can in
  turn inform the development of resources.
• While Hadoop can be used for performing
  queries, these will be high latency searches
  compared to most other systems.
• Hadoop's real value comes from its ability
  to process data into more structurally
  queryable or manipulatible forms.
© 2011 Avalon Consulting, LLC
BIG DATA Stage 2: SQL
• SQL relational databases work best with
  single dimensional views based upon
  primary/foreign key relationships, ideal for
  many data models that have relatively rigid
  structure but somewhat richer semantics.
• SQL is still ideal for many forms of real
  world data processing, but generally has
  both non-standardized streaming
  mechanisms and constrained procedure
  semantics.
© 2011 Avalon Consulting, LLC
BIG DATA Stage 3: Hash Databases

• MongoDB and similar tools extend hash
  tables to representing more complex data
  models, as the combination of hashes and
  sequences can readily represent objects that
  have the same core namespace context.
• These are becoming popular as producers
  and consumers of JSON, and typically
  employ a JavaScript stack for most
  transactions.
© 2011 Avalon Consulting, LLC
BIG DATA Stage 4: XML Databases

• XML Databases store XML representations
  of objects, which best encode narrative
  structures, hierarchical entities that cross
  namespace boundaries, and provide very
  sophisticated tools for building RESTful
  web application services.
• XML Databases are optimal for documents
  and hybrid document/data structures.
• XML Databases are standardizing upon
  XQuery as the common stack language.
© 2011 Avalon Consulting, LLC
BIG DATA Stage 5: RDF Triple Stores

• Triple stores encode relationships between
  resources via RDF n-tuples for query by
  SPARQL. Triple stores work best for
  working with distributed data where
  relationships between resources are as or
  more important than the actual contents of
  the resources themselves.




© 2011 Avalon Consulting, LLC
Complementary Technologies

• There is a tendency to see NoSQL
  technologies as competitive. This is both
  wrong and dangerous.
• The technologies have evolved to handle
  different phases of data acquisition,
  processing and search, going from data
  with poor semantics and rigid structure to
  data with rich semantics and flexible
  structure.
© 2011 Avalon Consulting, LLC
Generation 5 Data Stores

• The next wave of databases will likely incorporate
  two or more generation 4 technologies – such as
  combining an XML or JSON database with an
  RDF triple store to be able to handle inferences
  about stored content, or by adding Hadoop
  processing support to a SQL database capable of
  providing RESTful representations in XML or
  JSON of internal views built from SQL tables.



© 2011 Avalon Consulting, LLC
Query Unification

• The central challenge of the next twenty
  years will be in finding the commonalities
  across these various platforms in order to
  build a twenty first century SQL.
• This will require a deeper understanding of
  the characteristics of data algebras, which
  provides the underpinnings for the most
  optimal expression of data structures.

© 2011 Avalon Consulting, LLC
XQuery?

• XQuery may be a good candidate to handle
  this unification, for several reasons:
     – Solid extensibility model
     – SQL-like syntax, but capable of handling
         RESTful services inbound and outbound
     – XQuery 3.0 supports maps, which are
         hash/sequence constructs ideal for encoding
         JSON or MongoDB structures.
     – Works well in a distributed context.
     – Can integrate Sparql or SQL scripts readily.
     – Standards based, and written by SQL author
© 2011 Avalon Consulting, LLC
Contact Us

                         Avalon Consulting, LLC

Dallas Office - HQ Washington, DC Office
5600 Tennyson Parkway 527 Maple Avenue East
Suite 230        Suite 200
Plano, TX 75024      Vienna, VA 22180
469-424-3449         703-635-3302



                                www.avalonconsult.com
                                info@avalonconsult.com


© 2011 Avalon Consulting, LLC             22

Más contenido relacionado

La actualidad más candente

Introduction to Web Services
Introduction to Web ServicesIntroduction to Web Services
Introduction to Web ServicesJeffrey Anderson
 
The semantic web
The semantic web The semantic web
The semantic web ap
 
Semantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesSemantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesCambridge Semantics
 
Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Anja Jentzsch
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataEUCLID project
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic WebMarin Dimitrov
 
Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?Victor Olex
 
Big Linked Data - Creating Training Curricula
Big Linked Data - Creating Training CurriculaBig Linked Data - Creating Training Curricula
Big Linked Data - Creating Training CurriculaEUCLID project
 
An Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAn Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAnkur Biswas
 

La actualidad más candente (11)

Introduction to Web Services
Introduction to Web ServicesIntroduction to Web Services
Introduction to Web Services
 
The semantic web
The semantic web The semantic web
The semantic web
 
Semantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational DatabasesSemantic Graph Databases: The Evolution of Relational Databases
Semantic Graph Databases: The Evolution of Relational Databases
 
Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)Link Sets And Why They Are Important (EDF2012)
Link Sets And Why They Are Important (EDF2012)
 
Semantic web
Semantic webSemantic web
Semantic web
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?Resource Oriented Architectures: The Future of Data API?
Resource Oriented Architectures: The Future of Data API?
 
Big Linked Data - Creating Training Curricula
Big Linked Data - Creating Training CurriculaBig Linked Data - Creating Training Curricula
Big Linked Data - Creating Training Curricula
 
Semantic web
Semantic webSemantic web
Semantic web
 
An Introduction to Semantic Web Technology
An Introduction to Semantic Web TechnologyAn Introduction to Semantic Web Technology
An Introduction to Semantic Web Technology
 

Destacado

Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...AIST
 
Big data e xposed from big data to smart data
Big data e xposed   from big data to smart dataBig data e xposed   from big data to smart data
Big data e xposed from big data to smart dataMotty Cohen
 
Big Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioBig Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioCHAKER ALLAOUI
 
Turning Big Data Into Smart Data: A Yashi Presentation
Turning Big Data Into Smart Data: A Yashi PresentationTurning Big Data Into Smart Data: A Yashi Presentation
Turning Big Data Into Smart Data: A Yashi PresentationYashi
 
From Big Data to Smart Data
From Big Data to Smart DataFrom Big Data to Smart Data
From Big Data to Smart DataMarin Dimitrov
 
From Big to Smart Data - Smart Data Innovation Lab Overview
From Big to Smart Data - Smart Data Innovation Lab OverviewFrom Big to Smart Data - Smart Data Innovation Lab Overview
From Big to Smart Data - Smart Data Innovation Lab OverviewPlamen Kiradjiev
 
Busienss intelligence in banking sector
Busienss intelligence in banking sectorBusienss intelligence in banking sector
Busienss intelligence in banking sectorCSC
 
Business Intelligence In Financial Industry
Business Intelligence In Financial IndustryBusiness Intelligence In Financial Industry
Business Intelligence In Financial IndustryKartik Mehta
 

Destacado (11)

Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
 
Big data e xposed from big data to smart data
Big data e xposed   from big data to smart dataBig data e xposed   from big data to smart data
Big data e xposed from big data to smart data
 
SMART Seminar Series: "From Big Data to Smart data"
SMART Seminar Series: "From Big Data to Smart data"SMART Seminar Series: "From Big Data to Smart data"
SMART Seminar Series: "From Big Data to Smart data"
 
Big Data to SMART Data : Process Scenario
Big Data to SMART Data : Process ScenarioBig Data to SMART Data : Process Scenario
Big Data to SMART Data : Process Scenario
 
From Big Data to Smart Data
From Big Data to Smart DataFrom Big Data to Smart Data
From Big Data to Smart Data
 
Turning Big Data Into Smart Data: A Yashi Presentation
Turning Big Data Into Smart Data: A Yashi PresentationTurning Big Data Into Smart Data: A Yashi Presentation
Turning Big Data Into Smart Data: A Yashi Presentation
 
From Big Data to Smart Data
From Big Data to Smart DataFrom Big Data to Smart Data
From Big Data to Smart Data
 
From Big to Smart Data - Smart Data Innovation Lab Overview
From Big to Smart Data - Smart Data Innovation Lab OverviewFrom Big to Smart Data - Smart Data Innovation Lab Overview
From Big to Smart Data - Smart Data Innovation Lab Overview
 
Busienss intelligence in banking sector
Busienss intelligence in banking sectorBusienss intelligence in banking sector
Busienss intelligence in banking sector
 
Business Intelligence In Financial Industry
Business Intelligence In Financial IndustryBusiness Intelligence In Financial Industry
Business Intelligence In Financial Industry
 
Business Intelligence Banking
Business Intelligence   BankingBusiness Intelligence   Banking
Business Intelligence Banking
 

Similar a Thu 1400 cagle_kurt_color

Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) robin fay
 
Web 3 final(1)
Web 3 final(1)Web 3 final(1)
Web 3 final(1)Venky Dood
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchChimezie Ogbuji
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of librariesRegan Harper
 
Linked data MLA 2015
Linked data MLA 2015Linked data MLA 2015
Linked data MLA 2015Cason Snow
 
Linked Data MLA 2015
Linked Data MLA 2015Linked Data MLA 2015
Linked Data MLA 2015Cason Snow
 
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...Semantic - Based Querying Using Ontology in Relational Database of Library Ma...
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...dannyijwest
 
In Memory Database Essay
In Memory Database EssayIn Memory Database Essay
In Memory Database EssayTammy Moncrief
 
RELATIONAL MODEL OF DATABASES AND OTHER CONCEPTS OF DATABASES​
RELATIONAL MODEL OF DATABASES AND OTHER CONCEPTS OF DATABASES​RELATIONAL MODEL OF DATABASES AND OTHER CONCEPTS OF DATABASES​
RELATIONAL MODEL OF DATABASES AND OTHER CONCEPTS OF DATABASES​EdwinJacob5
 
RDFa Semantic Web
RDFa Semantic WebRDFa Semantic Web
RDFa Semantic WebRob Paok
 
A review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebA review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebSimon Price
 
Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?Museums Computer Group
 
Selecting the right database type for your knowledge management needs.
Selecting the right database type for your knowledge management needs.Selecting the right database type for your knowledge management needs.
Selecting the right database type for your knowledge management needs.Synaptica, LLC
 

Similar a Thu 1400 cagle_kurt_color (20)

Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
unit 1.pptx
unit 1.pptxunit 1.pptx
unit 1.pptx
 
Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries) Intro to the semantic web (for libraries)
Intro to the semantic web (for libraries)
 
51095137-Semantic-WEB.ppt
51095137-Semantic-WEB.ppt51095137-Semantic-WEB.ppt
51095137-Semantic-WEB.ppt
 
Web 3 final(1)
Web 3 final(1)Web 3 final(1)
Web 3 final(1)
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes research
 
Linked data and the future of libraries
Linked data and the future of librariesLinked data and the future of libraries
Linked data and the future of libraries
 
Linked data MLA 2015
Linked data MLA 2015Linked data MLA 2015
Linked data MLA 2015
 
Linked Data MLA 2015
Linked Data MLA 2015Linked Data MLA 2015
Linked Data MLA 2015
 
Datastores
DatastoresDatastores
Datastores
 
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...Semantic - Based Querying Using Ontology in Relational Database of Library Ma...
Semantic - Based Querying Using Ontology in Relational Database of Library Ma...
 
UNIT-2.pptx
UNIT-2.pptxUNIT-2.pptx
UNIT-2.pptx
 
Presentation1
Presentation1Presentation1
Presentation1
 
In Memory Database Essay
In Memory Database EssayIn Memory Database Essay
In Memory Database Essay
 
RELATIONAL MODEL OF DATABASES AND OTHER CONCEPTS OF DATABASES​
RELATIONAL MODEL OF DATABASES AND OTHER CONCEPTS OF DATABASES​RELATIONAL MODEL OF DATABASES AND OTHER CONCEPTS OF DATABASES​
RELATIONAL MODEL OF DATABASES AND OTHER CONCEPTS OF DATABASES​
 
RDFa Semantic Web
RDFa Semantic WebRDFa Semantic Web
RDFa Semantic Web
 
A review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebA review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic Web
 
Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?Lee Iverson - How does the web connect content?
Lee Iverson - How does the web connect content?
 
Semantic Web and Linked Open Data
Semantic Web and Linked Open DataSemantic Web and Linked Open Data
Semantic Web and Linked Open Data
 
Selecting the right database type for your knowledge management needs.
Selecting the right database type for your knowledge management needs.Selecting the right database type for your knowledge management needs.
Selecting the right database type for your knowledge management needs.
 

Más de DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 

Más de DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 

Último

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 

Último (20)

The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 

Thu 1400 cagle_kurt_color

  • 1. Your Trusted Web Presence Partner BIG DATA Semantic Data © 2011 Avalon Consulting, LLC 1
  • 2. Your Speaker • Kurt Cagle is an Information Architect for Avalon Consulting. • Author of 18 books on XML, Web Development and the Semantic Web • Managing Editor of XMLToday.org • Email: caglek@avalonconsult.com © 2011 Avalon Consulting, LLC
  • 3. Perspectives • In 1945, the cost to acquire a byte of data was high ~ $1/kB in 2011 USD. • By 1960, that same dollar could get 1000 times as much data. This is another expression of Moore's Law. • At 15 years per 1000x increase, the cost to acquire a kByte in 2011 is ~ $0.000000001/kB. • By 2050, this will be $0.00000000000000000000001/kB, or 10,000,000,000,000,000,000,000 kB/$. © 2011 Avalon Consulting, LLC
  • 4. Record Data • The fundamental unit of data is the record. • A record has an identity, a unique code (within its context) that differentiates it from all other records. • A record has zero or more properties that describe specific characteristics of the record. • Some of those properties may be pointers to other records that have a given identity. • The combination of properties and identity also share a semantic cohesiveness. © 2011 Avalon Consulting, LLC
  • 5. From Record to Resource • A resource is an abstract entity that is both unique and addressable. • A representation of that resource is a (potentially structured) bag of properties that describe characteristics of that resource. • If that resource is part of a collection, then the representation of that resource is a record in that collection. • Note that the record is NOT the resource – it is only a description of the state of the resource at the moment that it is queried. © 2011 Avalon Consulting, LLC
  • 6. Addressibility • A resource is addressible if, for a given collection, there is a key that can be used to retrieve a representation of the resource from the collection. • A collection in turn, can be thought of as the context or namespace of all addresses within that collection. • Conversely, if there is no collection for which a key exists to retrieve a resource then it is not a resource. © 2011 Avalon Consulting, LLC
  • 7. Resources and Time • Most resources are state machines – they change their (internal) state over time. • Depending upon the requested representation format, a record may be – the most recent representation of a resource, – the delta of changes from the last request, – or may be a log of all changes. • Through 2000 or so, most resources changed slowly. That is no longer necessarily true. • This means that all resources are services. © 2011 Avalon Consulting, LLC
  • 8. BIG DATA and REST • REST – Representational State Transfer – is becoming the primary services representation. • The period from 2005 to 2020 will be concerned with making previously non-addressable data RESTful. • REST imputes CRUD (Create, Read, Update, Delete) semantics to addressable networks. • REST also implies that collections are resources. © 2011 Avalon Consulting, LLC
  • 9. Representations • A collection processor (aka, a server) is an abstraction layer between internal and external data representations. • Any time a request is made, there is almost always a transformation that maps between an internal entity and the requested output. • Representations do not have to be (and most times are not) in the same format as the underlying resource – a resource internally shown as XML can be rendered as HTML, JSON, zipped files, graphics, PDF, etc. © 2011 Avalon Consulting, LLC
  • 10. Collections • A collection establishes a constrained set of resources. • A collection can also be thought of as a category, with each resource key as a term in the category's taxonomy. • A resource may belong to more than one collection. • Collections determine available representations for resources. © 2011 Avalon Consulting, LLC
  • 11. Collections and Search • A search is the invocation of a parametric function on a collection that returns a set of resource representations (typically with links to more extensive representations). • It is possible for a collection to have multiple search functions bound to that collection – in that regard a search function may itself be a resource in a different collection. • Search is the bridge between addressable RESTful retrieval and imperative web services. © 2011 Avalon Consulting, LLC
  • 12. Resources and Data Models • A resource is an abstraction of a physical entity or process, and as such it is, itself, conformant to a data model. • A representation of a resource is a transformation of that resource within the context of its collection. • This means that relationships that are only inferred within the internal model may be made explicit within the representation. © 2011 Avalon Consulting, LLC
  • 13. BIG DATA Stage 1: Hadoop • Hadoop can take context poor or legacy structured data and create from it contextual richer data records, which can in turn inform the development of resources. • While Hadoop can be used for performing queries, these will be high latency searches compared to most other systems. • Hadoop's real value comes from its ability to process data into more structurally queryable or manipulatible forms. © 2011 Avalon Consulting, LLC
  • 14. BIG DATA Stage 2: SQL • SQL relational databases work best with single dimensional views based upon primary/foreign key relationships, ideal for many data models that have relatively rigid structure but somewhat richer semantics. • SQL is still ideal for many forms of real world data processing, but generally has both non-standardized streaming mechanisms and constrained procedure semantics. © 2011 Avalon Consulting, LLC
  • 15. BIG DATA Stage 3: Hash Databases • MongoDB and similar tools extend hash tables to representing more complex data models, as the combination of hashes and sequences can readily represent objects that have the same core namespace context. • These are becoming popular as producers and consumers of JSON, and typically employ a JavaScript stack for most transactions. © 2011 Avalon Consulting, LLC
  • 16. BIG DATA Stage 4: XML Databases • XML Databases store XML representations of objects, which best encode narrative structures, hierarchical entities that cross namespace boundaries, and provide very sophisticated tools for building RESTful web application services. • XML Databases are optimal for documents and hybrid document/data structures. • XML Databases are standardizing upon XQuery as the common stack language. © 2011 Avalon Consulting, LLC
  • 17. BIG DATA Stage 5: RDF Triple Stores • Triple stores encode relationships between resources via RDF n-tuples for query by SPARQL. Triple stores work best for working with distributed data where relationships between resources are as or more important than the actual contents of the resources themselves. © 2011 Avalon Consulting, LLC
  • 18. Complementary Technologies • There is a tendency to see NoSQL technologies as competitive. This is both wrong and dangerous. • The technologies have evolved to handle different phases of data acquisition, processing and search, going from data with poor semantics and rigid structure to data with rich semantics and flexible structure. © 2011 Avalon Consulting, LLC
  • 19. Generation 5 Data Stores • The next wave of databases will likely incorporate two or more generation 4 technologies – such as combining an XML or JSON database with an RDF triple store to be able to handle inferences about stored content, or by adding Hadoop processing support to a SQL database capable of providing RESTful representations in XML or JSON of internal views built from SQL tables. © 2011 Avalon Consulting, LLC
  • 20. Query Unification • The central challenge of the next twenty years will be in finding the commonalities across these various platforms in order to build a twenty first century SQL. • This will require a deeper understanding of the characteristics of data algebras, which provides the underpinnings for the most optimal expression of data structures. © 2011 Avalon Consulting, LLC
  • 21. XQuery? • XQuery may be a good candidate to handle this unification, for several reasons: – Solid extensibility model – SQL-like syntax, but capable of handling RESTful services inbound and outbound – XQuery 3.0 supports maps, which are hash/sequence constructs ideal for encoding JSON or MongoDB structures. – Works well in a distributed context. – Can integrate Sparql or SQL scripts readily. – Standards based, and written by SQL author © 2011 Avalon Consulting, LLC
  • 22. Contact Us Avalon Consulting, LLC Dallas Office - HQ Washington, DC Office 5600 Tennyson Parkway 527 Maple Avenue East Suite 230 Suite 200 Plano, TX 75024 Vienna, VA 22180 469-424-3449 703-635-3302 www.avalonconsult.com info@avalonconsult.com © 2011 Avalon Consulting, LLC 22