SlideShare una empresa de Scribd logo
1 de 29
1
More Meaning. Better Results.
1
Building the Inform Semantic Publishing Ecosystem:
from Author to Audience
Marc Hadfield
VP, Research & Development
marc@inform.com
2
Marc Hadfield
• Semantic Technology, Computer Science
• Inform Technologies (Head of R&D)
‣ Semantic Technologies applied to Content Analysis & Distribution
• Alitora Systems (Co-Founder / CTO)
‣ Life Science Semantic Technology, Research, Big Data Analytics, Semantic HPC
‣ Life Science Natural Language Processing
• Columbia Genome Center
‣ NLP applied to Life Science Research Articles
• LCconnect (CTO)
‣ Letter-of-Credit Exchange
2
3
Semantics in Publishing…
3
• Ongoing Theme at ISWC 2010…
‣ NY Times
‣ Facebook (OpenGraph)
‣ Elsevier
‣ BBC
4
What is Inform?
4
• Inform is a content enrichment solution designed to increase consumer
engagement, page views and revenue.
• We provide a hosted Semantic Web Service for content publishers that:
1. Reads your article before you publish it
2. Turns main topics and entities (people, places, companies, organizations) into links
3. Provides feeds of related web content when you publish it
• New Direction: Optimizing Content Distribution via Direct Channels
• Web users moving away from destination web sites, but still want the destination web
site content.
• Companies utilizing Inform include:
Connecting your content
55
Audio, Video & Blogs
from the Web
Articles from
the Web
Content from Inform
Your Affiliates’ Content
Your Content
Affiliated
Content
Your
Content
Licensed
Content
Google Street View Topic 0.90
Google Company 1.00
Ireland Place 0.70
Norway Place 0.70
South Africa Place 0.70
Sweden Place 0.70
Brian McClendon Person 0.80
Mountain View, California Place 0.60
Wi-Fi Topic 0.50
6
Related Content Widgets
6
7
Inform Topic Pages, Micro Sites
7
8
My Job: Building the Semantic Platform…
8
• “Silo”-ed Semantic Technology  Semantic Web
‣ Aligned with Wikipedia, Leverage Linked Data for Mash-Ups
‣ RDFa, SKOS, Semantic SEO
• Semantic / NLP Engine
‣ Improve Features, Quality
• Semantic Data Infrastructure
‣ Scalable Infrastructure
• Semantic Data Analysis
‣ Algorithms (Topology of Graphs), Inference
‣ “PageRank” on semantic data
• Personalization, Usage Analysis
• Micro Sites
‣ Clusters of Topics, Generating Rich Content Experience
• Distributing to Social Platforms
‣ i.e. Facebook
9
Inform: Author to Audience
9
10
Leverage Inform Taxonomy
10
1111
Author 
‣ Content Creation Services
‣ Semantic Data Repository
‣ Semantic Data Analysis
‣ Content Selection Algorithms
‣ Webservices
‣ Content Distribution Services
 Audience
Inside the
Semantic
System
Architecture
12
Content Creation
12
• Article Creation Tool (ACT)
‣ Author Tools
‣ Embed in CMS, Tumblr / Wordpress Plugin
• Publisher Portal
‣ Editorial Tool
‣ Content Feeds
• Web Crawl
• Summarizer
‣ Create smart “blurbs” to advertise article
• LinkedData
‣ Freebase, Wikipedia, DBPedia, et cetera.
13
ACT Tool
13
14
ACT Tool
14
15
ACT Tool, Tumblr, Wordpress
15
16
Publisher Portal
16
17
Summarizer
17
18
Semantic Data Repository
18
• Data Master / Data Node
‣ Federated Semantic Data Managers
‣ SPARQL Triplestore (scalable cluster)
‣ Semantic Search
‣ Search Indexes (Semi-Structured and Full-Text Search)
‣ Lucene/Siren (Sindice)
‣ Facets, Frequency Counts
‣ Cache (In-Memory)
‣ Blob Store (Voldemort)
‣ Listener to Activity (Flume)
‣ User Activity (clicks)
‣ Content Activity (content updates)
‣ Near Real-Time Trends, Analysis
‣ Compute Algorithms (Stored Procedures in Groovy)
‣ Long Term Content Archive (offline)
19
Semantic Data Analysis
19
• Natural Language Processing
‣ Rules & Machine Learning, Training
‣ 500K articles per day, 4,000 unique sites
‣ Text Extraction, Section/Sentence Extraction
‣ Tokenization, Part-of-Speech, Noun/Verb Phrases
‣ Entity Extraction, Entity Normalization
‣ Topic Extraction, Summarization, Clustering
• User Activity
‣ User Model (Personalization)
• Semantic Inference
‣ F-Logic, Multi-Domain
‣ Linked Data Mash-Ups
• Semantic Graph Topology
‣ Entity / Property Importance Metrics, Ranking, “PageRank”
‣ Which triples in LinkedData are interesting?
20
Content Selection Algorithms
20
• Model of User, Personalization
‣ Social Networks provide Context
• Semantic Analysis of Content
• Algorithms
‣ Maximize Relevancy / Relatedness (Meets Editorial Criteria)
‣ Maximize Click-Through
‣ Cute Kitten vs. Engagement Issue
‣ Maximize Monetization
Goal: Content Exchange
21
Webservices
21
• REST
‣ Outputs RDF / JSON Data
• Natural Language Processing
‣ Article to Semantic MetaData
• Related Content
‣ Inputs: Content, Personalization, Algorithm
‣ Articles
‣ Semantic Mash-Ups
‣ Topics
‣ Entities
• Semantic Query, Site Search
• Storage, Content Repository
22
Content Distribution Services
22
• Customer Destinations (Traditional Business)
‣ Deep Integration
• Publisher Widgets
‣ Levels of Lightweight Integration
‣ Example: Related-Content-Widget in JavaScript
• Inform.com
‣ Topic Pages
• Micro Sites
‣ Several Thousand Owned-and-Operated Domains/Sites, Topic Driven
• Social Networks
‣ Facebook
Tools:
• Semantic SEO
‣ RDFa, SKOS
23
Semantic MetaData, RDFa
23
http://inspector.sindice.com
24
Facebook App
24
25
Using Facebook OpenGraph
25
Relevancy Algorithm:
Combine:
•Trending / Popular Topics
•Trending / Popular Articles
•Personalization “Liked” Topics
•Personalization “Liked” Articles
•User Profiles (“Users like you…”)
26
Facebook “Liked” Topics
26
27
Facebook Article Stream
27
28
Inform: Author to Audience via Semantics
28
29
Thanks for your attention!
29
Questions?
Contact Information:
Marc Hadfield
marc@inform.com

Más contenido relacionado

La actualidad más candente

Structured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for itStructured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for itRichard Wallis
 
Fried data summit data quality data analytics together
Fried data summit data quality data analytics togetherFried data summit data quality data analytics together
Fried data summit data quality data analytics togetherJeff Fried
 
A Real-World Implementation of Linked Data
A Real-World Implementation of Linked DataA Real-World Implementation of Linked Data
A Real-World Implementation of Linked DataDimitri van Hees
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointJoris Poelmans
 
How LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationHow LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationChi-Yi Kuan
 
Focused Crawling for Structured Data
Focused Crawling for Structured DataFocused Crawling for Structured Data
Focused Crawling for Structured DataRobert Meusel
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowRichard Wallis
 
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...Lucidworks
 
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j   graphs in the real world - graph days d.c. - april 14, 2015Neo4j   graphs in the real world - graph days d.c. - april 14, 2015
Neo4j graphs in the real world - graph days d.c. - april 14, 2015Neo4j
 
DWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose ItDWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose ItMarc D Anderson
 
Real-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case studyReal-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case studydeep.bi
 
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesKESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesLinked Enterprise Date Services
 
Understanding voice of the member via text mining
Understanding voice of the member via text miningUnderstanding voice of the member via text mining
Understanding voice of the member via text miningChi-Yi Kuan
 
Instant Security and User Management in Spring Boot
Instant Security and User Management in Spring BootInstant Security and User Management in Spring Boot
Instant Security and User Management in Spring BootRemy Champion
 
S4: The Self-Service Semantic Suite
S4: The Self-Service Semantic SuiteS4: The Self-Service Semantic Suite
S4: The Self-Service Semantic SuiteMarin Dimitrov
 
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...Yongzheng (Tiger) Zhang
 
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Linked Enterprise Date Services
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLDATAVERSITY
 
O365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermottO365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermottNCCOMMS
 
Semantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.comSemantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.comBernhard Schandl
 

La actualidad más candente (20)

Structured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for itStructured data: Where did that come from & why are Google asking for it
Structured data: Where did that come from & why are Google asking for it
 
Fried data summit data quality data analytics together
Fried data summit data quality data analytics togetherFried data summit data quality data analytics together
Fried data summit data quality data analytics together
 
A Real-World Implementation of Linked Data
A Real-World Implementation of Linked DataA Real-World Implementation of Linked Data
A Real-World Implementation of Linked Data
 
How to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePointHow to build your own Delve: combining machine learning, big data and SharePoint
How to build your own Delve: combining machine learning, big data and SharePoint
 
How LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data VisualizationHow LinkedIn Democratizes Big Data Visualization
How LinkedIn Democratizes Big Data Visualization
 
Focused Crawling for Structured Data
Focused Crawling for Structured DataFocused Crawling for Structured Data
Focused Crawling for Structured Data
 
Schema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & HowSchema.org Structured data the What, Why, & How
Schema.org Structured data the What, Why, & How
 
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...The SAS Search Journey:  Using AI to Move from Google to Lucidworks - Alex Fl...
The SAS Search Journey: Using AI to Move from Google to Lucidworks - Alex Fl...
 
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j   graphs in the real world - graph days d.c. - april 14, 2015Neo4j   graphs in the real world - graph days d.c. - april 14, 2015
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
 
DWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose ItDWCNZ - Content Types: Love Them or Lose It
DWCNZ - Content Types: Love Them or Lose It
 
Real-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case studyReal-time big data analytics based on product recommendations case study
Real-time big data analytics based on product recommendations case study
 
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data SourcesKESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
KESeDa: Knowledge Extraction from Heterogeneous Semi-Structured Data Sources
 
Understanding voice of the member via text mining
Understanding voice of the member via text miningUnderstanding voice of the member via text mining
Understanding voice of the member via text mining
 
Instant Security and User Management in Spring Boot
Instant Security and User Management in Spring BootInstant Security and User Management in Spring Boot
Instant Security and User Management in Spring Boot
 
S4: The Self-Service Semantic Suite
S4: The Self-Service Semantic SuiteS4: The Self-Service Semantic Suite
S4: The Self-Service Semantic Suite
 
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
Understanding Voice of Members via Text Mining – How Linkedin Built a Text An...
 
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
Semantically integrated Enterprise Data Lakes and Co-Evolution of Public / Pr...
 
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQLCouchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
Couchbase and Apache Kafka - Bridging the gap between RDBMS and NoSQL
 
O365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermottO365Con18 - Invest in Search - Matthew McDermott
O365Con18 - Invest in Search - Matthew McDermott
 
Semantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.comSemantically Enabled Personal Information Management with Cluug.com
Semantically Enabled Personal Information Management with Cluug.com
 

Similar a Building the Inform Semantic Publishing Ecosystem: from Author to Audience

2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v12017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1Don Miller
 
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case StudyOracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case StudyMark Rittman
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us? Andrea Volpini
 
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Mark Rittman
 
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 ConferenceStructuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 ConferenceNick Inglis
 
Climbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarClimbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarConcept Searching, Inc
 
Big problems Big data, simple AWS solution
Big problems Big data, simple AWS solutionBig problems Big data, simple AWS solution
Big problems Big data, simple AWS solutionJean-Claude Sotto
 
MLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceMLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceKarl Seiler
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...Dr. Haxel Consult
 
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Open Analytics
 
Open Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenOpen Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenChristopher Whitaker
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & AnalysisScott Sanders
 
Big problems Big Data, simple solutions
Big problems Big Data, simple solutionsBig problems Big Data, simple solutions
Big problems Big Data, simple solutionsClaudio Pontili
 
The Next Web of Linked Data
The Next Web of Linked DataThe Next Web of Linked Data
The Next Web of Linked DataJay Myers
 
How to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationHow to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationPerficient, Inc.
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopPeter Skomoroch
 

Similar a Building the Inform Semantic Publishing Ecosystem: from Author to Audience (20)

Semantics and Machine Learning
Semantics and Machine LearningSemantics and Machine Learning
Semantics and Machine Learning
 
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v12017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
 
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case StudyOracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
Oracle Big Data Spatial & Graph 
Social Media Analysis - Case Study
 
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-PremiseWebinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
Webinar: The Slippery Slope of Migrating to SharePoint Online or On-Premise
 
What do we want computers to do for us?
What do we want computers to do for us? What do we want computers to do for us?
What do we want computers to do for us?
 
Webinar: Slippery Slope of SharePoint Migrations
Webinar: Slippery Slope of SharePoint Migrations Webinar: Slippery Slope of SharePoint Migrations
Webinar: Slippery Slope of SharePoint Migrations
 
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
 
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 ConferenceStructuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
Structuring Serendipitous Collaboration - Nick Inglis at Collab365 Conference
 
Climbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations WebinarClimbing the Slippery Slope of SharePoint Migrations Webinar
Climbing the Slippery Slope of SharePoint Migrations Webinar
 
Big problems Big data, simple AWS solution
Big problems Big data, simple AWS solutionBig problems Big data, simple AWS solution
Big problems Big data, simple AWS solution
 
MLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a ServiceMLaaS - Machine Learning as a Service
MLaaS - Machine Learning as a Service
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
 
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
 
Open Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe OlsenOpen Data Summit Presentation by Joe Olsen
Open Data Summit Presentation by Joe Olsen
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & Analysis
 
Big problems Big Data, simple solutions
Big problems Big Data, simple solutionsBig problems Big Data, simple solutions
Big problems Big Data, simple solutions
 
The Next Web of Linked Data
The Next Web of Linked DataThe Next Web of Linked Data
The Next Web of Linked Data
 
Semantic Web For Dummies
Semantic Web For DummiesSemantic Web For Dummies
Semantic Web For Dummies
 
How to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data VisualizationHow to Empower Your Business Users with Oracle Data Visualization
How to Empower Your Business Users with Oracle Data Visualization
 
Rapid Data Exploration With Hadoop
Rapid Data Exploration With HadoopRapid Data Exploration With Hadoop
Rapid Data Exploration With Hadoop
 

Más de Vital.AI

Optimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data ScienceOptimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data ScienceVital.AI
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital.AI
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data ModelingVital.AI
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI
 
Natural Language Processing & Semantic Models in an Imperfect World
Natural Language Processing & Semantic Modelsin an Imperfect WorldNatural Language Processing & Semantic Modelsin an Imperfect World
Natural Language Processing & Semantic Models in an Imperfect WorldVital.AI
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest GraphVital.AI
 

Más de Vital.AI (6)

Optimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data ScienceOptimizing the
 Data Supply Chain
 for Data Science
Optimizing the
 Data Supply Chain
 for Data Science
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data Modeling
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent Apps
 
Natural Language Processing & Semantic Models in an Imperfect World
Natural Language Processing & Semantic Modelsin an Imperfect WorldNatural Language Processing & Semantic Modelsin an Imperfect World
Natural Language Processing & Semantic Models in an Imperfect World
 
Inform: Targeting the Interest Graph
Inform: Targeting the Interest GraphInform: Targeting the Interest Graph
Inform: Targeting the Interest Graph
 

Último

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 

Último (20)

Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 

Building the Inform Semantic Publishing Ecosystem: from Author to Audience

  • 1. 1 More Meaning. Better Results. 1 Building the Inform Semantic Publishing Ecosystem: from Author to Audience Marc Hadfield VP, Research & Development marc@inform.com
  • 2. 2 Marc Hadfield • Semantic Technology, Computer Science • Inform Technologies (Head of R&D) ‣ Semantic Technologies applied to Content Analysis & Distribution • Alitora Systems (Co-Founder / CTO) ‣ Life Science Semantic Technology, Research, Big Data Analytics, Semantic HPC ‣ Life Science Natural Language Processing • Columbia Genome Center ‣ NLP applied to Life Science Research Articles • LCconnect (CTO) ‣ Letter-of-Credit Exchange 2
  • 3. 3 Semantics in Publishing… 3 • Ongoing Theme at ISWC 2010… ‣ NY Times ‣ Facebook (OpenGraph) ‣ Elsevier ‣ BBC
  • 4. 4 What is Inform? 4 • Inform is a content enrichment solution designed to increase consumer engagement, page views and revenue. • We provide a hosted Semantic Web Service for content publishers that: 1. Reads your article before you publish it 2. Turns main topics and entities (people, places, companies, organizations) into links 3. Provides feeds of related web content when you publish it • New Direction: Optimizing Content Distribution via Direct Channels • Web users moving away from destination web sites, but still want the destination web site content. • Companies utilizing Inform include:
  • 5. Connecting your content 55 Audio, Video & Blogs from the Web Articles from the Web Content from Inform Your Affiliates’ Content Your Content Affiliated Content Your Content Licensed Content Google Street View Topic 0.90 Google Company 1.00 Ireland Place 0.70 Norway Place 0.70 South Africa Place 0.70 Sweden Place 0.70 Brian McClendon Person 0.80 Mountain View, California Place 0.60 Wi-Fi Topic 0.50
  • 7. 7 Inform Topic Pages, Micro Sites 7
  • 8. 8 My Job: Building the Semantic Platform… 8 • “Silo”-ed Semantic Technology  Semantic Web ‣ Aligned with Wikipedia, Leverage Linked Data for Mash-Ups ‣ RDFa, SKOS, Semantic SEO • Semantic / NLP Engine ‣ Improve Features, Quality • Semantic Data Infrastructure ‣ Scalable Infrastructure • Semantic Data Analysis ‣ Algorithms (Topology of Graphs), Inference ‣ “PageRank” on semantic data • Personalization, Usage Analysis • Micro Sites ‣ Clusters of Topics, Generating Rich Content Experience • Distributing to Social Platforms ‣ i.e. Facebook
  • 9. 9 Inform: Author to Audience 9
  • 11. 1111 Author  ‣ Content Creation Services ‣ Semantic Data Repository ‣ Semantic Data Analysis ‣ Content Selection Algorithms ‣ Webservices ‣ Content Distribution Services  Audience Inside the Semantic System Architecture
  • 12. 12 Content Creation 12 • Article Creation Tool (ACT) ‣ Author Tools ‣ Embed in CMS, Tumblr / Wordpress Plugin • Publisher Portal ‣ Editorial Tool ‣ Content Feeds • Web Crawl • Summarizer ‣ Create smart “blurbs” to advertise article • LinkedData ‣ Freebase, Wikipedia, DBPedia, et cetera.
  • 15. 15 ACT Tool, Tumblr, Wordpress 15
  • 18. 18 Semantic Data Repository 18 • Data Master / Data Node ‣ Federated Semantic Data Managers ‣ SPARQL Triplestore (scalable cluster) ‣ Semantic Search ‣ Search Indexes (Semi-Structured and Full-Text Search) ‣ Lucene/Siren (Sindice) ‣ Facets, Frequency Counts ‣ Cache (In-Memory) ‣ Blob Store (Voldemort) ‣ Listener to Activity (Flume) ‣ User Activity (clicks) ‣ Content Activity (content updates) ‣ Near Real-Time Trends, Analysis ‣ Compute Algorithms (Stored Procedures in Groovy) ‣ Long Term Content Archive (offline)
  • 19. 19 Semantic Data Analysis 19 • Natural Language Processing ‣ Rules & Machine Learning, Training ‣ 500K articles per day, 4,000 unique sites ‣ Text Extraction, Section/Sentence Extraction ‣ Tokenization, Part-of-Speech, Noun/Verb Phrases ‣ Entity Extraction, Entity Normalization ‣ Topic Extraction, Summarization, Clustering • User Activity ‣ User Model (Personalization) • Semantic Inference ‣ F-Logic, Multi-Domain ‣ Linked Data Mash-Ups • Semantic Graph Topology ‣ Entity / Property Importance Metrics, Ranking, “PageRank” ‣ Which triples in LinkedData are interesting?
  • 20. 20 Content Selection Algorithms 20 • Model of User, Personalization ‣ Social Networks provide Context • Semantic Analysis of Content • Algorithms ‣ Maximize Relevancy / Relatedness (Meets Editorial Criteria) ‣ Maximize Click-Through ‣ Cute Kitten vs. Engagement Issue ‣ Maximize Monetization Goal: Content Exchange
  • 21. 21 Webservices 21 • REST ‣ Outputs RDF / JSON Data • Natural Language Processing ‣ Article to Semantic MetaData • Related Content ‣ Inputs: Content, Personalization, Algorithm ‣ Articles ‣ Semantic Mash-Ups ‣ Topics ‣ Entities • Semantic Query, Site Search • Storage, Content Repository
  • 22. 22 Content Distribution Services 22 • Customer Destinations (Traditional Business) ‣ Deep Integration • Publisher Widgets ‣ Levels of Lightweight Integration ‣ Example: Related-Content-Widget in JavaScript • Inform.com ‣ Topic Pages • Micro Sites ‣ Several Thousand Owned-and-Operated Domains/Sites, Topic Driven • Social Networks ‣ Facebook Tools: • Semantic SEO ‣ RDFa, SKOS
  • 25. 25 Using Facebook OpenGraph 25 Relevancy Algorithm: Combine: •Trending / Popular Topics •Trending / Popular Articles •Personalization “Liked” Topics •Personalization “Liked” Articles •User Profiles (“Users like you…”)
  • 28. 28 Inform: Author to Audience via Semantics 28
  • 29. 29 Thanks for your attention! 29 Questions? Contact Information: Marc Hadfield marc@inform.com