SlideShare una empresa de Scribd logo
1 de 19
Descargar para leer sin conexión
Calais
Thomson Reuters Calais Initiative
Overview
• Going to discuss five basic topics


  – What is Calais?
  – Why we’re doing it & what our goals are
  – How it works / What’s under the hood?
  – A few examples
  – Where it’s headed
Calais…


• Calais extracts smart metadata from unstructured
  text and links that metadata to the Linked Data
  cloud.
Calais progress to date
• Launched in late January, 2008
• 9,500 developers have joined
  OpenCalais.com
• 1-3 million content ‘transactions’ per day
• Delivered four major update releases
• Free (as in free) for commercial or non-
  commercial use
5
                                                3                    Which provides
                                   Metadata
                                                                     information and
             1                    returned to
                                                                       other Linked
                                    the user
Unstructur                                                             Data pointers
                                   with keys
 ed Text




                                                                 4
                                                       Keys
                                                     provide
                                                    access to
                                                    the Calais
                              2                       Linked
                   Calais
                                                    Data cloud
                                                                                                        6
                  extracts
                  entities,                                                        To a range of open
                                                                                   and partner Linked
                 facts and
                                                                                      data assets,
                   events
                                                                                        including
                                                                                   Thomson Reuters
Quick Demo
 You can find the Calais Viewer demonstration tool here:
 http://viewer.opencalais.com (Note that the Calais Viewer is not the
 Calais service. It is merely a demonstration of how the service works.)
  – Copy and paste the text of a business news article from AP, Dow Jones
    or Reuters.com into the viewer, and press submit. The article is sent to
    the Calais engine which tags the content and returns it, marked-up.
  – The tags appear on the left hand rail, and you can click on the plus (+)
    sign to see the tags expand.
  – Since we are now on Calais 4.0, you can also use the viewer to see the
    Linked Data assets related to the tags Calais returns.
       • Click on a company name on the left hand rail to find a Calais summary page
         featuring a basic description for that company, as well as a number of links.
       • Follow those links to see the other data entries on that company that are
         available for public use in the Linked Data Cloud.
  – For example, here is the Calais summary page for IBM:
    http://d.opencalais.com/er/company/ralg-tr1r/9e3f6c34-aa6b-3a3b-b221-
    a07aa7933633.html
  – And here is the summary page for IBM in DBPedia (the Wikipedia
    translated into computer language): http://dbpedia.org/page/IBM
Why & What


 1. Derive semantic metadata from textual assets
 2. Use that semantic metadata to create entry points into
    the linked data ecosystem
 3. Provide a simple mechanism for the sharing of semantic
    metadata about textual content assets
 4. And just why are you doing this…
1: Semantics from Text: The Text Problem

                      • People consume text
                      • Most of it isn’t semantically enabled
                      • Most of it won’t be semantically
                        enabled
                      • This isn’t about standards –
                        microfromats vs RDFa vs.
                        whatever.


                      • Why: Latency, cost and short shelf-
                        life
1: Semantics from Text: The Text Problem
                                                                         • Target areas
                                                                           where:
Years
                                                                           – The economics
                                                         Great
                                                         Novels
                                                                             don’t support
                                               Scient.
        Shelf Life



                                                                             metadata
                                                Pubs
                                                                             creation
                                      Legacy
                                                                           – The value of
                                       News
                                                                             metadata is
                               New
                               Gen
                                                                             potentially high
                               News
                                                                           – The value of
Seconds               Tweets
                                                                             aggregated
                                                                             metadata is
                                      Latency
                        ds




                                                                             potentially
                     on




                                                                    rs
                                                                             extremely high
                                                                     a
                   c




                                                                  Ye
                Se
2: Getting from Text to the Linked Data Ecosystem
The Linked Data Cloud
3: Semantic Metadata Transport Layer
                        • I’m a content producer.
                          We’ve loaded the car
                          with rich semantic
                          metadata


                           – I’m sharing it within my
                             four walls
                           – How do I transport it to
                             my consumers?
                           – RSS / Atom, XML,
                             Proprietary data feeds,
                             Content API’s
4: Why We’re Doing It


• Two simple answers:


  – Hyper-evolution of capabilities – better, faster, stronger


  – The walled garden content world
How it Works – Under the Hood of Calais
How it Works – Under the Hood of Calais

                                                               Document
                                                                 Level
                                                               Metadata
                                                  Metadata
                               Reference         Management
                              Data Assets
                                                              Entity Level
                                                              Linked Data
                                                                 and …
                 Stat Tools
                              Disambig.
                ClearForest
   Calais Web
                                            RD
                               Engine
                NLP Engine
    Service
                                            F

                Rule   Lexi
                Base   cons
                                                   Output
                                                 Formatting
Where From Here?
• We’ve seen examples of first generation uses.
• Where does this go in the future?
• Beyond the document
  – Social Resume analysis
  – Museum Content Coalitions
  – Knowledge Management Applications
  – Investigative Journalism*
Investigative Journalism



FOIA       Calais Web   Company:Contract
Contract    Service     Company:Affiliation
Document
s

                                              Big Fuzzy Graph

News       Calais Web   Company:Person
            Service     FamilyRelation
What’s in the Pipeline?
• 2009 (this is a fuzzy list)


  – Person disambiguation @ domain level?
  – Other disambiguation
  – Continued expansion of URI’s (entities & events)
  – Calais as hub
  – Exposure of the IDE?
  – User managed lexicons
  – Languages
  – Opt-in SPARQL Endpoint?
• www.opencalais.com

  – Gallery – code and applications examples
  – Forums
  – Documentation


• Twitter @opencalais, Facebook Group

Más contenido relacionado

Similar a Open Calais For SF And LA Meetups

OpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry CouncilOpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry CouncilKrista Thomas
 
Web 3 0 Krista Thomas 1 26 10
Web 3 0 Krista Thomas 1 26 10Web 3 0 Krista Thomas 1 26 10
Web 3 0 Krista Thomas 1 26 10Krista Thomas
 
Cloud Computing through FCAPS Managed Services in a Virtualized Data Center
Cloud Computing through FCAPS Managed Services in a Virtualized Data CenterCloud Computing through FCAPS Managed Services in a Virtualized Data Center
Cloud Computing through FCAPS Managed Services in a Virtualized Data Centervsarathy
 
Computing for Human Experience and Wellness
Computing for Human Experience and WellnessComputing for Human Experience and Wellness
Computing for Human Experience and WellnessAmit Sheth
 
Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10keirdo1
 
Cutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and DellCutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and DellAMD
 
제1회 Korea Community Day 발표자료 Bigdata
제1회 Korea Community Day 발표자료 Bigdata 제1회 Korea Community Day 발표자료 Bigdata
제1회 Korea Community Day 발표자료 Bigdata Gruter
 
Django and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks assDjango and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks assTobias Lindaaker
 
NOSQLEU - Graph Databases and Neo4j
NOSQLEU - Graph Databases and Neo4jNOSQLEU - Graph Databases and Neo4j
NOSQLEU - Graph Databases and Neo4jTobias Lindaaker
 
Mark logic Corporate Overview
Mark logic Corporate OverviewMark logic Corporate Overview
Mark logic Corporate OverviewTony Agresta
 
Metadata: Increasing Value in Digital Content Competition Flyer
Metadata: Increasing Value in Digital Content Competition FlyerMetadata: Increasing Value in Digital Content Competition Flyer
Metadata: Increasing Value in Digital Content Competition FlyerChinwag
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonJeffrey T. Pollock
 
[Day 3] Building Sustainable Communities
[Day 3] Building Sustainable Communities[Day 3] Building Sustainable Communities
[Day 3] Building Sustainable Communitiescsi2009
 
Soeren okfn greece meetup
Soeren okfn greece meetupSoeren okfn greece meetup
Soeren okfn greece meetupOKFN-GR
 
Massive Data Analytics and the Cloud
Massive Data Analytics and the CloudMassive Data Analytics and the Cloud
Massive Data Analytics and the CloudBooz Allen Hamilton
 
100615 htap network_brussels
100615 htap network_brussels100615 htap network_brussels
100615 htap network_brusselsRudolf Husar
 
NOSQL Overview Lightning Talk (Scalability Geekcruise 2009)
NOSQL Overview Lightning Talk (Scalability Geekcruise 2009)NOSQL Overview Lightning Talk (Scalability Geekcruise 2009)
NOSQL Overview Lightning Talk (Scalability Geekcruise 2009)Emil Eifrem
 
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....Conférence Open Data par où commencer ? "How to achieve interoperability?" E....
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....Aline Custodio
 

Similar a Open Calais For SF And LA Meetups (20)

OpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry CouncilOpenCalais At The San Diego Software Industry Council
OpenCalais At The San Diego Software Industry Council
 
Web 3 0 Krista Thomas 1 26 10
Web 3 0 Krista Thomas 1 26 10Web 3 0 Krista Thomas 1 26 10
Web 3 0 Krista Thomas 1 26 10
 
Cloud Computing through FCAPS Managed Services in a Virtualized Data Center
Cloud Computing through FCAPS Managed Services in a Virtualized Data CenterCloud Computing through FCAPS Managed Services in a Virtualized Data Center
Cloud Computing through FCAPS Managed Services in a Virtualized Data Center
 
Computing for Human Experience and Wellness
Computing for Human Experience and WellnessComputing for Human Experience and Wellness
Computing for Human Experience and Wellness
 
Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10
 
Cutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and DellCutting Big Data Down to Size with AMD and Dell
Cutting Big Data Down to Size with AMD and Dell
 
제1회 Korea Community Day 발표자료 Bigdata
제1회 Korea Community Day 발표자료 Bigdata 제1회 Korea Community Day 발표자료 Bigdata
제1회 Korea Community Day 발표자료 Bigdata
 
Django and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks assDjango and Neo4j - Domain modeling that kicks ass
Django and Neo4j - Domain modeling that kicks ass
 
NOSQLEU - Graph Databases and Neo4j
NOSQLEU - Graph Databases and Neo4jNOSQLEU - Graph Databases and Neo4j
NOSQLEU - Graph Databases and Neo4j
 
Mark logic Corporate Overview
Mark logic Corporate OverviewMark logic Corporate Overview
Mark logic Corporate Overview
 
InfoWorld
InfoWorldInfoWorld
InfoWorld
 
Metadata: Increasing Value in Digital Content Competition Flyer
Metadata: Increasing Value in Digital Content Competition FlyerMetadata: Increasing Value in Digital Content Competition Flyer
Metadata: Increasing Value in Digital Content Competition Flyer
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
 
[Day 3] Building Sustainable Communities
[Day 3] Building Sustainable Communities[Day 3] Building Sustainable Communities
[Day 3] Building Sustainable Communities
 
Soeren okfn greece meetup
Soeren okfn greece meetupSoeren okfn greece meetup
Soeren okfn greece meetup
 
Massive Data Analytics and the Cloud
Massive Data Analytics and the CloudMassive Data Analytics and the Cloud
Massive Data Analytics and the Cloud
 
100615 htap network_brussels
100615 htap network_brussels100615 htap network_brussels
100615 htap network_brussels
 
NOSQL Overview Lightning Talk (Scalability Geekcruise 2009)
NOSQL Overview Lightning Talk (Scalability Geekcruise 2009)NOSQL Overview Lightning Talk (Scalability Geekcruise 2009)
NOSQL Overview Lightning Talk (Scalability Geekcruise 2009)
 
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....Conférence Open Data par où commencer ? "How to achieve interoperability?" E....
Conférence Open Data par où commencer ? "How to achieve interoperability?" E....
 
Transformanz
TransformanzTransformanz
Transformanz
 

Más de Krista Thomas

The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010Krista Thomas
 
Open Calais Workshop at WeMedia 2010
Open Calais Workshop at WeMedia 2010Open Calais Workshop at WeMedia 2010
Open Calais Workshop at WeMedia 2010Krista Thomas
 
Web 3 0 Krista Thomas 1 26 10
Web 3 0 Krista Thomas 1 26 10Web 3 0 Krista Thomas 1 26 10
Web 3 0 Krista Thomas 1 26 10Krista Thomas
 
Simple OpenCalais Whitepaper
Simple OpenCalais WhitepaperSimple OpenCalais Whitepaper
Simple OpenCalais WhitepaperKrista Thomas
 
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09Krista Thomas
 
Open Calais @ Transparent Text
Open Calais @ Transparent TextOpen Calais @ Transparent Text
Open Calais @ Transparent TextKrista Thomas
 
Tague Semtech Keynote 2009
Tague Semtech Keynote 2009Tague Semtech Keynote 2009
Tague Semtech Keynote 2009Krista Thomas
 
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009Krista Thomas
 
Intro To The Calais Web Service @ OpenCalais.com
Intro To The Calais Web Service @ OpenCalais.comIntro To The Calais Web Service @ OpenCalais.com
Intro To The Calais Web Service @ OpenCalais.comKrista Thomas
 
Open Calais Release 4.0
Open Calais Release 4.0Open Calais Release 4.0
Open Calais Release 4.0Krista Thomas
 
Calais @ the SD Forum
Calais @ the SD ForumCalais @ the SD Forum
Calais @ the SD ForumKrista Thomas
 
Calais @ the Palo Alto Semantic Web Meetup
Calais @ the Palo Alto Semantic Web MeetupCalais @ the Palo Alto Semantic Web Meetup
Calais @ the Palo Alto Semantic Web MeetupKrista Thomas
 
Final Calais For ONA
Final Calais For ONAFinal Calais For ONA
Final Calais For ONAKrista Thomas
 

Más de Krista Thomas (14)

Ad.ly Introduction
Ad.ly IntroductionAd.ly Introduction
Ad.ly Introduction
 
The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010The OpenCalais Workshop at WeMedia 2010
The OpenCalais Workshop at WeMedia 2010
 
Open Calais Workshop at WeMedia 2010
Open Calais Workshop at WeMedia 2010Open Calais Workshop at WeMedia 2010
Open Calais Workshop at WeMedia 2010
 
Web 3 0 Krista Thomas 1 26 10
Web 3 0 Krista Thomas 1 26 10Web 3 0 Krista Thomas 1 26 10
Web 3 0 Krista Thomas 1 26 10
 
Simple OpenCalais Whitepaper
Simple OpenCalais WhitepaperSimple OpenCalais Whitepaper
Simple OpenCalais Whitepaper
 
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
OpenCalais @ UC Berkeley Media Technology Summit 9/29/09
 
Open Calais @ Transparent Text
Open Calais @ Transparent TextOpen Calais @ Transparent Text
Open Calais @ Transparent Text
 
Tague Semtech Keynote 2009
Tague Semtech Keynote 2009Tague Semtech Keynote 2009
Tague Semtech Keynote 2009
 
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
Phase2 OpenPublish Presentation SF SemWeb Meetup, April 28, 2009
 
Intro To The Calais Web Service @ OpenCalais.com
Intro To The Calais Web Service @ OpenCalais.comIntro To The Calais Web Service @ OpenCalais.com
Intro To The Calais Web Service @ OpenCalais.com
 
Open Calais Release 4.0
Open Calais Release 4.0Open Calais Release 4.0
Open Calais Release 4.0
 
Calais @ the SD Forum
Calais @ the SD ForumCalais @ the SD Forum
Calais @ the SD Forum
 
Calais @ the Palo Alto Semantic Web Meetup
Calais @ the Palo Alto Semantic Web MeetupCalais @ the Palo Alto Semantic Web Meetup
Calais @ the Palo Alto Semantic Web Meetup
 
Final Calais For ONA
Final Calais For ONAFinal Calais For ONA
Final Calais For ONA
 

Último

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 

Último (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

Open Calais For SF And LA Meetups

  • 2. Overview • Going to discuss five basic topics – What is Calais? – Why we’re doing it & what our goals are – How it works / What’s under the hood? – A few examples – Where it’s headed
  • 3. Calais… • Calais extracts smart metadata from unstructured text and links that metadata to the Linked Data cloud.
  • 4. Calais progress to date • Launched in late January, 2008 • 9,500 developers have joined OpenCalais.com • 1-3 million content ‘transactions’ per day • Delivered four major update releases • Free (as in free) for commercial or non- commercial use
  • 5. 5 3 Which provides Metadata information and 1 returned to other Linked the user Unstructur Data pointers with keys ed Text 4 Keys provide access to the Calais 2 Linked Calais Data cloud 6 extracts entities, To a range of open and partner Linked facts and data assets, events including Thomson Reuters
  • 6. Quick Demo You can find the Calais Viewer demonstration tool here: http://viewer.opencalais.com (Note that the Calais Viewer is not the Calais service. It is merely a demonstration of how the service works.) – Copy and paste the text of a business news article from AP, Dow Jones or Reuters.com into the viewer, and press submit. The article is sent to the Calais engine which tags the content and returns it, marked-up. – The tags appear on the left hand rail, and you can click on the plus (+) sign to see the tags expand. – Since we are now on Calais 4.0, you can also use the viewer to see the Linked Data assets related to the tags Calais returns. • Click on a company name on the left hand rail to find a Calais summary page featuring a basic description for that company, as well as a number of links. • Follow those links to see the other data entries on that company that are available for public use in the Linked Data Cloud. – For example, here is the Calais summary page for IBM: http://d.opencalais.com/er/company/ralg-tr1r/9e3f6c34-aa6b-3a3b-b221- a07aa7933633.html – And here is the summary page for IBM in DBPedia (the Wikipedia translated into computer language): http://dbpedia.org/page/IBM
  • 7. Why & What 1. Derive semantic metadata from textual assets 2. Use that semantic metadata to create entry points into the linked data ecosystem 3. Provide a simple mechanism for the sharing of semantic metadata about textual content assets 4. And just why are you doing this…
  • 8. 1: Semantics from Text: The Text Problem • People consume text • Most of it isn’t semantically enabled • Most of it won’t be semantically enabled • This isn’t about standards – microfromats vs RDFa vs. whatever. • Why: Latency, cost and short shelf- life
  • 9. 1: Semantics from Text: The Text Problem • Target areas where: Years – The economics Great Novels don’t support Scient. Shelf Life metadata Pubs creation Legacy – The value of News metadata is New Gen potentially high News – The value of Seconds Tweets aggregated metadata is Latency ds potentially on rs extremely high a c Ye Se
  • 10. 2: Getting from Text to the Linked Data Ecosystem
  • 12. 3: Semantic Metadata Transport Layer • I’m a content producer. We’ve loaded the car with rich semantic metadata – I’m sharing it within my four walls – How do I transport it to my consumers? – RSS / Atom, XML, Proprietary data feeds, Content API’s
  • 13. 4: Why We’re Doing It • Two simple answers: – Hyper-evolution of capabilities – better, faster, stronger – The walled garden content world
  • 14. How it Works – Under the Hood of Calais
  • 15. How it Works – Under the Hood of Calais Document Level Metadata Metadata Reference Management Data Assets Entity Level Linked Data and … Stat Tools Disambig. ClearForest Calais Web RD Engine NLP Engine Service F Rule Lexi Base cons Output Formatting
  • 16. Where From Here? • We’ve seen examples of first generation uses. • Where does this go in the future? • Beyond the document – Social Resume analysis – Museum Content Coalitions – Knowledge Management Applications – Investigative Journalism*
  • 17. Investigative Journalism FOIA Calais Web Company:Contract Contract Service Company:Affiliation Document s Big Fuzzy Graph News Calais Web Company:Person Service FamilyRelation
  • 18. What’s in the Pipeline? • 2009 (this is a fuzzy list) – Person disambiguation @ domain level? – Other disambiguation – Continued expansion of URI’s (entities & events) – Calais as hub – Exposure of the IDE? – User managed lexicons – Languages – Opt-in SPARQL Endpoint?
  • 19. • www.opencalais.com – Gallery – code and applications examples – Forums – Documentation • Twitter @opencalais, Facebook Group