SlideShare una empresa de Scribd logo
1 de 37
Descargar para leer sin conexión
rNews
 Embedded Data For
The News Industry




                     1
Hello!

§ Stuart Myles – @smyles
 Lead of the IPTC Semantic Web WG &
 Deputy Director of Schema Standards,
 The Associated Press

§ Evan Sandhaus – @kansandhaus
 Lead Architect, Semantic Platforms,
 The New York Times Company

§ Andreas Gebhard – @agebhard
 Managing Editor,
 Getty Images
                                        2
...And 50
Others
            4
Story components which are obvious to a person…




     STORY


PHOTO
...are not so obvious to a machine.




   STORY


PHOTO
The Problem of Structured Data
§ Modern Web Sites
  Built with 3 Tier
                               Display
  Architecture
  • Data Tier: Database
                                Tier
    Where Content Lives.
  • Presentation Tier:
    HTML Document that is
    sent to user.
  • Logic Tier: Software
                               Logic
    that reads from the Data
    Tier and outputs the        Tier
    Presentation Tier.




                                Data
                                Tier
                                         8
The Problem Of Structured Data:
           Continued
            Data Tier                                      Logic Tier             Display Tier

Label      Type     Value                                               <html>
                                                                          <head>
id         number   1248069162607                                           <title>
Headline   text     New Web Code Draws Concern...                              New Web Code Draws Concern...
                                                                            </title>
Byline     text     By TANZINA VEGA                                       </head>
Date       date     20101010                                              <body>
                                                                            <div>
Body       text     In the next few years, a powerful...                       New Web Code Draws Concern...
Length     number   1123                                                    </div>
                                                                            <div>
Tag        text     Privacy                                                    By TANZINA VEGA
Tag        text     Computers and the Internet                              </div>
                                                                            <div>
Tag        text     Web Browsers                                               October 10, 2010
                                                                            </div>
                                                                            <div>
                                                                               In the next few years, a powerful...
                                                                            </div>
                                                                          </body>
§ Content very well structured on Data                                 </html>

       Tier, but all of this structure is lost in
       translation to presentation tier.


                                                                                                                      9
The Problem Of Structured Data:
    Continued


<html>
          Display Tier                                        ?
  <head>
    <title>
       New Web Code Draws Concern...
    </title>
  </head>
  <body>
    <div>




                                              =
       New Web Code Draws Concern...
    </div>
    <div>
       By TANZINA VEGA
    </div>
    <div>
                                                  § Search engines, social
       October 10, 2010                             networks, aggregators and
    </div>
    <div>                                           other sites only see the
       In the next few years, a powerful...
    </div>                                          Display Tier, and cannot
  </body>
</html>
                                                    leverage the underlying
                                                    structure of the data.




                                                                                10
Semantic Markup Standards

Microformats      RDFa          Microdata         JSON




 § First      § Official    § Unofficial   § Official
 § Simple     § Complex     § Flexible     § Developers
 § Rigid      § OpenGraph   § Schema.org   § External




                                                              11
rNews
        12
rNews Defined


  rNews is a data model for
  embedding machine-readable
  publishing metadata in web
  documents and a set of
  suggested implementations.




                               13
rNews is a data model

                                          PostalAddress



                         address     address                            address




                                                   geo
                                               Coordinates   Geo
         Person                    Location               Coordinates                   Organization




            creator                                                                     creator
            editor                                           name                        editor
            contributor                                                             contributor
            provider                                                                   provider
            copyrightHolder                     Concept                         copyrightHolder
            accountablePerson                                                sourceOrganization
                                              about   mentions

       Comment

                         comment




                                               NewsItem




                                          associatedMedia


                                         associatedArticle


       ImageObject
       VideoObject                                                                          Article
       AudioObject                                                                                     14
for embedding machine-readable
publishing metadata in web
documents


Headline
Byline
Tags
Creator
...




                                 15
and a set of suggested implementations



      RDFa      Microdata    JSON




      Today     Very Soon   Maybe?




                                         16
rNews - Working Example




                          17
1<!DOCTYPE4html4PUBLIC4">//W3C//DTD4XHTML41.04Transitional//EN"44"http://www.w3.org/TR/xhtml1/DTD/xhtml1>transitional.dtd">44
 2<html>
 3<head></head>
 4<body>
 5444<div>
 64444<div>
 74444444<div>Allies4Are44Split...</div>
 8444444<div>NATO4Takes44Command</div>
 9444444<div>
1044444444<img44src="img/libya_sample_reuters.jpg"/>
1144444444<div>Credit:4Goran4Tomasevic/Reuters</div>
12444444444<div>Rebel4fighters44take...</div>
13444444</div>
14444444<div>By4STEVEN4LEE44MYERS</div>
15444444<div>WASHINGTON44|4March424,42011</div>
16444444<div>
1744444444<p>Having44largely4succeeded...</p>
18444444</div>
19444444<div>
2044444444<p><a44href="http://www.nytimes.com/content/help/rights/copyright/copyright>notice.html">
2144444444444©4Copyright442011
22444444444</a><span>The4New4York44Times44Company</span></p>
2344444444<p><a44href="http://www.nytimes.com/ref/membercenter/help/agree.html">
2444444444444Disclaimer
25444444444</a></p>
264444444</div>
274444</div>
2844444<div>
29444444<div>
3044444444<div>Section</div>
3144444444<div>World</div>
32444444</div>
33444444<div>Tags</div>
344444444<div>
3544444444<div>
364444444444<div>People</div>
374444444444<div>Qaddafi,4Muammar44el></div>
3844444444</div>
39444444</div>444444
40444444<div>
4144444444<div>Discussion44(3)</div>
4244444444<div>
434444444444<div>So4the4question44is..."</div>
444444444444<div>
454444444444<a44href="http://timespeople.nytimes.com/view/user/27242827/activities.html">Chuck</a></div>
464444444444<div>March425th,44201148:274am</div>
4744444444</div>
48444444</div>
494444</div>
50444</div>
51</body>
52</html>
                                                                                                                                 18
HTML 5 Microdata
<!DOCTYPE HTML>
<html itemscope itemtype="http://schema.org/NewsArticle" >
<head>
! <style type="text/css">@import url(css/iptc_times2.css);</style>
! <meta itemprop="dateCreated" content="2011-03-23"/>
! <meta itemprop="description" content="The questions about the command..."/>
! <meta itemprop="inLanguage" content="en-US"/>
! <meta itemprop="thumbnailUrl" content="http://graphics8.nytimes.com/images/common/icons/t_wb_75.gif"/>
! <meta itemprop="genre" content="Current"/>
! <meta itemprop="id" content="1248069687395"/>
! <meta itemprop="version" content="2"/>
! <meta itemprop="publishingPrinciples" content="http://www.nytco.com/press/ethics.html"/>
! <meta itemprop="wordCount" content="879"/>!
</head>
<body>
! <div style="height:900px" class="article">
! ! <div class="a_column">
! ! ! <div itemprop="headline" class="headline">Allies Are Split on Goal and Exit Strategy in Libya</div>
! ! ! <div itemprop="alternativeHeadline" class="rider">NATO Takes Command</div>
! ! ! <div itemprop="associatedMedia" itemscope itemtype="http://schema.org/ImageObject">
! ! ! ! <img itemprop="URL" class="image" src="img/libya_sample_reuters.jpg"/>
! ! ! ! <div class="image_credit">Credit:
! ! ! ! ! <span itemprop="creator" itemscope itemtype="http://schema.org/Person">
! ! ! ! ! ! <span itemprop="name">Goran Tomasevic</span>
! ! ! ! ! </span>
! ! ! ! ! /
! ! ! ! ! <span itemprop="sourceOrganization" itemscope itemtype="http://schema.org/Organization">
! ! ! ! ! ! <span itemprop="name">Reuters</span>
! ! ! ! ! ! <meta itemprop="tickerSymbol" content="NYSE TRI"/>
! ! ! ! ! </span>
! ! ! ! </div>
                                                                                                            19
RDFa
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd">
<html xmlns:rnews="http://dec.iptc.org/rnews/0.1/">
<head>
! <style type="text/css">@import url(css/iptc_times2.css);</style>
</head>
<body>
! <div class="article" style="height:623px">
! ! <div class="a_column">
! ! ! <div property="rnews:headline" class="headline">Allies Are Split on Goal and Exit Strategy in Libya</div>
! ! ! <div class="rider">NATO Takes Command</div>
! ! ! <div class="main_image">
! ! ! ! <img class="image" src="img/libya_sample_reuters.jpg"/>
! ! ! ! <div class="image_credit">Credit: Goran Tomasevic/Reuters</div>
! ! ! ! <div class="image_caption">
! ! ! ! ! Rebel fighters take cover during a shelling near Ajdabiyah, Libya on Thursday.
! ! ! ! </div>
! ! ! </div>
! ! ! <div rel="rnews:createdBy" class="byline">By
! ! ! ! <span about="http://demo.iptc.org/per/steven_lee_myers" typeof="rnews:Person">
! ! ! ! ! <span property="rnews:name">STEVEN LEE MYERS</span>
! ! ! ! </span>
! ! ! </div>
! ! ! <div class="publication_date">
! ! ! ! <span property="rnews:dateline">WASHINGTON</span>
! ! ! ! |
! ! ! ! <span property="rnews:dateCreated" content="2011-03-24">March 24, 2011</span>
! ! ! </div>



                                                                                                                  20
The Way to rNews
The Way To rNews

       § June: Genesis of rNews - Evan at SemTech 2010
2010



       § November 5 - Rome: chartered
       § internal discussions about NYT draft
       § March 9 - Dubai: rNews 0.1
2011




       § lots of feedback, changes and additions
       § June 9 - Berlin: rNews 0.5
       § June 28: rNews 0.6
       § September 6: rNews 0.7 [aligned w/ schema.org]
       § October 7 - Vienna: rNews 1.0                    22
Engaging Our Community




                         23
Engaging Our Community




                         24
Engaging Our Community




                         25
Engaging Our Community




                         26
Feedback we incorporated...

§ In Person
  • 3 Meetups: New York, Berlin, London
  • Over a dozen one-on-one meetings with leading media and
    technology companies.
§ Online
  • Rnews.org forum
  • Numerous Blog Posts
§ In The Standard’s Community
  • W3C Community Group
  • Media Standards Trust


                                                              27
Feedback we incorporated...



                                         point
                                         circle
 latitude                                elevation
longitude                                polygon
 altitude                                box
                                         line
                              Geo
            Location       Coordinates




                                                     28
Feedback we incorporated...




          editor

 Person                  NewsItem




                                    29
rNews
    Benefits
Or Why You Should Care
     About rNews
Benefit #1: Better Links

With Structured
Data
No Structured
Data




                                   31
Benefit #2: Better Analytics




Javascript can extract richer news metadata
    Analytics per item, not just per page

                                              32
Benefit #3: Better Ad Placement


                    Leverage metadata
                       not just text




Avoid unfortunate
 juxtapositions                         33
rNews as a news API




  Level the Playing Field
Encourage Open Innovation   34
35
How Can You Help Us Get to rNews 1.0?

§ Check out the rNews 0.7 spec
§ Mark up some pages using rNews
§ Extract rNews properties using your favourite distiller
§ Dream up The Next Metadata Killer App™


             Let us know what you think
           Let us know how we can help
        @smyles • @agebhard • @kansandhaus

                                                             36
rNews
Thank
 You

        37

Más contenido relacionado

Más de Stuart Myles

IPTC Rights Statements For News
IPTC Rights Statements For NewsIPTC Rights Statements For News
IPTC Rights Statements For NewsStuart Myles
 
IPTC New Taxonomies Ideas
IPTC New Taxonomies IdeasIPTC New Taxonomies Ideas
IPTC New Taxonomies IdeasStuart Myles
 
IPTC Board Spring 2019
IPTC Board Spring 2019IPTC Board Spring 2019
IPTC Board Spring 2019Stuart Myles
 
IPTC Spring 2019 Conference
IPTC Spring 2019 ConferenceIPTC Spring 2019 Conference
IPTC Spring 2019 ConferenceStuart Myles
 
Photomation or Fauxtomation?
Photomation or Fauxtomation?Photomation or Fauxtomation?
Photomation or Fauxtomation?Stuart Myles
 
Image Tagging at the Associated Press
Image Tagging at the Associated PressImage Tagging at the Associated Press
Image Tagging at the Associated PressStuart Myles
 
IPTC Rights Working Group Toronto October 2018
IPTC Rights Working Group Toronto October 2018IPTC Rights Working Group Toronto October 2018
IPTC Rights Working Group Toronto October 2018Stuart Myles
 
IPTC AGM 2018 Welcome
IPTC AGM 2018 WelcomeIPTC AGM 2018 Welcome
IPTC AGM 2018 WelcomeStuart Myles
 
How Can We Make Algorithmic News More Transparent?
How Can We Make Algorithmic News More Transparent?How Can We Make Algorithmic News More Transparent?
How Can We Make Algorithmic News More Transparent?Stuart Myles
 
IPTC EXTRA Spring 2018
IPTC EXTRA Spring 2018IPTC EXTRA Spring 2018
IPTC EXTRA Spring 2018Stuart Myles
 
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...Stuart Myles
 
Ap Taxonomy Localization Requirements and Challenges
Ap Taxonomy Localization Requirements and ChallengesAp Taxonomy Localization Requirements and Challenges
Ap Taxonomy Localization Requirements and ChallengesStuart Myles
 
IPTC Spring Meeting Welcome To Athens April 2018
IPTC Spring Meeting Welcome To Athens April 2018IPTC Spring Meeting Welcome To Athens April 2018
IPTC Spring Meeting Welcome To Athens April 2018Stuart Myles
 
Sustaining Television News Technical Challenges
Sustaining Television News Technical ChallengesSustaining Television News Technical Challenges
Sustaining Television News Technical ChallengesStuart Myles
 
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
How to Train Your Classifier: Create a Serverless Machine Learning System wit...How to Train Your Classifier: Create a Serverless Machine Learning System wit...
How to Train Your Classifier: Create a Serverless Machine Learning System wit...Stuart Myles
 
The Search for IPTC's Next Managing Director
The Search for IPTC's Next Managing DirectorThe Search for IPTC's Next Managing Director
The Search for IPTC's Next Managing DirectorStuart Myles
 
IPTC Approach to News in JSON
IPTC Approach to News in JSONIPTC Approach to News in JSON
IPTC Approach to News in JSONStuart Myles
 
IPTC News in JSON November 2017
IPTC News in JSON November 2017IPTC News in JSON November 2017
IPTC News in JSON November 2017Stuart Myles
 
IPTC EXTRA and EXTRA+ November 2017
IPTC EXTRA and EXTRA+ November 2017IPTC EXTRA and EXTRA+ November 2017
IPTC EXTRA and EXTRA+ November 2017Stuart Myles
 
Welcome to Barcelona - IPTC November 2017
Welcome to Barcelona - IPTC November 2017Welcome to Barcelona - IPTC November 2017
Welcome to Barcelona - IPTC November 2017Stuart Myles
 

Más de Stuart Myles (20)

IPTC Rights Statements For News
IPTC Rights Statements For NewsIPTC Rights Statements For News
IPTC Rights Statements For News
 
IPTC New Taxonomies Ideas
IPTC New Taxonomies IdeasIPTC New Taxonomies Ideas
IPTC New Taxonomies Ideas
 
IPTC Board Spring 2019
IPTC Board Spring 2019IPTC Board Spring 2019
IPTC Board Spring 2019
 
IPTC Spring 2019 Conference
IPTC Spring 2019 ConferenceIPTC Spring 2019 Conference
IPTC Spring 2019 Conference
 
Photomation or Fauxtomation?
Photomation or Fauxtomation?Photomation or Fauxtomation?
Photomation or Fauxtomation?
 
Image Tagging at the Associated Press
Image Tagging at the Associated PressImage Tagging at the Associated Press
Image Tagging at the Associated Press
 
IPTC Rights Working Group Toronto October 2018
IPTC Rights Working Group Toronto October 2018IPTC Rights Working Group Toronto October 2018
IPTC Rights Working Group Toronto October 2018
 
IPTC AGM 2018 Welcome
IPTC AGM 2018 WelcomeIPTC AGM 2018 Welcome
IPTC AGM 2018 Welcome
 
How Can We Make Algorithmic News More Transparent?
How Can We Make Algorithmic News More Transparent?How Can We Make Algorithmic News More Transparent?
How Can We Make Algorithmic News More Transparent?
 
IPTC EXTRA Spring 2018
IPTC EXTRA Spring 2018IPTC EXTRA Spring 2018
IPTC EXTRA Spring 2018
 
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
IPTC Machine Readable Rights for News and Media: Solving Three Challenges wit...
 
Ap Taxonomy Localization Requirements and Challenges
Ap Taxonomy Localization Requirements and ChallengesAp Taxonomy Localization Requirements and Challenges
Ap Taxonomy Localization Requirements and Challenges
 
IPTC Spring Meeting Welcome To Athens April 2018
IPTC Spring Meeting Welcome To Athens April 2018IPTC Spring Meeting Welcome To Athens April 2018
IPTC Spring Meeting Welcome To Athens April 2018
 
Sustaining Television News Technical Challenges
Sustaining Television News Technical ChallengesSustaining Television News Technical Challenges
Sustaining Television News Technical Challenges
 
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
How to Train Your Classifier: Create a Serverless Machine Learning System wit...How to Train Your Classifier: Create a Serverless Machine Learning System wit...
How to Train Your Classifier: Create a Serverless Machine Learning System wit...
 
The Search for IPTC's Next Managing Director
The Search for IPTC's Next Managing DirectorThe Search for IPTC's Next Managing Director
The Search for IPTC's Next Managing Director
 
IPTC Approach to News in JSON
IPTC Approach to News in JSONIPTC Approach to News in JSON
IPTC Approach to News in JSON
 
IPTC News in JSON November 2017
IPTC News in JSON November 2017IPTC News in JSON November 2017
IPTC News in JSON November 2017
 
IPTC EXTRA and EXTRA+ November 2017
IPTC EXTRA and EXTRA+ November 2017IPTC EXTRA and EXTRA+ November 2017
IPTC EXTRA and EXTRA+ November 2017
 
Welcome to Barcelona - IPTC November 2017
Welcome to Barcelona - IPTC November 2017Welcome to Barcelona - IPTC November 2017
Welcome to Barcelona - IPTC November 2017
 

Último

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 

Último (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

On our way to IPTC rNews 1.0 - Inception and Design of a Standard

  • 1. rNews Embedded Data For The News Industry 1
  • 2. Hello! § Stuart Myles – @smyles Lead of the IPTC Semantic Web WG & Deputy Director of Schema Standards, The Associated Press § Evan Sandhaus – @kansandhaus Lead Architect, Semantic Platforms, The New York Times Company § Andreas Gebhard – @agebhard Managing Editor, Getty Images 2
  • 3.
  • 5.
  • 6. Story components which are obvious to a person… STORY PHOTO
  • 7. ...are not so obvious to a machine. STORY PHOTO
  • 8. The Problem of Structured Data § Modern Web Sites Built with 3 Tier Display Architecture • Data Tier: Database Tier Where Content Lives. • Presentation Tier: HTML Document that is sent to user. • Logic Tier: Software Logic that reads from the Data Tier and outputs the Tier Presentation Tier. Data Tier 8
  • 9. The Problem Of Structured Data: Continued Data Tier Logic Tier Display Tier Label Type Value <html> <head> id number 1248069162607 <title> Headline text New Web Code Draws Concern... New Web Code Draws Concern... </title> Byline text By TANZINA VEGA </head> Date date 20101010 <body> <div> Body text In the next few years, a powerful... New Web Code Draws Concern... Length number 1123 </div> <div> Tag text Privacy By TANZINA VEGA Tag text Computers and the Internet </div> <div> Tag text Web Browsers October 10, 2010 </div> <div> In the next few years, a powerful... </div> </body> § Content very well structured on Data </html> Tier, but all of this structure is lost in translation to presentation tier. 9
  • 10. The Problem Of Structured Data: Continued <html> Display Tier ? <head> <title> New Web Code Draws Concern... </title> </head> <body> <div> = New Web Code Draws Concern... </div> <div> By TANZINA VEGA </div> <div> § Search engines, social October 10, 2010 networks, aggregators and </div> <div> other sites only see the In the next few years, a powerful... </div> Display Tier, and cannot </body> </html> leverage the underlying structure of the data. 10
  • 11. Semantic Markup Standards Microformats RDFa Microdata JSON § First § Official § Unofficial § Official § Simple § Complex § Flexible § Developers § Rigid § OpenGraph § Schema.org § External 11
  • 12. rNews 12
  • 13. rNews Defined rNews is a data model for embedding machine-readable publishing metadata in web documents and a set of suggested implementations. 13
  • 14. rNews is a data model PostalAddress address address address geo Coordinates Geo Person Location Coordinates Organization creator creator editor name editor contributor contributor provider provider copyrightHolder Concept copyrightHolder accountablePerson sourceOrganization about mentions Comment comment NewsItem associatedMedia associatedArticle ImageObject VideoObject Article AudioObject 14
  • 15. for embedding machine-readable publishing metadata in web documents Headline Byline Tags Creator ... 15
  • 16. and a set of suggested implementations RDFa Microdata JSON Today Very Soon Maybe? 16
  • 17. rNews - Working Example 17
  • 18. 1<!DOCTYPE4html4PUBLIC4">//W3C//DTD4XHTML41.04Transitional//EN"44"http://www.w3.org/TR/xhtml1/DTD/xhtml1>transitional.dtd">44 2<html> 3<head></head> 4<body> 5444<div> 64444<div> 74444444<div>Allies4Are44Split...</div> 8444444<div>NATO4Takes44Command</div> 9444444<div> 1044444444<img44src="img/libya_sample_reuters.jpg"/> 1144444444<div>Credit:4Goran4Tomasevic/Reuters</div> 12444444444<div>Rebel4fighters44take...</div> 13444444</div> 14444444<div>By4STEVEN4LEE44MYERS</div> 15444444<div>WASHINGTON44|4March424,42011</div> 16444444<div> 1744444444<p>Having44largely4succeeded...</p> 18444444</div> 19444444<div> 2044444444<p><a44href="http://www.nytimes.com/content/help/rights/copyright/copyright>notice.html"> 2144444444444©4Copyright442011 22444444444</a><span>The4New4York44Times44Company</span></p> 2344444444<p><a44href="http://www.nytimes.com/ref/membercenter/help/agree.html"> 2444444444444Disclaimer 25444444444</a></p> 264444444</div> 274444</div> 2844444<div> 29444444<div> 3044444444<div>Section</div> 3144444444<div>World</div> 32444444</div> 33444444<div>Tags</div> 344444444<div> 3544444444<div> 364444444444<div>People</div> 374444444444<div>Qaddafi,4Muammar44el></div> 3844444444</div> 39444444</div>444444 40444444<div> 4144444444<div>Discussion44(3)</div> 4244444444<div> 434444444444<div>So4the4question44is..."</div> 444444444444<div> 454444444444<a44href="http://timespeople.nytimes.com/view/user/27242827/activities.html">Chuck</a></div> 464444444444<div>March425th,44201148:274am</div> 4744444444</div> 48444444</div> 494444</div> 50444</div> 51</body> 52</html> 18
  • 19. HTML 5 Microdata <!DOCTYPE HTML> <html itemscope itemtype="http://schema.org/NewsArticle" > <head> ! <style type="text/css">@import url(css/iptc_times2.css);</style> ! <meta itemprop="dateCreated" content="2011-03-23"/> ! <meta itemprop="description" content="The questions about the command..."/> ! <meta itemprop="inLanguage" content="en-US"/> ! <meta itemprop="thumbnailUrl" content="http://graphics8.nytimes.com/images/common/icons/t_wb_75.gif"/> ! <meta itemprop="genre" content="Current"/> ! <meta itemprop="id" content="1248069687395"/> ! <meta itemprop="version" content="2"/> ! <meta itemprop="publishingPrinciples" content="http://www.nytco.com/press/ethics.html"/> ! <meta itemprop="wordCount" content="879"/>! </head> <body> ! <div style="height:900px" class="article"> ! ! <div class="a_column"> ! ! ! <div itemprop="headline" class="headline">Allies Are Split on Goal and Exit Strategy in Libya</div> ! ! ! <div itemprop="alternativeHeadline" class="rider">NATO Takes Command</div> ! ! ! <div itemprop="associatedMedia" itemscope itemtype="http://schema.org/ImageObject"> ! ! ! ! <img itemprop="URL" class="image" src="img/libya_sample_reuters.jpg"/> ! ! ! ! <div class="image_credit">Credit: ! ! ! ! ! <span itemprop="creator" itemscope itemtype="http://schema.org/Person"> ! ! ! ! ! ! <span itemprop="name">Goran Tomasevic</span> ! ! ! ! ! </span> ! ! ! ! ! / ! ! ! ! ! <span itemprop="sourceOrganization" itemscope itemtype="http://schema.org/Organization"> ! ! ! ! ! ! <span itemprop="name">Reuters</span> ! ! ! ! ! ! <meta itemprop="tickerSymbol" content="NYSE TRI"/> ! ! ! ! ! </span> ! ! ! ! </div> 19
  • 20. RDFa <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd"> <html xmlns:rnews="http://dec.iptc.org/rnews/0.1/"> <head> ! <style type="text/css">@import url(css/iptc_times2.css);</style> </head> <body> ! <div class="article" style="height:623px"> ! ! <div class="a_column"> ! ! ! <div property="rnews:headline" class="headline">Allies Are Split on Goal and Exit Strategy in Libya</div> ! ! ! <div class="rider">NATO Takes Command</div> ! ! ! <div class="main_image"> ! ! ! ! <img class="image" src="img/libya_sample_reuters.jpg"/> ! ! ! ! <div class="image_credit">Credit: Goran Tomasevic/Reuters</div> ! ! ! ! <div class="image_caption"> ! ! ! ! ! Rebel fighters take cover during a shelling near Ajdabiyah, Libya on Thursday. ! ! ! ! </div> ! ! ! </div> ! ! ! <div rel="rnews:createdBy" class="byline">By ! ! ! ! <span about="http://demo.iptc.org/per/steven_lee_myers" typeof="rnews:Person"> ! ! ! ! ! <span property="rnews:name">STEVEN LEE MYERS</span> ! ! ! ! </span> ! ! ! </div> ! ! ! <div class="publication_date"> ! ! ! ! <span property="rnews:dateline">WASHINGTON</span> ! ! ! ! | ! ! ! ! <span property="rnews:dateCreated" content="2011-03-24">March 24, 2011</span> ! ! ! </div> 20
  • 21. The Way to rNews
  • 22. The Way To rNews § June: Genesis of rNews - Evan at SemTech 2010 2010 § November 5 - Rome: chartered § internal discussions about NYT draft § March 9 - Dubai: rNews 0.1 2011 § lots of feedback, changes and additions § June 9 - Berlin: rNews 0.5 § June 28: rNews 0.6 § September 6: rNews 0.7 [aligned w/ schema.org] § October 7 - Vienna: rNews 1.0 22
  • 27. Feedback we incorporated... § In Person • 3 Meetups: New York, Berlin, London • Over a dozen one-on-one meetings with leading media and technology companies. § Online • Rnews.org forum • Numerous Blog Posts § In The Standard’s Community • W3C Community Group • Media Standards Trust 27
  • 28. Feedback we incorporated... point circle latitude elevation longitude polygon altitude box line Geo Location Coordinates 28
  • 29. Feedback we incorporated... editor Person NewsItem 29
  • 30. rNews Benefits Or Why You Should Care About rNews
  • 31. Benefit #1: Better Links With Structured Data No Structured Data 31
  • 32. Benefit #2: Better Analytics Javascript can extract richer news metadata Analytics per item, not just per page 32
  • 33. Benefit #3: Better Ad Placement Leverage metadata not just text Avoid unfortunate juxtapositions 33
  • 34. rNews as a news API Level the Playing Field Encourage Open Innovation 34
  • 35. 35
  • 36. How Can You Help Us Get to rNews 1.0? § Check out the rNews 0.7 spec § Mark up some pages using rNews § Extract rNews properties using your favourite distiller § Dream up The Next Metadata Killer App™ Let us know what you think Let us know how we can help @smyles • @agebhard • @kansandhaus 36