SlideShare una empresa de Scribd logo
1 de 39
hoard.it : Stealing your data
Or... “Where is your online value?”
Or... “Originality sucks”
Dan Zambonini
www.boxuk.com

Museums and the Web 2009, Indianapolis, April 16
WARNING
WARNING
1. I am playing Devil’s Advocate

2. These are‘thoughts in progress’
Introduction
1. The hoard.it project

2. Museums and the Web:
   where’s the value?
Introduction
1. The hoard.it project

2. Museums and the Web:
   where’s the value?
2.5 - 15%
2.5 - 15%
Cross-Collections Projects

  “Search through the cultural collections of Europe”



            “explore and comment on collections”


     “find and explore digital collections from museums”


                   “Discover cultural objects, collections”
Why is this a Problem?
1. Some duplication of effort
  • £25,000 - £100,000 to put collections online
  • £1,500 - £6,500 per cross-collection project
2. Potential end-user confusion
3. Usually only include larger institutions
4. Is there really a need?
Our Approach
• Use data that already exists
   • No cost/duplication of effort
• No input or changes from museums
   • Lightweight, open to all
• Re-expose the data programmatically
   • Enable easy re-use
How it works
Screen-Scraper + Spider
How it works
Screen-Scraper + Spider
How it works
Screen-Scraper + Spider
Difficulties and Limitations
•   Must have collections online
•   Must have a consistent template
•   Slow; not real-time
•   Technical variations (encoding, standards)
•   Rudimentary: Flash/Forms a barrier
Difficulties: Normalization
•   Dates
    •   circa 19th century, 1960s, 2008-01, 1Jan ’52, 2000 BC, 30s, April 4 1934,
        04-76, 1783-25-04, 10-11-64, about 200 AD, Victorian, 1100-1150, ...

    •   http://feeds.boxuk.com/convert/date/


•   Location
    •   Points of interest, cities, towns, countries, administrative regions, political
        regions, ancient names, continents, postal codes, co-ordinates, ...

    •   http://developer.yahoo.com/geo/
The Data
   Virtual Museum of Canada!

     Carnegie Museum of Art!

          Smithsonian NASM!

 National Museum of Australia!

      National Portrait Gallery!

        Imperial War Museum!

National Museums of Scotland!

                     Ingenious!

  Museum of London: E20CL!

               British Museum!

  Victoria and Albert Museum!

   National Maritime Museum!

                  Powerhouse!

             Science Museum!

             24 Hour Museum!

            Freebase: Events!

    Wikipedia: List of Painters!

                                   0!   2000!   4000!   6000!   8000!   10000!   12000!   14000!   16000!
The Data
   Virtual Museum of Canada!

     Carnegie Museum of Art!

          Smithsonian NASM!

 National Museum of Australia!

      National Portrait Gallery!

        Imperial War Museum!

National Museums of Scotland!

                     Ingenious!

  Museum of London: E20CL!

               British Museum!

  Victoria and Albert Museum!

   National Maritime Museum!

                  Powerhouse!

             Science Museum!

             24 Hour Museum!

            Freebase: Events!

    Wikipedia: List of Painters!

                                   0!   2000!   4000!   6000!   8000!   10000!   12000!   14000!   16000!


                                                                            70,000 objects
The Data
 • URL            100%
 • Identifier     95%
 • Title          100%
 • Description    70%
 • Image          85%
 • Creator        50%
 • Created Date   75%
 • Copyright      50%
 • Dimensions     45%
 • Subject        65%
 • Location       45%
 • Materials      65%
Data Mining - Location
                                       65%   Europe
                                       15%   Asia
                                       14%   North America
                                       4%    Oceania




Percentage of objects from the same continent as museum:

• North America: 85%
• Europe:        75%
• Oceania:       65%
% of objects by continent of origin!




             0!
                  10!
                        20!
                                  30!
                                          40!
                                                  50!
                                                          60!
                                                                     70!
                                                                           80!
                                                                                 90!
        -1000!
         -900!
         -800!
         -700!
         -600!
         -500!
         -400!
         -300!
         -200!
         -100!
            0!
          100!
          200!
          300!
          400!
          500!




Year!
          600!
          700!
          800!
          900!
         1000!
         1100!
         1200!
         1300!
         1400!
         1500!
         1600!
         1700!
         1800!
         1900!
         2000!
                                 Asia!
                                 Africa!
                                 Europe!
                                 Oceania!
                                 North America!
                                 South America!
                                                                                       Data Mining - Date/Location
% of objects by material!




                      0!
                           5!
                                10!
                                                15!
                                                                  20!
                                                                            25!
                                                                                  30!
                                                                                        35!
                                                                                              40!
              0!
         10
              0!
         20
              0!
         30
              0!
         40
              0!
         50
              0!
         60
              0!
         70
              0!
         80
              0!
         90
              0!
        10
          00
               !




Year!
        11
           0  0!
        12
             00
                  !
        13
             00
                  !
        14
             00
                  !
        15
          00
               !
        16
             00
                  !
        17
             00
                  !
        18
             00
                  !
        19
             00
                  !
        20
             00
                  !
                                                          Clay!

                                                  Gold!

                                      Silver!
                                                                   Stone!
                                                                                                    Data Mining - Date/Material
How it has been used
•   Experiments: http://hoard.it/labs/




•   UK Museums on the
    Web 2008 Hack Day


•   Who knows...?
                                         Photo courtesy of Brian Kelly
How it has been used
Next steps...
Next steps...


 ABSOLUTELY
  NOTHING
Do you offer anything?
dbPedia, Freebase
What can you offer?
•   Expertise
•   Media
•   The Physical Space
•   Reputation and Trust
•   Audience
•   Voice, Exposure and Influence
What’s changed?
“...not all information should flow everywhere; only the
meaningful should be transmitted.

But in the network economy only signals in real time (or
close to it) are truly meaningful.

Examine the speed of knowledge in your system. How
can it be brought closer to real time? If this requires the
cooperation of subcontractors, distant partners, and far-
flung customers, so much the better.”

Kevin Kelly
http://www.kk.org/newrules/blog/2009/04/if-you-are-not-in-real-time-yo.php
What’s changed?


                  !quot;#$%#$&
!quot;#$%&




                  '($(&
                  )%*+,-%.&




          '()%&
What’s changed?
What’s changed?

 EXECUTION
    not
   IDEAS
What’s changed?

              !quot;#$%&'()
              *+#,)




                      !quot;#$%&'(
                      )*#+%$%&'(
                      ,--.**%+%$&'(
                      /0.(1%20&(3.#"4.*(
                      5.*%26(
UK Newspaper Example
                                ,-./012345quot;
                                 #!quot;
                                  +quot;
                                  *quot;
                F44:G2.:=quot;                        6278925:quot;
                                  )quot;
                                  (quot;
                                  'quot;
                                                                               H2-1Iquot;JKL.8==quot;
                                  "
                                                                               H2-1Iquot;A2-1quot;
                                  %quot;
                                                                               H2-1Iquot;A-..4.quot;
                                  $quot;
                                                                               H2-1Iquot;CM2.quot;
                                  #quot;
                                                                               H2-1Iquot;>8187.2LBquot;
                                  !quot;
D5-E08quot;D=8.=quot;                                                 ;2/8<44:quot;;25=quot;
                                                                               ;-525/-21quot;>-G8=quot;
                                                                               >B8quot;N02.O-25quot;
                                                                               >B8quot;P5O8L85O85Mquot;
                                                                               >B8quot;C05quot;
                                                                               >B8quot;>-G8=quot;



         9CCquot;C0<=/.-<8.=quot;                         >?-@8.quot;;4114?8.=quot;




                             A85345=quot;-5quot;$&quot;B.=quot;
For example
•   Let your patrons collaborate
•   Let your patrons run your space
•   Give local communities a voice
•   Provide advice and guidance
•   Collect & distribute niche knowledge
•   ...


•   You know better than I do.
What has to change?
•   A focus on proven user needs
•   Re-usable services, not more data
•   Smaller projects
•   Iterative approaches
•   A real commitment to the web platform
•   (At least some) In-house development
How do we get there?
•   Should web projects generate revenue?
•   Don’t be afraid of re-inventing the wheel
•   Demand all projects use/expose APIs that
    are easy (REST not SOAP/OAI) and publicized
•   Show early, show often
•   Annoy funding bodies to support more,
    smaller, longer (i.e. iterative) ‘boring’ projects,
    and less ‘big, audacious’ projects.
Summary
•   We stole your data...
•   But then so are lots of other people...
•   So produce value elsewhere.


•   Ideas are harmful: do what’s proven...
•   But do it brilliantly.
•   And to do that, we need change.
Thank you
      www.boxuk.com


      dan@boxuk.com


    twitter.com/zambonini
Thank you
      www.boxuk.com


      dan@boxuk.com


    twitter.com/zambonini

Más contenido relacionado

Destacado

Intranet y sus beneficios
Intranet y sus beneficiosIntranet y sus beneficios
Intranet y sus beneficiosAndrewwcc
 
Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"Asyst News
 
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing ...
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing  ...Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing  ...
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing ...museums and the web
 
Saudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solutionSaudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solutionBrandon Dooley
 
Digital Storytelling - Palestra Santa Maria (19/11/16)
Digital Storytelling - Palestra Santa Maria (19/11/16)Digital Storytelling - Palestra Santa Maria (19/11/16)
Digital Storytelling - Palestra Santa Maria (19/11/16)Penso Ideias
 
Practica colas (if, else)
Practica colas (if, else)Practica colas (if, else)
Practica colas (if, else)Eli Diaz
 
Kittim silva-manual-practico-de-homilitica(1)
Kittim silva-manual-practico-de-homilitica(1)Kittim silva-manual-practico-de-homilitica(1)
Kittim silva-manual-practico-de-homilitica(1)miguelmunguia
 
Inspiring Shopper Behaviours
Inspiring Shopper BehavioursInspiring Shopper Behaviours
Inspiring Shopper BehavioursOgilvy Consulting
 

Destacado (11)

Intranet y sus beneficios
Intranet y sus beneficiosIntranet y sus beneficios
Intranet y sus beneficios
 
12 san francisco museum of modern art
12 san francisco museum of modern art12 san francisco museum of modern art
12 san francisco museum of modern art
 
Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"Matéria "Viagem ao Interior"
Matéria "Viagem ao Interior"
 
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing ...
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing  ...Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing  ...
Rob Stein, Charles Moad, Ed Bachta, Museums and Cloud Computing ...
 
Saudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solutionSaudi waste (recycling, energy, composte, water) solution
Saudi waste (recycling, energy, composte, water) solution
 
Digital Storytelling - Palestra Santa Maria (19/11/16)
Digital Storytelling - Palestra Santa Maria (19/11/16)Digital Storytelling - Palestra Santa Maria (19/11/16)
Digital Storytelling - Palestra Santa Maria (19/11/16)
 
PicNic no Monet
PicNic no MonetPicNic no Monet
PicNic no Monet
 
Practica colas (if, else)
Practica colas (if, else)Practica colas (if, else)
Practica colas (if, else)
 
Kittim silva-manual-practico-de-homilitica(1)
Kittim silva-manual-practico-de-homilitica(1)Kittim silva-manual-practico-de-homilitica(1)
Kittim silva-manual-practico-de-homilitica(1)
 
Proyecto Final
Proyecto FinalProyecto Final
Proyecto Final
 
Inspiring Shopper Behaviours
Inspiring Shopper BehavioursInspiring Shopper Behaviours
Inspiring Shopper Behaviours
 

Similar a Dan Zambonini and Mike Ellis, hoard.it: Aggregating, displaying and mining object-data without consent

RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmailRTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmailAlberto Bacchelli
 
The Future of The Web: Transmission TX2 Talk
The Future of The Web: Transmission TX2 TalkThe Future of The Web: Transmission TX2 Talk
The Future of The Web: Transmission TX2 TalkDigital Sparks
 
Cloud computing
Cloud computingCloud computing
Cloud computingtimesheet1
 
TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"Karla Witte
 
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...WRI Ross Center for Sustainable Cities
 
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」Takashi Iba
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009adminfbgroup
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009guest3117009
 

Similar a Dan Zambonini and Mike Ellis, hoard.it: Aggregating, displaying and mining object-data without consent (8)

RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmailRTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
RTFM (Read The Factual Mails) --Augmenting Program Comprehension with REmail
 
The Future of The Web: Transmission TX2 Talk
The Future of The Web: Transmission TX2 TalkThe Future of The Web: Transmission TX2 Talk
The Future of The Web: Transmission TX2 Talk
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"TBF 2011- Ezequiel Singer: "Google Workshop"
TBF 2011- Ezequiel Singer: "Google Workshop"
 
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
Transforming Fuels and Vehicle Technology - Drew Kodjak - ICCT - Transforming...
 
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
ORF2011「学びの対話ワークショップ:クリエイティブ・ラーニングと人材育成」
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009
 
B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009B2 B Channel Newsletter Q4 2009
B2 B Channel Newsletter Q4 2009
 

Más de museums and the web

How to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting SiuHow to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting Siumuseums and the web
 
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...museums and the web
 
MW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museumMW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museummuseums and the web
 
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...museums and the web
 
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...museums and the web
 
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...museums and the web
 
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...museums and the web
 
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia GuideMW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia Guidemuseums and the web
 
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...museums and the web
 
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor TrackingMW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Trackingmuseums and the web
 
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...museums and the web
 
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...museums and the web
 
MW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data SculptingMW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data Sculptingmuseums and the web
 
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...museums and the web
 
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...museums and the web
 
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...museums and the web
 
MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network museums and the web
 
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...museums and the web
 
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...museums and the web
 

Más de museums and the web (20)

How to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting SiuHow to Give an Accessible Presentation - Yue-Ting Siu
How to Give an Accessible Presentation - Yue-Ting Siu
 
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
MW2011: N. Di Blas +, A “Smart” Authoring and Delivery Tool for Multichannel ...
 
MW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museumMW2011: D. Birchall + M. Henson, Gaming the museum
MW2011: D. Birchall + M. Henson, Gaming the museum
 
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
MW2011: G. Chae +, Can Social Tagging Be a Tool to Reduce the Semantic Gap be...
 
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...MW2011: Klavans, J.  +, Computational Linguistics in Museums: Applications fo...
MW2011: Klavans, J. +, Computational Linguistics in Museums: Applications fo...
 
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
MW2011: L. Tallon + I. Froes, Going Mobile? Insights into the museum communit...
 
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
MW2011: D. Laursen, Guided expectations: a case study of a sound collage audi...
 
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia GuideMW2011: J. Flemming +, Launching the MFA Multimedia Guide
MW2011: J. Flemming +, Launching the MFA Multimedia Guide
 
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
MW2011: S. Fantoni, Mobile devices for orientation and way finding: the case ...
 
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor TrackingMW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
MW2011: J. Bickersteth + C. Ainsley, Mobile Phones and Visitor Tracking
 
MW2011 Best of the Web Awards
MW2011 Best of the Web AwardsMW2011 Best of the Web Awards
MW2011 Best of the Web Awards
 
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
MW2011: Quigley, S., Integration of Print and Digital Publishing Workflows at...
 
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
MW2011: Cope, A., Authority Records, Future Computers and Other Unfinished Hi...
 
MW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data SculptingMW2011: S. Kenderdine, Cultural Data Sculpting
MW2011: S. Kenderdine, Cultural Data Sculpting
 
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
MW2010: N. Proctor, The Museum Is Mobile: Cross-platform content design for a...
 
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
MW2010: J. Doyle + M. Doyle, Mixing Social Glue with Brick and Mortar: Experi...
 
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
MW2010: M. Petrie + L. Tallon, The iPhone effect?: Comparing visitors’ and mu...
 
MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network MW2010: Building an online research community: The Reciprocal Research Network
MW2010: Building an online research community: The Reciprocal Research Network
 
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
MW2010: S. Hazan et al., ATHENA: A Mechanism for Harvesting Europe's Museum H...
 
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
MW2010: D. Peacock, Putting Mallala on the map: Creating a wiki community wit...
 

Último

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 

Último (20)

Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 

Dan Zambonini and Mike Ellis, hoard.it: Aggregating, displaying and mining object-data without consent

  • 1. hoard.it : Stealing your data Or... “Where is your online value?” Or... “Originality sucks” Dan Zambonini www.boxuk.com Museums and the Web 2009, Indianapolis, April 16
  • 3. WARNING 1. I am playing Devil’s Advocate 2. These are‘thoughts in progress’
  • 4. Introduction 1. The hoard.it project 2. Museums and the Web: where’s the value?
  • 5. Introduction 1. The hoard.it project 2. Museums and the Web: where’s the value?
  • 8. Cross-Collections Projects “Search through the cultural collections of Europe” “explore and comment on collections” “find and explore digital collections from museums” “Discover cultural objects, collections”
  • 9. Why is this a Problem? 1. Some duplication of effort • £25,000 - £100,000 to put collections online • £1,500 - £6,500 per cross-collection project 2. Potential end-user confusion 3. Usually only include larger institutions 4. Is there really a need?
  • 10. Our Approach • Use data that already exists • No cost/duplication of effort • No input or changes from museums • Lightweight, open to all • Re-expose the data programmatically • Enable easy re-use
  • 14. Difficulties and Limitations • Must have collections online • Must have a consistent template • Slow; not real-time • Technical variations (encoding, standards) • Rudimentary: Flash/Forms a barrier
  • 15. Difficulties: Normalization • Dates • circa 19th century, 1960s, 2008-01, 1Jan ’52, 2000 BC, 30s, April 4 1934, 04-76, 1783-25-04, 10-11-64, about 200 AD, Victorian, 1100-1150, ... • http://feeds.boxuk.com/convert/date/ • Location • Points of interest, cities, towns, countries, administrative regions, political regions, ancient names, continents, postal codes, co-ordinates, ... • http://developer.yahoo.com/geo/
  • 16. The Data Virtual Museum of Canada! Carnegie Museum of Art! Smithsonian NASM! National Museum of Australia! National Portrait Gallery! Imperial War Museum! National Museums of Scotland! Ingenious! Museum of London: E20CL! British Museum! Victoria and Albert Museum! National Maritime Museum! Powerhouse! Science Museum! 24 Hour Museum! Freebase: Events! Wikipedia: List of Painters! 0! 2000! 4000! 6000! 8000! 10000! 12000! 14000! 16000!
  • 17. The Data Virtual Museum of Canada! Carnegie Museum of Art! Smithsonian NASM! National Museum of Australia! National Portrait Gallery! Imperial War Museum! National Museums of Scotland! Ingenious! Museum of London: E20CL! British Museum! Victoria and Albert Museum! National Maritime Museum! Powerhouse! Science Museum! 24 Hour Museum! Freebase: Events! Wikipedia: List of Painters! 0! 2000! 4000! 6000! 8000! 10000! 12000! 14000! 16000! 70,000 objects
  • 18. The Data • URL 100% • Identifier 95% • Title 100% • Description 70% • Image 85% • Creator 50% • Created Date 75% • Copyright 50% • Dimensions 45% • Subject 65% • Location 45% • Materials 65%
  • 19. Data Mining - Location 65% Europe 15% Asia 14% North America 4% Oceania Percentage of objects from the same continent as museum: • North America: 85% • Europe: 75% • Oceania: 65%
  • 20. % of objects by continent of origin! 0! 10! 20! 30! 40! 50! 60! 70! 80! 90! -1000! -900! -800! -700! -600! -500! -400! -300! -200! -100! 0! 100! 200! 300! 400! 500! Year! 600! 700! 800! 900! 1000! 1100! 1200! 1300! 1400! 1500! 1600! 1700! 1800! 1900! 2000! Asia! Africa! Europe! Oceania! North America! South America! Data Mining - Date/Location
  • 21. % of objects by material! 0! 5! 10! 15! 20! 25! 30! 35! 40! 0! 10 0! 20 0! 30 0! 40 0! 50 0! 60 0! 70 0! 80 0! 90 0! 10 00 ! Year! 11 0 0! 12 00 ! 13 00 ! 14 00 ! 15 00 ! 16 00 ! 17 00 ! 18 00 ! 19 00 ! 20 00 ! Clay! Gold! Silver! Stone! Data Mining - Date/Material
  • 22. How it has been used • Experiments: http://hoard.it/labs/ • UK Museums on the Web 2008 Hack Day • Who knows...? Photo courtesy of Brian Kelly
  • 23. How it has been used
  • 26. Do you offer anything? dbPedia, Freebase
  • 27. What can you offer? • Expertise • Media • The Physical Space • Reputation and Trust • Audience • Voice, Exposure and Influence
  • 28. What’s changed? “...not all information should flow everywhere; only the meaningful should be transmitted. But in the network economy only signals in real time (or close to it) are truly meaningful. Examine the speed of knowledge in your system. How can it be brought closer to real time? If this requires the cooperation of subcontractors, distant partners, and far- flung customers, so much the better.” Kevin Kelly http://www.kk.org/newrules/blog/2009/04/if-you-are-not-in-real-time-yo.php
  • 29. What’s changed? !quot;#$%#$& !quot;#$%& '($(& )%*+,-%.& '()%&
  • 32. What’s changed? !quot;#$%&'() *+#,) !quot;#$%&'( )*#+%$%&'( ,--.**%+%$&'( /0.(1%20&(3.#&quot;4.*( 5.*%26(
  • 33. UK Newspaper Example ,-./012345quot; #!quot; +quot; *quot; F44:G2.:=quot; 6278925:quot; )quot; (quot; 'quot; H2-1Iquot;JKL.8==quot; &quot; H2-1Iquot;A2-1quot; %quot; H2-1Iquot;A-..4.quot; $quot; H2-1Iquot;CM2.quot; #quot; H2-1Iquot;>8187.2LBquot; !quot; D5-E08quot;D=8.=quot; ;2/8<44:quot;;25=quot; ;-525/-21quot;>-G8=quot; >B8quot;N02.O-25quot; >B8quot;P5O8L85O85Mquot; >B8quot;C05quot; >B8quot;>-G8=quot; 9CCquot;C0<=/.-<8.=quot; >?-@8.quot;;4114?8.=quot; A85345=quot;-5quot;$&quot;B.=quot;
  • 34. For example • Let your patrons collaborate • Let your patrons run your space • Give local communities a voice • Provide advice and guidance • Collect & distribute niche knowledge • ... • You know better than I do.
  • 35. What has to change? • A focus on proven user needs • Re-usable services, not more data • Smaller projects • Iterative approaches • A real commitment to the web platform • (At least some) In-house development
  • 36. How do we get there? • Should web projects generate revenue? • Don’t be afraid of re-inventing the wheel • Demand all projects use/expose APIs that are easy (REST not SOAP/OAI) and publicized • Show early, show often • Annoy funding bodies to support more, smaller, longer (i.e. iterative) ‘boring’ projects, and less ‘big, audacious’ projects.
  • 37. Summary • We stole your data... • But then so are lots of other people... • So produce value elsewhere. • Ideas are harmful: do what’s proven... • But do it brilliantly. • And to do that, we need change.
  • 38. Thank you www.boxuk.com dan@boxuk.com twitter.com/zambonini
  • 39. Thank you www.boxuk.com dan@boxuk.com twitter.com/zambonini

Notas del editor