SlideShare una empresa de Scribd logo
1 de 102
Descargar para leer sin conexión
Putting the world’s
          cultural heritage online
           with crowdsourcing
                          Frederick Zarndt
                          @cowboyMontana
                   frederick@frederickzarndt.com
             Slides @ http://bit.ly/crowdsrootstech2013

               CCS / Digital Divide Data / DL Consulting
Photo held by John Oxley Library, State Library of
Queensland. Original from Courier-mail, Brisbane,
Queensland, Australia.
Crowds
In 2004 James Surowiecki published ...




                 The Wisdom of Crowds: Why
                 the Many Are Smarter Than
                 the Few and How Collective
                 Wisdom Shapes Business,
                 Economies, Societies and
                 Nations

                              In it he says ...
... a crowd of persons that are
           diverse ...
...
      in
         d   ep
                  en
                       de
                            nt
                                 ...
... and
decentralized ...
usually make
    better
judgements or
decisions than
single persons
“Country Fair” by Grandma Moses. Original painting 1950.
“crowdsourcing”

was coined by Jeff Howe in “The rise of
 crowdsourcing” published in Wired
        magazine June 2006.
web trends for
  “crowdsourcing”
Jan-2006 to Jan-2013
• On the date of publication of Jeff Howe’s Wired
          magazine article, 1-Jun-2007, Wikipedia did not have
          an entry (list) of crowdsourcing projects*.
        • On 25-Jan-2010 Wikipedia’s list of crowdsourcing
          projects had 35 entries*.
        • On 17-Mar -2013 Wikipedia’s list of crowdsourcing
          projects had 158 entries+.

* From Internet Archives’ Wayback Machine.
+ Wikipedia contributors, "List of crowdsourcing projects," Wikipedia, The Free Encyclopedia, https://
en.wikipedia.org/wiki/List_of_crowdsourcing_projects (accessed March 17, 2013).
Crowdsourcing is the practice of obtaining
                needed services, ideas, or content by
           soliciting contributions from a large group of
                people, and especially from an online
              community, rather than from traditional
              employees or suppliers. ... [It] is different
           from ordinary outsourcing since it is a task or
            problem that is outsourced to an undefined
             public rather than a specific, named group.


Wikipedia contributors, "Crowdsourcing," Wikipedia, The Free Encyclopedia, http://en.wikipedia.org/wiki/
Crowdsourcing (accessed March 17, 2013)
crowdcollaboration    crowd*




                                  crowdsourcing
         ng
       di




               citizen science
     un
     df
ow
cr




crowdcasting            crowdvoting
what is Alexa?

•   Alexa collects and analyzes Internet data for purposes of web analytics. Web analytics is the
    measurement, collection, analysis and reporting of Internet data for the purposes of understanding
    and optimizing web usage. Alexa is now a subsidiary of Amazon.

•   Alexa was founded in 1996 by Brewster Kahle (Internet Archive) and Bruce Gilliat.

•   Alexa operations includes archiving of webpages as they are crawled. This database served as the
    basis for the creation of the Internet Archive accessible through the Wayback Machine.

•   Alexa continually crawls all publicly-available websites to create a series of snapshots of the web.

•   Alexa gathers information from a variety of sources to provide key statistics about each site on the
    web, for example, Traffic Rank, the number of PageViews, and site Speed, Bounce Rate, etc.
    This information is derived from Alexa toolbar users (~6,000,000 worldwide).
definitions

        •   A PageView is a request for a file whose type is defined as a page.

        •   A Unique Visitor is a uniquely identified client generating requests on the web
            server or viewing pages within a defined time period (i.e. day, week or month). A
            Unique Visitor counts once within the timescale.

        •   A Visit is a series of page requests from the same uniquely identified client with a
            time of no more than 30 minutes between each page request.

        •   Bounce Rate is the percentage of visits where the visitor enters and exits at the same
            page without visiting any other pages on the site in between.

        •   World | Country Rank is a function of the average daily unique visits and the number
            of unique pages requested.




definitions adapted from Wikipedia http://en.wikipedia.org/wiki/Web_analytics
crowdfunding




Kickstarter (http://www.kickstarter.com/) was 1st launched in Apr 2009. As of 17-Mar-2013
                  its Alexa Internet traffic rank is 751 (global) / 294 (USA).
  35,000+ projects successfully funded with $500,000,000+ by 3,000,000+ people.
crowdvoting




reddit (http://www.reddit.com/) was 1st launched in June 2005. As of 17-Mar-2013 its Alexa
 Internet traffic rank is 124 (global) / 54 (USA). reddit had more than 55,000,000 unique
  visitors from 175 countries who cast more than 17,000,000 votes about which stories are
                                            important.
Amazon Mechanical Turk (https://www.mturk.com) was launched Nov 2005.
As of 17-Mar-2013 its Alexa Internet traffic rank is 8,219 (global) / 3,036 (USA).
crowdsourcing




Each day 200,000,000 recaptcha’s are solved by humans around the world.
Zooniverse (https://www.zooniverse.org) was 1st launched as Galaxy Zoo July 2007.
 As of 17-Mar-2013 it has 801,682 participants worldwide. Its Alexa traffic rank is
                        271,574 (global) / 127,695 (USA).
crowdcollaboration
Wikipedia

            •     Wikipedia began 2001

            •     Now in 285 languages, 24,640,000 articles

            •     4,210,000 articles in English

            •     More than 1,000,000 articles each in German, French, Italian, and Dutch

            •     40 wikipedia languages with more than 100,000 articles

            •     112 wikipedia languages with more than 10,000 articles

            •     488,470,000 unique visitors (Jan 2013)

            •     84,848,000 active (5+ edits) contributors

            •     Alexa global traffic rank: #6 in worldwide web traffic




Statistics from Wikimedia Report Card http://reportcard.wmflabs.org
Family Search Indexing was 1st launched (beta) 2004. As of 17-Mar-2013 Family Search’s
 (https://familysearch.org/) Alexa Internet traffic rank is 4,480 (global) / 1,208 (USA).
• Started (beta) 2004

         • More than 780,000 worldwide registered volunteers from ~25
           countries index records relevant to family history

         • Approximately 100,000 active volunteers each month

         • UI in Chinese, English, German, French, Italian, Japanese,
           Korean, Portuguese, and Russian

         • Blind double-key entry with arbitration / reconciliation

         • More than 1,500,088,741 records indexed (July 2012)

         • Accuracy typically > 99.95%

Statistics from private communication with Family Search 5-Jul-2013
Project Gutenberg was 1st launched Dec 1971.
As of 17-Mar-2013 Project Gutenberg’s Alexa Internet traffic rank 5,192 (global) / 2,851 (USA).
• Started Dec 1971

• Worldwide volunteers transcribe or proofread OCR’d public
  domain books through Distributed Proofreaders

• 42,000 free ebooks completed (March 2013)

• More than 100,000 free ebooks offered by its partners and
  affiliates

• Partner / affiliated projects for Australia, Canada, Europe,
  Germany, Runeberg (Nordic literature), self-published
  contemporary authors, Consortia Center in collaboration with
  the World eBook Library, ...
As of 17-Mar-2013 the National Library of Australia’s (http://trove.nla.gov.au/) Alexa Internet traffic
   rank is 14,490 (global) / 330 (Australia). Trove gets ~75% of all National Library web traffic.
National Library of
                          Australia
           • Online since 2008
           • 7,200,000+ pages
           • Top text corrector 1,250,000 lines (June 2012)
           • 2,450,000+ lines corrected each month (average for
             1st 6 months 2012)
           • 68,908,757 lines corrected as of July 2012, up from
             42,411,468 lines corrected July 2011.
           • 63,613 total registered users (July 2012)
           • 4,146 active users (June 2012)
Statistics from private communication with the National Library of Australia Oct 2012
Courtesy of Tim Sherrat, Tinkerer-in-Chief at WraggeLabs Emporium (http://wraggelabs.com/
As of 17-Mar-2013 National Library of Finland’s (http://www.nationallibrary.fi/) Alexa Internet global
       traffic rank is 4,303,901. Its Internet traffic rank for Finland was 199 as of 2-Apr-2012.
National Library of
              Finland
• Digitalkoot is a project to improve OCR text in digitized
  newspapers -- by playing games!
• Digitalkoot is a collaboration between the National
  Library and Microtask
• Players correct OCR text by playing Myyräsillassa
  (Mole Bridge) or Myyräjahdissa (Mole Hunt)
• National Library has 4,000,000+ digitized pages
• 109,321 registered players (October 2012)
• Since February 2011 8,024,530 micro-tasks have been
  completed
As of 17-Mar-2013 UC Riverside’s Alexa Internet traffic rank is 11,782 (global) / 4,120 (USA).
                    CDNC gets ~3.30% of all UC Riverside web traffic.
California Digital
    Newspaper Collection
• CDNC began digitizing newspapers in 2005 as part of
  the Library of Congress National Digital Newspapers
  Program (NDNP)
• Newspapers digitized to article-level in addition to
  page-level as required by NDNP (same as Utah Digital
  Newspapers)
• Since 2009 hosted on Veridian at http://cdnc.ucr.edu
• Collection size 55,970 issues, 495,175 pages, 5,658,224
  articles, 498,000,000+ lines (Mar-2013)
OCR text correction

• OCR text correction added August 2011
• Corrections are done line by line
• ~578,000+ lines of text corrected Oct 2012
• ~935,398+ lines of text corrected Mar 2013
• ~2% of the collection corrected, 98% to go!
• Top corrector 327,244 lines > 2x 2nd corrector
Cambridge Public Library
    Historic Newspaper Collection


• Cambridge Historic Newspapers online since Jan 2012.
• Cambridge Massachusetts Public Library digitized local
  newspapers (http://cambridge.dlconsulting.com/)
• Newspapers digitized to article-level
• Collection size 6,346 issues, 59,070 pages, 669,406
  articles (Mar-2013)
• Collection includes 13,099 obituary cards
Why correct text?
 Here’s why ...
Raw OCR text                                    Newspaper image
Deaths. lln»rieff, Esq. of <c .. Qn.
Sunday, the till. greatly Drandrellt, of
Orms4irJi.- ~ ; ;✓ ' • * On ijfr r inn
ljjjil F iij '11 f Havodivyd,
Carnarvonshire, S ; **" *- ' « ' March
Oxford, F. Tfovmeud, Uerald. » • V .
•On Tncsdav last, Mr. Charles.
IWilinson, this 8 ; had vf thesis#,, a
week ago, which tcrminate<i'iu his
death. . / ' ■ O'i Sunday, dJst nit. at.
AsbtCnvHall, mar Lancaster,
Mr.,Geo. Worn ick, many years
house'steward hit late Once The
Hamilton and Brandon. He locked
himself h»oWn'r«wte<: soon. twelve
o'clock" that dny, and fii»-d a loaded
pistol "through Ins bead, 1 which
instantaneously killed him. Coronet's
Verdict, shot himself in a temporary fit of
Friday week,



Excerpt from The British Newspaper Archive, Chester Courant, Tuesday 6-Apr-1819, page 3.
Motivation
Graphic from Kaufmann et al. “More than fun and money. Worker Motivation in
Crowdsourcing – A Study on Mechanical Turk.”
Wisdom of crowds


                             Each person should have private information even if
          Diversity          it's just an eccentric interpretation of the known
                             facts.
                             People's opinions aren't determined by the opinions
       Independence
                             of those around them.

                             People are able to specialize and draw on local
     Decentralization
                             knowledge.

                             Some mechanism exists for turning private
        Aggregation
                             judgments into a collective decision.

James Surowiecki, The Wisdom of Crowds: Why the Many Are Smarter Than the Few and How
Collective Wisdom Shapes Business, Economies, Societies and Nations, Anchor Books, New York, 2005.
Cognitive surplus


        ... people are learning to use their free time for creative activities
        rather than consumptive ones [such as watching TV] ...

        ... the total human cognitive effort in creating all of Wikipedia in
        every language is about one hundred million hours ...

        ... Americans alone watch two hundred billion hours of TV every
        year, or enough time, if it would be devoted to projects similar to
        Wikipedia, to create about 2000 of them ...




Clay Shirky. Cognitive surplus: Creativity and generosity in a connected age. Penguin Press. New York. 2010.
Motivation
                       Genealogists and family historians

                      • The 2012 National Library of Australia’s Trove
                        status report showed that ~50% of Trove users are
                        family historians

PAPERSPAST            • National Library of New Zealand survey found that
                        ~50% of PapersPast users are genealogists
                      • A 2013 California Digital Newspaper Collection
                        survey shows that more than 65% of its users are
                        genealogists; 75% are 50 years old or older
                      • A 2012 Utah Digital Newspapers survey showed
                        that 72% of its users are genealogists*
*John Herbert and Randy Olsen. “Small town papers: Still delivering the news”. Paper given
at 2012 World Library and Information Congress. Helsinki. August 2012.
Motivation
                                          Trove users’ report


             • “I enjoy the correction - it’s a great way to learn more about past
             history and things of interest whilst doing a ‘service to the
             community’ by correcting text for the benefit of others.”

             • “I have recently retired from IT and thought that I could be of
             some assistance to the project. It benefits me and other people. It
             helps with family research.”




From Rose Holley in “Many Hands Make Light Work.” National Library of Australia March 2009.
Motivation
                                        CDNC users’ report



             “I am interested in all kinds of history. I have pursued genealogy as a
               hobby for many years. I correct text at CDNC because I see it as a
             constructive way to contribute to a worthwhile project. Because I am
                                interested in history, I enjoy it.”
                                                     Wesley, California




Personal communications with CDNC text correctors.
Motivation
                                        CDNC users’ report


                “I only correct the text on articles of local interest - nothing at state,
               national or international level, no advertisements, etc.  The objective
                    is to be able to help researchers to locate local people, places,
               organizations and events using the on-line search at CDNC.  I correct
               local news & gossip, personal items, real estate transactions, superior
                court proceedings, county and local board of supervisors meetings,
                       obituaries, birth notices, marriages, yachting news, etc.”
                                                     Ann, California




Personal communications with CDNC text correctors.
Motivation
                                        CDNC users’ report

             “I am correcting text for the Coronado Tent City Program for 1903. 
               It is important to correct any problems with personal names and
                 other information so that researchers will be able to search by
              keyword and be assured of retrieving desired results. ... type fonts
              cause a great deal of difficulty in digitizing the text and can cause
               problems for searchers.  Also, many of the guests' names at Tent
            City and Hotel Del Coronado were taken from the registration books
            and reported in the Program.  This led to many problems in spelling
            of last names and the editors were not careful to be consistent in the
             spellings.  This Program is an important resource since it provides
              an excellent picture of daily life in Tent City and captures much of
                                 the history of Coronado itself.”
                                                     Gene, California

Personal communications with CDNC text correctors.
Motivation
                                        CDNC users’ report



                     “I have always been interested in history, especially the
                 development of the American West, and nothing brings it alive
                   better than newspapers of the time. I believe them to be an
                 invaluable source of knowledge for us and future generations.”
                                                 David, United Kingdom




Personal communications with CDNC text correctors.
Motivation
                              CDNC users’ report

                 CDNC is an excellent source of information matching my
               personal interest in such topics as sea history, development of
              shipbuilding, clippers and other ships etc. ... Unfortunately, the
             quality of text ... is rather poor I’m afraid. This is why I started to
                 do all corrections necessary for myself ... and to leave the
                corrected text for use of others. .... I am not doing this very
                      regularly as this is just my hobby and pleasure.
                                           Jerzey, Poland




Personal communications with CDNC text correctors.
Ok, raw OCR newspaper text
  is bad. But how much
 difference can one person
     (me) really make?
You can make a
               difference




Graphic courtesy of TYPEinspire (http://typeinspire.com/)
User        Lines corrected   Lines corrected   User
                 1             242,965          1,456,906        1
                 2              87,515          1,385,369        2
                 3              31,318          1,010,360        3
                 4              24,144           960,230         4
                 5              23,184           847,340         5
                 6              19,240           786,147         6
                 7             18,898            657,187         7
                 8              16,875           600,513         8
                 9              11,784           582,276         9
                10               9,762           565,384        10

Statistics from Oct 2012
uncorrected OCR accuracy by
                     newspaper title
                                                           OCR character   ~OCR word
                               Title
                                                             accuracy       accuracy*

            PRP Pacific Rural Press 1871 - 1922                92.6%         68.1%

            SFC San Francisco Call 1890 - 1913                 92.6%         68.1%

            LAH Los Angeles Herald 1873 - 1910                 88.7%         54.9%

            LH Livermore Herald 1877 - 1899                    88.6%         54.6%

            DAC Daily Alta California 1841 - 1891              88.2%         53.4%

            CFJ California Farmer and Journal
                                                               86.5%         48.4%
            of Useful Sciences 1855 - 1880

            SN Sausalito News 1885 - 1922                      70.4%         17.3%


*Word   accuracy assumes average word length is 5 characters
corrected OCR accuracy by
           newspaper title
                                        OCR character   Corrected
                   Title
                                          accuracy      accuracy

PRP Pacific Rural Press 1871 - 1922         92.6%        99.3%

SFC San Francisco Call 1890 - 1913          92.6%        99.6%

LAH Los Angeles Herald 1873 - 1910          88.7%        99.1%

LH Livermore Herald 1877 - 1899             88.6%        99.9%

DAC Daily Alta California 1841 - 1891       88.2%        99.9%

CFJ California Farmer and Journal
                                            86.5%        99.8%
of Useful Sciences 1855 - 1880

SN Sausalito News 1885 - 1922               70.4%        100.0%
corrected OCR accuracy by
                       newspaper title
                               OCR character       ~OCR word    Corrected   ~Corrected word
                Title
                                 accuracy           accuracy*   accuracy       accuracy*

        PRP 1871 - 1922              92.6%             68.1%     99.3%          96.5%

        SFC 1890 - 1913              92.6%             68.1%     99.6%          98.0%

        LAH 1873 - 1910              88.7%             54.9%     99.1%          95.6%

        LH 1877 - 1899               88.6%             54.6%     99.9%          99.5%

        DAC 1841 - 1891              88.2%             53.4%     99.9%          99.5%

        CF 1855 - 1880               86.5%             48.4%     98.3%          91.8%

        SN 1885 - 1922               70.4%             17.3%     100.0%         100.0%



*Word   accuracy assumes average word length is 5 characters
correction accuracy by user

         Average uncorrected   Average corrected
  User
            text accuracy        text accuracy
   A            70.4%               100.0%
   B            87.1%               99.5%
   C            95.4%               99.5%
   D            86.5%               98.3%
   E            95.3%               100.0%
   F            91.0%               100.0%
   G            91.0%               99.8%
   H            90.5%               99.0%
   I            96.6%               99.8%
   J            94.8%               100.0%
   K           86.8%                99.3%
the long tail* of crowdsourced OCR text
                   correction

                    a probability distribution has a long tail if a larger share
                    of population rests within its tail than it would under a
                                       normal distribution

                    the most productive users represent a small fraction of
                    the total user population and ~50% of total production,
                         or, said a different way, the largest fraction but
                        individually not quite so productive users are as
                             important as the most productive users




*The phrase “long tail” was popularized by Chris Anderson in the October 2004 Wired magazine article
The Long Tail and by Clay Shirky’s February 2003 essay “Power laws, web logs, and inequality”.
OCR text correction long tails

                            3,000,000




                            2,250,000
                                           50%
300000




top corrector 242,965       1,500,000   top corrector 1,456,906
225000




           50%               750,000

150000                                                                         50%


                                   0


 75000
                                                                  NLA lines corrected by text corector



                                                    50%
     0

                        CDNC lines corrected by text corrector
Website traffic
Website traffic


           After a crowdsourcing transcription project of diaries from the American
           War Between the States, Nicole Saylor, Head of Digital Library Services
           at the University of Iowa Libraries, reported



                      “On June 9, 2011, we went from about 1000 daily
                      hits to our digital library on a really good day to
                      more than 70,000.”



Nicole Saylor interviewed by Trevor Owens. “Crowdsourcing the Civil War: Insights Interview with Nicole Saylor” blog post at http://
blogs.loc.gov/digitalpreservation/2011/12/crowdsourcing-the-civil-war-insights-interview-with-nicole-saylor/. Dec 6, 2011.
Website traffic

       Website traffic at CDNC before / after implementing
                           crowdsourcing


                     before crowdsourcing         after crowdsourcing
                                                                           change
                   11-Jun-2011 / 12-Jul-2011   11-Jun-2012 / 12-Jul-2012

    visits                 17,485                      21,488              +22.9%

unique visitors            11,381                      13,376              +17.5%

visit duration            9m 24s                       11m 7s              +18.3%

 bounce rate               51.3%                       44.5%               -6.8%

pages per visit             14.9                         11.7              -21.5%
Website traffic
Crowdsourcing
   benefits




        Public domain photo courtesy of US Navy
$
                 Economics

   Financial value of outsourced OCR text correction for
   newspapers?
   The Assumptions
• 25 to 50 characters per line in a newspaper column:
  Assume 40 characters per line (CDNC sample average)
• Outsourced text transcription or correction costs USD
  $0.35 to $1.20 per 1000 characters: Assume $0.50 per
  1000 characters
$
       Economics


$ 578,000 lines x 40 characters per line x 1/1000 x
  $0.50 = $11,560
$ 68,908,757 lines x 40 characters per line x
  1/1000 x $0.50 = $1,378,175
$
                                   Economics


               Financial value of in-house OCR text correction?
               The Assumptions
         • Correction takes 15 seconds per line
         • Cost is hourly wage plus benefits of lowest level
           employee, $10 for CDNC, $41.88* for Australia



AUD $40.38 = USD $41.88 is the actual labor value assumed by the National Library of Australia to calculate
avoided costs due to crowdsourced OCR text correction in its 2012 Trove Status Report.
$
          Economics


$ 578,000 lines x 15 seconds per line x 1/3600 hrs
  per second x $10.00 per hr = $24,083
$ 68,908,757 lines x 15 seconds per line x 1/3600 hrs
  per second x $41.88 per hr = $12,024,578
Accuracy



“His Accuracy Depends on Ours!"
Office for Emergency Management. Office of War Information.
Domestic Operations Branch. Bureau of Special Services. [Photo
held at US National Archives and Records Administration]
• Edwin Kiljin (Koninklijke Bibliotheek the Netherlands) reports
         raw OCR character accuracies of 68% for early 20th century
         newspapers
         • Rose Holley (National Library of Australia) reports raw OCR
         character accuracy varied from 71% to 98% on a sample Trove
         digitized newspapers



Edwin Kiljin. “The current state-of-art in newspaper digitization.” D-Lib Magazine. January/February 2008.
Rose Holley. “How good can it get? Analysing and improving OCR accuracy in large scale historic newspaper
digitisation programs. D-Lib Magazine. March/April 2009.
Public domain graphic courtesy of Wikimedia Commons.
Graphic is logo for Accuracy in Media (http://www.aim.org/)
Accuracy
                 Mapping texts* assesses digitization quality of digital
              newspapers by comparing the number of words recognized to
                          the total number of words scanned




* Mapping  texts is a collaboration between the University of North Texas and Stanford University aimed at
experimenting with new methods for finding and analyzing meaningful patterns embedded in massive
collections of digital newspapers.
Accuracy

How does low text accuracy affect search recall?
The Facts
• Average uncorrected OCR character accuracy of the
  CDNC sample data is ~89%
• Average length of an English word is 5 characters
• Average word accuracy is 89% x 89% x 89% x 89% x 89%
  = 55.8% - round up to 60% or 6 out of 10 words correct
Search recall no text correction




                                                                    ARND
                                                                         T
                                     ARNDT             ARNDT
                DT
             ARN             ARNDT
                                               ARNDT


                                        ARNDT            ARNDT




                                                                             ARNDT
                                       ARNDT




instances of “ARNDT” found                                     instances of “ARNDT” not found
Accuracy


The Facts
• Average corrected character accuracy of the CDNC
  sample data is ~99.4%
• Average word accuracy of CDNC corrected text is 99.4%
  x 99.4% x 99.4% x 99.4% x 99.4% = 97.0%
Search recall with text correction




                                                 ARNDT
                        ARNDT



                                        ARNDT               ARNDT

                                ARNDT
                                                 ARNDT

                                                    ARNDT
                                           ARNDT              ARNDT




                                                ARNDT




instances of “ARNDT” found                                          instances of “ARNDT” not found
Accuracy

         A search for “Arndt” at Chronicling America gives
         10,267 results*
         • If Chronicling America text accuracy is 55.8% (same as
           uncorrected CDNC sample), then 8,133 instances of
           “Arndt” were not found
         • If text accuracy is 97.0%, then 317 instances of “Arndt”
           were not found



*   Search performed 31 Oct 2012
Accuracy
Suppose the word/name is longer than 5
characters?
The Facts
• Assume that average uncorrected / corrected OCR
  character accuracy is ~89% / ~99% same as CDNC.
     Name        Name length Raw text accuracy Corrected text accuracy
     Eklund           6             49.7%               94.2%
     Kennedy          7            44.2%                 93.25
     Espinosa         8            39.4%                92.3%
    Bonaparte         9            35.0%                91.4%
    Chatterjee       10             31.2%               90.4%
Accuracy

 Chronicling America searches done 19-Mar-2013
 (6,025,474 pages from 1836 to 1922).


               Number of      Missing results with    Missing results with
 Name
             search results    raw text accuracy     corrected text accuracy

 Eklund          2,951               2,987                    182
Kennedy         360,723             455,392                  26,111
Espinosa         1,918               2,950                    160
Bonaparte       44,664               82,947                   4,203
Chatterjee         19                 42                       2
Resources




Public domain photo “A useful instruction for young sailors from the Royal
Hospital School, Greenwich” from the National Maritime Museum.
Comprehensive worldwide list of online
         newspaper archives




Wikipedia contributors, "List of online newspaper archives," Wikipedia, The Free Encyclopedia, https://
en.wikipedia.org/wiki/Wikipedia:List_of_online_newspaper_archives (accessed March 17, 2013).
Search many digital newspaper
          collections at once!




As of 17-Mar-2013 elephind (http://www.elephind.com) has indexed 930 newspapers from 11
   historical digital collections comprising 1,041,086 issues and 44,158,901 pages/articles.
Try crowdsourcing!

        Correct California newspapers at http://cdnc.ucr.edu


        Correct Australian newspapers http://trove.nla.gov.au


        Correct Cambridge MA newspapers http://bit.ly/cambridgepublic


        Correct Tennessee newspapers http://tndp.lib.utk.edu


        Correct Virginia newspapers http://virginiachronicle.com



Login with user name “crowdsatrootstech2013” or
     “crowdsatrootstech2013@gmail.com”,
             password “roots$tech”
Hãy thử crowdsourcing!
 Correct Vietnamese newspapers http://bit.ly/nationallibraryofvietnam




Попробуйте краудсорсинга!
 Or try Russian language periodicals http://bit.ly/russianperiodicals




  Kokeile crowdsourcing!
 Or try Finnish newspapers http://digi.lib.helsinki.fi/sanomalehti
Other resources

Mapping Texts at http://mappingtexts.stanford.edu/




            Wragge Labs at http://wraggelabs.com/




            Wikipedia list of crowdsourcing projects
                    https://en.wikipedia.org/wiki/
                  List_of_crowdsourcing_projects
?
             Frederick Zarndt
             @cowboyMontana
      frederick@frederickzarndt.com
Slides @ http://bit.ly/crowdsrootstech2013

CCS / Digital Divide Data / DL Consulting

                         Photo held by John Oxley Library, State Library of Queensland. Original from Courier-mail,

                         Brisbane, Queensland, Australia.
Try crowdsourcing!

        Correct California newspapers at http://cdnc.ucr.edu


        Correct Australian newspapers http://trove.nla.gov.au


        Correct Cambridge MA newspapers http://bit.ly/cambridgepublic


        Correct Tennessee newspapers http://tndp.lib.utk.edu


        Correct Virginia newspapers http://virginiachronicle.com



Login with user name “crowdsatrootstech2013” or
     “crowdsatrootstech2013@gmail.com”,
             password “roots$tech”
FYI about Trove


• If you hope to begin your text correction hobby
  with Trove’s family notices (births, deaths,
  weddings), you may have a tough go of it. As of
  17-Mar-2013, there were 768,333 family notices
  in Trove digitized newspapers; most seem to
  have already been corrected.
• Lack of text correction opportunity
  notwithstanding, now you know where to find
  768,333 family notices published in Australia
  from 1803 to 1954.
Try crowdsourcing!


         Correct British newspapers http://www.britishnewspaperarchive.co.uk/




The British Newspaper Archive is a subscription service from
brightsolid and the British Library. From now until the end of
 RootsTech you can use it at no cost with the user name and
                       password below.


 Login with user name “crowdsatrootstech2013” or
      “crowdsatrootstech2013@gmail.com”,
              password “roots$tech”
?
             Frederick Zarndt
             @cowboyMontana
      frederick@frederickzarndt.com
Slides @ http://bit.ly/crowdsrootstech2013

CCS / Digital Divide Data / DL Consulting

                         Photo held by John Oxley Library, State Library of Queensland. Original from Courier-mail,

                         Brisbane, Queensland, Australia.

Más contenido relacionado

La actualidad más candente

20120821 putting the world’s cultural heritage online with crowd sourcing sli...
20120821 putting the world’s cultural heritage online with crowd sourcing sli...20120821 putting the world’s cultural heritage online with crowd sourcing sli...
20120821 putting the world’s cultural heritage online with crowd sourcing sli...Frederick Zarndt
 
Information Visualization - Visualizing Digital Collections at Archive-It
Information Visualization - Visualizing Digital Collections at Archive-ItInformation Visualization - Visualizing Digital Collections at Archive-It
Information Visualization - Visualizing Digital Collections at Archive-ItMichele Weigle
 
Combining Social Media Storytelling With Web Archives
Combining Social Media Storytelling With Web ArchivesCombining Social Media Storytelling With Web Archives
Combining Social Media Storytelling With Web ArchivesShawn Jones
 
Library Blogging: Reaching Out with a Dynamic Web Presence
Library Blogging: Reaching Out with a Dynamic Web PresenceLibrary Blogging: Reaching Out with a Dynamic Web Presence
Library Blogging: Reaching Out with a Dynamic Web PresenceHeather Martyn
 
Documenting Ferguson: Building a community digital repository
Documenting Ferguson: Building a community digital repositoryDocumenting Ferguson: Building a community digital repository
Documenting Ferguson: Building a community digital repositoryChris Freeland
 
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...Ken Chad Consulting Ltd
 
Establishing the Missouri Hub: A Service Hub for DPLA
Establishing the Missouri Hub: A Service Hub for DPLAEstablishing the Missouri Hub: A Service Hub for DPLA
Establishing the Missouri Hub: A Service Hub for DPLAChris Freeland
 
Digital Odyssey 2015 - Open Collections
Digital Odyssey 2015 - Open CollectionsDigital Odyssey 2015 - Open Collections
Digital Odyssey 2015 - Open CollectionsOurDigitalWorld
 
Organizing a DPLA Service Hub in Missouri
Organizing a DPLA Service Hub in MissouriOrganizing a DPLA Service Hub in Missouri
Organizing a DPLA Service Hub in MissouriChris Freeland
 
Documenting the Now: Supporting Scholarly Use & Preservation of Social Media ...
Documenting the Now: Supporting Scholarly Use & Preservation of Social Media ...Documenting the Now: Supporting Scholarly Use & Preservation of Social Media ...
Documenting the Now: Supporting Scholarly Use & Preservation of Social Media ...Chris Freeland
 
Pilots & Partnerships: University Academic Computing and University Libraries...
Pilots & Partnerships: University Academic Computing and University Libraries...Pilots & Partnerships: University Academic Computing and University Libraries...
Pilots & Partnerships: University Academic Computing and University Libraries...Chris Freeland
 
Newman Numismatic Portal Overview - Mar 2015
Newman Numismatic Portal Overview - Mar 2015Newman Numismatic Portal Overview - Mar 2015
Newman Numismatic Portal Overview - Mar 2015Chris Freeland
 
Building a Collaboration for Digital Publishing
Building a Collaboration for Digital PublishingBuilding a Collaboration for Digital Publishing
Building a Collaboration for Digital PublishingHarriett Green
 
20111114 b hyland government data and publishers
20111114   b hyland government data and publishers20111114   b hyland government data and publishers
20111114 b hyland government data and publishersBernadette Hyland-Wood
 
ISSOTL 2013: A Book By Any Other Name Would Read As Well? How E-Books Transfo...
ISSOTL 2013: A Book By Any Other Name Would Read As Well? How E-Books Transfo...ISSOTL 2013: A Book By Any Other Name Would Read As Well? How E-Books Transfo...
ISSOTL 2013: A Book By Any Other Name Would Read As Well? How E-Books Transfo...Stefanie Panke
 
Infrastructure, engagement, innovation: library directions
Infrastructure, engagement, innovation: library directionsInfrastructure, engagement, innovation: library directions
Infrastructure, engagement, innovation: library directionslisld
 
Nahl LIS Educators VWs 3-5-10 ALAFIN
Nahl LIS Educators VWs 3-5-10 ALAFINNahl LIS Educators VWs 3-5-10 ALAFIN
Nahl LIS Educators VWs 3-5-10 ALAFINDiane Nahl
 

La actualidad más candente (20)

20120821 putting the world’s cultural heritage online with crowd sourcing sli...
20120821 putting the world’s cultural heritage online with crowd sourcing sli...20120821 putting the world’s cultural heritage online with crowd sourcing sli...
20120821 putting the world’s cultural heritage online with crowd sourcing sli...
 
Information Visualization - Visualizing Digital Collections at Archive-It
Information Visualization - Visualizing Digital Collections at Archive-ItInformation Visualization - Visualizing Digital Collections at Archive-It
Information Visualization - Visualizing Digital Collections at Archive-It
 
Combining Social Media Storytelling With Web Archives
Combining Social Media Storytelling With Web ArchivesCombining Social Media Storytelling With Web Archives
Combining Social Media Storytelling With Web Archives
 
Library Blogging: Reaching Out with a Dynamic Web Presence
Library Blogging: Reaching Out with a Dynamic Web PresenceLibrary Blogging: Reaching Out with a Dynamic Web Presence
Library Blogging: Reaching Out with a Dynamic Web Presence
 
Documenting Ferguson: Building a community digital repository
Documenting Ferguson: Building a community digital repositoryDocumenting Ferguson: Building a community digital repository
Documenting Ferguson: Building a community digital repository
 
Wikipedia as a democracy
Wikipedia as a democracyWikipedia as a democracy
Wikipedia as a democracy
 
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...
 
Establishing the Missouri Hub: A Service Hub for DPLA
Establishing the Missouri Hub: A Service Hub for DPLAEstablishing the Missouri Hub: A Service Hub for DPLA
Establishing the Missouri Hub: A Service Hub for DPLA
 
Books In A Web 2
Books In A Web 2Books In A Web 2
Books In A Web 2
 
Digital Odyssey 2015 - Open Collections
Digital Odyssey 2015 - Open CollectionsDigital Odyssey 2015 - Open Collections
Digital Odyssey 2015 - Open Collections
 
Organizing a DPLA Service Hub in Missouri
Organizing a DPLA Service Hub in MissouriOrganizing a DPLA Service Hub in Missouri
Organizing a DPLA Service Hub in Missouri
 
Documenting the Now: Supporting Scholarly Use & Preservation of Social Media ...
Documenting the Now: Supporting Scholarly Use & Preservation of Social Media ...Documenting the Now: Supporting Scholarly Use & Preservation of Social Media ...
Documenting the Now: Supporting Scholarly Use & Preservation of Social Media ...
 
Open Source & Libraries
Open Source & LibrariesOpen Source & Libraries
Open Source & Libraries
 
Pilots & Partnerships: University Academic Computing and University Libraries...
Pilots & Partnerships: University Academic Computing and University Libraries...Pilots & Partnerships: University Academic Computing and University Libraries...
Pilots & Partnerships: University Academic Computing and University Libraries...
 
Newman Numismatic Portal Overview - Mar 2015
Newman Numismatic Portal Overview - Mar 2015Newman Numismatic Portal Overview - Mar 2015
Newman Numismatic Portal Overview - Mar 2015
 
Building a Collaboration for Digital Publishing
Building a Collaboration for Digital PublishingBuilding a Collaboration for Digital Publishing
Building a Collaboration for Digital Publishing
 
20111114 b hyland government data and publishers
20111114   b hyland government data and publishers20111114   b hyland government data and publishers
20111114 b hyland government data and publishers
 
ISSOTL 2013: A Book By Any Other Name Would Read As Well? How E-Books Transfo...
ISSOTL 2013: A Book By Any Other Name Would Read As Well? How E-Books Transfo...ISSOTL 2013: A Book By Any Other Name Would Read As Well? How E-Books Transfo...
ISSOTL 2013: A Book By Any Other Name Would Read As Well? How E-Books Transfo...
 
Infrastructure, engagement, innovation: library directions
Infrastructure, engagement, innovation: library directionsInfrastructure, engagement, innovation: library directions
Infrastructure, engagement, innovation: library directions
 
Nahl LIS Educators VWs 3-5-10 ALAFIN
Nahl LIS Educators VWs 3-5-10 ALAFINNahl LIS Educators VWs 3-5-10 ALAFIN
Nahl LIS Educators VWs 3-5-10 ALAFIN
 

Destacado

20130630 What motivates library crowdsourcing volunteers? [ALA LITA]
20130630 What motivates library crowdsourcing volunteers? [ALA LITA]20130630 What motivates library crowdsourcing volunteers? [ALA LITA]
20130630 What motivates library crowdsourcing volunteers? [ALA LITA]Frederick Zarndt
 
How the british library and the singapore natinoal library support social cha...
How the british library and the singapore natinoal library support social cha...How the british library and the singapore natinoal library support social cha...
How the british library and the singapore natinoal library support social cha...Frederick Zarndt
 
Many hands make light work, the american version [charleston library conferen...
Many hands make light work, the american version [charleston library conferen...Many hands make light work, the american version [charleston library conferen...
Many hands make light work, the american version [charleston library conferen...Frederick Zarndt
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...Frederick Zarndt
 
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...Frederick Zarndt
 
Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...Frederick Zarndt
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...Frederick Zarndt
 

Destacado (7)

20130630 What motivates library crowdsourcing volunteers? [ALA LITA]
20130630 What motivates library crowdsourcing volunteers? [ALA LITA]20130630 What motivates library crowdsourcing volunteers? [ALA LITA]
20130630 What motivates library crowdsourcing volunteers? [ALA LITA]
 
How the british library and the singapore natinoal library support social cha...
How the british library and the singapore natinoal library support social cha...How the british library and the singapore natinoal library support social cha...
How the british library and the singapore natinoal library support social cha...
 
Many hands make light work, the american version [charleston library conferen...
Many hands make light work, the american version [charleston library conferen...Many hands make light work, the american version [charleston library conferen...
Many hands make light work, the american version [charleston library conferen...
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...
 
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
 
Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...Coronado public library digital newspapers workshop local partnerships [oct 2...
Coronado public library digital newspapers workshop local partnerships [oct 2...
 
An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...An international survey of born digital legal deposit policies and practices ...
An international survey of born digital legal deposit policies and practices ...
 

Similar a 20130321 Putting the world's cultural heritage online with crowdsourcing [rootstech salt lake city]

20120821 putting the world’s cultural heritage online with crowd sourcing [na...
20120821 putting the world’s cultural heritage online with crowd sourcing [na...20120821 putting the world’s cultural heritage online with crowd sourcing [na...
20120821 putting the world’s cultural heritage online with crowd sourcing [na...Frederick Zarndt
 
Hello islandora building a digital repository nov 30, 2016 v6
Hello islandora  building a digital repository nov 30, 2016 v6Hello islandora  building a digital repository nov 30, 2016 v6
Hello islandora building a digital repository nov 30, 2016 v6eohallor
 
Digital Transformation and Data - the Wikimedia Residency at the University o...
Digital Transformation and Data - the Wikimedia Residency at the University o...Digital Transformation and Data - the Wikimedia Residency at the University o...
Digital Transformation and Data - the Wikimedia Residency at the University o...Ewan McAndrew
 
20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]Frederick Zarndt
 
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...Frederick Zarndt
 
Crowdsourcing and social engagement: potential, power and freedom for librari...
Crowdsourcing and social engagement: potential, power and freedom for librari...Crowdsourcing and social engagement: potential, power and freedom for librari...
Crowdsourcing and social engagement: potential, power and freedom for librari...Rose Holley
 
20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]Frederick Zarndt
 
What If You Let Citizens Build Your Website?
What If You Let Citizens Build Your Website?What If You Let Citizens Build Your Website?
What If You Let Citizens Build Your Website?GovLoop
 
Community Generated Databases for NY State History Conference 2013
Community Generated Databases for NY State History Conference 2013Community Generated Databases for NY State History Conference 2013
Community Generated Databases for NY State History Conference 2013Larry Naukam
 
Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011lljohnston
 
Humanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformHumanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformUCLDH
 
LIS 653 Posters Fall 2014
LIS 653 Posters Fall 2014 LIS 653 Posters Fall 2014
LIS 653 Posters Fall 2014 PrattSILS
 
DPLA - an introduction for historians
DPLA  - an introduction for historiansDPLA  - an introduction for historians
DPLA - an introduction for historiansLarry Naukam
 
World Affairs Council, Wikipedia as global collaboration Feb 2010
World Affairs Council, Wikipedia as global collaboration  Feb 2010World Affairs Council, Wikipedia as global collaboration  Feb 2010
World Affairs Council, Wikipedia as global collaboration Feb 2010Wikimedia Foundation
 
Adding More Semantics to the Social Web
Adding More Semantics to the Social WebAdding More Semantics to the Social Web
Adding More Semantics to the Social WebJohn Breslin
 
OpenLinkedData at Information.dk - DrupalCamp CPH 2010
OpenLinkedData at Information.dk - DrupalCamp CPH 2010OpenLinkedData at Information.dk - DrupalCamp CPH 2010
OpenLinkedData at Information.dk - DrupalCamp CPH 2010johsw
 
Collaborative Networks ASTC 2014
Collaborative Networks ASTC 2014Collaborative Networks ASTC 2014
Collaborative Networks ASTC 2014Chris Dornfeld
 
The public library and wikipedia
The public library and wikipediaThe public library and wikipedia
The public library and wikipediadorohoward
 
Doctoral open day_digital_research_session_Social_Sciences_BL
Doctoral open day_digital_research_session_Social_Sciences_BLDoctoral open day_digital_research_session_Social_Sciences_BL
Doctoral open day_digital_research_session_Social_Sciences_BLAquiles Alencar Brayner
 

Similar a 20130321 Putting the world's cultural heritage online with crowdsourcing [rootstech salt lake city] (20)

20120821 putting the world’s cultural heritage online with crowd sourcing [na...
20120821 putting the world’s cultural heritage online with crowd sourcing [na...20120821 putting the world’s cultural heritage online with crowd sourcing [na...
20120821 putting the world’s cultural heritage online with crowd sourcing [na...
 
Hello islandora building a digital repository nov 30, 2016 v6
Hello islandora  building a digital repository nov 30, 2016 v6Hello islandora  building a digital repository nov 30, 2016 v6
Hello islandora building a digital repository nov 30, 2016 v6
 
Digital Transformation and Data - the Wikimedia Residency at the University o...
Digital Transformation and Data - the Wikimedia Residency at the University o...Digital Transformation and Data - the Wikimedia Residency at the University o...
Digital Transformation and Data - the Wikimedia Residency at the University o...
 
20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]20140408 digital newspapers collections [idlc kuala lumpur]
20140408 digital newspapers collections [idlc kuala lumpur]
 
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
 
Crowdsourcing and social engagement: potential, power and freedom for librari...
Crowdsourcing and social engagement: potential, power and freedom for librari...Crowdsourcing and social engagement: potential, power and freedom for librari...
Crowdsourcing and social engagement: potential, power and freedom for librari...
 
20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]
 
What If You Let Citizens Build Your Website?
What If You Let Citizens Build Your Website?What If You Let Citizens Build Your Website?
What If You Let Citizens Build Your Website?
 
Community Generated Databases for NY State History Conference 2013
Community Generated Databases for NY State History Conference 2013Community Generated Databases for NY State History Conference 2013
Community Generated Databases for NY State History Conference 2013
 
Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011Leslie Johnston Keynote, Best Practices Exchange 2011
Leslie Johnston Keynote, Best Practices Exchange 2011
 
Aquiles imlr seminar
Aquiles imlr seminarAquiles imlr seminar
Aquiles imlr seminar
 
Humanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse PlatformHumanities Crowdsourcing on the Zooniverse Platform
Humanities Crowdsourcing on the Zooniverse Platform
 
LIS 653 Posters Fall 2014
LIS 653 Posters Fall 2014 LIS 653 Posters Fall 2014
LIS 653 Posters Fall 2014
 
DPLA - an introduction for historians
DPLA  - an introduction for historiansDPLA  - an introduction for historians
DPLA - an introduction for historians
 
World Affairs Council, Wikipedia as global collaboration Feb 2010
World Affairs Council, Wikipedia as global collaboration  Feb 2010World Affairs Council, Wikipedia as global collaboration  Feb 2010
World Affairs Council, Wikipedia as global collaboration Feb 2010
 
Adding More Semantics to the Social Web
Adding More Semantics to the Social WebAdding More Semantics to the Social Web
Adding More Semantics to the Social Web
 
OpenLinkedData at Information.dk - DrupalCamp CPH 2010
OpenLinkedData at Information.dk - DrupalCamp CPH 2010OpenLinkedData at Information.dk - DrupalCamp CPH 2010
OpenLinkedData at Information.dk - DrupalCamp CPH 2010
 
Collaborative Networks ASTC 2014
Collaborative Networks ASTC 2014Collaborative Networks ASTC 2014
Collaborative Networks ASTC 2014
 
The public library and wikipedia
The public library and wikipediaThe public library and wikipedia
The public library and wikipedia
 
Doctoral open day_digital_research_session_Social_Sciences_BL
Doctoral open day_digital_research_session_Social_Sciences_BLDoctoral open day_digital_research_session_Social_Sciences_BL
Doctoral open day_digital_research_session_Social_Sciences_BL
 

Más de Frederick Zarndt

Digitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum ArchivesDigitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum ArchivesFrederick Zarndt
 
2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and Practices2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and PracticesFrederick Zarndt
 
e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017Frederick Zarndt
 
Project Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin PrinciplesProject Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin PrinciplesFrederick Zarndt
 
What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]Frederick Zarndt
 
Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]Frederick Zarndt
 
What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]Frederick Zarndt
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsFrederick Zarndt
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsFrederick Zarndt
 
Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...Frederick Zarndt
 
20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]Frederick Zarndt
 
What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...Frederick Zarndt
 
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...Frederick Zarndt
 
20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...Frederick Zarndt
 
20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]Frederick Zarndt
 
201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...Frederick Zarndt
 
201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...Frederick Zarndt
 
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...Frederick Zarndt
 
20130629 If you build it, will they visit [ala lita lightning talk]
20130629 If you build it, will they visit [ala lita lightning talk]20130629 If you build it, will they visit [ala lita lightning talk]
20130629 If you build it, will they visit [ala lita lightning talk]Frederick Zarndt
 
20130412 Productivity of the crowd [acrl indianapolis]
20130412 Productivity of the crowd [acrl indianapolis]20130412 Productivity of the crowd [acrl indianapolis]
20130412 Productivity of the crowd [acrl indianapolis]Frederick Zarndt
 

Más de Frederick Zarndt (20)

Digitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum ArchivesDigitization of the Tuol Sleng Genocide Museum Archives
Digitization of the Tuol Sleng Genocide Museum Archives
 
2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and Practices2017 Born Digital Legal Deposit Policies and Practices
2017 Born Digital Legal Deposit Policies and Practices
 
e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017e-Legal Deposit Survey 2017
e-Legal Deposit Survey 2017
 
Project Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin PrinciplesProject Management according to Great Pumpkin Principles
Project Management according to Great Pumpkin Principles
 
What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]What did you say? interculture communication [20160308 phnom penh]
What did you say? interculture communication [20160308 phnom penh]
 
Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]Coronado public library digital newspapers workshop [Oct 2016]
Coronado public library digital newspapers workshop [Oct 2016]
 
What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]What did you say? mindful interculture communication [201608 icgse]
What did you say? mindful interculture communication [201608 icgse]
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital News
 
Here Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital NewsHere Today, Gone within a Month: The Fleeting Life of Digital News
Here Today, Gone within a Month: The Fleeting Life of Digital News
 
Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...Rootstech 2015 finding and using digitized historical newspapers workshop [20...
Rootstech 2015 finding and using digitized historical newspapers workshop [20...
 
20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]20140410 ifla digitization workshop [idlc kuala lumpur]
20140410 ifla digitization workshop [idlc kuala lumpur]
 
What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...What did you say? Intercultural expectations, misunderstandings, and communic...
What did you say? Intercultural expectations, misunderstandings, and communic...
 
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
20140628 crowdsourcing, family history, and long tails for libraries [ala ann...
 
20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...20131019 digital collections - if you build them will anyone visit [library 2...
20131019 digital collections - if you build them will anyone visit [library 2...
 
20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]20130903 what did you say? interculture communication [hamburg]
20130903 what did you say? interculture communication [hamburg]
 
201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...
 
201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...201308 wlic standards committee zarndt et al the alto editorial board collabo...
201308 wlic standards committee zarndt et al the alto editorial board collabo...
 
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
2013 ifla satellite zarndt et al [marketing cultural heritage digital collect...
 
20130629 If you build it, will they visit [ala lita lightning talk]
20130629 If you build it, will they visit [ala lita lightning talk]20130629 If you build it, will they visit [ala lita lightning talk]
20130629 If you build it, will they visit [ala lita lightning talk]
 
20130412 Productivity of the crowd [acrl indianapolis]
20130412 Productivity of the crowd [acrl indianapolis]20130412 Productivity of the crowd [acrl indianapolis]
20130412 Productivity of the crowd [acrl indianapolis]
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 

Último (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 

20130321 Putting the world's cultural heritage online with crowdsourcing [rootstech salt lake city]

  • 1. Putting the world’s cultural heritage online with crowdsourcing Frederick Zarndt @cowboyMontana frederick@frederickzarndt.com Slides @ http://bit.ly/crowdsrootstech2013 CCS / Digital Divide Data / DL Consulting Photo held by John Oxley Library, State Library of Queensland. Original from Courier-mail, Brisbane, Queensland, Australia.
  • 2.
  • 3.
  • 4.
  • 5.
  • 7. In 2004 James Surowiecki published ... The Wisdom of Crowds: Why the Many Are Smarter Than the Few and How Collective Wisdom Shapes Business, Economies, Societies and Nations In it he says ...
  • 8. ... a crowd of persons that are diverse ...
  • 9. ... in d ep en de nt ...
  • 11. usually make better judgements or decisions than single persons
  • 12. “Country Fair” by Grandma Moses. Original painting 1950.
  • 13. “crowdsourcing” was coined by Jeff Howe in “The rise of crowdsourcing” published in Wired magazine June 2006.
  • 14. web trends for “crowdsourcing” Jan-2006 to Jan-2013
  • 15. • On the date of publication of Jeff Howe’s Wired magazine article, 1-Jun-2007, Wikipedia did not have an entry (list) of crowdsourcing projects*. • On 25-Jan-2010 Wikipedia’s list of crowdsourcing projects had 35 entries*. • On 17-Mar -2013 Wikipedia’s list of crowdsourcing projects had 158 entries+. * From Internet Archives’ Wayback Machine. + Wikipedia contributors, "List of crowdsourcing projects," Wikipedia, The Free Encyclopedia, https:// en.wikipedia.org/wiki/List_of_crowdsourcing_projects (accessed March 17, 2013).
  • 16. Crowdsourcing is the practice of obtaining needed services, ideas, or content by soliciting contributions from a large group of people, and especially from an online community, rather than from traditional employees or suppliers. ... [It] is different from ordinary outsourcing since it is a task or problem that is outsourced to an undefined public rather than a specific, named group. Wikipedia contributors, "Crowdsourcing," Wikipedia, The Free Encyclopedia, http://en.wikipedia.org/wiki/ Crowdsourcing (accessed March 17, 2013)
  • 17. crowdcollaboration crowd* crowdsourcing ng di citizen science un df ow cr crowdcasting crowdvoting
  • 18. what is Alexa? • Alexa collects and analyzes Internet data for purposes of web analytics. Web analytics is the measurement, collection, analysis and reporting of Internet data for the purposes of understanding and optimizing web usage. Alexa is now a subsidiary of Amazon. • Alexa was founded in 1996 by Brewster Kahle (Internet Archive) and Bruce Gilliat. • Alexa operations includes archiving of webpages as they are crawled. This database served as the basis for the creation of the Internet Archive accessible through the Wayback Machine. • Alexa continually crawls all publicly-available websites to create a series of snapshots of the web. • Alexa gathers information from a variety of sources to provide key statistics about each site on the web, for example, Traffic Rank, the number of PageViews, and site Speed, Bounce Rate, etc. This information is derived from Alexa toolbar users (~6,000,000 worldwide).
  • 19. definitions • A PageView is a request for a file whose type is defined as a page. • A Unique Visitor is a uniquely identified client generating requests on the web server or viewing pages within a defined time period (i.e. day, week or month). A Unique Visitor counts once within the timescale. • A Visit is a series of page requests from the same uniquely identified client with a time of no more than 30 minutes between each page request. • Bounce Rate is the percentage of visits where the visitor enters and exits at the same page without visiting any other pages on the site in between. • World | Country Rank is a function of the average daily unique visits and the number of unique pages requested. definitions adapted from Wikipedia http://en.wikipedia.org/wiki/Web_analytics
  • 20. crowdfunding Kickstarter (http://www.kickstarter.com/) was 1st launched in Apr 2009. As of 17-Mar-2013 its Alexa Internet traffic rank is 751 (global) / 294 (USA). 35,000+ projects successfully funded with $500,000,000+ by 3,000,000+ people.
  • 21. crowdvoting reddit (http://www.reddit.com/) was 1st launched in June 2005. As of 17-Mar-2013 its Alexa Internet traffic rank is 124 (global) / 54 (USA). reddit had more than 55,000,000 unique visitors from 175 countries who cast more than 17,000,000 votes about which stories are important.
  • 22. Amazon Mechanical Turk (https://www.mturk.com) was launched Nov 2005. As of 17-Mar-2013 its Alexa Internet traffic rank is 8,219 (global) / 3,036 (USA).
  • 23. crowdsourcing Each day 200,000,000 recaptcha’s are solved by humans around the world.
  • 24. Zooniverse (https://www.zooniverse.org) was 1st launched as Galaxy Zoo July 2007. As of 17-Mar-2013 it has 801,682 participants worldwide. Its Alexa traffic rank is 271,574 (global) / 127,695 (USA).
  • 26. Wikipedia • Wikipedia began 2001 • Now in 285 languages, 24,640,000 articles • 4,210,000 articles in English • More than 1,000,000 articles each in German, French, Italian, and Dutch • 40 wikipedia languages with more than 100,000 articles • 112 wikipedia languages with more than 10,000 articles • 488,470,000 unique visitors (Jan 2013) • 84,848,000 active (5+ edits) contributors • Alexa global traffic rank: #6 in worldwide web traffic Statistics from Wikimedia Report Card http://reportcard.wmflabs.org
  • 27. Family Search Indexing was 1st launched (beta) 2004. As of 17-Mar-2013 Family Search’s (https://familysearch.org/) Alexa Internet traffic rank is 4,480 (global) / 1,208 (USA).
  • 28. • Started (beta) 2004 • More than 780,000 worldwide registered volunteers from ~25 countries index records relevant to family history • Approximately 100,000 active volunteers each month • UI in Chinese, English, German, French, Italian, Japanese, Korean, Portuguese, and Russian • Blind double-key entry with arbitration / reconciliation • More than 1,500,088,741 records indexed (July 2012) • Accuracy typically > 99.95% Statistics from private communication with Family Search 5-Jul-2013
  • 29. Project Gutenberg was 1st launched Dec 1971. As of 17-Mar-2013 Project Gutenberg’s Alexa Internet traffic rank 5,192 (global) / 2,851 (USA).
  • 30. • Started Dec 1971 • Worldwide volunteers transcribe or proofread OCR’d public domain books through Distributed Proofreaders • 42,000 free ebooks completed (March 2013) • More than 100,000 free ebooks offered by its partners and affiliates • Partner / affiliated projects for Australia, Canada, Europe, Germany, Runeberg (Nordic literature), self-published contemporary authors, Consortia Center in collaboration with the World eBook Library, ...
  • 31. As of 17-Mar-2013 the National Library of Australia’s (http://trove.nla.gov.au/) Alexa Internet traffic rank is 14,490 (global) / 330 (Australia). Trove gets ~75% of all National Library web traffic.
  • 32. National Library of Australia • Online since 2008 • 7,200,000+ pages • Top text corrector 1,250,000 lines (June 2012) • 2,450,000+ lines corrected each month (average for 1st 6 months 2012) • 68,908,757 lines corrected as of July 2012, up from 42,411,468 lines corrected July 2011. • 63,613 total registered users (July 2012) • 4,146 active users (June 2012) Statistics from private communication with the National Library of Australia Oct 2012
  • 33. Courtesy of Tim Sherrat, Tinkerer-in-Chief at WraggeLabs Emporium (http://wraggelabs.com/
  • 34. As of 17-Mar-2013 National Library of Finland’s (http://www.nationallibrary.fi/) Alexa Internet global traffic rank is 4,303,901. Its Internet traffic rank for Finland was 199 as of 2-Apr-2012.
  • 35. National Library of Finland • Digitalkoot is a project to improve OCR text in digitized newspapers -- by playing games! • Digitalkoot is a collaboration between the National Library and Microtask • Players correct OCR text by playing Myyräsillassa (Mole Bridge) or Myyräjahdissa (Mole Hunt) • National Library has 4,000,000+ digitized pages • 109,321 registered players (October 2012) • Since February 2011 8,024,530 micro-tasks have been completed
  • 36. As of 17-Mar-2013 UC Riverside’s Alexa Internet traffic rank is 11,782 (global) / 4,120 (USA). CDNC gets ~3.30% of all UC Riverside web traffic.
  • 37. California Digital Newspaper Collection • CDNC began digitizing newspapers in 2005 as part of the Library of Congress National Digital Newspapers Program (NDNP) • Newspapers digitized to article-level in addition to page-level as required by NDNP (same as Utah Digital Newspapers) • Since 2009 hosted on Veridian at http://cdnc.ucr.edu • Collection size 55,970 issues, 495,175 pages, 5,658,224 articles, 498,000,000+ lines (Mar-2013)
  • 38. OCR text correction • OCR text correction added August 2011 • Corrections are done line by line • ~578,000+ lines of text corrected Oct 2012 • ~935,398+ lines of text corrected Mar 2013 • ~2% of the collection corrected, 98% to go! • Top corrector 327,244 lines > 2x 2nd corrector
  • 39.
  • 40. Cambridge Public Library Historic Newspaper Collection • Cambridge Historic Newspapers online since Jan 2012. • Cambridge Massachusetts Public Library digitized local newspapers (http://cambridge.dlconsulting.com/) • Newspapers digitized to article-level • Collection size 6,346 issues, 59,070 pages, 669,406 articles (Mar-2013) • Collection includes 13,099 obituary cards
  • 41. Why correct text? Here’s why ...
  • 42. Raw OCR text Newspaper image Deaths. lln»rieff, Esq. of <c .. Qn. Sunday, the till. greatly Drandrellt, of Orms4irJi.- ~ ; ;✓ ' • * On ijfr r inn ljjjil F iij '11 f Havodivyd, Carnarvonshire, S ; **" *- ' « ' March Oxford, F. Tfovmeud, Uerald. » • V . •On Tncsdav last, Mr. Charles. IWilinson, this 8 ; had vf thesis#,, a week ago, which tcrminate<i'iu his death. . / ' ■ O'i Sunday, dJst nit. at. AsbtCnvHall, mar Lancaster, Mr.,Geo. Worn ick, many years house'steward hit late Once The Hamilton and Brandon. He locked himself h»oWn'r«wte<: soon. twelve o'clock" that dny, and fii»-d a loaded pistol "through Ins bead, 1 which instantaneously killed him. Coronet's Verdict, shot himself in a temporary fit of Friday week, Excerpt from The British Newspaper Archive, Chester Courant, Tuesday 6-Apr-1819, page 3.
  • 43. Motivation Graphic from Kaufmann et al. “More than fun and money. Worker Motivation in Crowdsourcing – A Study on Mechanical Turk.”
  • 44. Wisdom of crowds Each person should have private information even if Diversity it's just an eccentric interpretation of the known facts. People's opinions aren't determined by the opinions Independence of those around them. People are able to specialize and draw on local Decentralization knowledge. Some mechanism exists for turning private Aggregation judgments into a collective decision. James Surowiecki, The Wisdom of Crowds: Why the Many Are Smarter Than the Few and How Collective Wisdom Shapes Business, Economies, Societies and Nations, Anchor Books, New York, 2005.
  • 45. Cognitive surplus ... people are learning to use their free time for creative activities rather than consumptive ones [such as watching TV] ... ... the total human cognitive effort in creating all of Wikipedia in every language is about one hundred million hours ... ... Americans alone watch two hundred billion hours of TV every year, or enough time, if it would be devoted to projects similar to Wikipedia, to create about 2000 of them ... Clay Shirky. Cognitive surplus: Creativity and generosity in a connected age. Penguin Press. New York. 2010.
  • 46. Motivation Genealogists and family historians • The 2012 National Library of Australia’s Trove status report showed that ~50% of Trove users are family historians PAPERSPAST • National Library of New Zealand survey found that ~50% of PapersPast users are genealogists • A 2013 California Digital Newspaper Collection survey shows that more than 65% of its users are genealogists; 75% are 50 years old or older • A 2012 Utah Digital Newspapers survey showed that 72% of its users are genealogists* *John Herbert and Randy Olsen. “Small town papers: Still delivering the news”. Paper given at 2012 World Library and Information Congress. Helsinki. August 2012.
  • 47. Motivation Trove users’ report • “I enjoy the correction - it’s a great way to learn more about past history and things of interest whilst doing a ‘service to the community’ by correcting text for the benefit of others.” • “I have recently retired from IT and thought that I could be of some assistance to the project. It benefits me and other people. It helps with family research.” From Rose Holley in “Many Hands Make Light Work.” National Library of Australia March 2009.
  • 48. Motivation CDNC users’ report “I am interested in all kinds of history. I have pursued genealogy as a hobby for many years. I correct text at CDNC because I see it as a constructive way to contribute to a worthwhile project. Because I am interested in history, I enjoy it.” Wesley, California Personal communications with CDNC text correctors.
  • 49. Motivation CDNC users’ report “I only correct the text on articles of local interest - nothing at state, national or international level, no advertisements, etc.  The objective is to be able to help researchers to locate local people, places, organizations and events using the on-line search at CDNC.  I correct local news & gossip, personal items, real estate transactions, superior court proceedings, county and local board of supervisors meetings, obituaries, birth notices, marriages, yachting news, etc.” Ann, California Personal communications with CDNC text correctors.
  • 50. Motivation CDNC users’ report “I am correcting text for the Coronado Tent City Program for 1903.  It is important to correct any problems with personal names and other information so that researchers will be able to search by keyword and be assured of retrieving desired results. ... type fonts cause a great deal of difficulty in digitizing the text and can cause problems for searchers.  Also, many of the guests' names at Tent City and Hotel Del Coronado were taken from the registration books and reported in the Program.  This led to many problems in spelling of last names and the editors were not careful to be consistent in the spellings.  This Program is an important resource since it provides an excellent picture of daily life in Tent City and captures much of the history of Coronado itself.” Gene, California Personal communications with CDNC text correctors.
  • 51. Motivation CDNC users’ report “I have always been interested in history, especially the development of the American West, and nothing brings it alive better than newspapers of the time. I believe them to be an invaluable source of knowledge for us and future generations.” David, United Kingdom Personal communications with CDNC text correctors.
  • 52. Motivation CDNC users’ report CDNC is an excellent source of information matching my personal interest in such topics as sea history, development of shipbuilding, clippers and other ships etc. ... Unfortunately, the quality of text ... is rather poor I’m afraid. This is why I started to do all corrections necessary for myself ... and to leave the corrected text for use of others. .... I am not doing this very regularly as this is just my hobby and pleasure. Jerzey, Poland Personal communications with CDNC text correctors.
  • 53. Ok, raw OCR newspaper text is bad. But how much difference can one person (me) really make?
  • 54. You can make a difference Graphic courtesy of TYPEinspire (http://typeinspire.com/)
  • 55. User Lines corrected Lines corrected User 1 242,965 1,456,906 1 2 87,515 1,385,369 2 3 31,318 1,010,360 3 4 24,144 960,230 4 5 23,184 847,340 5 6 19,240 786,147 6 7 18,898 657,187 7 8 16,875 600,513 8 9 11,784 582,276 9 10 9,762 565,384 10 Statistics from Oct 2012
  • 56. uncorrected OCR accuracy by newspaper title OCR character ~OCR word Title accuracy accuracy* PRP Pacific Rural Press 1871 - 1922 92.6% 68.1% SFC San Francisco Call 1890 - 1913 92.6% 68.1% LAH Los Angeles Herald 1873 - 1910 88.7% 54.9% LH Livermore Herald 1877 - 1899 88.6% 54.6% DAC Daily Alta California 1841 - 1891 88.2% 53.4% CFJ California Farmer and Journal 86.5% 48.4% of Useful Sciences 1855 - 1880 SN Sausalito News 1885 - 1922 70.4% 17.3% *Word accuracy assumes average word length is 5 characters
  • 57. corrected OCR accuracy by newspaper title OCR character Corrected Title accuracy accuracy PRP Pacific Rural Press 1871 - 1922 92.6% 99.3% SFC San Francisco Call 1890 - 1913 92.6% 99.6% LAH Los Angeles Herald 1873 - 1910 88.7% 99.1% LH Livermore Herald 1877 - 1899 88.6% 99.9% DAC Daily Alta California 1841 - 1891 88.2% 99.9% CFJ California Farmer and Journal 86.5% 99.8% of Useful Sciences 1855 - 1880 SN Sausalito News 1885 - 1922 70.4% 100.0%
  • 58. corrected OCR accuracy by newspaper title OCR character ~OCR word Corrected ~Corrected word Title accuracy accuracy* accuracy accuracy* PRP 1871 - 1922 92.6% 68.1% 99.3% 96.5% SFC 1890 - 1913 92.6% 68.1% 99.6% 98.0% LAH 1873 - 1910 88.7% 54.9% 99.1% 95.6% LH 1877 - 1899 88.6% 54.6% 99.9% 99.5% DAC 1841 - 1891 88.2% 53.4% 99.9% 99.5% CF 1855 - 1880 86.5% 48.4% 98.3% 91.8% SN 1885 - 1922 70.4% 17.3% 100.0% 100.0% *Word accuracy assumes average word length is 5 characters
  • 59. correction accuracy by user Average uncorrected Average corrected User text accuracy text accuracy A 70.4% 100.0% B 87.1% 99.5% C 95.4% 99.5% D 86.5% 98.3% E 95.3% 100.0% F 91.0% 100.0% G 91.0% 99.8% H 90.5% 99.0% I 96.6% 99.8% J 94.8% 100.0% K 86.8% 99.3%
  • 60. the long tail* of crowdsourced OCR text correction a probability distribution has a long tail if a larger share of population rests within its tail than it would under a normal distribution the most productive users represent a small fraction of the total user population and ~50% of total production, or, said a different way, the largest fraction but individually not quite so productive users are as important as the most productive users *The phrase “long tail” was popularized by Chris Anderson in the October 2004 Wired magazine article The Long Tail and by Clay Shirky’s February 2003 essay “Power laws, web logs, and inequality”.
  • 61. OCR text correction long tails 3,000,000 2,250,000 50% 300000 top corrector 242,965 1,500,000 top corrector 1,456,906 225000 50% 750,000 150000 50% 0 75000 NLA lines corrected by text corector 50% 0 CDNC lines corrected by text corrector
  • 63. Website traffic After a crowdsourcing transcription project of diaries from the American War Between the States, Nicole Saylor, Head of Digital Library Services at the University of Iowa Libraries, reported “On June 9, 2011, we went from about 1000 daily hits to our digital library on a really good day to more than 70,000.” Nicole Saylor interviewed by Trevor Owens. “Crowdsourcing the Civil War: Insights Interview with Nicole Saylor” blog post at http:// blogs.loc.gov/digitalpreservation/2011/12/crowdsourcing-the-civil-war-insights-interview-with-nicole-saylor/. Dec 6, 2011.
  • 64. Website traffic Website traffic at CDNC before / after implementing crowdsourcing before crowdsourcing after crowdsourcing change 11-Jun-2011 / 12-Jul-2011 11-Jun-2012 / 12-Jul-2012 visits 17,485 21,488 +22.9% unique visitors 11,381 13,376 +17.5% visit duration 9m 24s 11m 7s +18.3% bounce rate 51.3% 44.5% -6.8% pages per visit 14.9 11.7 -21.5%
  • 66. Crowdsourcing benefits Public domain photo courtesy of US Navy
  • 67. $ Economics Financial value of outsourced OCR text correction for newspapers? The Assumptions • 25 to 50 characters per line in a newspaper column: Assume 40 characters per line (CDNC sample average) • Outsourced text transcription or correction costs USD $0.35 to $1.20 per 1000 characters: Assume $0.50 per 1000 characters
  • 68. $ Economics $ 578,000 lines x 40 characters per line x 1/1000 x $0.50 = $11,560 $ 68,908,757 lines x 40 characters per line x 1/1000 x $0.50 = $1,378,175
  • 69. $ Economics Financial value of in-house OCR text correction? The Assumptions • Correction takes 15 seconds per line • Cost is hourly wage plus benefits of lowest level employee, $10 for CDNC, $41.88* for Australia AUD $40.38 = USD $41.88 is the actual labor value assumed by the National Library of Australia to calculate avoided costs due to crowdsourced OCR text correction in its 2012 Trove Status Report.
  • 70. $ Economics $ 578,000 lines x 15 seconds per line x 1/3600 hrs per second x $10.00 per hr = $24,083 $ 68,908,757 lines x 15 seconds per line x 1/3600 hrs per second x $41.88 per hr = $12,024,578
  • 71. Accuracy “His Accuracy Depends on Ours!" Office for Emergency Management. Office of War Information. Domestic Operations Branch. Bureau of Special Services. [Photo held at US National Archives and Records Administration]
  • 72. • Edwin Kiljin (Koninklijke Bibliotheek the Netherlands) reports raw OCR character accuracies of 68% for early 20th century newspapers • Rose Holley (National Library of Australia) reports raw OCR character accuracy varied from 71% to 98% on a sample Trove digitized newspapers Edwin Kiljin. “The current state-of-art in newspaper digitization.” D-Lib Magazine. January/February 2008. Rose Holley. “How good can it get? Analysing and improving OCR accuracy in large scale historic newspaper digitisation programs. D-Lib Magazine. March/April 2009. Public domain graphic courtesy of Wikimedia Commons. Graphic is logo for Accuracy in Media (http://www.aim.org/)
  • 73. Accuracy Mapping texts* assesses digitization quality of digital newspapers by comparing the number of words recognized to the total number of words scanned * Mapping texts is a collaboration between the University of North Texas and Stanford University aimed at experimenting with new methods for finding and analyzing meaningful patterns embedded in massive collections of digital newspapers.
  • 74. Accuracy How does low text accuracy affect search recall? The Facts • Average uncorrected OCR character accuracy of the CDNC sample data is ~89% • Average length of an English word is 5 characters • Average word accuracy is 89% x 89% x 89% x 89% x 89% = 55.8% - round up to 60% or 6 out of 10 words correct
  • 75. Search recall no text correction ARND T ARNDT ARNDT DT ARN ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT instances of “ARNDT” found instances of “ARNDT” not found
  • 76. Accuracy The Facts • Average corrected character accuracy of the CDNC sample data is ~99.4% • Average word accuracy of CDNC corrected text is 99.4% x 99.4% x 99.4% x 99.4% x 99.4% = 97.0%
  • 77. Search recall with text correction ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT ARNDT instances of “ARNDT” found instances of “ARNDT” not found
  • 78. Accuracy A search for “Arndt” at Chronicling America gives 10,267 results* • If Chronicling America text accuracy is 55.8% (same as uncorrected CDNC sample), then 8,133 instances of “Arndt” were not found • If text accuracy is 97.0%, then 317 instances of “Arndt” were not found * Search performed 31 Oct 2012
  • 79. Accuracy Suppose the word/name is longer than 5 characters? The Facts • Assume that average uncorrected / corrected OCR character accuracy is ~89% / ~99% same as CDNC. Name Name length Raw text accuracy Corrected text accuracy Eklund 6 49.7% 94.2% Kennedy 7 44.2% 93.25 Espinosa 8 39.4% 92.3% Bonaparte 9 35.0% 91.4% Chatterjee 10 31.2% 90.4%
  • 80. Accuracy Chronicling America searches done 19-Mar-2013 (6,025,474 pages from 1836 to 1922). Number of Missing results with Missing results with Name search results raw text accuracy corrected text accuracy Eklund 2,951 2,987 182 Kennedy 360,723 455,392 26,111 Espinosa 1,918 2,950 160 Bonaparte 44,664 82,947 4,203 Chatterjee 19 42 2
  • 81. Resources Public domain photo “A useful instruction for young sailors from the Royal Hospital School, Greenwich” from the National Maritime Museum.
  • 82. Comprehensive worldwide list of online newspaper archives Wikipedia contributors, "List of online newspaper archives," Wikipedia, The Free Encyclopedia, https:// en.wikipedia.org/wiki/Wikipedia:List_of_online_newspaper_archives (accessed March 17, 2013).
  • 83. Search many digital newspaper collections at once! As of 17-Mar-2013 elephind (http://www.elephind.com) has indexed 930 newspapers from 11 historical digital collections comprising 1,041,086 issues and 44,158,901 pages/articles.
  • 84. Try crowdsourcing! Correct California newspapers at http://cdnc.ucr.edu Correct Australian newspapers http://trove.nla.gov.au Correct Cambridge MA newspapers http://bit.ly/cambridgepublic Correct Tennessee newspapers http://tndp.lib.utk.edu Correct Virginia newspapers http://virginiachronicle.com Login with user name “crowdsatrootstech2013” or “crowdsatrootstech2013@gmail.com”, password “roots$tech”
  • 85. Hãy thử crowdsourcing! Correct Vietnamese newspapers http://bit.ly/nationallibraryofvietnam Попробуйте краудсорсинга! Or try Russian language periodicals http://bit.ly/russianperiodicals Kokeile crowdsourcing! Or try Finnish newspapers http://digi.lib.helsinki.fi/sanomalehti
  • 86. Other resources Mapping Texts at http://mappingtexts.stanford.edu/ Wragge Labs at http://wraggelabs.com/ Wikipedia list of crowdsourcing projects https://en.wikipedia.org/wiki/ List_of_crowdsourcing_projects
  • 87. ? Frederick Zarndt @cowboyMontana frederick@frederickzarndt.com Slides @ http://bit.ly/crowdsrootstech2013 CCS / Digital Divide Data / DL Consulting Photo held by John Oxley Library, State Library of Queensland. Original from Courier-mail, Brisbane, Queensland, Australia.
  • 88. Try crowdsourcing! Correct California newspapers at http://cdnc.ucr.edu Correct Australian newspapers http://trove.nla.gov.au Correct Cambridge MA newspapers http://bit.ly/cambridgepublic Correct Tennessee newspapers http://tndp.lib.utk.edu Correct Virginia newspapers http://virginiachronicle.com Login with user name “crowdsatrootstech2013” or “crowdsatrootstech2013@gmail.com”, password “roots$tech”
  • 89.
  • 90.
  • 91.
  • 92.
  • 93.
  • 94.
  • 95.
  • 96.
  • 97.
  • 98. FYI about Trove • If you hope to begin your text correction hobby with Trove’s family notices (births, deaths, weddings), you may have a tough go of it. As of 17-Mar-2013, there were 768,333 family notices in Trove digitized newspapers; most seem to have already been corrected. • Lack of text correction opportunity notwithstanding, now you know where to find 768,333 family notices published in Australia from 1803 to 1954.
  • 99. Try crowdsourcing! Correct British newspapers http://www.britishnewspaperarchive.co.uk/ The British Newspaper Archive is a subscription service from brightsolid and the British Library. From now until the end of RootsTech you can use it at no cost with the user name and password below. Login with user name “crowdsatrootstech2013” or “crowdsatrootstech2013@gmail.com”, password “roots$tech”
  • 100.
  • 101.
  • 102. ? Frederick Zarndt @cowboyMontana frederick@frederickzarndt.com Slides @ http://bit.ly/crowdsrootstech2013 CCS / Digital Divide Data / DL Consulting Photo held by John Oxley Library, State Library of Queensland. Original from Courier-mail, Brisbane, Queensland, Australia.