SlideShare una empresa de Scribd logo
1 de 1
Descargar para leer sin conexión
Crowdsourcing in Article Evaluation
                                                                                                                              Isabella Peters1, Stefanie Haustein1,2 & Jens Terliesner1                                                                                                                                                                                                                                           MULTIDIMENSIONAL 
                                                                                                                                                                                                                                                                                                                                                                                                                                      JOURNAL 
                                                                                                                           isabella.peters@uni-duesseldorf.de | s.haustein@fz-juelich.de | jens.terliesner@uni-duesseldorf.de                                                                                                                                                                                        JOURNAL
                                                                                                                                                                                                                                                                                                                                                                                                                   MANAGEMENT
                                                                                                                                                                                                                                                                                                                                                                                                                                    EVALUATION




                                              GENERAL IDEA OF CROWDSOURCING ARTICLE & JOURNAL EVALUATION
                                              Past: traditional  journal evaluation uses cumulated                                                                                 Present: access‐,  download‐,  subscription statistics of                                                   Future: a  multidimensional  approach which combines
                                              citation numbers of articles.                                                                                                        electronic articles should reflect usage of  articles and                                                   available usage numbers.
                                              Problem: citations do  not appropriately reflect articles‘                                                                           journals.                                                                                                   Focus: data of  STM‐social bookmarking systems (e.g., 
                                              influence on  readers  because only such  readers  were                                                                              Problem: measuring is problematic although standards                                                        CiteULike)  for measuring journal perception and  reader
                                              count, who also write articles and publish in journals.                                                                              are given. Global usage statistics are not available.                                                       perception of articles as crowdsourced alternative.


                                                                                                                                                                                                                                                                                                                                                              88.4% of all retrieved bookmarks were tagged
                                              DATA COLLECTION & TEST SETS                                                                                                                                                     matc
                                                                                                                                                                                                                                  hing
                                                                                                                                                                                                                                         via
                                                                                                                                                                                                                                             DO
                                                                                                                                                                                                                                                I                               REV MOD PHYS                                      10000
                                                                                                                                                                                                                                                                                                                                                              8,208 tags were assigned 38,241 times

                                             Test set I                                                                                                                                                 O
                                                                                                                                                                                                            I
                                                                                                                                                                                                   /D
                                                                                                                                                                                          via SN
                                              45 solid state physics journals                                                                                                           g
                                                                                                                                                                                      in / IS
                                                                                                                                                                                    ch n
                                                                                                                                                                                                                                                                                                                                  1000

                                                                                                                                                                                  ar tio




                                                                                                                                                                                                                                                                                                  frequency
                                              all publications from 2004 to 2008                                                                                               se evia
                                                                                                                                                                               ab
                                                                                                                                                                                 br
                                                                                                                                                                                                                                                                                                                                    100
                                                                                                                                       J PHYS A                              /
                                              bibliographic data for 168,109                                                         J PHYS A                     tit
                                                                                                                                                                        le

                                                                                                                                  SOFT MATTER

                                              documents from Web of Science                                                                                                                                                                                                                                                         10

                                                                                                                                                                                                                                                                                 PHYS REV E
                                                                                                                                     PHYS REV E


                                             Data collection                                                                                                                                                    CiteULike
                                                                                                                                                                                                                                                                                                                                        1
                                                                                                                                                                                                                                                                                                                                            1              10           100      1000    10000

                                              matching of articles and bookmarks                                                  REV MOD PHYS
                                                                                                                                                                                                                        tag                                                                                                                                             tags
                                                                                                                                                                                                                                                                                                                                                                                                 tagcloud: all tags assigned at least 50 times

                                              to articles in CiteULike, BibSonomy
                                              and Connotea                                                                        journals
                                                                                                                                                                                                                                                                                                                                  350                                           13,608 bookmarks were matched to 
                                                                                                                                                                                                                                                                                                                                            satoshi (322 posts)
                                             Test set II                                                                                                                                                                                                                                                                          300
                                                                                                                                                                                                                                                                                                                                                                                10,280 articles




                                                                                                                                                                                                                                                                                                   Number of bookmarks per user
                                              13,760 correct bookmarks retrieved                                                                                                         bookmarks                                                                                J PHYS A

                                                                                                                                                                                                                                                                                                                                  250
                                                                                                                                                                                                                                                                                                                                                                                2,441 unique users
                                                                                                                                                                                                                                                                                                                                            bronckobuster (238 posts)
                                              for articles of test set I                                                       Number of bookmarks retrieved                                                                                                                                                                                rice (234 posts)
                                                                                                                                                                                                                                                                                                                                                                                1,179 users posted one article
                                                                                                                                                                                                                                     bibliographic data                                                                           200

                                                                                                     BibSonomy              940
                                                                                                                                                                                                                                                                                                                                                                                75% of content is created by 21% 
                                                                                                                                                                                                                                                                                                                                  150
                                                                                                                                                                                                                                                                                                                                                                                of users
                                                                                                          CiteULike                                                                   10778                                                                                                                                       100                                           8,511 articles were only 
                                                                                                                                                                                                                                                                                                                                   50                                           bookmarked once
                                                                                                          Connotea                2042
                                                                                                                                                                                                                                                                                                                                    0
                                                                                                                                                                                                                                                                                                                                                                           Users from BibSonomy, CiteULike and Connotea




                                              RQ: DO TAGS REFLECT OTHER VIEWS ON ARTICLES THAN AUTHOR OR INTERMEDIARY KEYWORDS?
                                           Comparison of:                                                                                                                   Preprocessing and cleaning of keyword sets:                                                                                                            Results from preprocessing and cleaning:
                                                                                                                                                                             aim: to receive a linguistically homogenous keyword                                                                                                    author: ‐55.3% spelling variants
                                                                              subject headings
                                                                                                                                                                             collection                                                                                                                                             intermediary: ‐2.8% spelling variants
                                      publication
                                                                                                                                                                             all keywords: removed special characters (except                                                                                                       automatic: ‐5.3% spelling variants
                                                                     Inspec
                                                     author                            matching via DOI                                      724 articles of test set I&II   hyphens and underscores), lower case, BE to AE,                                                                                                        tags: ‐8.4% variants
                                                                                                                                                                                                      *
                                                    keywords                                                                                 contained all keyword types stemming with Porter 2
                                                      title
                                                                                                                                                                             author keywords: removed stop words and dataset                                                                                                            author: +34.1% overlap**
                                                                                                                   tags
                                                     terms
                                                                                                                                             comparison of keyword sets specific terms (e.g., imported)                                                                                                                                 intermediary: +21% overlap
                                                    abstract                                                                                 on article level via cosine     tags for comparison with title & abstract terms: split at                                                                                                  automatic: +20.6% overlap
                                                                                                               BibSonomy
                                                     terms                                                     CiteULike
                                                                                                                                             similarity coefficient          separating character (e.g., hyphen or undescore)
                                                                     Web of Science                            Connotea
                                                                                                                                                                             tags for comparison with automatic & controlled                                                                                                                                                                     ** at least one term in common
                                                                              KeyWords PlusTM
                                                                                                                                                                             keywords: deletion of separating character and blanks




                                               RESULTS OF TERM SET COMPARISON                                                                                                                                                 mean overlap tag ratio
                                                                                                                                                                                                                                tags in terms                                                                                               mean overlap term ratio
                                                                                                                                                                                                                                                                                                                                              terms in tags

                                                    mean cosine similarity
                                                    between tags and keywords




                                                                                                               tags reveal user perception of articles
                                      tags for articles of the
                                      journal J Stat Mech                                                                 crowdsourcing article & journal evaluation



                                                                                                                                                                                           Analysis over time can reveal shifts in thematic focus areas

                                                                                                                                                                                                                                                                                tags assigned to articles
                                      intermediary keywords for articles                                                                                                                                                                                                        published in J Phys
                                      of the journal J Stat Mech                                                                                                                                                                                                                Condens Matter in 2004
Mitglied der Helmholtz-Gemeinschaft




                                                                                                                                                                                tags assigned to articles published in 
                                                                                                                                                                                      J Phys Condens Matter in 2008
                                                                                                                                                                                                                                                                                                                                                                                                                        overlap: at least one
                                                                                                                                                                                                                                                                                                                                                                                                                           term in common
                                                    Good, B., Tennis, J., & Wilkinson, M. 2009. Social tagging in the life sciences: Characterizing a new metadata resource for bioinformatics. BMC Bioinformatics, 10(313). DOI= 10.1186/1471‐2105‐10‐313. 
                                                    Haustein, S. 2011. Wissenschaftliche Zeitschriften im Web 2.0. Die Analyse von Bookmarks zur Evaluation wissenschaftlicher Journale. In Proceedings of the 12th International Symposium for Information Science (Hildesheim,
                                                       Germany, March 09‐11, 2011). 148‐159.
                                                    Haustein, S., & Siebenlist, T. 2011. Applying social bookmarking data to evaluate journal usage. Journal of Informetrics, 5(3). 446‐457.  DOI= 10.1016/j.joi.2011.04.002
                                       References




                                                    Jeong, W. 2009. Is tagging effective? Overlapping ratios with other metadata fields. In Proceedings of the International Conference on Dublin Core and Metadata Applications (Seoul, Korea, October 12‐16, 2009). 31‐39. 
                                                    Lin, X., Beaudoin, J., Bul, Y., & Desai, K. 2006. Exploring characteristics of social classification. In Proceedings of the 17th Annual ASIS&T SIG/CR Classification Research Workshop (Austin, USA, November 03‐08, 2006).         1 Department of Information Science, Heinrich‐Heine‐University Düsseldorf, 
                                                    Lu, C., Park J., &Hu, X. 2010. User tags versus expert‐assigned subject terms: A comparison of LibraryThing tags and Library of Congress Subject Headings. Journal of Information Science, 36(6), 763‐779. 
                                                    Lux, M., Granitzer, M., & Kern, R. 2007. Aspects of broad folksonomies. In Proceedings of the 18th International Conference on Database and Expert Systems Applications (Regensburg, Germany, September 03‐07, 2007). 283‐287.                                                Universitätsstraße 1, 40225 Düsseldorf (Germany)
                                                    Noll, M. G., & Meinel, C. 2007. Authors vs. readers. A comparative study of document metadata and content in the WWW. In Proceedings of the 2007 ACM Symposium on Document Engineering (Winnipeg, Canada, August 28‐31,                                                          2 Central Library, Forschungszentrum Jülich,
                                                       2007). 177‐186. 
                                                    Peters, I. 2009. Folksonomies. Indexing and Retrieval in Web 2.0. De Gruyter Saur, München.
                                                    Terliesner, J., & Peters, I. 2011. Der T‐Index als Stabilitätsindikator für dokument‐spezifische Tag‐Verteilungen. In Proceedings of the 12th International Symposium for Information Science (Hildesheim, Germany, March 09‐11,
                                                                                                                                                                                                                                                                                                                                                52425 Jülich (Germany)
                                                       2011). 123‐133.
                                                    * http://snowball.tartarus.org/download.php.

Más contenido relacionado

Similar a Crowdsourcing in Article Evaluation

Evaluation Tool Rurener 30 11 09
Evaluation Tool Rurener 30 11 09Evaluation Tool Rurener 30 11 09
Evaluation Tool Rurener 30 11 09mandika
 
2011 2012 poster
2011 2012 poster2011 2012 poster
2011 2012 posterlagman1
 
Pen test press_kit_2012
Pen test press_kit_2012Pen test press_kit_2012
Pen test press_kit_2012Amiga Utomo
 
Orchestrating the Technologies and Processes of the Customer Engagement Cycle
Orchestrating the Technologies and Processes of the Customer Engagement CycleOrchestrating the Technologies and Processes of the Customer Engagement Cycle
Orchestrating the Technologies and Processes of the Customer Engagement CycleMichael Moon
 
10 Most Common Misconceptions About User Experience Design
10 Most Common Misconceptions About User Experience Design10 Most Common Misconceptions About User Experience Design
10 Most Common Misconceptions About User Experience DesignWhitney Hess
 
Web application architecture
Web application architectureWeb application architecture
Web application architectureJoshua Eckblad
 
CRM en verandermanagement- Nanne Dodde- CRMin1day
CRM en verandermanagement- Nanne Dodde- CRMin1dayCRM en verandermanagement- Nanne Dodde- CRMin1day
CRM en verandermanagement- Nanne Dodde- CRMin1day3ND B.V.
 
Health Information Technology poster presentation
Health Information Technology poster presentationHealth Information Technology poster presentation
Health Information Technology poster presentationnoelanif5
 
Bringing the Real World to ZAP @ USF.
Bringing the Real World to ZAP @ USF.Bringing the Real World to ZAP @ USF.
Bringing the Real World to ZAP @ USF.Eric Ritter
 

Similar a Crowdsourcing in Article Evaluation (11)

Evaluation Tool Rurener 30 11 09
Evaluation Tool Rurener 30 11 09Evaluation Tool Rurener 30 11 09
Evaluation Tool Rurener 30 11 09
 
2011 2012 poster
2011 2012 poster2011 2012 poster
2011 2012 poster
 
Pen test press_kit_2012
Pen test press_kit_2012Pen test press_kit_2012
Pen test press_kit_2012
 
Orchestrating the Technologies and Processes of the Customer Engagement Cycle
Orchestrating the Technologies and Processes of the Customer Engagement CycleOrchestrating the Technologies and Processes of the Customer Engagement Cycle
Orchestrating the Technologies and Processes of the Customer Engagement Cycle
 
PLE Overview
PLE OverviewPLE Overview
PLE Overview
 
10 Most Common Misconceptions About User Experience Design
10 Most Common Misconceptions About User Experience Design10 Most Common Misconceptions About User Experience Design
10 Most Common Misconceptions About User Experience Design
 
Web application architecture
Web application architectureWeb application architecture
Web application architecture
 
CRM en verandermanagement- Nanne Dodde- CRMin1day
CRM en verandermanagement- Nanne Dodde- CRMin1dayCRM en verandermanagement- Nanne Dodde- CRMin1day
CRM en verandermanagement- Nanne Dodde- CRMin1day
 
Health Information Technology poster presentation
Health Information Technology poster presentationHealth Information Technology poster presentation
Health Information Technology poster presentation
 
Fractions programme
Fractions programmeFractions programme
Fractions programme
 
Bringing the Real World to ZAP @ USF.
Bringing the Real World to ZAP @ USF.Bringing the Real World to ZAP @ USF.
Bringing the Real World to ZAP @ USF.
 

Último

ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptxDhatriParmar
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1GloryAnnCastre1
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxMan or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxDhatriParmar
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 

Último (20)

ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
Unraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptxUnraveling Hypertext_ Analyzing  Postmodern Elements in  Literature.pptx
Unraveling Hypertext_ Analyzing Postmodern Elements in Literature.pptx
 
Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1Reading and Writing Skills 11 quarter 4 melc 1
Reading and Writing Skills 11 quarter 4 melc 1
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxMan or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 

Crowdsourcing in Article Evaluation

  • 1. Crowdsourcing in Article Evaluation Isabella Peters1, Stefanie Haustein1,2 & Jens Terliesner1 MULTIDIMENSIONAL  JOURNAL  isabella.peters@uni-duesseldorf.de | s.haustein@fz-juelich.de | jens.terliesner@uni-duesseldorf.de JOURNAL MANAGEMENT EVALUATION GENERAL IDEA OF CROWDSOURCING ARTICLE & JOURNAL EVALUATION Past: traditional  journal evaluation uses cumulated Present: access‐,  download‐,  subscription statistics of  Future: a  multidimensional  approach which combines citation numbers of articles. electronic articles should reflect usage of  articles and  available usage numbers. Problem: citations do  not appropriately reflect articles‘ journals. Focus: data of  STM‐social bookmarking systems (e.g.,  influence on  readers  because only such  readers  were Problem: measuring is problematic although standards CiteULike)  for measuring journal perception and  reader count, who also write articles and publish in journals. are given. Global usage statistics are not available. perception of articles as crowdsourced alternative. 88.4% of all retrieved bookmarks were tagged DATA COLLECTION & TEST SETS matc hing via DO I REV MOD PHYS 10000 8,208 tags were assigned 38,241 times Test set I O I /D via SN 45 solid state physics journals g in / IS ch n 1000 ar tio frequency all publications from 2004 to 2008 se evia ab br 100 J PHYS A / bibliographic data for 168,109  J PHYS A tit le SOFT MATTER documents from Web of Science 10 PHYS REV E PHYS REV E Data collection CiteULike 1 1 10 100 1000 10000 matching of articles and bookmarks REV MOD PHYS tag tags tagcloud: all tags assigned at least 50 times to articles in CiteULike, BibSonomy and Connotea journals 350 13,608 bookmarks were matched to  satoshi (322 posts) Test set II 300 10,280 articles Number of bookmarks per user 13,760 correct bookmarks retrieved  bookmarks J PHYS A 250 2,441 unique users bronckobuster (238 posts) for articles of test set I Number of bookmarks retrieved rice (234 posts) 1,179 users posted one article bibliographic data 200 BibSonomy 940 75% of content is created by 21%  150 of users CiteULike 10778 100 8,511 articles were only  50 bookmarked once Connotea 2042 0 Users from BibSonomy, CiteULike and Connotea RQ: DO TAGS REFLECT OTHER VIEWS ON ARTICLES THAN AUTHOR OR INTERMEDIARY KEYWORDS? Comparison of: Preprocessing and cleaning of keyword sets: Results from preprocessing and cleaning: aim: to receive a linguistically homogenous keyword author: ‐55.3% spelling variants subject headings collection intermediary: ‐2.8% spelling variants publication all keywords: removed special characters (except automatic: ‐5.3% spelling variants Inspec author matching via DOI 724 articles of test set I&II  hyphens and underscores), lower case, BE to AE, tags: ‐8.4% variants * keywords contained all keyword types stemming with Porter 2 title author keywords: removed stop words and dataset author: +34.1% overlap** tags terms comparison of keyword sets specific terms (e.g., imported) intermediary: +21% overlap abstract on article level via cosine tags for comparison with title & abstract terms: split at  automatic: +20.6% overlap BibSonomy terms CiteULike similarity coefficient separating character (e.g., hyphen or undescore) Web of Science Connotea tags for comparison with automatic & controlled ** at least one term in common KeyWords PlusTM keywords: deletion of separating character and blanks RESULTS OF TERM SET COMPARISON mean overlap tag ratio tags in terms mean overlap term ratio terms in tags mean cosine similarity between tags and keywords tags reveal user perception of articles tags for articles of the journal J Stat Mech crowdsourcing article & journal evaluation Analysis over time can reveal shifts in thematic focus areas tags assigned to articles intermediary keywords for articles published in J Phys of the journal J Stat Mech Condens Matter in 2004 Mitglied der Helmholtz-Gemeinschaft tags assigned to articles published in  J Phys Condens Matter in 2008 overlap: at least one term in common Good, B., Tennis, J., & Wilkinson, M. 2009. Social tagging in the life sciences: Characterizing a new metadata resource for bioinformatics. BMC Bioinformatics, 10(313). DOI= 10.1186/1471‐2105‐10‐313.  Haustein, S. 2011. Wissenschaftliche Zeitschriften im Web 2.0. Die Analyse von Bookmarks zur Evaluation wissenschaftlicher Journale. In Proceedings of the 12th International Symposium for Information Science (Hildesheim, Germany, March 09‐11, 2011). 148‐159. Haustein, S., & Siebenlist, T. 2011. Applying social bookmarking data to evaluate journal usage. Journal of Informetrics, 5(3). 446‐457.  DOI= 10.1016/j.joi.2011.04.002 References Jeong, W. 2009. Is tagging effective? Overlapping ratios with other metadata fields. In Proceedings of the International Conference on Dublin Core and Metadata Applications (Seoul, Korea, October 12‐16, 2009). 31‐39.  Lin, X., Beaudoin, J., Bul, Y., & Desai, K. 2006. Exploring characteristics of social classification. In Proceedings of the 17th Annual ASIS&T SIG/CR Classification Research Workshop (Austin, USA, November 03‐08, 2006).  1 Department of Information Science, Heinrich‐Heine‐University Düsseldorf,  Lu, C., Park J., &Hu, X. 2010. User tags versus expert‐assigned subject terms: A comparison of LibraryThing tags and Library of Congress Subject Headings. Journal of Information Science, 36(6), 763‐779.  Lux, M., Granitzer, M., & Kern, R. 2007. Aspects of broad folksonomies. In Proceedings of the 18th International Conference on Database and Expert Systems Applications (Regensburg, Germany, September 03‐07, 2007). 283‐287.  Universitätsstraße 1, 40225 Düsseldorf (Germany) Noll, M. G., & Meinel, C. 2007. Authors vs. readers. A comparative study of document metadata and content in the WWW. In Proceedings of the 2007 ACM Symposium on Document Engineering (Winnipeg, Canada, August 28‐31,  2 Central Library, Forschungszentrum Jülich, 2007). 177‐186.  Peters, I. 2009. Folksonomies. Indexing and Retrieval in Web 2.0. De Gruyter Saur, München. Terliesner, J., & Peters, I. 2011. Der T‐Index als Stabilitätsindikator für dokument‐spezifische Tag‐Verteilungen. In Proceedings of the 12th International Symposium for Information Science (Hildesheim, Germany, March 09‐11, 52425 Jülich (Germany) 2011). 123‐133. * http://snowball.tartarus.org/download.php.