Diakopoulos

•Descargar como PPTX, PDF•

1 recomendación•172 vistas

The document discusses criteria for selecting comments for the "NYT Picks" section of the New York Times website. It examines literature on positive criteria for inclusion such as thoughtfulness, brevity, relevance, and diversity. It poses research questions on whether NYT Picks comments reflect these criteria and whether algorithms could be developed to assess criteria and augment human moderation. While automation may scale moderation and improve the user experience, it also raises issues regarding over-generalization and need for transparency.

Noticias y política

Picking the NYT Picks:
Editorial Criteria and Automation
in the Curation of Online News Comments
Nicholas Diakopoulos
University of Maryland, College Park – College of Journalism
@ndiakopoulos | nickdiakopoulos.com | nad@umd.edu

“NYT Picks is the most popular comment queue. We
spend a lot of time tweaking that and getting that
right.”
What are criteria for selection?
How can we augment moderator capability to consider more comments?

Criteria from Literature
Negative / Exclusion
Personal attacks, profanity, abusive
behavior
Positive / Inclusion Internal Coherence
Thoughtfulness
Brevity / Length
Relevance
Fairness / Diversity
Novelty
Argument Quality
Criticality
Emotionality
Entertainment Value
Readability
Personal Experience

Crowdsourcing
Argument Quality
Criticality
Emotionality
Entertainment Value
Readability
Personal Experience
Internal Coherence
Thoughtfulness
Brevity / Length
Relevance
Fairness / Diversity
Novelty
RQ1: Do “NYT Picks” comments reflect positive
editorial criteria identified in literature?

Automation
Argument Quality
Criticality
Emotionality
Entertainment Value
Readability
Personal Experience
Internal Coherence
Thoughtfulness
Brevity / Length
Relevance
Fairness / Diversity
Novelty
RQ2: Can algorithmic approaches to assessing criteria
be developed?

Automated scores point towards scalable
opportunities for moderation and UX…

But automation also raises questions about
over-generalization across contexts, and
algorithmic transparency

Questions?
Contact
Nick Diakopoulos
University of Maryland, College of Journalism
Twitter: @ndiakopoulos
Email: nad@umd.edu
Web: http://www.nickdiakopoulos.com
More Info
N. Diakopoulos. The Editor’s Eye: Curation and Comment
Relevance on the New York Times. Proc. CSCW. March,
2015.

Más contenido relacionado

Destacado

TaniaKnight Center

Richard TofelKnight Center

MensingKnight Center

LloydKnight Center

PoplinKnight Center

MoroKnight Center

HinsleyKnight Center

SingerKnight Center

Andrei DynkoKnight Center

Grueskin austinKnight Center

MarioKnight Center

MarymontKnight Center

WestKnight Center

DowneyKnight Center

FinancialindependenceKnight Center

Kelly G. NiknejadKnight Center

PmfailKnight Center

PerrinKnight Center

RamshawKnight Center

Lu WuKnight Center

Destacado (20)

Tania

Richard Tofel

Mensing

Lloyd

Poplin

Moro

Hinsley

Singer

Andrei Dynko

Grueskin austin

Mario

Marymont

West

Downey

Financialindependence

Kelly G. Niknejad

Pmfail

Perrin

Ramshaw

Lu Wu

Similar a Diakopoulos

Pragmatic ethical and fair AI for data scientistsDavid Graus

Chap013Dhamo daran

SocialCite makes its debut at the HighWire Press meetingKent Anderson

A NOVEL APPROACH FOR TWITTER SENTIMENT ANALYSIS USING HYBRID CLASSIFIERIRJET Journal

Maa250 assignment 2 ethics and financial services trimester RIYAN43

presentatie Reputation Management & workshop PhD community Thieme Hennis

Qualitative analysissol tolentino

Critical Thinking Assessment Model.pptxTitanEurope1

SELECTION/HRMNayyera Anbreen

Unlocking Potential: A Guide to Psychometric Assessment ToolsAcadecraft Pvt. Ltd.

Qual, Mixed, Machine and Everything in BetweenStuart Shulman

On serendipity in recommender systems - Haifa RecSoc workshop june 2015Giovanni Semeraro

Proposalguest8d8d82

Tammaro ircdl 2013Anna Maria Tammaro

5 io employee selectionHarve Abella

Through the eyes of the editor: nursing researchRoger Watson

Diamonds in the Rough (Sentiment(al) AnalysisScott K. Wilder

DeJoy Miller & Oberdick - Disciplinary literacy – a context for learning crit...IL Group (CILIP Information Literacy Group)

Presentation for Doctoral Consortium at UMAP'11Thieme Hennis

Similar a Diakopoulos (20)

Pragmatic ethical and fair AI for data scientists

Chap013

SocialCite makes its debut at the HighWire Press meeting

A NOVEL APPROACH FOR TWITTER SENTIMENT ANALYSIS USING HYBRID CLASSIFIER

Maa250 assignment 2 ethics and financial services trimester

presentatie Reputation Management & workshop PhD community

Qualitative analysis

Critical Thinking Assessment Model.pptx

SELECTION/HRM

Unlocking Potential: A Guide to Psychometric Assessment Tools

Qual, Mixed, Machine and Everything in Between

On serendipity in recommender systems - Haifa RecSoc workshop june 2015

Proposal

Tammaro ircdl 2013

5 io employee selection

Through the eyes of the editor: nursing research

Diamonds in the Rough (Sentiment(al) Analysis

DeJoy Miller & Oberdick - Disciplinary literacy – a context for learning crit...

Presentation for Doctoral Consortium at UMAP'11

Más de Knight Center

MartinKnight Center

BrittKnight Center

Joseph yooKnight Center

RamirezKnight Center

GriggsKnight Center

Ting tingchiaKnight Center

SymsonKnight Center

Garcia ruizKnight Center

Brundrett. 2015Knight Center

J moroneyKnight Center

CollinsKnight Center

RayKnight Center

OwenKnight Center

Royal blasingameKnight Center

ScaccoKnight Center

HavlakKnight Center

LeeKnight Center

HernandezKnight Center

RobinsKnight Center

Witt el alKnight Center

Más de Knight Center (20)

Martin

Britt

Joseph yoo

Ramirez

Griggs

Ting tingchia

Symson

Garcia ruiz

Brundrett. 2015

J moroney

Collins

Ray

Owen

Royal blasingame

Scacco

Havlak

Lee

Hernandez

Robins

Witt el al

Último

Enjoy Night⚡Call Girls Iffco Chowk Gurgaon >༒8448380779 Escort ServiceDelhi Call girls

如何办理(BU学位证书)美国贝翰文大学毕业证学位证书Fi L

29042024_First India Newspaper Jaipur.pdfFIRST INDIA

Roberts Rules Cheat Sheet for LD4 Precinct Commiteemenkfjstone13

BDSM⚡Call Girls in Sector 143 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls

Pakistan PMLN Election Manifesto 2024.pdfFahimUddin61

AI as Research Assistant: Upscaling Content Analysis to Identify Patterns of ...Axel Bruns

Dynamics of Destructive Polarisation in Mainstream and Social Media: The Case...Axel Bruns

TDP As the Party of Hope For AP Youth Under N Chandrababu Naidu’s Leadershipanjanibaddipudi1

Enjoy Night⚡Call Girls Rajokri Delhi >༒8448380779 Escort ServiceDelhi Call girls

Call Girls in Mira Road Mumbai ( Neha 09892124323 ) College Escorts Service i...Pooja Nehwal

Beyond Afrocentrism: Prerequisites for Somalia to lead African de-colonizatio...Muhammad Shamsaddin Megalommatis

25042024_First India Newspaper Jaipur.pdfFIRST INDIA

2024 03 13 AZ GOP LD4 Gen Meeting Minutes_FINAL.docxkfjstone13

KAHULUGAN AT KAHALAGAHAN NG GAWAING PANSIBIKO.pptxjohnandrewcarlos

Defensa de JOH insiste que testimonio de analista de la DEA es falso y solici...AlexisTorres963861

Nara Chandrababu Naidu's Visionary Policies For Andhra Pradesh's Developmentnarsireddynannuri1

BDSM⚡Call Girls in Sector 135 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls

BDSM⚡Call Girls in Indirapuram Escorts >༒8448380779 Escort ServiceDelhi Call girls

26042024_First India Newspaper Jaipur.pdfFIRST INDIA

Diakopoulos

1. Picking the NYT Picks: Editorial Criteria and Automation in the Curation of Online News Comments Nicholas Diakopoulos University of Maryland, College Park – College of Journalism @ndiakopoulos | nickdiakopoulos.com | nad@umd.edu

4. “NYT Picks is the most popular comment queue. We spend a lot of time tweaking that and getting that right.” What are criteria for selection? How can we augment moderator capability to consider more comments?

5. Criteria from Literature Negative / Exclusion Personal attacks, profanity, abusive behavior Positive / Inclusion Internal Coherence Thoughtfulness Brevity / Length Relevance Fairness / Diversity Novelty Argument Quality Criticality Emotionality Entertainment Value Readability Personal Experience

6. Crowdsourcing Argument Quality Criticality Emotionality Entertainment Value Readability Personal Experience Internal Coherence Thoughtfulness Brevity / Length Relevance Fairness / Diversity Novelty RQ1: Do “NYT Picks” comments reflect positive editorial criteria identified in literature?

7. Automation Argument Quality Criticality Emotionality Entertainment Value Readability Personal Experience Internal Coherence Thoughtfulness Brevity / Length Relevance Fairness / Diversity Novelty RQ2: Can algorithmic approaches to assessing criteria be developed?

10. Automated scores point towards scalable opportunities for moderation and UX…

11. But automation also raises questions about over-generalization across contexts, and algorithmic transparency

12. Questions? Contact Nick Diakopoulos University of Maryland, College of Journalism Twitter: @ndiakopoulos Email: nad@umd.edu Web: http://www.nickdiakopoulos.com More Info N. Diakopoulos. The Editor’s Eye: Curation and Comment Relevance on the New York Times. Proc. CSCW. March, 2015.

Notas del editor

On September 11, 2013, Vladamir Putin published an op-ed in the NYT. Among other things, he questioned american exceptionalism – and if there’s one thing you shouldn’t do in ‘merica it’s that. He was prodding the american public. In response, comments flooded in, 6,367 of them in fact. Of those 4,447 were published along with the piece.
How could you possibly organize thousands of comments and find the interesting or insightful ones? Like other commenting systems users can vote up a comment by recommending it. Comments are sorts by oldest first, or they can be filtered by their recommendation scores. . The published comments included 85 of which were deemed NYT Picks, which garner a little badge and reflect the “most interesting and thoughtful” comments. What makes this most impressive though is that each of those comments was read by a human moderator, a trained journalist, at NYT before being published. That it, the NYT practices pre-moderation, in comparison to many other publications which only look at comments after they’re published.
In fact they’re read by a team led by Bassey Etim, the community manager at NYT. Together with him team of 13 moderators, they read almost every comment before it’s posted to the site. Part of that job is choosing the NYT Picks comments. “NYT Picks is the most popular comment queue. Spend a lot of time tweaking that and getting that right.” As a baseline they’re looking for about 5 picks per 100 comments. Outside of blogs they do about 22 queues a day, but they’d like to open comments on more articles. So how could we help them scale up? Talk about the potential benefits of selecting comments: signals norms and expectations for behavior, creating a beneficial feedback loop.
Positive criteria considered in the literature from studies of: Letters to the editor, online comments for print, on-air radio comments Readability: style, clarity, adherence to standard grammar, degree it’s well-articulated. Stress that operationalizing these is hard and there are many challenges for future work.
The focus of this work is initially on crowdsourcing ratings for 9 of these dimensions, so excluding relevance, fairness, and novelty since they are much more difficult to measure using crowdsourcing, and also I have a previous paper that looked at relevance explicitly. The crowdsourcing approach collected human ratings of 8 of the 9 criteria here (b/c length is trival to measure by counting words). 500 comments 250 each of NYT Picks and non picks. Rated on a scale from 1 to 5, collected on Amazon Mechanical Turk. 3 independent ratings of each comment. Restricted to workers with reliable history, and substantial history, and from US or Canada. Collected 1500 ratings from 89 different workers. We measured the Krippendorf’s alpha which is a measure of the interrater reliability and got slight to moderate agreement among the 3 raters except for entertainment value (so people couldn’t agree on what was funny).
Eventually would like to compute scores for all of these criteria automatically, but for now we do three of them. Readability is the reading level according to the SMOG index, and index that measures the usage of more complex words. There was a high correlation between the SMOG index and the crowdsourced ratings of readability. Personal experience is based on detecting the proportion of words from LIWC dictionaries that reflect 1st person personal pronouns as well as family and friends relationships. Comment tokens are stemmed to match the dictionary
So I found a statistically sig diff for all criteria except entertaining and emotionality, and emotionality was actually sig at p=0.08. Several of these criteria also correlated fairly well, such as thoughtfulness and readability, and argument quality with thoughtfulness. Future work might look at scaling up the data collection and looking at dimensionality reduction techniques
All stat sig at p =0.05 or lower.
Editorial selections (NYT Picks) do reflect many of the editorial criteria articulated in the literature. Continuity of professional criteria into online space (except brevity) Online spaces don’t have same space constraints and we found NYT editors preferred longer comments for Picks. Raises question of how well this serves users from their perspective. The scores we computed, in particular the personal experience score could have some really nice applications for amplifying the value of comments for moderators, as well as reporters. In some follow-up work we’ve shown the this to comment moderators and they’re excited about the possibilities. Automation could also enable new end-user experiences, where users adapt their own view of the comments based on automatically computed scores along journalistically interesting lines.
Over-generalization … diff communities, or topics (e.g. sports) require different treatment, so algorithmic solutions can’t be one-size-fits-all. Is it always better to high a highly readable comment, and when does that come into tension with diversity or fairness of perspectives. Do Picks affect community or individual behavior?
Mention CommentIQ project at UMD, funded by the Knight Foundation We’re going to be hiring a fellow or fellows, so if you’re interested in joining the lab, please come speak to me. We work on everything from data visualization to algorithmic accountability and transparency, as well as data mining things like online comments. If you want to combine data and computing, with design, in the context of journalism, please come talk.

Diakopoulos

Recomendados

Recomendados

Más contenido relacionado

Destacado

Destacado (20)

Similar a Diakopoulos

Similar a Diakopoulos (20)

Más de Knight Center

Más de Knight Center (20)

Último

Último (20)

Diakopoulos

Notas del editor