SlideShare una empresa de Scribd logo
1 de 19
Descargar para leer sin conexión
Five NLP Challenges in
Data-Driven Personas
Dr. Joni Salminen
October 20, 2021
Nanyang Technological University, Singapore
Meet the APG Team!
Professor Jim Jansen
The Leader (Principal Scientist)
• Inventor of APG
• Leads the project
• Customer relationships &
management
MSc. Soon-gyo Jung
The Genius (Software Engineer)
• Creator of APG
• Front-End / Back-End
• Implements like a genius, hence
the nickname
Dr. Joni Salminen
The Handyman (Scientist)
• Helps with user studies,
system development, etc.
• Strategic guy, likes to think the
big picture
?
YOU?
Why personas?
• Summarize relevant user information for decision
makers for doing their jobs better (e.g., creating
products that actually serve people’s needs)
• Are an alternative (or complement) to numbers
• Provide a different way of doing user/customer analytics
(more approachable & memorable)
• Give faces to user data
…are not just about visualization, but empathetic
representations of users! [1]
[1] Nielsen, L. (2019). Personas—User Focused Design (2nd ed.
2019 edition). Springer.
Why automate persona generation?
Personas are usually created with manual methods (i.e.,
interviews & ethnography), methods that are expensive
and slow to implement, and they can quickly become
outdated. Because of the limitations, personas risk being
inaccurate representations of the true user base.
Better
personas
Better
decisions
Better
results.
In contrast, APG provides personas that are fast to
create and updated automatically. This means the cost of
persona creation is dramatically reduced, making them
available for organizations with limited means (e.g.,
startups, small businesses). Depending on the underlying
dataset, APG can cover a wide range of behaviors and
demographics.
Manual methods
Automation
An, J., Kwak, H., Salminen, J., Jung, S., & Jansen, B. J. (2018). Imaginary People
Representing Real Numbers: Generating Personas from Online Social Media Data.
ACM Transactions on the Web (TWEB), 12(4), 27. https://doi.org/10.1145/3265986
v. 2.0 (2021)
Literally, giving faces to user data!
Personification = nameless, faceless
segments are turned into personas that
describe a behavioral and demographic
pattern in the data [1]
Enrichment = enriching the persona profiles with
additional information such as sentiment, loyalty, quotes,
most viewed content, and topics of interest [1]
[1] An, J., Kwak, H., Salminen, J., Jung, S., & Jansen, B. J. (2018).
Imaginary People Representing Real Numbers: Generating Personas
from Online Social Media Data. ACM Transactions on the Web
(TWEB), 12(4), 27. https://doi.org/10.1145/3265986
Requirements:
• Enough data (e.g., >100,000
viewers/visitors/users/customers)
• Enough content (e.g., >1000
products/pages/videos/posts)
• Large and heterogeneous
audience
…so, probably not good for most
SMEs, startups, micro-organizations
(traditional personas work the best
for such organizations!)
You choose the tool based
on the problem!
Our ”Client Persona”
Research Roadmap for
Automatic Persona Generation [1]
Information architecture:
How to determine the
relevant persona
information for a given user,
use case, and industry?
(e.g., e-health, e-commerce,
politics, gaming…)
Quotes:
How to find demographically
matching, non-toxic comments
that describe the persona’s
attitudes and are relevant for
end users?
Temporal analysis:
How to analyze change
of personas over time?
APG is about finding better ways to process and choose
useful user information from vast amounts of online data.
”Personas are about giving faces to data.”
Image: How to
automatically generate, tag,
and choose appropriate
persona profile pictures?
Evaluation: (1) How to ensure
personas are of high quality
(complete, clear, consistent and
credible)? (2) How to measure
value of personas for individuals
and organizations?
Attributes & Topics of
Interest: How to automatically
infer user attributes, such as
interests, needs, wants, goals,
political orientation, and brand
affinity from social media?
[1] Salminen, J., Jansen, B. J., An, J., Kwak, H.,
& Jung, S. (2019). Automatic Persona
Generation for Online Content Creators:
Conceptual Rationale and a Research Agenda.
In L. Nielsen (Ed.), Personas—User Focused
Design (2nd ed., pp. 135–160). Springer London.
https://doi.org/10.1007/978-1-4471-7427-1_8
Interactivity: How to design
interactive features to make
users cope with more
personas?
Current NLP techniques in APG
• Topic classification:
• Current: Zero-shot classification (à la HuggingFace RoBERTa) for small
organizations and supervised ML (XGBoost and TF-IDF) for large
clients
• Past: LDA (crap!)
• Sentiment analysis:
• Current: EmoLex (multiple languages, dictionary-based)
• Future: SenticNet?
CHA1: Generate Persona Quotes
• Objective: Generate artificial quotes that reflect the persona’s
(a) attitudes and (b) demographics.
• NLP field: Conditional text generation
• Requirements:
• Demographically accurate
• Attitudinally accurate
• Topically accurate (enables searching)
The key here is conditional; mere grammaticality is not enough
but need to capture the persona’s ”self”.
”Quotes reflect the
personas attitudes
about given topics and
about life in general.”
CHA2: Chat with Personas
• Objective: Make it possible for users to ask things from a
persona, and the persona will give answers that, again, reflect
who the persona is in terms (a) attitudes, (b) demographics, and
(b) topics.
• NLP field: Dialogue systems
type to ask Ahmed a question…
You: Hi Ahmed! What
do you think about the
elections in Pakistan?
Ahmed: I don’t like it
[negative sentiment, click
to learn more]
CHA3: Frankenstein’s Personas
• Objective: solve Bødker’s [1] ”Frankenstein problem”:
inconsistency of persona information
• Example cases: man  woman, Indian  Pakistanese, etc.
(cultural sensibilities (Häkkilä et al. [2]))
• NLP field: supervised ML (language modeling)
• How to match the quotes with the personas’ demographics and
actual attitudes? (And maximize reflecting all aspects of the
persona’s attitudes?)
[1] Bødker, S., Christiansen, E., Nyvang, T., & Zander, P.-O. (2012).
Personas, people and participation: Challenges from the trenches of
local government. Proceedings of the 12th Participatory Design
Conference: Research Papers-Volume 1, 91–100.
[2] Häkkilä, J., Wiberg, M., Eira, N. J., Seppänen, T., Juuso, I., Mäkikalli, M., &
Wolf, K. (2020). Design Sensibilities-Designing for Cultural Sensitivity.
Proceedings of the 11th Nordic Conference on Human-Computer Interaction:
Shaping Experiences, Shaping Society, 1–3.
CHA4: Drifting Personas
• Objective: Identify topical changes in personas and notify
decision makers of these changes.
• NLP field: Concept drift / topic drift / model drift… (common
issues in ML [1])
• All refer to CHANGE in the underlying user behavior (basically,
the data: new categories appear, old ones change, distributions
change, etc.)
• How often should personas be changed? How should the change
be measured / detected? [2]
[1] Widmer, G., & Kubat, M. (1996). Learning in the presence of concept drift and hidden contexts. Machine
Learning, 23(1), 69–101.
[2] Jansen, B. J., Jung, S., & Salminen, J. (2019). Capturing the change in topical interests of personas over
time. Proceedings of the Association for Information Science and Technology, 56(1), 127–136.
CHA5: Personas from Text Only
• User segmentation / text analytics / pattern mining
• Either for a specific use case (e.g., toxic personas, fake news
personas, fandom personas…) or general representations of
humanity that can be queried at will (i.e., ”stacking” different
user models on top of each other to create truly multifaceted
human representations)
• Needs data, help from psychologists, etc. How to validate and
so on?
Common challenges:
• Modeling people based on what they write.
• Lack of resources:
• Datasets (need demographically labeled data)
• Baselines
• Evaluation metrics (have to consider UX / HCI / user feedback; not only
technical, but socio-technical problems)
• Most importantly, not enough PEOPLE working on these issues
Data is available but what about
information?
• People’s attitudes, fears, doubts, hopes, needs, wants… can
these be inferred from unstructured (micro-)texts?
• Rosetta’s Stone for data-driven personas: user modeling /
attribute inference from smartly sampled tweets?
• Dictionaries (LIWC, AFINN, EMOLEX) vs. deep learning?
…VITALLY important because persona users’ information needs
are unique --- need to have flexible tools for them to query
persona attitudes in real-time
➔ static data-driven personas won’t do!
Thank you! Questions?
Dr. Joni Salminen
jsalminen@hbku.edu.qa
The APG family (Davao, 2019)
Get the book from Amazon!
(or your university library)

Más contenido relacionado

Similar a Five NLP Challenges in Data-Driven Personas

Automatic Persona Generation: Introduction & Current Challenges
Automatic Persona Generation: Introduction & Current ChallengesAutomatic Persona Generation: Introduction & Current Challenges
Automatic Persona Generation: Introduction & Current ChallengesJoni Salminen
 
Enriching social media personas with personality traits
Enriching social media personas with personality traitsEnriching social media personas with personality traits
Enriching social media personas with personality traitsJoni Salminen
 
Deep Customer Insights, Laurea, October 2015
Deep Customer Insights, Laurea, October 2015 Deep Customer Insights, Laurea, October 2015
Deep Customer Insights, Laurea, October 2015 Taneli Heinonen
 
Research Roadmap for Automatic Persona Generation (2018)
Research Roadmap for Automatic Persona Generation (2018)Research Roadmap for Automatic Persona Generation (2018)
Research Roadmap for Automatic Persona Generation (2018)Joni Salminen
 
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi VatrapuSocial Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi VatrapuCBS Competitiveness Platform
 
Personas drivers of transformation or hollow profit?
Personas drivers of transformation or hollow profit? Personas drivers of transformation or hollow profit?
Personas drivers of transformation or hollow profit? Ann Longley
 
Personas and politics: The discursive construction of the "user" in Informat...
Personas and politics:  The discursive construction of the "user" in Informat...Personas and politics:  The discursive construction of the "user" in Informat...
Personas and politics: The discursive construction of the "user" in Informat...tankgrrl
 
20220103 jim spohrer hicss v9
20220103 jim spohrer hicss v920220103 jim spohrer hicss v9
20220103 jim spohrer hicss v9ISSIP
 
Big Data, Small Personas: Research Agenda for Automatic Persona Generation
Big Data, Small Personas: Research Agenda for Automatic Persona GenerationBig Data, Small Personas: Research Agenda for Automatic Persona Generation
Big Data, Small Personas: Research Agenda for Automatic Persona GenerationJoni Salminen
 
Social Media and Student Learning: Using Analytics to Visualise Twitter Commu...
Social Media and Student Learning: Using Analytics to Visualise Twitter Commu...Social Media and Student Learning: Using Analytics to Visualise Twitter Commu...
Social Media and Student Learning: Using Analytics to Visualise Twitter Commu...sharstoer
 
Is More Better?: Impact of Multiple Photos on Perception of Persona Profiles
Is More Better?: Impact of Multiple Photos on Perception of Persona ProfilesIs More Better?: Impact of Multiple Photos on Perception of Persona Profiles
Is More Better?: Impact of Multiple Photos on Perception of Persona ProfilesJoni Salminen
 
PatternLanguageOfData
PatternLanguageOfDataPatternLanguageOfData
PatternLanguageOfDatakimErwin
 
Design considerations for machine learning system
Design considerations for machine learning systemDesign considerations for machine learning system
Design considerations for machine learning systemAkemi Tazaki
 
Supporting Rationale Awareness in Large-Scale Online Open Participative Activ...
Supporting Rationale Awareness in Large-Scale Online Open Participative Activ...Supporting Rationale Awareness in Large-Scale Online Open Participative Activ...
Supporting Rationale Awareness in Large-Scale Online Open Participative Activ...Lu Xiao
 
Defining Personas, A User Experience Approach
Defining Personas, A User Experience ApproachDefining Personas, A User Experience Approach
Defining Personas, A User Experience ApproachLeon Kadoch Hardie
 
Convergence, Computation and Continuity: Challenges for PR in the 21st Century
Convergence, Computation and Continuity: Challenges for PR in the 21st CenturyConvergence, Computation and Continuity: Challenges for PR in the 21st Century
Convergence, Computation and Continuity: Challenges for PR in the 21st CenturySimon Collister & Associates
 
Social computing meet & greet
Social computing meet & greetSocial computing meet & greet
Social computing meet & greetAngela Brandt
 

Similar a Five NLP Challenges in Data-Driven Personas (20)

Automatic Persona Generation: Introduction & Current Challenges
Automatic Persona Generation: Introduction & Current ChallengesAutomatic Persona Generation: Introduction & Current Challenges
Automatic Persona Generation: Introduction & Current Challenges
 
Enriching social media personas with personality traits
Enriching social media personas with personality traitsEnriching social media personas with personality traits
Enriching social media personas with personality traits
 
Deep Customer Insights, Laurea, October 2015
Deep Customer Insights, Laurea, October 2015 Deep Customer Insights, Laurea, October 2015
Deep Customer Insights, Laurea, October 2015
 
Research Roadmap for Automatic Persona Generation (2018)
Research Roadmap for Automatic Persona Generation (2018)Research Roadmap for Automatic Persona Generation (2018)
Research Roadmap for Automatic Persona Generation (2018)
 
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi VatrapuSocial Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
Social Media Analytics: Concepts, Models, Methods, & Tools - Ravi Vatrapu
 
Personas drivers of transformation or hollow profit?
Personas drivers of transformation or hollow profit? Personas drivers of transformation or hollow profit?
Personas drivers of transformation or hollow profit?
 
Personas and politics: The discursive construction of the "user" in Informat...
Personas and politics:  The discursive construction of the "user" in Informat...Personas and politics:  The discursive construction of the "user" in Informat...
Personas and politics: The discursive construction of the "user" in Informat...
 
20220103 jim spohrer hicss v9
20220103 jim spohrer hicss v920220103 jim spohrer hicss v9
20220103 jim spohrer hicss v9
 
Big Data, Small Personas: Research Agenda for Automatic Persona Generation
Big Data, Small Personas: Research Agenda for Automatic Persona GenerationBig Data, Small Personas: Research Agenda for Automatic Persona Generation
Big Data, Small Personas: Research Agenda for Automatic Persona Generation
 
Social Media and Student Learning: Using Analytics to Visualise Twitter Commu...
Social Media and Student Learning: Using Analytics to Visualise Twitter Commu...Social Media and Student Learning: Using Analytics to Visualise Twitter Commu...
Social Media and Student Learning: Using Analytics to Visualise Twitter Commu...
 
Is More Better?: Impact of Multiple Photos on Perception of Persona Profiles
Is More Better?: Impact of Multiple Photos on Perception of Persona ProfilesIs More Better?: Impact of Multiple Photos on Perception of Persona Profiles
Is More Better?: Impact of Multiple Photos on Perception of Persona Profiles
 
PatternLanguageOfData
PatternLanguageOfDataPatternLanguageOfData
PatternLanguageOfData
 
Design considerations for machine learning system
Design considerations for machine learning systemDesign considerations for machine learning system
Design considerations for machine learning system
 
Supporting Rationale Awareness in Large-Scale Online Open Participative Activ...
Supporting Rationale Awareness in Large-Scale Online Open Participative Activ...Supporting Rationale Awareness in Large-Scale Online Open Participative Activ...
Supporting Rationale Awareness in Large-Scale Online Open Participative Activ...
 
G1803024452
G1803024452G1803024452
G1803024452
 
Defining Personas, A User Experience Approach
Defining Personas, A User Experience ApproachDefining Personas, A User Experience Approach
Defining Personas, A User Experience Approach
 
Convergence, Computation and Continuity: Challenges for PR in the 21st Century
Convergence, Computation and Continuity: Challenges for PR in the 21st CenturyConvergence, Computation and Continuity: Challenges for PR in the 21st Century
Convergence, Computation and Continuity: Challenges for PR in the 21st Century
 
The wicked problem of data literacy - Corrall
The wicked problem of data literacy - CorrallThe wicked problem of data literacy - Corrall
The wicked problem of data literacy - Corrall
 
User centric design (ucd)
User centric design (ucd)User centric design (ucd)
User centric design (ucd)
 
Social computing meet & greet
Social computing meet & greetSocial computing meet & greet
Social computing meet & greet
 

Más de Joni Salminen

Problem of majority voting
Problem of majority votingProblem of majority voting
Problem of majority votingJoni Salminen
 
Persona Analytics: Progress Report and Road Ahead
Persona Analytics: Progress Report and Road AheadPersona Analytics: Progress Report and Road Ahead
Persona Analytics: Progress Report and Road AheadJoni Salminen
 
User Studies for APG: How to support system development with user feedback?
User Studies for APG: How to support system development with user feedback?User Studies for APG: How to support system development with user feedback?
User Studies for APG: How to support system development with user feedback?Joni Salminen
 
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...Joni Salminen
 
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...Joni Salminen
 
Determining Online Brand Reputation with Machine Learning from Social Media M...
Determining Online Brand Reputation with Machine Learning from Social Media M...Determining Online Brand Reputation with Machine Learning from Social Media M...
Determining Online Brand Reputation with Machine Learning from Social Media M...Joni Salminen
 
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...Joni Salminen
 
OSS-EBM: Open Source Software Entrepreneurial Business Modelling
OSS-EBM: Open Source Software Entrepreneurial Business ModellingOSS-EBM: Open Source Software Entrepreneurial Business Modelling
OSS-EBM: Open Source Software Entrepreneurial Business ModellingJoni Salminen
 
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...Joni Salminen
 
Tips for Scale Development: Evaluating Automatic Personas
Tips for Scale Development: Evaluating Automatic PersonasTips for Scale Development: Evaluating Automatic Personas
Tips for Scale Development: Evaluating Automatic PersonasJoni Salminen
 
Why do startups avoid difficult problems?
Why do startups avoid difficult problems?Why do startups avoid difficult problems?
Why do startups avoid difficult problems?Joni Salminen
 
Social Espionage: Drawing Benefit from Competitors’ Social Media Presence
Social Espionage: Drawing Benefit from Competitors’ Social Media PresenceSocial Espionage: Drawing Benefit from Competitors’ Social Media Presence
Social Espionage: Drawing Benefit from Competitors’ Social Media PresenceJoni Salminen
 
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)Joni Salminen
 
Social Media Marketing (Digital Marketing '15 @ Oulu University)
Social Media Marketing (Digital Marketing '15 @ Oulu University)Social Media Marketing (Digital Marketing '15 @ Oulu University)
Social Media Marketing (Digital Marketing '15 @ Oulu University)Joni Salminen
 
Search Engine Marketing (Digital Marketing '15 @ Oulu University)
Search Engine Marketing (Digital Marketing '15 @ Oulu University)Search Engine Marketing (Digital Marketing '15 @ Oulu University)
Search Engine Marketing (Digital Marketing '15 @ Oulu University)Joni Salminen
 
Introduction to Digital Marketing (Digital Marketing '15 @ Oulu University)
Introduction to Digital Marketing (Digital Marketing '15 @ Oulu University)Introduction to Digital Marketing (Digital Marketing '15 @ Oulu University)
Introduction to Digital Marketing (Digital Marketing '15 @ Oulu University)Joni Salminen
 
Web Analytics (Digital Marketing '15 @ Oulu University)
Web Analytics (Digital Marketing '15 @ Oulu University)Web Analytics (Digital Marketing '15 @ Oulu University)
Web Analytics (Digital Marketing '15 @ Oulu University)Joni Salminen
 
Generating Cultural Personas From Social Data - A Perspective of Middle Easte...
Generating Cultural Personas From Social Data - A Perspective of Middle Easte...Generating Cultural Personas From Social Data - A Perspective of Middle Easte...
Generating Cultural Personas From Social Data - A Perspective of Middle Easte...Joni Salminen
 
Who does what in marketing? Toward an understanding of marketer–machine inter...
Who does what in marketing? Toward an understanding of marketer–machine inter...Who does what in marketing? Toward an understanding of marketer–machine inter...
Who does what in marketing? Toward an understanding of marketer–machine inter...Joni Salminen
 
Social media marketing for researchers
Social media marketing for researchersSocial media marketing for researchers
Social media marketing for researchersJoni Salminen
 

Más de Joni Salminen (20)

Problem of majority voting
Problem of majority votingProblem of majority voting
Problem of majority voting
 
Persona Analytics: Progress Report and Road Ahead
Persona Analytics: Progress Report and Road AheadPersona Analytics: Progress Report and Road Ahead
Persona Analytics: Progress Report and Road Ahead
 
User Studies for APG: How to support system development with user feedback?
User Studies for APG: How to support system development with user feedback?User Studies for APG: How to support system development with user feedback?
User Studies for APG: How to support system development with user feedback?
 
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...
Combining Behaviors and Demographics to Segment Online Audiences:Experiments ...
 
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
To Use Branded Keywords or Not? Rationale of Professional Search-engine Marke...
 
Determining Online Brand Reputation with Machine Learning from Social Media M...
Determining Online Brand Reputation with Machine Learning from Social Media M...Determining Online Brand Reputation with Machine Learning from Social Media M...
Determining Online Brand Reputation with Machine Learning from Social Media M...
 
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...
Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for...
 
OSS-EBM: Open Source Software Entrepreneurial Business Modelling
OSS-EBM: Open Source Software Entrepreneurial Business ModellingOSS-EBM: Open Source Software Entrepreneurial Business Modelling
OSS-EBM: Open Source Software Entrepreneurial Business Modelling
 
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...
Gender effect on e-commerce sales of experience gifts: Preliminary empirical ...
 
Tips for Scale Development: Evaluating Automatic Personas
Tips for Scale Development: Evaluating Automatic PersonasTips for Scale Development: Evaluating Automatic Personas
Tips for Scale Development: Evaluating Automatic Personas
 
Why do startups avoid difficult problems?
Why do startups avoid difficult problems?Why do startups avoid difficult problems?
Why do startups avoid difficult problems?
 
Social Espionage: Drawing Benefit from Competitors’ Social Media Presence
Social Espionage: Drawing Benefit from Competitors’ Social Media PresenceSocial Espionage: Drawing Benefit from Competitors’ Social Media Presence
Social Espionage: Drawing Benefit from Competitors’ Social Media Presence
 
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)
Strategic Digital Marketing (Digital Marketing '15 @ Oulu University)
 
Social Media Marketing (Digital Marketing '15 @ Oulu University)
Social Media Marketing (Digital Marketing '15 @ Oulu University)Social Media Marketing (Digital Marketing '15 @ Oulu University)
Social Media Marketing (Digital Marketing '15 @ Oulu University)
 
Search Engine Marketing (Digital Marketing '15 @ Oulu University)
Search Engine Marketing (Digital Marketing '15 @ Oulu University)Search Engine Marketing (Digital Marketing '15 @ Oulu University)
Search Engine Marketing (Digital Marketing '15 @ Oulu University)
 
Introduction to Digital Marketing (Digital Marketing '15 @ Oulu University)
Introduction to Digital Marketing (Digital Marketing '15 @ Oulu University)Introduction to Digital Marketing (Digital Marketing '15 @ Oulu University)
Introduction to Digital Marketing (Digital Marketing '15 @ Oulu University)
 
Web Analytics (Digital Marketing '15 @ Oulu University)
Web Analytics (Digital Marketing '15 @ Oulu University)Web Analytics (Digital Marketing '15 @ Oulu University)
Web Analytics (Digital Marketing '15 @ Oulu University)
 
Generating Cultural Personas From Social Data - A Perspective of Middle Easte...
Generating Cultural Personas From Social Data - A Perspective of Middle Easte...Generating Cultural Personas From Social Data - A Perspective of Middle Easte...
Generating Cultural Personas From Social Data - A Perspective of Middle Easte...
 
Who does what in marketing? Toward an understanding of marketer–machine inter...
Who does what in marketing? Toward an understanding of marketer–machine inter...Who does what in marketing? Toward an understanding of marketer–machine inter...
Who does what in marketing? Toward an understanding of marketer–machine inter...
 
Social media marketing for researchers
Social media marketing for researchersSocial media marketing for researchers
Social media marketing for researchers
 

Último

Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptxRajatChauhan518211
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 

Último (20)

Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 

Five NLP Challenges in Data-Driven Personas

  • 1. Five NLP Challenges in Data-Driven Personas Dr. Joni Salminen October 20, 2021 Nanyang Technological University, Singapore
  • 2. Meet the APG Team! Professor Jim Jansen The Leader (Principal Scientist) • Inventor of APG • Leads the project • Customer relationships & management MSc. Soon-gyo Jung The Genius (Software Engineer) • Creator of APG • Front-End / Back-End • Implements like a genius, hence the nickname Dr. Joni Salminen The Handyman (Scientist) • Helps with user studies, system development, etc. • Strategic guy, likes to think the big picture ? YOU?
  • 3.
  • 4. Why personas? • Summarize relevant user information for decision makers for doing their jobs better (e.g., creating products that actually serve people’s needs) • Are an alternative (or complement) to numbers • Provide a different way of doing user/customer analytics (more approachable & memorable) • Give faces to user data …are not just about visualization, but empathetic representations of users! [1] [1] Nielsen, L. (2019). Personas—User Focused Design (2nd ed. 2019 edition). Springer.
  • 5. Why automate persona generation? Personas are usually created with manual methods (i.e., interviews & ethnography), methods that are expensive and slow to implement, and they can quickly become outdated. Because of the limitations, personas risk being inaccurate representations of the true user base. Better personas Better decisions Better results. In contrast, APG provides personas that are fast to create and updated automatically. This means the cost of persona creation is dramatically reduced, making them available for organizations with limited means (e.g., startups, small businesses). Depending on the underlying dataset, APG can cover a wide range of behaviors and demographics. Manual methods Automation An, J., Kwak, H., Salminen, J., Jung, S., & Jansen, B. J. (2018). Imaginary People Representing Real Numbers: Generating Personas from Online Social Media Data. ACM Transactions on the Web (TWEB), 12(4), 27. https://doi.org/10.1145/3265986
  • 7. Literally, giving faces to user data! Personification = nameless, faceless segments are turned into personas that describe a behavioral and demographic pattern in the data [1] Enrichment = enriching the persona profiles with additional information such as sentiment, loyalty, quotes, most viewed content, and topics of interest [1] [1] An, J., Kwak, H., Salminen, J., Jung, S., & Jansen, B. J. (2018). Imaginary People Representing Real Numbers: Generating Personas from Online Social Media Data. ACM Transactions on the Web (TWEB), 12(4), 27. https://doi.org/10.1145/3265986
  • 8.
  • 9. Requirements: • Enough data (e.g., >100,000 viewers/visitors/users/customers) • Enough content (e.g., >1000 products/pages/videos/posts) • Large and heterogeneous audience …so, probably not good for most SMEs, startups, micro-organizations (traditional personas work the best for such organizations!) You choose the tool based on the problem! Our ”Client Persona”
  • 10. Research Roadmap for Automatic Persona Generation [1] Information architecture: How to determine the relevant persona information for a given user, use case, and industry? (e.g., e-health, e-commerce, politics, gaming…) Quotes: How to find demographically matching, non-toxic comments that describe the persona’s attitudes and are relevant for end users? Temporal analysis: How to analyze change of personas over time? APG is about finding better ways to process and choose useful user information from vast amounts of online data. ”Personas are about giving faces to data.” Image: How to automatically generate, tag, and choose appropriate persona profile pictures? Evaluation: (1) How to ensure personas are of high quality (complete, clear, consistent and credible)? (2) How to measure value of personas for individuals and organizations? Attributes & Topics of Interest: How to automatically infer user attributes, such as interests, needs, wants, goals, political orientation, and brand affinity from social media? [1] Salminen, J., Jansen, B. J., An, J., Kwak, H., & Jung, S. (2019). Automatic Persona Generation for Online Content Creators: Conceptual Rationale and a Research Agenda. In L. Nielsen (Ed.), Personas—User Focused Design (2nd ed., pp. 135–160). Springer London. https://doi.org/10.1007/978-1-4471-7427-1_8 Interactivity: How to design interactive features to make users cope with more personas?
  • 11. Current NLP techniques in APG • Topic classification: • Current: Zero-shot classification (à la HuggingFace RoBERTa) for small organizations and supervised ML (XGBoost and TF-IDF) for large clients • Past: LDA (crap!) • Sentiment analysis: • Current: EmoLex (multiple languages, dictionary-based) • Future: SenticNet?
  • 12. CHA1: Generate Persona Quotes • Objective: Generate artificial quotes that reflect the persona’s (a) attitudes and (b) demographics. • NLP field: Conditional text generation • Requirements: • Demographically accurate • Attitudinally accurate • Topically accurate (enables searching) The key here is conditional; mere grammaticality is not enough but need to capture the persona’s ”self”. ”Quotes reflect the personas attitudes about given topics and about life in general.”
  • 13. CHA2: Chat with Personas • Objective: Make it possible for users to ask things from a persona, and the persona will give answers that, again, reflect who the persona is in terms (a) attitudes, (b) demographics, and (b) topics. • NLP field: Dialogue systems type to ask Ahmed a question… You: Hi Ahmed! What do you think about the elections in Pakistan? Ahmed: I don’t like it [negative sentiment, click to learn more]
  • 14. CHA3: Frankenstein’s Personas • Objective: solve Bødker’s [1] ”Frankenstein problem”: inconsistency of persona information • Example cases: man  woman, Indian  Pakistanese, etc. (cultural sensibilities (Häkkilä et al. [2])) • NLP field: supervised ML (language modeling) • How to match the quotes with the personas’ demographics and actual attitudes? (And maximize reflecting all aspects of the persona’s attitudes?) [1] Bødker, S., Christiansen, E., Nyvang, T., & Zander, P.-O. (2012). Personas, people and participation: Challenges from the trenches of local government. Proceedings of the 12th Participatory Design Conference: Research Papers-Volume 1, 91–100. [2] Häkkilä, J., Wiberg, M., Eira, N. J., Seppänen, T., Juuso, I., Mäkikalli, M., & Wolf, K. (2020). Design Sensibilities-Designing for Cultural Sensitivity. Proceedings of the 11th Nordic Conference on Human-Computer Interaction: Shaping Experiences, Shaping Society, 1–3.
  • 15. CHA4: Drifting Personas • Objective: Identify topical changes in personas and notify decision makers of these changes. • NLP field: Concept drift / topic drift / model drift… (common issues in ML [1]) • All refer to CHANGE in the underlying user behavior (basically, the data: new categories appear, old ones change, distributions change, etc.) • How often should personas be changed? How should the change be measured / detected? [2] [1] Widmer, G., & Kubat, M. (1996). Learning in the presence of concept drift and hidden contexts. Machine Learning, 23(1), 69–101. [2] Jansen, B. J., Jung, S., & Salminen, J. (2019). Capturing the change in topical interests of personas over time. Proceedings of the Association for Information Science and Technology, 56(1), 127–136.
  • 16. CHA5: Personas from Text Only • User segmentation / text analytics / pattern mining • Either for a specific use case (e.g., toxic personas, fake news personas, fandom personas…) or general representations of humanity that can be queried at will (i.e., ”stacking” different user models on top of each other to create truly multifaceted human representations) • Needs data, help from psychologists, etc. How to validate and so on?
  • 17. Common challenges: • Modeling people based on what they write. • Lack of resources: • Datasets (need demographically labeled data) • Baselines • Evaluation metrics (have to consider UX / HCI / user feedback; not only technical, but socio-technical problems) • Most importantly, not enough PEOPLE working on these issues
  • 18. Data is available but what about information? • People’s attitudes, fears, doubts, hopes, needs, wants… can these be inferred from unstructured (micro-)texts? • Rosetta’s Stone for data-driven personas: user modeling / attribute inference from smartly sampled tweets? • Dictionaries (LIWC, AFINN, EMOLEX) vs. deep learning? …VITALLY important because persona users’ information needs are unique --- need to have flexible tools for them to query persona attitudes in real-time ➔ static data-driven personas won’t do!
  • 19. Thank you! Questions? Dr. Joni Salminen jsalminen@hbku.edu.qa The APG family (Davao, 2019) Get the book from Amazon! (or your university library)