SlideShare a Scribd company logo
1 of 7
Download to read offline
Twitter is noisy
but you can find some diamonds
Pew Internet report:


“75% of online news consumers say they get
  news forwarded through email or posts on
  social networking sites
and 52% say they share links to news with
  others via those means.”
Twitter Lists
• Filtering main friends timeline is a bad idea
• Twitter Lists: manually created set of users who
  often post on a certain topic
• For example:
   – @huffingtonpost/apple-news
   – @IndieFlix/film-people-to-follow
   – @alisohani/bigdata-analytics
• A Twitter user can be included into different lists.
• Me for example:
  http://twitter.com/mariagrineva/lists/memberships
What kind of noise?
• People tweet on other topics too, including
  personal stuff



• Global news widely spread, often really
  annoying: IPad launch, ash clouds, Christmas,
  Michael Jackson
Our Approach
• Identifying niche topic of Twitter list
  automatically, at real-time
• Improve the niche topic with respect to the
  Global Twitter Stream
  – If there is a burst related to Apple, IPad => check
    maybe all Twitter is talking about that
Filtering = Classification
• Traditional approaches to filter news use only
  textual features
• We use both textual and social features for
  classification
  – Twitter lists is a community of interconnected
    users => see who is the center and who is an
    outsider
What is done
• Method for identification list’s topic
  signature with respect to Global Twitter
  Stream
• Social features identification
• Evaluation framework

More Related Content

Viewers also liked

Analytics for the Real-Time Web
Analytics for the Real-Time WebAnalytics for the Real-Time Web
Analytics for the Real-Time Web
maria.grineva
 
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge BasesSemantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
maria.grineva
 
XQuery Triggers in Native XML Database Sedna
XQuery Triggers in Native XML Database SednaXQuery Triggers in Native XML Database Sedna
XQuery Triggers in Native XML Database Sedna
maria.grineva
 
Architecture of Native XML Database Sedna
Architecture of Native XML Database SednaArchitecture of Native XML Database Sedna
Architecture of Native XML Database Sedna
maria.grineva
 
Tqr 2013 probes proxies
Tqr 2013 probes proxies Tqr 2013 probes proxies
Tqr 2013 probes proxies
An Jacobs
 
Analytics for the Real-Time Web
Analytics for the Real-Time WebAnalytics for the Real-Time Web
Analytics for the Real-Time Web
maria.grineva
 

Viewers also liked (12)

Analytics for the Real-Time Web
Analytics for the Real-Time WebAnalytics for the Real-Time Web
Analytics for the Real-Time Web
 
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge BasesSemantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
Semantic Data Search and Analysis Using Web-based User-Generated Knowledge Bases
 
XQuery Triggers in Native XML Database Sedna
XQuery Triggers in Native XML Database SednaXQuery Triggers in Native XML Database Sedna
XQuery Triggers in Native XML Database Sedna
 
Architecture of Native XML Database Sedna
Architecture of Native XML Database SednaArchitecture of Native XML Database Sedna
Architecture of Native XML Database Sedna
 
Tqr 2013 probes proxies
Tqr 2013 probes proxies Tqr 2013 probes proxies
Tqr 2013 probes proxies
 
Строителство Градът
Строителство ГрадътСтроителство Градът
Строителство Градът
 
Using Social Media to Build Your Business
Using Social Media to Build Your BusinessUsing Social Media to Build Your Business
Using Social Media to Build Your Business
 
Marketing Success in 7 Steps Guaranteed
Marketing Success in 7 Steps GuaranteedMarketing Success in 7 Steps Guaranteed
Marketing Success in 7 Steps Guaranteed
 
Development and Testing of a Conceptual Framework for Asking about Intoxication
Development and Testing of a Conceptual Framework for Asking about IntoxicationDevelopment and Testing of a Conceptual Framework for Asking about Intoxication
Development and Testing of a Conceptual Framework for Asking about Intoxication
 
The Social Research Group
The Social Research GroupThe Social Research Group
The Social Research Group
 
Defining the User Experience of Emotion & Content
Defining the User Experience of Emotion & ContentDefining the User Experience of Emotion & Content
Defining the User Experience of Emotion & Content
 
Analytics for the Real-Time Web
Analytics for the Real-Time WebAnalytics for the Real-Time Web
Analytics for the Real-Time Web
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Filtering Twitter

  • 1. Twitter is noisy but you can find some diamonds
  • 2. Pew Internet report: “75% of online news consumers say they get news forwarded through email or posts on social networking sites and 52% say they share links to news with others via those means.”
  • 3. Twitter Lists • Filtering main friends timeline is a bad idea • Twitter Lists: manually created set of users who often post on a certain topic • For example: – @huffingtonpost/apple-news – @IndieFlix/film-people-to-follow – @alisohani/bigdata-analytics • A Twitter user can be included into different lists. • Me for example: http://twitter.com/mariagrineva/lists/memberships
  • 4. What kind of noise? • People tweet on other topics too, including personal stuff • Global news widely spread, often really annoying: IPad launch, ash clouds, Christmas, Michael Jackson
  • 5. Our Approach • Identifying niche topic of Twitter list automatically, at real-time • Improve the niche topic with respect to the Global Twitter Stream – If there is a burst related to Apple, IPad => check maybe all Twitter is talking about that
  • 6. Filtering = Classification • Traditional approaches to filter news use only textual features • We use both textual and social features for classification – Twitter lists is a community of interconnected users => see who is the center and who is an outsider
  • 7. What is done • Method for identification list’s topic signature with respect to Global Twitter Stream • Social features identification • Evaluation framework