SlideShare una empresa de Scribd logo
1 de 36
Descargar para leer sin conexión
Filtering Twitter Noise



The problem, and some solutions using
       TalkingPuffin and Scala.

           Dave Briccetti
           @dcbriccetti
What is Twitter noise?
How can I mute everything from Foursquare?
Nothing more from Foursquare.

If we wanted, we could just exclude Foursquare
         tweets from individual users.
What about noisy retweeters?
No more retweets from this user. This leaves her
   original, primary tweets, which we value.
How does TalkingPuffin store these filters?

       Let’s look at the filters dialog.
These are “compound filters.”
Contains a List of these:
Tags
Tags may be assigned to people, and tweets may
       be included or excluded by tags.

      Tags can be exported to Twitter lists.
Here we just want to look at Twitter employees.
Tags are managed by a TagUsers object.
CompoundFilters and tags are stored in
           InOutFilters.
InOutFilters are stored in a FilterSet, along with
        some general, boolean options.
Detecting retweets with regular expressions
External Noise Filters




 A shared repository of regular expressions or
other filters that can be used in a community to
            identify and filter out noise.
General Filters Pane
Delivered Via a Web Service on
      Google App Engine
Loaded into a List of regular
       expressions
And used to find noise tweets
Summary
●   There is a lot of noise on Twitter
●   With the right tools you can control it
●   TalkingPuffin is one such tool
●   Scala is a great language for creating such
    tools
More
●   Learn more about
    ●   TalkingPuffin at TalkingPuffin.org
    ●   Scala at scala-lang.org
●   Consider joining the TalkingPuffin open source
    development effort
●   Contact
    ●   Dave Briccetti, @dcbriccetti

Más contenido relacionado

Destacado

Berlitz Tip - Meetings in English
Berlitz Tip - Meetings in EnglishBerlitz Tip - Meetings in English
Berlitz Tip - Meetings in EnglishBerlitz Corporation
 
Berlitz Tips - Intercultural Communication
Berlitz Tips - Intercultural CommunicationBerlitz Tips - Intercultural Communication
Berlitz Tips - Intercultural CommunicationBerlitz Corporation
 
Nuevo presentación de microsoft office power point
Nuevo presentación de microsoft office power pointNuevo presentación de microsoft office power point
Nuevo presentación de microsoft office power pointMiren Pardo
 
Proyecto original el verano
Proyecto original el veranoProyecto original el verano
Proyecto original el veranoMiren Pardo
 
Nuevo presentación de microsoft office power point
Nuevo presentación de microsoft office power pointNuevo presentación de microsoft office power point
Nuevo presentación de microsoft office power pointMiren Pardo
 
Proyecto original los camiones
Proyecto original los camionesProyecto original los camiones
Proyecto original los camionesMiren Pardo
 
Proyecto los museos de Castellón
Proyecto los museos de CastellónProyecto los museos de Castellón
Proyecto los museos de CastellónMiren Pardo
 
Proyecto superhéroes
Proyecto superhéroesProyecto superhéroes
Proyecto superhéroesMiren Pardo
 
Proyecto museos original
Proyecto museos originalProyecto museos original
Proyecto museos originalMiren Pardo
 
Proyecto de los superhéroes original
Proyecto de los superhéroes originalProyecto de los superhéroes original
Proyecto de los superhéroes originalMiren Pardo
 
Proyecto mi cuerpo por fuera
Proyecto mi cuerpo por fueraProyecto mi cuerpo por fuera
Proyecto mi cuerpo por fueraMiren Pardo
 
Mi cuerpo por fuera original
Mi cuerpo por fuera originalMi cuerpo por fuera original
Mi cuerpo por fuera originalMiren Pardo
 
Proyecto van gogh
Proyecto van goghProyecto van gogh
Proyecto van goghMiren Pardo
 
Proyecto van gogh original
Proyecto van gogh originalProyecto van gogh original
Proyecto van gogh originalMiren Pardo
 

Destacado (15)

Berlitz Tip - Meetings in English
Berlitz Tip - Meetings in EnglishBerlitz Tip - Meetings in English
Berlitz Tip - Meetings in English
 
作業標準書について
作業標準書について作業標準書について
作業標準書について
 
Berlitz Tips - Intercultural Communication
Berlitz Tips - Intercultural CommunicationBerlitz Tips - Intercultural Communication
Berlitz Tips - Intercultural Communication
 
Nuevo presentación de microsoft office power point
Nuevo presentación de microsoft office power pointNuevo presentación de microsoft office power point
Nuevo presentación de microsoft office power point
 
Proyecto original el verano
Proyecto original el veranoProyecto original el verano
Proyecto original el verano
 
Nuevo presentación de microsoft office power point
Nuevo presentación de microsoft office power pointNuevo presentación de microsoft office power point
Nuevo presentación de microsoft office power point
 
Proyecto original los camiones
Proyecto original los camionesProyecto original los camiones
Proyecto original los camiones
 
Proyecto los museos de Castellón
Proyecto los museos de CastellónProyecto los museos de Castellón
Proyecto los museos de Castellón
 
Proyecto superhéroes
Proyecto superhéroesProyecto superhéroes
Proyecto superhéroes
 
Proyecto museos original
Proyecto museos originalProyecto museos original
Proyecto museos original
 
Proyecto de los superhéroes original
Proyecto de los superhéroes originalProyecto de los superhéroes original
Proyecto de los superhéroes original
 
Proyecto mi cuerpo por fuera
Proyecto mi cuerpo por fueraProyecto mi cuerpo por fuera
Proyecto mi cuerpo por fuera
 
Mi cuerpo por fuera original
Mi cuerpo por fuera originalMi cuerpo por fuera original
Mi cuerpo por fuera original
 
Proyecto van gogh
Proyecto van goghProyecto van gogh
Proyecto van gogh
 
Proyecto van gogh original
Proyecto van gogh originalProyecto van gogh original
Proyecto van gogh original
 

Similar a Filtering Twitter Noise

Tools to use tweets at an event
Tools to use tweets at an eventTools to use tweets at an event
Tools to use tweets at an eventIan Anderson Gray
 
Python report on twitter sentiment analysis
Python report on twitter sentiment analysisPython report on twitter sentiment analysis
Python report on twitter sentiment analysisAntaraBhattacharya12
 
Listening: Free Tools & Techniques for Nonprofit Brand Monitoring
Listening: Free Tools & Techniques for Nonprofit Brand MonitoringListening: Free Tools & Techniques for Nonprofit Brand Monitoring
Listening: Free Tools & Techniques for Nonprofit Brand MonitoringGregory Heller
 
2015 06-29--lita--blend it
2015 06-29--lita--blend it2015 06-29--lita--blend it
2015 06-29--lita--blend itPaul Signorelli
 
User centered practices with apps you already use
User centered practices with apps you already useUser centered practices with apps you already use
User centered practices with apps you already useNathaniel Padgett
 
Introduction to Google Summer of Code
Introduction to Google Summer of CodeIntroduction to Google Summer of Code
Introduction to Google Summer of CodeTushar Dadlani
 
Final Year PPT on Twitter App
Final Year PPT on Twitter AppFinal Year PPT on Twitter App
Final Year PPT on Twitter Appscorpionking257
 
Twitter for the Learning Professional
Twitter for the Learning ProfessionalTwitter for the Learning Professional
Twitter for the Learning ProfessionalDavid Kelly
 
OSMC 2021 | Contributing to open source with the example of icinga (1)
OSMC 2021 | Contributing to open source with the example of icinga (1)OSMC 2021 | Contributing to open source with the example of icinga (1)
OSMC 2021 | Contributing to open source with the example of icinga (1)NETWAYS
 
Twitter Presentation
Twitter PresentationTwitter Presentation
Twitter PresentationChris Hunter
 
Why Twitter’s New Product “Curator” Could Save Social Marketers Hours
Why Twitter’s New Product “Curator” Could Save Social Marketers HoursWhy Twitter’s New Product “Curator” Could Save Social Marketers Hours
Why Twitter’s New Product “Curator” Could Save Social Marketers HoursMohamed Mahdy
 
Open Source Craft at Twitter
Open Source Craft at TwitterOpen Source Craft at Twitter
Open Source Craft at TwitterChris Aniszczyk
 
6 Free Rock Star Twitter Tools
6 Free Rock Star Twitter Tools6 Free Rock Star Twitter Tools
6 Free Rock Star Twitter ToolsKristen Carney
 
Twitter Tools For The Masses ...From C&M
Twitter Tools For The Masses ...From C&MTwitter Tools For The Masses ...From C&M
Twitter Tools For The Masses ...From C&MBeyond
 
Community building lessons from Ansible
Community building lessons from AnsibleCommunity building lessons from Ansible
Community building lessons from AnsibleGreg DeKoenigsberg
 
Staying Ahead of the Curve: Essential Newsgathering and Monitoring Tools
Staying Ahead of the Curve: Essential Newsgathering and Monitoring ToolsStaying Ahead of the Curve: Essential Newsgathering and Monitoring Tools
Staying Ahead of the Curve: Essential Newsgathering and Monitoring ToolsChristine Guirguis
 
6 Updates in Collaboration @ Atlassian
6 Updates in Collaboration @ Atlassian6 Updates in Collaboration @ Atlassian
6 Updates in Collaboration @ AtlassianMatthew Cobby
 
Parallel Path's Definitive Guide to Tweet Chats
Parallel Path's Definitive Guide to Tweet ChatsParallel Path's Definitive Guide to Tweet Chats
Parallel Path's Definitive Guide to Tweet ChatsParallel Path
 

Similar a Filtering Twitter Noise (20)

Tools to use tweets at an event
Tools to use tweets at an eventTools to use tweets at an event
Tools to use tweets at an event
 
Python report on twitter sentiment analysis
Python report on twitter sentiment analysisPython report on twitter sentiment analysis
Python report on twitter sentiment analysis
 
Listening: Free Tools & Techniques for Nonprofit Brand Monitoring
Listening: Free Tools & Techniques for Nonprofit Brand MonitoringListening: Free Tools & Techniques for Nonprofit Brand Monitoring
Listening: Free Tools & Techniques for Nonprofit Brand Monitoring
 
Let's talk FOSS!
Let's talk FOSS!Let's talk FOSS!
Let's talk FOSS!
 
2015 06-29--lita--blend it
2015 06-29--lita--blend it2015 06-29--lita--blend it
2015 06-29--lita--blend it
 
User centered practices with apps you already use
User centered practices with apps you already useUser centered practices with apps you already use
User centered practices with apps you already use
 
Introduction to Google Summer of Code
Introduction to Google Summer of CodeIntroduction to Google Summer of Code
Introduction to Google Summer of Code
 
Final Year PPT on Twitter App
Final Year PPT on Twitter AppFinal Year PPT on Twitter App
Final Year PPT on Twitter App
 
Twitter for the Learning Professional
Twitter for the Learning ProfessionalTwitter for the Learning Professional
Twitter for the Learning Professional
 
OSMC 2021 | Contributing to open source with the example of icinga (1)
OSMC 2021 | Contributing to open source with the example of icinga (1)OSMC 2021 | Contributing to open source with the example of icinga (1)
OSMC 2021 | Contributing to open source with the example of icinga (1)
 
Twitter Presentation
Twitter PresentationTwitter Presentation
Twitter Presentation
 
Why Twitter’s New Product “Curator” Could Save Social Marketers Hours
Why Twitter’s New Product “Curator” Could Save Social Marketers HoursWhy Twitter’s New Product “Curator” Could Save Social Marketers Hours
Why Twitter’s New Product “Curator” Could Save Social Marketers Hours
 
Open Source Craft at Twitter
Open Source Craft at TwitterOpen Source Craft at Twitter
Open Source Craft at Twitter
 
6 Free Rock Star Twitter Tools
6 Free Rock Star Twitter Tools6 Free Rock Star Twitter Tools
6 Free Rock Star Twitter Tools
 
Advanced twitter for_nouns_
Advanced twitter for_nouns_Advanced twitter for_nouns_
Advanced twitter for_nouns_
 
Twitter Tools For The Masses ...From C&M
Twitter Tools For The Masses ...From C&MTwitter Tools For The Masses ...From C&M
Twitter Tools For The Masses ...From C&M
 
Community building lessons from Ansible
Community building lessons from AnsibleCommunity building lessons from Ansible
Community building lessons from Ansible
 
Staying Ahead of the Curve: Essential Newsgathering and Monitoring Tools
Staying Ahead of the Curve: Essential Newsgathering and Monitoring ToolsStaying Ahead of the Curve: Essential Newsgathering and Monitoring Tools
Staying Ahead of the Curve: Essential Newsgathering and Monitoring Tools
 
6 Updates in Collaboration @ Atlassian
6 Updates in Collaboration @ Atlassian6 Updates in Collaboration @ Atlassian
6 Updates in Collaboration @ Atlassian
 
Parallel Path's Definitive Guide to Tweet Chats
Parallel Path's Definitive Guide to Tweet ChatsParallel Path's Definitive Guide to Tweet Chats
Parallel Path's Definitive Guide to Tweet Chats
 

Último

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 

Último (20)

Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 

Filtering Twitter Noise