SlideShare una empresa de Scribd logo
1 de 20
DOS AND DON’TS OF DATAVIZ
ATALEOFPIES,DECEPTIONANDMINDTRICKS
IÑAKIPUIGDOLLERS SABIN
Data Scientist
DON’T RESCALE PROPORTIONS!
x1.75 times
bigger
Source: http://cadenaser.com/
15.1 + 70.7 + 15.2
= 101%
DO KEEP THE PROPORTIONSAS THEYARE
YET ANOTHER EXAMPLE …
?
Source: Twitter, @ppmadrid
NOW IN A PROPORTIONAL SCALE
PSOE
PARTIDO
POPULAR
NúmerodeParados
DON’T OMIT THE ORIGIN OF THE Y-AXIS
Where is
the
Axis??
94 is not 0
Source: http://blog.rtve.es/
http://mediamatters.org/
DO SHOW THE Y-AXIS FROM THE ORIGIN
MillionDollars
50.66% 49.07%
THIS ALSO HAPPENS IN SCIENTIFIC PAPERS
This is a big
difference, isn’t it?
According to the
paper,
this should be
1.82
The value of Y
(Rape Myth
Acceptance)
varies between
1 and 5
There are values
placed in the
wrong position
Source: Fox, Jesse; Bailenson, Jeremy N.; Tricase, Liz (2013). "The embodiment of
sexualized virtual selves: The Proteus effect and experiences of self-objectification via
avatars". Computers in Human Behavior 29 (3): 930–938
THE REALITY IS SOMETHING DIFFERENT
Face
It was not that
different in the
end…
Remember:
The value of Y
(Rape Myth
Acceptance)
varies between
1 and 5
DON’T USE INVENTED OR TAILOR-MADE SCALES
How can this be a line?
Source: http://mediamatters.org/
DO PLOT DATAAS IT IS
DON’T USE DIFFERENT SCALES FOR THE SAME
AXIS
Left Y-Axis
(representing the
non-smokers)
starts at 2
Right Y-Axis
(representing
the smokers)
starts at 3
Source: H. Wainer, Visual Revelations, Graphical Tales of Fate and
Deceptions from Napoleon Bonaparte to Ross Perot
Disclaimer! This Graph is
from a tobacco company
DO USE THE SAME SCALE TO MAKE DATA
COMPARABLE
DON’T SHOW MEANINGLESS NUMBERS
DON’T USE PIE CHARTS
193% ???
That’s a big pie!
Source: http://mediamatters.org/
DON’T USE 3D
Perspective makes
percentages look different
Source: http://imgarcade.com/1/misleading-circle-graphs/
SOME THINGS WE LEARNED AT SCHIBSTED
■Know your audience and adapt the visualization to them
■The title matters, it has to be attractive but not distracting
■Select the most suitable plot, there is no one-plot-fit-all
■Show only relevant information, crowded visualizations are
misleading
■Sometimes you can break the rules… 
DO CHOOSE A VISUALIZATION FITTING YOUR
AUDIENCE
Percentage of Sellers per segment
Slack channels sharing users
DON’T USE CROWDED PLOTS WITH MISLEADING
INFORMATION
■Too many elements
■The colours are
meaningless
■The axes are misleading
(not showing the origin)
DO SHOW ONLY WHAT IS IMPORTANT
■Axes starting at 0
■Only the necessary
elements
GOAL
Show the correlation of the
data points
… A DIFFERENTAPPROACH
■We don’t care about the
value  it’s OK to break the
axis rule!!
■The colours have a meaning
GOAL
Show the distribution and
density of the data points
WE ARE LOOKING FOR TALENT!
inaki.puigdollers@schibsted.com
Thanks, questions?
Data Scientist – Schibsted Product & Technology

Más contenido relacionado

Destacado (6)

201602 Technology Trends 2016 -spanish
201602 Technology Trends 2016  -spanish201602 Technology Trends 2016  -spanish
201602 Technology Trends 2016 -spanish
 
#2 DataBeersBCN - "Why counting people at public transport" by Caterina Font
 #2 DataBeersBCN - "Why counting people at public transport" by Caterina Font #2 DataBeersBCN - "Why counting people at public transport" by Caterina Font
#2 DataBeersBCN - "Why counting people at public transport" by Caterina Font
 
мочевая система
мочевая системамочевая система
мочевая система
 
Вовлеченность персонала
Вовлеченность персоналаВовлеченность персонала
Вовлеченность персонала
 
قابلية الاستعمال
قابلية الاستعمالقابلية الاستعمال
قابلية الاستعمال
 
How to win from Programming
How to win from ProgrammingHow to win from Programming
How to win from Programming
 

Similar a #5 DataBeersBCN -"Dos and Don'ts of Data Viz"

Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
Julia Grosman
 

Similar a #5 DataBeersBCN -"Dos and Don'ts of Data Viz" (20)

Вебинар «Интерактивная визуализация данных при помощи Infogram»
Вебинар «Интерактивная визуализация данных при помощи Infogram»Вебинар «Интерактивная визуализация данных при помощи Infogram»
Вебинар «Интерактивная визуализация данных при помощи Infogram»
 
Semiotic strategies: The things you are looking at have names
Semiotic strategies: The things you are looking at have namesSemiotic strategies: The things you are looking at have names
Semiotic strategies: The things you are looking at have names
 
TCS: Success Strategies Of The Fastest Growing Internet Retailers
TCS: Success Strategies Of The Fastest Growing Internet RetailersTCS: Success Strategies Of The Fastest Growing Internet Retailers
TCS: Success Strategies Of The Fastest Growing Internet Retailers
 
Santahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeiras
Santahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeirasSantahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeiras
Santahelena Truthtelling - Por marcas mais humanas, autênticas e verdadeiras
 
Mind The Gap - ConnectNow
Mind The Gap - ConnectNowMind The Gap - ConnectNow
Mind The Gap - ConnectNow
 
How to Visualize Data Like a Pro
How to Visualize Data Like a ProHow to Visualize Data Like a Pro
How to Visualize Data Like a Pro
 
[DEVit 360] Opti-pessimism: Design for the best case, build for the worst
[DEVit 360] Opti-pessimism: Design for the best case, build for the worst[DEVit 360] Opti-pessimism: Design for the best case, build for the worst
[DEVit 360] Opti-pessimism: Design for the best case, build for the worst
 
Data storytelling
Data storytellingData storytelling
Data storytelling
 
5 Non-Obvious Trends For 2018 | Exclusive Book Preview
5 Non-Obvious Trends For 2018 | Exclusive Book Preview5 Non-Obvious Trends For 2018 | Exclusive Book Preview
5 Non-Obvious Trends For 2018 | Exclusive Book Preview
 
HIERARCHY_Global shop recap 15
HIERARCHY_Global shop recap 15HIERARCHY_Global shop recap 15
HIERARCHY_Global shop recap 15
 
Santahelena Truthtelling Hacktown 2019
Santahelena Truthtelling Hacktown 2019Santahelena Truthtelling Hacktown 2019
Santahelena Truthtelling Hacktown 2019
 
Data Design: Where Math and Art Collide
Data Design: Where Math and Art CollideData Design: Where Math and Art Collide
Data Design: Where Math and Art Collide
 
Data Driven Marketing
Data Driven MarketingData Driven Marketing
Data Driven Marketing
 
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
Halley Gray - Use Humor to Increase Your Conversion Rate by 28%
 
3.0 nobody knows... intro planning strategique
3.0 nobody knows... intro planning strategique3.0 nobody knows... intro planning strategique
3.0 nobody knows... intro planning strategique
 
Palestra sobre o livro TRUTHTELLING
Palestra sobre o livro TRUTHTELLINGPalestra sobre o livro TRUTHTELLING
Palestra sobre o livro TRUTHTELLING
 
World Communications Forum Davos 2013
World Communications Forum Davos 2013World Communications Forum Davos 2013
World Communications Forum Davos 2013
 
Cobalt LLP Social Media Presentation 2012
Cobalt LLP Social Media Presentation 2012Cobalt LLP Social Media Presentation 2012
Cobalt LLP Social Media Presentation 2012
 
Social Media Optimization for Business 2013
Social Media Optimization for Business 2013Social Media Optimization for Business 2013
Social Media Optimization for Business 2013
 
TCS: Trend Based Marketing - Tomorrow's Campaigns Today
TCS: Trend Based Marketing - Tomorrow's Campaigns TodayTCS: Trend Based Marketing - Tomorrow's Campaigns Today
TCS: Trend Based Marketing - Tomorrow's Campaigns Today
 

Más de DataBeersBCN

Más de DataBeersBCN (20)

#6 DataBeersBCN -"Whales"
#6 DataBeersBCN -"Whales"#6 DataBeersBCN -"Whales"
#6 DataBeersBCN -"Whales"
 
#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"
#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"
#6 DataBeersBCN -"Data, Beer and Enterprise Architecture"
 
#6 DataBeersBCN -"GoodCityLife.org"
#6 DataBeersBCN -"GoodCityLife.org"#6 DataBeersBCN -"GoodCityLife.org"
#6 DataBeersBCN -"GoodCityLife.org"
 
#6 DataBeersBCN -"The (Big) Data behind the brain"
#6 DataBeersBCN -"The (Big) Data behind the brain"#6 DataBeersBCN -"The (Big) Data behind the brain"
#6 DataBeersBCN -"The (Big) Data behind the brain"
 
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"#5 DataBeersBCN -"How to do Data Journalism… and not die trying"
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"
 
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"
 
#5 DataBeersBCN -"Location Based Business Oportunity Detector"
#5 DataBeersBCN -"Location Based Business Oportunity Detector"#5 DataBeersBCN -"Location Based Business Oportunity Detector"
#5 DataBeersBCN -"Location Based Business Oportunity Detector"
 
#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes
#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes
#4 DataBeersBCN - "Visualizing Geolocated Tweets" by Joana Simoes
 
#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti
#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti
#4 DataBeersBCN - "We know what you did last sonar" by Fernando Cucchietti
 
#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert
#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert
#3 DataBeersBCN - "The impact of data in reality" by Karina Gibert
 
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
#3 DataBeersBCN - "How to get into the news with Social networks analysis" by...
 
#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...
#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...
#3 DataBeersBCN - "When we start caring about data" by Dani Pearson & Pau Gar...
 
#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola
#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola
#3 DataBeersBCN - "Big Fun Data" by Xavier Guardiola
 
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo Aragon
 
#2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...
 #2 DataBeersBCN - "Using data to make great and succesful mobile games" by J... #2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...
#2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...
 
#2 DataBeersBCN - "Govern Obert - Opengov.cat" by Concha Catalan
#2 DataBeersBCN - "Govern Obert  - Opengov.cat" by Concha Catalan#2 DataBeersBCN - "Govern Obert  - Opengov.cat" by Concha Catalan
#2 DataBeersBCN - "Govern Obert - Opengov.cat" by Concha Catalan
 
#1 DataBeersBCN - Xavier
#1 DataBeersBCN - Xavier#1 DataBeersBCN - Xavier
#1 DataBeersBCN - Xavier
 
#1 DataBeersBCN - David Solans
#1 DataBeersBCN - David Solans#1 DataBeersBCN - David Solans
#1 DataBeersBCN - David Solans
 
#1 DataBeersBCN - Dani Villatoro from BBVA DATA ANALYTICS
#1 DataBeersBCN - Dani Villatoro  from BBVA DATA ANALYTICS#1 DataBeersBCN - Dani Villatoro  from BBVA DATA ANALYTICS
#1 DataBeersBCN - Dani Villatoro from BBVA DATA ANALYTICS
 
#1 DataBeersBCN - Oscar Marin from Outliers.Collective
#1 DataBeersBCN - Oscar Marin from Outliers.Collective#1 DataBeersBCN - Oscar Marin from Outliers.Collective
#1 DataBeersBCN - Oscar Marin from Outliers.Collective
 

Último

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
AroojKhan71
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 

Último (20)

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 

#5 DataBeersBCN -"Dos and Don'ts of Data Viz"

  • 1. DOS AND DON’TS OF DATAVIZ ATALEOFPIES,DECEPTIONANDMINDTRICKS IÑAKIPUIGDOLLERS SABIN Data Scientist
  • 2. DON’T RESCALE PROPORTIONS! x1.75 times bigger Source: http://cadenaser.com/ 15.1 + 70.7 + 15.2 = 101%
  • 3. DO KEEP THE PROPORTIONSAS THEYARE
  • 4. YET ANOTHER EXAMPLE … ? Source: Twitter, @ppmadrid
  • 5. NOW IN A PROPORTIONAL SCALE PSOE PARTIDO POPULAR NúmerodeParados
  • 6. DON’T OMIT THE ORIGIN OF THE Y-AXIS Where is the Axis?? 94 is not 0 Source: http://blog.rtve.es/ http://mediamatters.org/
  • 7. DO SHOW THE Y-AXIS FROM THE ORIGIN MillionDollars 50.66% 49.07%
  • 8. THIS ALSO HAPPENS IN SCIENTIFIC PAPERS This is a big difference, isn’t it? According to the paper, this should be 1.82 The value of Y (Rape Myth Acceptance) varies between 1 and 5 There are values placed in the wrong position Source: Fox, Jesse; Bailenson, Jeremy N.; Tricase, Liz (2013). "The embodiment of sexualized virtual selves: The Proteus effect and experiences of self-objectification via avatars". Computers in Human Behavior 29 (3): 930–938
  • 9. THE REALITY IS SOMETHING DIFFERENT Face It was not that different in the end… Remember: The value of Y (Rape Myth Acceptance) varies between 1 and 5
  • 10. DON’T USE INVENTED OR TAILOR-MADE SCALES How can this be a line? Source: http://mediamatters.org/
  • 11. DO PLOT DATAAS IT IS
  • 12. DON’T USE DIFFERENT SCALES FOR THE SAME AXIS Left Y-Axis (representing the non-smokers) starts at 2 Right Y-Axis (representing the smokers) starts at 3 Source: H. Wainer, Visual Revelations, Graphical Tales of Fate and Deceptions from Napoleon Bonaparte to Ross Perot Disclaimer! This Graph is from a tobacco company
  • 13. DO USE THE SAME SCALE TO MAKE DATA COMPARABLE
  • 14. DON’T SHOW MEANINGLESS NUMBERS DON’T USE PIE CHARTS 193% ??? That’s a big pie! Source: http://mediamatters.org/ DON’T USE 3D Perspective makes percentages look different Source: http://imgarcade.com/1/misleading-circle-graphs/
  • 15. SOME THINGS WE LEARNED AT SCHIBSTED ■Know your audience and adapt the visualization to them ■The title matters, it has to be attractive but not distracting ■Select the most suitable plot, there is no one-plot-fit-all ■Show only relevant information, crowded visualizations are misleading ■Sometimes you can break the rules… 
  • 16. DO CHOOSE A VISUALIZATION FITTING YOUR AUDIENCE Percentage of Sellers per segment Slack channels sharing users
  • 17. DON’T USE CROWDED PLOTS WITH MISLEADING INFORMATION ■Too many elements ■The colours are meaningless ■The axes are misleading (not showing the origin)
  • 18. DO SHOW ONLY WHAT IS IMPORTANT ■Axes starting at 0 ■Only the necessary elements GOAL Show the correlation of the data points
  • 19. … A DIFFERENTAPPROACH ■We don’t care about the value  it’s OK to break the axis rule!! ■The colours have a meaning GOAL Show the distribution and density of the data points
  • 20. WE ARE LOOKING FOR TALENT! inaki.puigdollers@schibsted.com Thanks, questions? Data Scientist – Schibsted Product & Technology

Notas del editor

  1. -A picture tells a thousand words -Goal: share examples of visualizations showing distorted information and how can this be addressed
  2. -Common practise to fool people’s mind is rescaling porportions -Even though you show the numbers, if the plot is not proportional  contradictory information -A picture tells a thousand words
  3. -Here you see how different the plot looks when the proportions are as they should -However this particular example can be just an error, just not intentional. But what about this one?
  4. -Spatial perception is a very important component of image processing in human’s brain -This is why mass media abuses this kind of blatant distortions to communicate somehow biased message
  5. -Again, if we do the exercise of re-plotting the data in a fairer way we see that reality is something different to what they try to show -So the blue line is flatter than the one they presented originally, take your own conclusions…
  6. -Another technique to show distorted data is omitting the Y-axis. -Messes up with spatial perception again -Comparing is very difficult
  7. -But if we re-plot it truth comes to surface again… -And that incredibly huge difference betwwen both candidates is gone -And the federal wellfare received in US hasn’t grown as much neither...
  8. -No surprise media uses this -We all knew that TV and newspapers provided biased information -Is more strange is to see this in science
  9. -Some scientific studies use distortion techniques as well to “enhance” their message -But if we see how it really looks like this is what we have: the difference between conditions is not that big -Is it science a matter of believe in the end?
  10. -Another great example from Fox news: created a linear growth of the job loss by QUARTER out of the blue
  11. -This is how it really looks, not only the values are not linear, but the periods are not quarters but random months across 3 different years!
  12. -Another good deceiving technique is to use double axis in the same plot -It can be good: enhanced readability, but if the axis are not the same you can create effects like the one from this tobacco company showing that smoking is not affecting with death rate, only the age matters
  13. -However if we re-plot it correctly we see a complete different story -No Surprise it comes from a tobacco company, right?
  14. -And then we have the pie charts. -Should I use them? I ‘ll try to avoid them -If you insist -remember simple rule: pie charts show parts of a whole so make them sum up to 100% and no more - avoid perspective games
  15. -Things we learned at Schisted, I’m going to talk about a couple of them -One of the most delicate points: choosing which visualization to use -Know audience beforehand
  16. -Not everybody understands reality the same way, while a DS may feel comfortable with a network plot, BP tend to prefer bar plots or waterfall plots -In addition, There is no one-plot-fit-all solution
  17. -Once you have decided which way to go you have to be careful with the number elements you add to the plot. By elements I mean : colours, size of the points, width of the bars, regression lines,… amogn others -Crowded plots are, more often than not, misleading and distracting audience's attention from what is important.
  18. -My suggestion: do not add irrelevant elements, every single element you have in the plot has to be meaningful by itself. -Here, for instance we have a clear goal, so we sticked to it and showed only elements that helped us to explain that message
  19. -If your goal is different, so is your plot -All in all, I would say that the golden rule in data visualization is two folded to communicate a message (this is your goal) based on some observed data (which you have to respect)
  20. -If your goal is different, so is your plot -All in all, I would say that the golden rule in data visualization is two folded to communicate a message (this is your goal) based on some observed data (which you have to respect)