SlideShare una empresa de Scribd logo
1 de 18
Descargar para leer sin conexión
© 2014 IBM Corporation
The risks of using spreadsheets
for statistical analysis
What you’ll learn
 Applications of spreadsheets
 Limitations of spreadsheets
 How to overcome these issues with IBM SPSS Statistics
 Next steps
Why use spreadsheets?
 Familiarity and ease of use
 Simple to get an answer out of a spreadsheet
 Easy to share with others
 Low cost
Business uses for spreadsheets
 Financial and cost accounting
– Balance sheets
– Profit and loss accounts
– Cash Books
 Data collection and analysis
– Analyze survey results
– Present results to others in tabular or
visual format
 Mathematics
– Calculate trigonometric and
logarithmic functions
– Standard deviations, averages,
means, and more
Spreadsheets are ubiquitous in most organizations, especially for certain tasks.
Spreadsheets are prone to errors
 P. Brown, J. Gould. An experimental study of
people creating spreadsheets, ACM Transactions
on Office Information Systems 5
– 17 errors in one spreadsheet
– 94% of deployed spreadsheets contained errors
– Average: 5.2% of cells were in error
 Philip Howard, Managing Spreadsheet fraud
– According to PricewaterhouseCoopers and KPMG
– 90% of corporate spreadsheets have material errors
– Cost: $10-$100K per error per month
 Computational Statistics and Data Analysis
– Ran standardized tests for accuracy in Excel
and concludes it is inadequate for substantive
statistical analysis
– Flaw was clearly in Excel’s algorithms
Spreadsheets can be frustrating
“… On average, people spend
about 12 hours per month
consolidating, modifying and
correcting spreadsheets.
That’s about a day and a half
per month – or about 5 to 10
percent of their time.”
-- Ventana Research,
Spreadsheets in Today’s
Enterprise: Making
Intelligent Use of a Core
Technology, 2013
Causes of spreadsheet errors
 Mistakes in logic
 Incorrectly copied formulas
 Accidentally overwritten formulas
 Misuse of built-in functions
 Omitted factors
 Data input errors
 Sorting numbers
 Cell orientation
 Adding new data
There are many reasons your spreadsheet can fail you.
Limitations of spreadsheets for analysis
 Limited formulas
 Lack of metadata
 No data validation capabilities
 Absence of data preparation
and data manipulation
 Few analytical techniques
 No automated reporting
 Inability to project the future
But what is the alternative…?
IBM® SPSS® Statistics for analysis and reporting
Advantages of IBM SPSS Statistics
Prevent errors
Follow the analytical process
Access data from multiple data sources
Process raw data
Apply advanced analytical techniques
Export results to multiple formats
Automate time-consuming manual processes
4.
Supports the entire analytical lifecycle
Planning
Data Collection
1. Data Access
2.
Data Preparation &
Cleansing
Data Manipulation 3.
Deployment 5.
& Automation
Data Analysis &
Reporting
Improves data access
 Survey data
 Email data
 Corporate databases
 Web site data
 IBM Cognos data
And more…
IBM SPSS Statistics……
Streamlines data preparation
13
Assign detailed labels to your
variable names.
Assign code to categories (1=Male,
2= Female).
 Specify which values are missing.
 Select specific individuals for
analysis.
 Create rules to validate data and
catch data entry errors.plicate IDs.
IBM SPSS Statistics automates the time-consuming task of getting your data
ready for analysis. techniques to help you ensure your data is ready for
Eliminate the need to label all data.
 Make sure your data is clear and
properly organized for analysis.
Find duplicate cases automatically.
Simplifies data manipulation
Actions are driven by drag-
and-drop dialog boxes.
It’s easy to create new
variables..
14
IBM SPSS Statistics looks like a spreadsheet, but behaves very differently.
Wizards simplify
calculations with dates and
times.
Organize data into
categories, such as age
groups, for more insightful
analysis.
Produces more accurate data analysis
 Go beyond summary statistics
and row-and-column math.
 Design your own reports and
graphs using drag and drop
functions.
 Edit output after you create it.
Uncover hidden patterns in your
data that can provide new insights.
With IBM SPSS Statistics, data analysis is simple and intuitive.
Provides flexible deployment options
Deploy SPSS Statistics
output to smart devices.
 Export to Excel, Word,
PowerPoint and PDF.
Save your code for easy
reuse.
Create recurring reports
faster and more efficiently.
Key takeaways
Spreadsheets are widely
used for data analysis.
But more than 90% of
spreadsheets contain at least
one error.
IBM SPSS Statistics can
help you overcome common
errors, increasing the
accuracy of your analytical
results.
© 2014 IBM
M Corporation
Next steps:
Read the white paper: The Risks of Using Spreadsheets
Visit our website: www.ibm.com/spss
Talk to a representative: 800-543-2185

Más contenido relacionado

La actualidad más candente

Systat 13 Training ppt
Systat 13 Training pptSystat 13 Training ppt
Systat 13 Training ppt
Siriyak Cr
 
datasec_flyer_A5_v02
datasec_flyer_A5_v02datasec_flyer_A5_v02
datasec_flyer_A5_v02
Claire Ashton
 

La actualidad más candente (20)

Data science life cycle
Data science life cycleData science life cycle
Data science life cycle
 
Systat 13 Training ppt
Systat 13 Training pptSystat 13 Training ppt
Systat 13 Training ppt
 
Spss vs Excel: Which One is The Best Tool For Statistics
Spss vs Excel:  Which One is The Best Tool For StatisticsSpss vs Excel:  Which One is The Best Tool For Statistics
Spss vs Excel: Which One is The Best Tool For Statistics
 
Spreadsheet risks & mitigation
Spreadsheet risks & mitigationSpreadsheet risks & mitigation
Spreadsheet risks & mitigation
 
Excel-based SAP Reporting: Maximum potential, maximum efficiency
Excel-based SAP Reporting: Maximum potential, maximum efficiencyExcel-based SAP Reporting: Maximum potential, maximum efficiency
Excel-based SAP Reporting: Maximum potential, maximum efficiency
 
Understand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.SUnderstand the Demand of Analyst Opportunity in U.S
Understand the Demand of Analyst Opportunity in U.S
 
Business intelligence - benefits of using an online analytical solution
Business intelligence - benefits of using an online analytical solutionBusiness intelligence - benefits of using an online analytical solution
Business intelligence - benefits of using an online analytical solution
 
Difference Between Excel & Tableau
Difference Between Excel & TableauDifference Between Excel & Tableau
Difference Between Excel & Tableau
 
Driving Healthcare Operations with Data Science
Driving Healthcare Operations with Data ScienceDriving Healthcare Operations with Data Science
Driving Healthcare Operations with Data Science
 
CRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining ProjectsCRISP-DM - Agile Approach To Data Mining Projects
CRISP-DM - Agile Approach To Data Mining Projects
 
Microsoft excel
Microsoft excelMicrosoft excel
Microsoft excel
 
Building a Data Warehouse at Clover
Building a Data Warehouse at CloverBuilding a Data Warehouse at Clover
Building a Data Warehouse at Clover
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
 
Leveraged Analytics at Scale
Leveraged Analytics at ScaleLeveraged Analytics at Scale
Leveraged Analytics at Scale
 
Erp reporting from Excel
Erp reporting from ExcelErp reporting from Excel
Erp reporting from Excel
 
Microsoft excel
Microsoft excelMicrosoft excel
Microsoft excel
 
datasec_flyer_A5_v02
datasec_flyer_A5_v02datasec_flyer_A5_v02
datasec_flyer_A5_v02
 
Sbp-SmartGraphs&Smartstats-workflow in Smartsheet
Sbp-SmartGraphs&Smartstats-workflow in SmartsheetSbp-SmartGraphs&Smartstats-workflow in Smartsheet
Sbp-SmartGraphs&Smartstats-workflow in Smartsheet
 
SPSS vs Stata: All You need to Know
SPSS vs Stata: All You need to KnowSPSS vs Stata: All You need to Know
SPSS vs Stata: All You need to Know
 
Feature engineering
Feature engineeringFeature engineering
Feature engineering
 

Similar a Don't be limited by error-prone spreadsheets

progress_DBBI-infographic_01-01
progress_DBBI-infographic_01-01progress_DBBI-infographic_01-01
progress_DBBI-infographic_01-01
Natasha Peterson
 
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
Precisely
 
Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute
PoojaPatidar11
 

Similar a Don't be limited by error-prone spreadsheets (20)

Analytics from data to better decision
Analytics   from data to better decisionAnalytics   from data to better decision
Analytics from data to better decision
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
Analytics 101 - Getting Started
Analytics 101 - Getting Started Analytics 101 - Getting Started
Analytics 101 - Getting Started
 
progress_DBBI-infographic_01-01
progress_DBBI-infographic_01-01progress_DBBI-infographic_01-01
progress_DBBI-infographic_01-01
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
Data Analytics Introduction.pptx
Data Analytics Introduction.pptxData Analytics Introduction.pptx
Data Analytics Introduction.pptx
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
 
data mining and warehousing computer science
data mining and warehousing computer sciencedata mining and warehousing computer science
data mining and warehousing computer science
 
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleans...
 
What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?What is Data analytics? How is data analytics a better career option?
What is Data analytics? How is data analytics a better career option?
 
ERP
ERPERP
ERP
 
Characteristics of modern data architecture that drive innovation
Characteristics of modern data architecture that drive innovationCharacteristics of modern data architecture that drive innovation
Characteristics of modern data architecture that drive innovation
 
How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?
 
Data analytics presentation- Management career institute
Data analytics presentation- Management career institute Data analytics presentation- Management career institute
Data analytics presentation- Management career institute
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation SlidesBig Data Tools PowerPoint Presentation Slides
Big Data Tools PowerPoint Presentation Slides
 
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdfThe Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
The Simple 5-Step Process for Creating a Winning Data Pipeline.pdf
 
Data drift and machine learning
Data drift and machine learningData drift and machine learning
Data drift and machine learning
 

Último

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 

Don't be limited by error-prone spreadsheets

  • 1. © 2014 IBM Corporation The risks of using spreadsheets for statistical analysis
  • 2. What you’ll learn  Applications of spreadsheets  Limitations of spreadsheets  How to overcome these issues with IBM SPSS Statistics  Next steps
  • 3. Why use spreadsheets?  Familiarity and ease of use  Simple to get an answer out of a spreadsheet  Easy to share with others  Low cost
  • 4. Business uses for spreadsheets  Financial and cost accounting – Balance sheets – Profit and loss accounts – Cash Books  Data collection and analysis – Analyze survey results – Present results to others in tabular or visual format  Mathematics – Calculate trigonometric and logarithmic functions – Standard deviations, averages, means, and more Spreadsheets are ubiquitous in most organizations, especially for certain tasks.
  • 5. Spreadsheets are prone to errors  P. Brown, J. Gould. An experimental study of people creating spreadsheets, ACM Transactions on Office Information Systems 5 – 17 errors in one spreadsheet – 94% of deployed spreadsheets contained errors – Average: 5.2% of cells were in error  Philip Howard, Managing Spreadsheet fraud – According to PricewaterhouseCoopers and KPMG – 90% of corporate spreadsheets have material errors – Cost: $10-$100K per error per month  Computational Statistics and Data Analysis – Ran standardized tests for accuracy in Excel and concludes it is inadequate for substantive statistical analysis – Flaw was clearly in Excel’s algorithms
  • 6. Spreadsheets can be frustrating “… On average, people spend about 12 hours per month consolidating, modifying and correcting spreadsheets. That’s about a day and a half per month – or about 5 to 10 percent of their time.” -- Ventana Research, Spreadsheets in Today’s Enterprise: Making Intelligent Use of a Core Technology, 2013
  • 7. Causes of spreadsheet errors  Mistakes in logic  Incorrectly copied formulas  Accidentally overwritten formulas  Misuse of built-in functions  Omitted factors  Data input errors  Sorting numbers  Cell orientation  Adding new data There are many reasons your spreadsheet can fail you.
  • 8. Limitations of spreadsheets for analysis  Limited formulas  Lack of metadata  No data validation capabilities  Absence of data preparation and data manipulation  Few analytical techniques  No automated reporting  Inability to project the future
  • 9. But what is the alternative…?
  • 10. IBM® SPSS® Statistics for analysis and reporting Advantages of IBM SPSS Statistics Prevent errors Follow the analytical process Access data from multiple data sources Process raw data Apply advanced analytical techniques Export results to multiple formats Automate time-consuming manual processes
  • 11. 4. Supports the entire analytical lifecycle Planning Data Collection 1. Data Access 2. Data Preparation & Cleansing Data Manipulation 3. Deployment 5. & Automation Data Analysis & Reporting
  • 12. Improves data access  Survey data  Email data  Corporate databases  Web site data  IBM Cognos data And more… IBM SPSS Statistics……
  • 13. Streamlines data preparation 13 Assign detailed labels to your variable names. Assign code to categories (1=Male, 2= Female).  Specify which values are missing.  Select specific individuals for analysis.  Create rules to validate data and catch data entry errors.plicate IDs. IBM SPSS Statistics automates the time-consuming task of getting your data ready for analysis. techniques to help you ensure your data is ready for Eliminate the need to label all data.  Make sure your data is clear and properly organized for analysis. Find duplicate cases automatically.
  • 14. Simplifies data manipulation Actions are driven by drag- and-drop dialog boxes. It’s easy to create new variables.. 14 IBM SPSS Statistics looks like a spreadsheet, but behaves very differently. Wizards simplify calculations with dates and times. Organize data into categories, such as age groups, for more insightful analysis.
  • 15. Produces more accurate data analysis  Go beyond summary statistics and row-and-column math.  Design your own reports and graphs using drag and drop functions.  Edit output after you create it. Uncover hidden patterns in your data that can provide new insights. With IBM SPSS Statistics, data analysis is simple and intuitive.
  • 16. Provides flexible deployment options Deploy SPSS Statistics output to smart devices.  Export to Excel, Word, PowerPoint and PDF. Save your code for easy reuse. Create recurring reports faster and more efficiently.
  • 17. Key takeaways Spreadsheets are widely used for data analysis. But more than 90% of spreadsheets contain at least one error. IBM SPSS Statistics can help you overcome common errors, increasing the accuracy of your analytical results.
  • 18. © 2014 IBM M Corporation Next steps: Read the white paper: The Risks of Using Spreadsheets Visit our website: www.ibm.com/spss Talk to a representative: 800-543-2185