SlideShare a Scribd company logo
1 of 13
Data Analysis
Prepare for Data Analysis
There are several steps involved for data preparation. They are:
Questionnaire checking: Questionnaire checking involves
eliminating unacceptable questionnaires. These questionnaires may
be incomplete, instructions not followed, little variance, missing
pages, past cutoff date or respondent not qualified.
Editing: Editing looks to correct illegible, incomplete, inconsistent
and ambiguous answers.
Coding: Coding typically assigns symbols or numeric codes to
answers that do not already have them so that statistical techniques
can be applied.
Prepare Data for Analysis
Transcribing: Transcribing data involves transferring data so as to
make it accessible to people or applications for further processing.
Cleaning: Cleaning reviews data for consistencies. Inconsistencies
may arise from faulty logic, out of range or extreme values.
Statistical adjustments: Statistical adjustments applies to data that
requires weighting and scale transformations.
Analysis strategy selection: Finally, selection of a data analysis
strategy is based on earlier work in designing the research project but
is finalized after consideration of the characteristics of the data that
has been gathered.
https://www.cvent.com/en/blog/events/7-steps-prepare-data-analysis
Graphical presentation: Bar Chart
A bar chart or bar graph is a chart or graph that presents categorical
data with rectangular bars with heights or lengths proportional to the
values that they represent. The bars can be plotted vertically or
horizontally. A vertical bar chart is sometimes called a column chart.
Graphical presentation: Pie Chart
A pie chart (or a circle chart) is a circular statistical graphic, which is
divided into slices to illustrate numerical proportion. In a pie chart,
the arc length of each slice (and consequently its central angle and
area), is proportional to the quantity it represents.
Frequency table
Frequency refers to the number of times an event or a value occurs.
A frequency table is a table that lists items and shows the number of
times the items occur.
Cross Tabulation: How It Works
Cross tabulation is a method to quantitatively analyze the relationship
between multiple variables. It is also known as contingency tables or
cross tabs, cross tabulation groups variables to understand the
correlation between different variables. It also shows how correlations
change from one variable grouping to another. It is usually used in
statistical analysis to find patterns, trends, and probabilities within raw
data.
Cross tabulation is usually performed on categorical data — data that
can be divided into mutually exclusive groups.
Cross tabulations are used to examine relationships within data that
may not be readily apparent. Cross tabulation is especially useful for
studying market research or survey responses. Cross tabulation of
categorical data can be done with through tools such as SPSS, SAS,
and Microsoft Excel.
Cross Tabulation
Consider the below sample data set in Excel. It displays details about
commercial transactions for four product categories. Let’s use this data set
to show cross tabulation in action.
This data can be converted to pivot table format by selecting the entire table
and inserting a pivot table in the Excel file. The table can correlate different
variables row-wise, column-wise, or value-wise in either table format or
chart format.
Cross Tabulation
Then the results appear in a pivot table:
It is now clear that the highest sales were done for P1 using Master Card.
Therefore, we can conclude that the MasterCard payment method and
product P1 category is the most profitable combination.
Similarly, we can use cross tabulation and find the relation between the
product category and the payment method type with regard to the number
of transactions.
https://humansofdata.atlan.com/2016/01/cross-tabulation-how-why/
Chi square test
Chi square test is a statistical hypothesis test that is valid to perform
when the test statistic is chi-squared distributed under the null hypothesis.
A chi-square goodness of fit test determines if sample data matches a
population. For more details on this type, see: Goodness of Fit Test.
A chi-square test for independence compares two variables in a
contingency table to see if they are related. In a more general sense, it
tests to see whether distributions of categorical variables differ from each
another.
Formula:
https://www.youtube.com/watch?v=f53nXHoMXx4
Chi square test
A, B, C, and D. A random sample of 650 residents of the city is taken and
their occupation is recorded as "white collar", "blue collar", or "no collar".
The null hypothesis is that each person's neighborhood of residence is
independent of the person's occupational classification. The data are
tabulated as:
By the assumption of independence under the hypothesis we should
"expect" the number of white-collar workers in neighborhood A to be
https://www.youtube.com/watch?v=f53nXHoMXx4
Data analysis
Thank you all

More Related Content

Similar to Data analysis.pptx

Masters in quality management
Masters in quality managementMasters in quality management
Masters in quality management
selinasimpson0501
 
Statistical quality management
Statistical quality managementStatistical quality management
Statistical quality management
selinasimpson2601
 
Functions of quality management
Functions of quality managementFunctions of quality management
Functions of quality management
selinasimpson2901
 
Concept of quality management
Concept of quality managementConcept of quality management
Concept of quality management
selinasimpson1001
 
Directorate of quality management
Directorate of quality managementDirectorate of quality management
Directorate of quality management
selinasimpson2401
 
Quality management project management
Quality management project managementQuality management project management
Quality management project management
selinasimpson1401
 
Continuous quality management
Continuous quality managementContinuous quality management
Continuous quality management
selinasimpson341
 
Software quality management system
Software quality management systemSoftware quality management system
Software quality management system
selinasimpson1801
 
Productivity and quality management
Productivity and quality managementProductivity and quality management
Productivity and quality management
selinasimpson1401
 
How to become iso 9001 certified
How to become iso 9001 certifiedHow to become iso 9001 certified
How to become iso 9001 certified
porikgefus
 
Iso 9001 help
Iso 9001 helpIso 9001 help
Iso 9001 help
daritajon
 
Sharepoint quality management system
Sharepoint quality management systemSharepoint quality management system
Sharepoint quality management system
selinasimpson2101
 
Supply chain quality management
Supply chain quality managementSupply chain quality management
Supply chain quality management
selinasimpson0901
 
Quality management system procedures
Quality management system proceduresQuality management system procedures
Quality management system procedures
selinasimpson2101
 
Pg diploma in quality management
Pg diploma in quality managementPg diploma in quality management
Pg diploma in quality management
selinasimpson371
 

Similar to Data analysis.pptx (20)

Masters in quality management
Masters in quality managementMasters in quality management
Masters in quality management
 
Exam Short Preparation on Data Analytics
Exam Short Preparation on Data AnalyticsExam Short Preparation on Data Analytics
Exam Short Preparation on Data Analytics
 
Statistical quality management
Statistical quality managementStatistical quality management
Statistical quality management
 
Functions of quality management
Functions of quality managementFunctions of quality management
Functions of quality management
 
Quality management examples
Quality management examplesQuality management examples
Quality management examples
 
Concept of quality management
Concept of quality managementConcept of quality management
Concept of quality management
 
Directorate of quality management
Directorate of quality managementDirectorate of quality management
Directorate of quality management
 
Quality management masters
Quality management mastersQuality management masters
Quality management masters
 
Quality management project management
Quality management project managementQuality management project management
Quality management project management
 
Continuous quality management
Continuous quality managementContinuous quality management
Continuous quality management
 
Software quality management system
Software quality management systemSoftware quality management system
Software quality management system
 
Productivity and quality management
Productivity and quality managementProductivity and quality management
Productivity and quality management
 
How to become iso 9001 certified
How to become iso 9001 certifiedHow to become iso 9001 certified
How to become iso 9001 certified
 
Ms quality management
Ms quality managementMs quality management
Ms quality management
 
محاضرة 9
محاضرة 9محاضرة 9
محاضرة 9
 
Iso 9001 help
Iso 9001 helpIso 9001 help
Iso 9001 help
 
Sharepoint quality management system
Sharepoint quality management systemSharepoint quality management system
Sharepoint quality management system
 
Supply chain quality management
Supply chain quality managementSupply chain quality management
Supply chain quality management
 
Quality management system procedures
Quality management system proceduresQuality management system procedures
Quality management system procedures
 
Pg diploma in quality management
Pg diploma in quality managementPg diploma in quality management
Pg diploma in quality management
 

Recently uploaded

1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
AldoGarca30
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptx
chumtiyababu
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
mphochane1998
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
Kamal Acharya
 

Recently uploaded (20)

1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
1_Introduction + EAM Vocabulary + how to navigate in EAM.pdf
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planes
 
Online electricity billing project report..pdf
Online electricity billing project report..pdfOnline electricity billing project report..pdf
Online electricity billing project report..pdf
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
Online food ordering system project report.pdf
Online food ordering system project report.pdfOnline food ordering system project report.pdf
Online food ordering system project report.pdf
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLEGEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
GEAR TRAIN- BASIC CONCEPTS AND WORKING PRINCIPLE
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
Navigating Complexity: The Role of Trusted Partners and VIAS3D in Dassault Sy...
 
Verification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptxVerification of thevenin's theorem for BEEE Lab (1).pptx
Verification of thevenin's theorem for BEEE Lab (1).pptx
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments""Lesotho Leaps Forward: A Chronicle of Transformative Developments"
"Lesotho Leaps Forward: A Chronicle of Transformative Developments"
 
Hospital management system project report.pdf
Hospital management system project report.pdfHospital management system project report.pdf
Hospital management system project report.pdf
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
 

Data analysis.pptx

  • 2. Prepare for Data Analysis There are several steps involved for data preparation. They are: Questionnaire checking: Questionnaire checking involves eliminating unacceptable questionnaires. These questionnaires may be incomplete, instructions not followed, little variance, missing pages, past cutoff date or respondent not qualified. Editing: Editing looks to correct illegible, incomplete, inconsistent and ambiguous answers. Coding: Coding typically assigns symbols or numeric codes to answers that do not already have them so that statistical techniques can be applied.
  • 3. Prepare Data for Analysis Transcribing: Transcribing data involves transferring data so as to make it accessible to people or applications for further processing. Cleaning: Cleaning reviews data for consistencies. Inconsistencies may arise from faulty logic, out of range or extreme values. Statistical adjustments: Statistical adjustments applies to data that requires weighting and scale transformations. Analysis strategy selection: Finally, selection of a data analysis strategy is based on earlier work in designing the research project but is finalized after consideration of the characteristics of the data that has been gathered. https://www.cvent.com/en/blog/events/7-steps-prepare-data-analysis
  • 4. Graphical presentation: Bar Chart A bar chart or bar graph is a chart or graph that presents categorical data with rectangular bars with heights or lengths proportional to the values that they represent. The bars can be plotted vertically or horizontally. A vertical bar chart is sometimes called a column chart.
  • 5. Graphical presentation: Pie Chart A pie chart (or a circle chart) is a circular statistical graphic, which is divided into slices to illustrate numerical proportion. In a pie chart, the arc length of each slice (and consequently its central angle and area), is proportional to the quantity it represents.
  • 6. Frequency table Frequency refers to the number of times an event or a value occurs. A frequency table is a table that lists items and shows the number of times the items occur.
  • 7. Cross Tabulation: How It Works Cross tabulation is a method to quantitatively analyze the relationship between multiple variables. It is also known as contingency tables or cross tabs, cross tabulation groups variables to understand the correlation between different variables. It also shows how correlations change from one variable grouping to another. It is usually used in statistical analysis to find patterns, trends, and probabilities within raw data. Cross tabulation is usually performed on categorical data — data that can be divided into mutually exclusive groups. Cross tabulations are used to examine relationships within data that may not be readily apparent. Cross tabulation is especially useful for studying market research or survey responses. Cross tabulation of categorical data can be done with through tools such as SPSS, SAS, and Microsoft Excel.
  • 8. Cross Tabulation Consider the below sample data set in Excel. It displays details about commercial transactions for four product categories. Let’s use this data set to show cross tabulation in action. This data can be converted to pivot table format by selecting the entire table and inserting a pivot table in the Excel file. The table can correlate different variables row-wise, column-wise, or value-wise in either table format or chart format.
  • 9. Cross Tabulation Then the results appear in a pivot table: It is now clear that the highest sales were done for P1 using Master Card. Therefore, we can conclude that the MasterCard payment method and product P1 category is the most profitable combination. Similarly, we can use cross tabulation and find the relation between the product category and the payment method type with regard to the number of transactions. https://humansofdata.atlan.com/2016/01/cross-tabulation-how-why/
  • 10. Chi square test Chi square test is a statistical hypothesis test that is valid to perform when the test statistic is chi-squared distributed under the null hypothesis. A chi-square goodness of fit test determines if sample data matches a population. For more details on this type, see: Goodness of Fit Test. A chi-square test for independence compares two variables in a contingency table to see if they are related. In a more general sense, it tests to see whether distributions of categorical variables differ from each another. Formula: https://www.youtube.com/watch?v=f53nXHoMXx4
  • 11. Chi square test A, B, C, and D. A random sample of 650 residents of the city is taken and their occupation is recorded as "white collar", "blue collar", or "no collar". The null hypothesis is that each person's neighborhood of residence is independent of the person's occupational classification. The data are tabulated as: By the assumption of independence under the hypothesis we should "expect" the number of white-collar workers in neighborhood A to be https://www.youtube.com/watch?v=f53nXHoMXx4