Introduction To Statistics.ppt

M
Manish AgarwalAssistant Professor en ITS, Ghaziabad
Course Introduction
 Gestor:
prof. Ing. Zlata Sojková, CSc.
zlata.sojkova@uniag.sk
 Lecturer
Ing. Jozef Palkovič, PhD.
jozef.palkovic@uniag.sk
Conditions for successful course
completion
 Method of evaluation and completion:
- credit
- exam (with a sufficient number of points, there's an
opportunity to get the grade "free of charge")
 Continuous assessment:
- two tests during the semester (mid-term test and end-
term test)
 You should get at least 50% of total points to get credit
Final evaluation:
- exam (written and verbal part)
- presenting a project
 Precondition for successful course completion: Basic
knowledge of MS Excel
Recommended literature:
 Hanke E. J, Reitsch A. G: Understanding Business
Statistics
 Anderson, D.R. - Sweeney, D.J. - Williams, T.A.: Statistics
for Business and Economics. South-Western Pub., 2005,
320 p., ISBN 978-032-422-486-3
 Jaisingh, L.R.: Statistics for the Utterly Confused. McGraw
Hill, 2005, 352 p., ISBN 978-007-146-193-1
 Everitt, B. S.: The Cambridge (explanatory) dictionary of
statistics. Cambridge University Press, 2006, 442 p., ISBN
978-052-169-027-0
 Illowsky, B. - Dean, S. (2009, August 5). Collaborative
Statistics. Retrieved from the Connexions Web site:
http://cnx.org/content/col10522/1.36
Moodle course by Martina Majorova at:
http://moodle.uniag.sk/fem/course/view.php?id=211
„There are three kinds of lies: lies,
damned lies, and statistics.“ (B.Disraeli)
Introduction To Statistics
Why study statistics?
1. Data are everywhere
2. Statistical techniques are used to make many
decisions that affect our lives
3. No matter what your career, you will make
professional decisions that involve data. An
understanding of statistical methods will help
you make these decisions efectively
Applications of statistical concepts in
the business world
 Finance – correlation and regression, index
numbers, time series analysis
 Marketing – hypothesis testing, chi-square tests,
nonparametric statistics
 Personel – hypothesis testing, chi-square tests,
nonparametric tests
 Operating management – hypothesis testing,
estimation, analysis of variance, time series
analysis
Statistics
 The science of collectiong, organizing,
presenting, analyzing, and interpreting data to
assist in making more effective decisions
 Statistical analysis – used to manipulate
summarize, and investigate data, so that useful
decision-making information results.
Introduction To Statistics.ppt
Types of statistics
 Descriptive statistics – Methods of organizing,
summarizing, and presenting data in an
informative way
 Inferential statistics – The methods used to
determine something about a population on the
basis of a sample
 Population –The entire set of individuals or objects
of interest or the measurements obtained from all
individuals or objects of interest
 Sample – A portion, or part, of the population of
interest
Introduction To Statistics.ppt
Inferential Statistics
 Estimation
 e.g., Estimate the population
mean weight using the sample
mean weight
 Hypothesis testing
 e.g., Test the claim that the
population mean weight is 70 kg
Inference is the process of drawing conclusions or making decisions
about a population based on sample results
Sampling
a sample should have the same characteristics
as the population it is representing.
Sampling can be:
 with replacement: a member of the population
may be chosen more than once (picking the
candy from the bowl)
 without replacement: a member of the
population may be chosen only once (lottery
ticket)
Sampling methods
Sampling methods can be:
 random (each member of the population has an equal
chance of being selected)
 nonrandom
The actual process of sampling causes sampling
errors. For example, the sample may not be large
enough or representative of the population. Factors not
related to the sampling process cause nonsampling
errors. A defective counting device can cause a
nonsampling error.
Random sampling methods
 simple random sample (each sample of the same
size has an equal chance of being selected)
 stratified sample (divide the population into groups
called strata and then take a sample from each
stratum)
 cluster sample (divide the population into strata and
then randomly select some of the strata. All the
members from these strata are in the cluster sample.)
 systematic sample (randomly select a starting point
and take every n-th piece of data from a listing of the
population)
Descriptive Statistics
 Collect data
 e.g., Survey
 Present data
 e.g., Tables and graphs
 Summarize data
 e.g., Sample mean = i
X
n

Statistical data
 The collection of data that are relevant to the
problem being studied is commonly the most
difficult, expensive, and time-consuming part of
the entire research project.
 Statistical data are usually obtained by counting
or measuring items.
 Primary data are collected specifically for the
analysis desired
 Secondary data have already been compiled and
are available for statistical analysis
 A variable is an item of interest that can take on
many different numerical values.
 A constant has a fixed numerical value.
Data
Statistical data are usually obtained by counting or
measuring items. Most data can be put into the
following categories:
 Qualitative - data are measurements that each
fail into one of several categories. (hair color,
ethnic groups and other attributes of the
population)
 quantitative - data are observations that are
measured on a numerical scale (distance traveled
to college, number of children in a family, etc.)
Qualitative data
Qualitative data are generally described by words or
letters. They are not as widely used as quantitative data
because many numerical techniques do not apply to the
qualitative data. For example, it does not make sense to
find an average hair color or blood type.
Qualitative data can be separated into two subgroups:
 dichotomic (if it takes the form of a word with two
options (gender - male or female)
 polynomic (if it takes the form of a word with more
than two options (education - primary school,
secondary school and university).
Quantitative data
Quantitative data are always numbers and are the
result of counting or measuring attributes of a
population.
Quantitative data can be separated into two
subgroups:
 discrete (if it is the result of counting (the number of
students of a given ethnic group in a class, the
number of books on a shelf, ...)
 continuous (if it is the result of measuring (distance
traveled, weight of luggage, …)
Types of variables
Variables
Quantitative
Qualitative
Dichotomic Polynomic Discrete Continuous
Gender, marital
status
Brand of Pc, hair
color
Children in family,
Strokes on a golf
hole
Amount of income
tax paid, weight of
a student
Numerical scale of measurement:
 Nominal – consist of categories in each of which the number of
respective observations is recorded. The categories are in no logical
order and have no particular relationship. The categories are said to
be mutually exclusive since an individual, object, or measurement
can be included in only one of them.
 Ordinal – contain more information. Consists of distinct categories in
which order is implied. Values in one category are larger or smaller
than values in other categories (e.g. rating-excelent, good, fair, poor)
 Interval – is a set of numerical measurements in which the distance
between numbers is of a known, sonstant size.
 Ratio – consists of numerical measurements where the distance
between numbers is of a known, constant size, in addition, there is a
nonarbitrary zero point.
Introduction To Statistics.ppt
Data presentation
„ The question is“ said Alice, „whether you can
make words mean so many different things.“
„The question is,“ said Humpty Dumpty, „which
is to be master-that´s all.“ (Lewis Carroll)
Numerical presentation of qualitative
data
 pivot table (qualitative dichotomic statistical
attributes)
 contingency table (qualitative statistical
attributes from which at least one of them is
polynomic)
You should know how to convert absolute
values to relative ones (%).
Frequency distributions – numerical
presentation of quantitative data
 Frequency distribution – shows the frequency, or
number of occurences, in each of several
categories. Frequency distributions are used to
summarize large volumes of data values.
 When the raw data are measured on a
qunatitative scale, either interval or ration,
categories or classes must be designed for the
data values before a frequency distribution can
be formulated.
Steps for constructing a frequency
distribution
1. Determine the number of classes
2. Determine the size of each class
3. Determine the starting point for the first class
4. Tally the number of values that occur in each
class
5. Prepare a table of the distribution using actual
counts and/ or percentages (relative
frequencies)
m n

 
max min
h
m


Frequency table
 absolute frequency “ni” (Data TabData
AnalysisHistogram)
 relative frequency “fi”
Cumulative frequency distribution shows the
total number of occurrences that lie above or
below certain key values.
 cumulative frequency “Ni”
 cumulative relative frequency “Fi”
Charts and graphs
 Frequency distributions are good ways to present
the essential aspects of data collections in
concise and understable terms
 Pictures are always more effective in displaying
large data collections
Histogram
 Frequently used to graphically present interval
and ratio data
 Is often used for interval and ratio data
 The adjacent bars indicate that a numerical range
is being summarized by indicating the frequencies
in arbitrarily chosen classes
Introduction To Statistics.ppt
Frequency polygon
 Another common method for graphically
presenting interval and ratio data
 To construct a frequency polygon mark the
frequencies on the vertical axis and the values of
the variable being measured on the horizontal
axis, as with the histogram.
 If the purpose of presenting is comparation with
other distributions, the frequency polygon
provides a good summary of the data
Introduction To Statistics.ppt
Ogive
 A graph of a cumulative frequency distribution
 Ogive is used when one wants to determine how
many observations lie above or below a certain
value in a distribution.
 First cumulative frequency distribution is
constructed
 Cumulative frequencies are plotted at the upper
class limit of each category
 Ogive can also be constructed for a relative
frequency distribution.
Introduction To Statistics.ppt
Pie Chart
 The pie chart is an effective way of displaying the
percentage breakdown of data by category.
 Useful if the relative sizes of the data components
are to be emphasized
 Pie charts also provide an effective way of
presenting ratio- or interval-scaled data after they
have been organized into categories
Pie Chart
Bar chart
 Another common method for graphically
presenting nominal and ordinal scaled data
 One bar is used to represent the frequency for
each category
 The bars are usually positioned vertically with
their bases located on the horizontal axis of the
graph
 The bars are separated, and this is why such a
graph is frequently used for nominal and ordinal
data – the separation emphasize the plotting of
frequencies for distinct categories
Introduction To Statistics.ppt
Time Series Graph
The time series graph is a graph of
data that have been measured over
time.
The horizontal axis of this graph
represents time periods and the
vertical axis shows the numerical
values corresponding to these time
periods
Introduction To Statistics.ppt
Introduction To Statistics.ppt
Introduction To Statistics.ppt
1 de 43

Recomendados

Introduction to statistics por
Introduction to statisticsIntroduction to statistics
Introduction to statisticsShaamma(Simi_ch) Fiverr
30 vistas43 diapositivas
Stat-Lesson.pptx por
Stat-Lesson.pptxStat-Lesson.pptx
Stat-Lesson.pptxJennilynFeliciano2
9 vistas36 diapositivas
Introduction-To-Statistics-18032022-010747pm (1).ppt por
Introduction-To-Statistics-18032022-010747pm (1).pptIntroduction-To-Statistics-18032022-010747pm (1).ppt
Introduction-To-Statistics-18032022-010747pm (1).pptIsrar36
40 vistas19 diapositivas
Ca in patna por
Ca in patnaCa in patna
Ca in patnaEdhole.com
429 vistas44 diapositivas
Lesson1_Topic 1.pptx por
Lesson1_Topic 1.pptxLesson1_Topic 1.pptx
Lesson1_Topic 1.pptxDindoArambalaOjeda
23 vistas16 diapositivas
Statistics.ppt por
Statistics.pptStatistics.ppt
Statistics.ppt21EDM25Lilitha
17 vistas23 diapositivas

Más contenido relacionado

Similar a Introduction To Statistics.ppt

introduction to statistical theory por
introduction to statistical theoryintroduction to statistical theory
introduction to statistical theoryUnsa Shakir
1.8K vistas38 diapositivas
STATISTICS-AND-PROBABLITY-A-REVIEW-FOR-SHS.pdf por
STATISTICS-AND-PROBABLITY-A-REVIEW-FOR-SHS.pdfSTATISTICS-AND-PROBABLITY-A-REVIEW-FOR-SHS.pdf
STATISTICS-AND-PROBABLITY-A-REVIEW-FOR-SHS.pdfMariaCatherineErfeLa
2 vistas80 diapositivas
Lect 1_Biostat.pdf por
Lect 1_Biostat.pdfLect 1_Biostat.pdf
Lect 1_Biostat.pdfBirhanTesema
26 vistas74 diapositivas
Module 8-S M & T C I, Regular.pptx por
Module 8-S M & T C I, Regular.pptxModule 8-S M & T C I, Regular.pptx
Module 8-S M & T C I, Regular.pptxRajashekhar Shirvalkar
14 vistas220 diapositivas
Statistics-24-04-2021-20210618114031.ppt por
Statistics-24-04-2021-20210618114031.pptStatistics-24-04-2021-20210618114031.ppt
Statistics-24-04-2021-20210618114031.pptAkhtaruzzamanlimon1
2 vistas41 diapositivas
Statistics-24-04-2021-20210618114031.ppt por
Statistics-24-04-2021-20210618114031.pptStatistics-24-04-2021-20210618114031.ppt
Statistics-24-04-2021-20210618114031.pptJapnaamKaurAhluwalia
9 vistas41 diapositivas

Similar a Introduction To Statistics.ppt(20)

introduction to statistical theory por Unsa Shakir
introduction to statistical theoryintroduction to statistical theory
introduction to statistical theory
Unsa Shakir1.8K vistas
Chapter 4 MMW.pdf por RaRaRamirez
Chapter 4 MMW.pdfChapter 4 MMW.pdf
Chapter 4 MMW.pdf
RaRaRamirez1.4K vistas
QUANTITATIVE METHODS NOTES.pdf por BensonNduati1
QUANTITATIVE METHODS NOTES.pdfQUANTITATIVE METHODS NOTES.pdf
QUANTITATIVE METHODS NOTES.pdf
BensonNduati12.4K vistas
Engineering Statistics por Bahzad5
Engineering Statistics Engineering Statistics
Engineering Statistics
Bahzad5200 vistas
Introduction To Statistics por albertlaporte
Introduction To StatisticsIntroduction To Statistics
Introduction To Statistics
albertlaporte180.7K vistas
Machine learning pre requisite por Ram Singh
Machine learning pre requisiteMachine learning pre requisite
Machine learning pre requisite
Ram Singh75 vistas
Chapter 1 por Lem Lem
Chapter 1Chapter 1
Chapter 1
Lem Lem367 vistas
Statistics for Data Analytics por SSaudia
Statistics for Data AnalyticsStatistics for Data Analytics
Statistics for Data Analytics
SSaudia574 vistas
Introduction of biostatistics por khushbu
Introduction of biostatisticsIntroduction of biostatistics
Introduction of biostatistics
khushbu 4.9K vistas
DATA UNIT-3.pptx por bibha737
DATA UNIT-3.pptxDATA UNIT-3.pptx
DATA UNIT-3.pptx
bibha7373 vistas
Introduction to Statistics (Part -I) por YesAnalytics
Introduction to Statistics (Part -I)Introduction to Statistics (Part -I)
Introduction to Statistics (Part -I)
YesAnalytics6.2K vistas
Statistics final seminar por Tejas Jagtap
Statistics final seminarStatistics final seminar
Statistics final seminar
Tejas Jagtap1.6K vistas

Más de Manish Agarwal

What is Sequencing.pptx por
What is Sequencing.pptxWhat is Sequencing.pptx
What is Sequencing.pptxManish Agarwal
9 vistas4 diapositivas
Queueing Theory.pptx por
Queueing Theory.pptxQueueing Theory.pptx
Queueing Theory.pptxManish Agarwal
17 vistas9 diapositivas
What is Replacement Theory.pptx por
What is Replacement Theory.pptxWhat is Replacement Theory.pptx
What is Replacement Theory.pptxManish Agarwal
100 vistas8 diapositivas
Final Budegt.pptx por
Final Budegt.pptxFinal Budegt.pptx
Final Budegt.pptxManish Agarwal
8 vistas9 diapositivas
or intro.pptx por
or intro.pptxor intro.pptx
or intro.pptxManish Agarwal
4 vistas6 diapositivas
INTRODUCTION TO STATISTICS.pptx por
INTRODUCTION TO STATISTICS.pptxINTRODUCTION TO STATISTICS.pptx
INTRODUCTION TO STATISTICS.pptxManish Agarwal
14 vistas16 diapositivas

Más de Manish Agarwal(15)

Último

How Leaders See Data? (Level 1) por
How Leaders See Data? (Level 1)How Leaders See Data? (Level 1)
How Leaders See Data? (Level 1)Narendra Narendra
13 vistas76 diapositivas
Understanding Hallucinations in LLMs - 2023 09 29.pptx por
Understanding Hallucinations in LLMs - 2023 09 29.pptxUnderstanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptxGreg Makowski
13 vistas18 diapositivas
ColonyOS por
ColonyOSColonyOS
ColonyOSJohanKristiansson6
9 vistas17 diapositivas
PROGRAMME.pdf por
PROGRAMME.pdfPROGRAMME.pdf
PROGRAMME.pdfHiNedHaJar
17 vistas13 diapositivas
UNEP FI CRS Climate Risk Results.pptx por
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptxpekka28
11 vistas51 diapositivas
3196 The Case of The East River por
3196 The Case of The East River3196 The Case of The East River
3196 The Case of The East RiverErickANDRADE90
11 vistas4 diapositivas

Último(20)

Understanding Hallucinations in LLMs - 2023 09 29.pptx por Greg Makowski
Understanding Hallucinations in LLMs - 2023 09 29.pptxUnderstanding Hallucinations in LLMs - 2023 09 29.pptx
Understanding Hallucinations in LLMs - 2023 09 29.pptx
Greg Makowski13 vistas
UNEP FI CRS Climate Risk Results.pptx por pekka28
UNEP FI CRS Climate Risk Results.pptxUNEP FI CRS Climate Risk Results.pptx
UNEP FI CRS Climate Risk Results.pptx
pekka2811 vistas
3196 The Case of The East River por ErickANDRADE90
3196 The Case of The East River3196 The Case of The East River
3196 The Case of The East River
ErickANDRADE9011 vistas
Short Story Assignment by Kelly Nguyen por kellynguyen01
Short Story Assignment by Kelly NguyenShort Story Assignment by Kelly Nguyen
Short Story Assignment by Kelly Nguyen
kellynguyen0118 vistas
Cross-network in Google Analytics 4.pdf por GA4 Tutorials
Cross-network in Google Analytics 4.pdfCross-network in Google Analytics 4.pdf
Cross-network in Google Analytics 4.pdf
GA4 Tutorials6 vistas
Survey on Factuality in LLM's.pptx por NeethaSherra1
Survey on Factuality in LLM's.pptxSurvey on Factuality in LLM's.pptx
Survey on Factuality in LLM's.pptx
NeethaSherra15 vistas
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx por DataScienceConferenc1
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
Data structure and algorithm. por Abdul salam
Data structure and algorithm. Data structure and algorithm.
Data structure and algorithm.
Abdul salam 18 vistas
Supercharging your Data with Azure AI Search and Azure OpenAI por Peter Gallagher
Supercharging your Data with Azure AI Search and Azure OpenAISupercharging your Data with Azure AI Search and Azure OpenAI
Supercharging your Data with Azure AI Search and Azure OpenAI
Peter Gallagher37 vistas
RuleBookForTheFairDataEconomy.pptx por noraelstela1
RuleBookForTheFairDataEconomy.pptxRuleBookForTheFairDataEconomy.pptx
RuleBookForTheFairDataEconomy.pptx
noraelstela167 vistas
Building Real-Time Travel Alerts por Timothy Spann
Building Real-Time Travel AlertsBuilding Real-Time Travel Alerts
Building Real-Time Travel Alerts
Timothy Spann109 vistas
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdf por vikas12611618
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdfVikas 500 BIG DATA TECHNOLOGIES LAB.pdf
Vikas 500 BIG DATA TECHNOLOGIES LAB.pdf
vikas126116188 vistas
Organic Shopping in Google Analytics 4.pdf por GA4 Tutorials
Organic Shopping in Google Analytics 4.pdfOrganic Shopping in Google Analytics 4.pdf
Organic Shopping in Google Analytics 4.pdf
GA4 Tutorials10 vistas
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation por DataScienceConferenc1
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
[DSC Europe 23] Spela Poklukar & Tea Brasanac - Retrieval Augmented Generation
CRIJ4385_Death Penalty_F23.pptx por yvettemm100
CRIJ4385_Death Penalty_F23.pptxCRIJ4385_Death Penalty_F23.pptx
CRIJ4385_Death Penalty_F23.pptx
yvettemm1006 vistas

Introduction To Statistics.ppt

  • 2.  Gestor: prof. Ing. Zlata Sojková, CSc. zlata.sojkova@uniag.sk  Lecturer Ing. Jozef Palkovič, PhD. jozef.palkovic@uniag.sk
  • 3. Conditions for successful course completion  Method of evaluation and completion: - credit - exam (with a sufficient number of points, there's an opportunity to get the grade "free of charge")  Continuous assessment: - two tests during the semester (mid-term test and end- term test)  You should get at least 50% of total points to get credit Final evaluation: - exam (written and verbal part) - presenting a project  Precondition for successful course completion: Basic knowledge of MS Excel
  • 4. Recommended literature:  Hanke E. J, Reitsch A. G: Understanding Business Statistics  Anderson, D.R. - Sweeney, D.J. - Williams, T.A.: Statistics for Business and Economics. South-Western Pub., 2005, 320 p., ISBN 978-032-422-486-3  Jaisingh, L.R.: Statistics for the Utterly Confused. McGraw Hill, 2005, 352 p., ISBN 978-007-146-193-1  Everitt, B. S.: The Cambridge (explanatory) dictionary of statistics. Cambridge University Press, 2006, 442 p., ISBN 978-052-169-027-0  Illowsky, B. - Dean, S. (2009, August 5). Collaborative Statistics. Retrieved from the Connexions Web site: http://cnx.org/content/col10522/1.36 Moodle course by Martina Majorova at: http://moodle.uniag.sk/fem/course/view.php?id=211
  • 5. „There are three kinds of lies: lies, damned lies, and statistics.“ (B.Disraeli) Introduction To Statistics
  • 6. Why study statistics? 1. Data are everywhere 2. Statistical techniques are used to make many decisions that affect our lives 3. No matter what your career, you will make professional decisions that involve data. An understanding of statistical methods will help you make these decisions efectively
  • 7. Applications of statistical concepts in the business world  Finance – correlation and regression, index numbers, time series analysis  Marketing – hypothesis testing, chi-square tests, nonparametric statistics  Personel – hypothesis testing, chi-square tests, nonparametric tests  Operating management – hypothesis testing, estimation, analysis of variance, time series analysis
  • 8. Statistics  The science of collectiong, organizing, presenting, analyzing, and interpreting data to assist in making more effective decisions  Statistical analysis – used to manipulate summarize, and investigate data, so that useful decision-making information results.
  • 10. Types of statistics  Descriptive statistics – Methods of organizing, summarizing, and presenting data in an informative way  Inferential statistics – The methods used to determine something about a population on the basis of a sample  Population –The entire set of individuals or objects of interest or the measurements obtained from all individuals or objects of interest  Sample – A portion, or part, of the population of interest
  • 12. Inferential Statistics  Estimation  e.g., Estimate the population mean weight using the sample mean weight  Hypothesis testing  e.g., Test the claim that the population mean weight is 70 kg Inference is the process of drawing conclusions or making decisions about a population based on sample results
  • 13. Sampling a sample should have the same characteristics as the population it is representing. Sampling can be:  with replacement: a member of the population may be chosen more than once (picking the candy from the bowl)  without replacement: a member of the population may be chosen only once (lottery ticket)
  • 14. Sampling methods Sampling methods can be:  random (each member of the population has an equal chance of being selected)  nonrandom The actual process of sampling causes sampling errors. For example, the sample may not be large enough or representative of the population. Factors not related to the sampling process cause nonsampling errors. A defective counting device can cause a nonsampling error.
  • 15. Random sampling methods  simple random sample (each sample of the same size has an equal chance of being selected)  stratified sample (divide the population into groups called strata and then take a sample from each stratum)  cluster sample (divide the population into strata and then randomly select some of the strata. All the members from these strata are in the cluster sample.)  systematic sample (randomly select a starting point and take every n-th piece of data from a listing of the population)
  • 16. Descriptive Statistics  Collect data  e.g., Survey  Present data  e.g., Tables and graphs  Summarize data  e.g., Sample mean = i X n 
  • 17. Statistical data  The collection of data that are relevant to the problem being studied is commonly the most difficult, expensive, and time-consuming part of the entire research project.  Statistical data are usually obtained by counting or measuring items.  Primary data are collected specifically for the analysis desired  Secondary data have already been compiled and are available for statistical analysis  A variable is an item of interest that can take on many different numerical values.  A constant has a fixed numerical value.
  • 18. Data Statistical data are usually obtained by counting or measuring items. Most data can be put into the following categories:  Qualitative - data are measurements that each fail into one of several categories. (hair color, ethnic groups and other attributes of the population)  quantitative - data are observations that are measured on a numerical scale (distance traveled to college, number of children in a family, etc.)
  • 19. Qualitative data Qualitative data are generally described by words or letters. They are not as widely used as quantitative data because many numerical techniques do not apply to the qualitative data. For example, it does not make sense to find an average hair color or blood type. Qualitative data can be separated into two subgroups:  dichotomic (if it takes the form of a word with two options (gender - male or female)  polynomic (if it takes the form of a word with more than two options (education - primary school, secondary school and university).
  • 20. Quantitative data Quantitative data are always numbers and are the result of counting or measuring attributes of a population. Quantitative data can be separated into two subgroups:  discrete (if it is the result of counting (the number of students of a given ethnic group in a class, the number of books on a shelf, ...)  continuous (if it is the result of measuring (distance traveled, weight of luggage, …)
  • 21. Types of variables Variables Quantitative Qualitative Dichotomic Polynomic Discrete Continuous Gender, marital status Brand of Pc, hair color Children in family, Strokes on a golf hole Amount of income tax paid, weight of a student
  • 22. Numerical scale of measurement:  Nominal – consist of categories in each of which the number of respective observations is recorded. The categories are in no logical order and have no particular relationship. The categories are said to be mutually exclusive since an individual, object, or measurement can be included in only one of them.  Ordinal – contain more information. Consists of distinct categories in which order is implied. Values in one category are larger or smaller than values in other categories (e.g. rating-excelent, good, fair, poor)  Interval – is a set of numerical measurements in which the distance between numbers is of a known, sonstant size.  Ratio – consists of numerical measurements where the distance between numbers is of a known, constant size, in addition, there is a nonarbitrary zero point.
  • 24. Data presentation „ The question is“ said Alice, „whether you can make words mean so many different things.“ „The question is,“ said Humpty Dumpty, „which is to be master-that´s all.“ (Lewis Carroll)
  • 25. Numerical presentation of qualitative data  pivot table (qualitative dichotomic statistical attributes)  contingency table (qualitative statistical attributes from which at least one of them is polynomic) You should know how to convert absolute values to relative ones (%).
  • 26. Frequency distributions – numerical presentation of quantitative data  Frequency distribution – shows the frequency, or number of occurences, in each of several categories. Frequency distributions are used to summarize large volumes of data values.  When the raw data are measured on a qunatitative scale, either interval or ration, categories or classes must be designed for the data values before a frequency distribution can be formulated.
  • 27. Steps for constructing a frequency distribution 1. Determine the number of classes 2. Determine the size of each class 3. Determine the starting point for the first class 4. Tally the number of values that occur in each class 5. Prepare a table of the distribution using actual counts and/ or percentages (relative frequencies) m n    max min h m  
  • 28. Frequency table  absolute frequency “ni” (Data TabData AnalysisHistogram)  relative frequency “fi” Cumulative frequency distribution shows the total number of occurrences that lie above or below certain key values.  cumulative frequency “Ni”  cumulative relative frequency “Fi”
  • 29. Charts and graphs  Frequency distributions are good ways to present the essential aspects of data collections in concise and understable terms  Pictures are always more effective in displaying large data collections
  • 30. Histogram  Frequently used to graphically present interval and ratio data  Is often used for interval and ratio data  The adjacent bars indicate that a numerical range is being summarized by indicating the frequencies in arbitrarily chosen classes
  • 32. Frequency polygon  Another common method for graphically presenting interval and ratio data  To construct a frequency polygon mark the frequencies on the vertical axis and the values of the variable being measured on the horizontal axis, as with the histogram.  If the purpose of presenting is comparation with other distributions, the frequency polygon provides a good summary of the data
  • 34. Ogive  A graph of a cumulative frequency distribution  Ogive is used when one wants to determine how many observations lie above or below a certain value in a distribution.  First cumulative frequency distribution is constructed  Cumulative frequencies are plotted at the upper class limit of each category  Ogive can also be constructed for a relative frequency distribution.
  • 36. Pie Chart  The pie chart is an effective way of displaying the percentage breakdown of data by category.  Useful if the relative sizes of the data components are to be emphasized  Pie charts also provide an effective way of presenting ratio- or interval-scaled data after they have been organized into categories
  • 38. Bar chart  Another common method for graphically presenting nominal and ordinal scaled data  One bar is used to represent the frequency for each category  The bars are usually positioned vertically with their bases located on the horizontal axis of the graph  The bars are separated, and this is why such a graph is frequently used for nominal and ordinal data – the separation emphasize the plotting of frequencies for distinct categories
  • 40. Time Series Graph The time series graph is a graph of data that have been measured over time. The horizontal axis of this graph represents time periods and the vertical axis shows the numerical values corresponding to these time periods