A sample of the instructor's resources to support the textbook Statistics for Geography and Environmental Science. Further information at www.social-statistics.org
Statistics for Geography and Environmental Science:an introductory lecture course (sample)
1. Statistics for Geography and
Environmental Science:
an introductory lecture course
(sample)
By Richard Harris, with material
by Claire Jarvis
USA: http://amzn.to/rNBWd5
UK: http://amzn.to/tZ7fVu
3. Module 1
(Extracts from Chapter 1 of Statistics for Geography
and Environmental Science)
DATA, STATISTICS AND
GEOGRAPHY
4. Module overview
To convince you that studying
statistics is a good idea!
Our argument is that data collection
and analysis are central to the
functioning of contemporary society
so knowledge of quantitative
methods is a necessary skill to
contribute to social and scientific
debate.
5. About statistics
Statistics are a reflective practice: a
way of approaching research that
requires a clear and manageable
research question to be formulated, a
means to answer that question,
knowledge of the assumptions of
each test used, an understanding of
the consequences of violating those
assumptions, and awareness of the
researcher‘s own prejudices when
doing the research.
6. Some reasons to study statistics
Reasons for human geographers
– Data collection and analysis are central
to the functioning of society, to systems
of governance and science.
– Knowledge of statistics is an entry into
debate, informed critique and the
possibility of creating change.
7. Some reasons to study statistics
Reasons for GI scientists
– To address the uncertainties and
ambiguities of using data analytical.
– Because of the increased integration of
mapping capabilities, data visualizations
and (geo-) statistical analysis.
8. Some reasons to study statistics
Reasons for all students
– They provide a transferable skill set
using in other areas of research, study
and employment.
– There is a recognised shortage of
students with skills in quantitative
methods, especially within the social
sciences.
9. Types of statistic
Descriptive
– Used to provide a summary of a set of
measurements, e.g. the average.
Inferential
– Use the data at hand to convey information
about the population (‗the greater
something‘) from which the data are drawn.
Relational
– Consider whether greater or lesser values
in one set of data are related to greater or
lesser values in another.
10. Geographical data
These are records of what has
happened at some location on the
Earth‘s surface and where.
For many statistical tests the where
is largely ignored.
However, it is central to geostatistics
and to spatial statistics (as their
names suggest)
11. Some problems when analysing
geographical data
Standard statistical tests assume that
each ‗bit‘ of data (each observation)
has a value that is not influenced by
any other.
However, we may often expect there
to be geographical patterns in the
data.
– Spatial autocorrelation: geographical
patterns in the measurements
12. Some problems when analysing
geographical data
Determining what causes what in a
complex and dynamic natural or
social system is extremely tricky.
Two things may be associated (e.g.
greater income inequality and more
non-recycled waste) without the one
directly causing the other.
13. Some problems when analysing
geographical data
Data and structured forms of enquiry
can only tell us so much and may not
be appropriate to some types of
research for which a more
qualitative, participatory or less
representational approach may be
better.
14. Further reading
Chapter 1 of Statistics for
Geography and Environmental
Science by Richard Harris and Claire
Jarvis (Prentice Hall / Pearson, 2011)
Includes a review of the following
key concepts: types of statistics;
why error is unavoidable;
geographical data analysis; and
spatial autocorrelation and the first
law of geography.
15. Module 2
(Extracts from Chapter 2 of Statistics for Geography
and Environmental Science)
DESCRIPTIVE STATISTICS
16. Module overview
This module is about ―everyday statistics‖,
the sort that summarise data and describe
them in simple ways.
They include the number of home runs this
season, average male earnings, numbers
unemployed, outside temperature, average
cost of a barrel of oil, regional variations in
crime rates, pollution statistics, measures
of the economy and other ―facts and
figures‖
These are the sorts of descriptive
information that come about by observing
and measuring something, then by
summarising the data in clear and
straightforward ways.
17. Data and variables
Data
– A collection of observations:
measurements made of something.
A variable
– Another name for a collection of data.
Variable because it is unlikely that the
data are all the same.
Data types
– These include discrete, continuous,
and categorical data.
18. Simple ways of presenting data
Discrete data Continuous data
Frequency table Summary table
Bar chart (below) Histogram (below, with a rug plot)
20. Information to include
in a summary table
Measures of central tendency
(―averages‖)
– The mean and/or median
• The ―centre‖ of the data
Measures of spread and variation
– The range (minimum to maximum)
– The interquartile range (from ‗mid-
spread‘ of the data)
– The standard deviation,s
21. More about the standard deviation
Essentially a measure of average
variation around the mean.
It is also the square root of the
variance.
The variance is the sum of squares
divided by the degrees of freedom
22. Boxplots
Are useful for
showing the
median,
interquartile
range and range
of a set of data,
for indentifying
outliers and also
for comparing
variables.
23. Other ways of classifying numeric
data
Nominal, ordinal, interval and ratio
Counts and rates
Proportions and percentages
Parametric and non—parametric
Arithmetic and geometric
Primary and secondary
24. Further reading
Chapter 2 of Statistics for Geography
and Environmental Science by Richard
Harris and Claire Jarvis (Prentice Hall /
Pearson, 2011)
Includes a review of the following key
concepts: data and variables; discrete
and continuous data; the range;
histograms, rug plots, and stem and
leaf plots; measures of central
tendency; why averages can be
misleading; quantiles; the sum of
squares; degrees of freedom; the
standard deviation and the variance;
box plots; and five and six number
summaries