Se ha denunciado esta presentación.
Utilizamos tu perfil de LinkedIn y tus datos de actividad para personalizar los anuncios y mostrarte publicidad más relevante. Puedes cambiar tus preferencias de publicidad en cualquier momento.
Próxima SlideShare
Cargando en…5
×

# Data Analysis using SPSS: Part 1

363 visualizaciones

Publicado el

These slides will guide a practitioner to use SPSS for data analysis.

Publicado en: Datos y análisis
• Full Name
Comment goes here.

Are you sure you want to Yes No
Your message goes here
• Sé el primero en comentar

### Data Analysis using SPSS: Part 1

1. 1. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Data Analysis using SPSS Taddesse Kassahun Email: tadestat@gmail.com Cellphone: +251911835082 Department of Statistics Addis Ababa University Taddesse Kassahun Data Analysis using SPSS 1 / 135
2. 2. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Part One Taddesse Kassahun Data Analysis using SPSS 2 / 135
3. 3. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Contents 1 Basics of Research and Statistics Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation 2 Introduction to SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use 3 Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Taddesse Kassahun Data Analysis using SPSS 3 / 135
4. 4. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Research Problems A research process begins with a problem of interest to the researcher. A research problem is a statement that asks about the relationships between variables. Statistical methods can be used to indicate solutions for such research problems using: What kind and how much data need to be collected? How should data are organized and summarized? How can we analyze data and draw conclusions from it? How can we assess the strength of the conclusions and evaluate their uncertainty? Taddesse Kassahun Data Analysis using SPSS 4 / 135
5. 5. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Variables A variable is a characteristic of the participants or situation for a given study which takes diﬀerent values. A variable must be able to vary or have diﬀerent values. Sex is a variable: it has two values (female or male). Age is a variable: it has a large number of values. Level of statistics knowledge is a variable (none, average, suﬃcient). In quantitative research, variables are divided into two: independent variables and dependent variables. SPSS uses the term values to describe the several options or values of a variable. Taddesse Kassahun Data Analysis using SPSS 5 / 135
6. 6. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Variables ... For example, a study is performed to investigate the eﬀect of a treatment (one group is assigned to treatment group while the other group doesn’t receive the treatment) The independent variable (treatment type) has two values or levels (treatment and no treatment). Suppose now that we are interested in comparing two treatments and include a third no-treatment group as a control. The study still has one active independent variable (treatment type) but with three values or levels. Variable = Treatment type; Variable values: 1 = Treatment 1, 2 = Treatment 2, 3 = Control Taddesse Kassahun Data Analysis using SPSS 6 / 135
7. 7. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Variables... Dependent variable is assumed to measure or assess the eﬀect of an independent variable(s). Dependent variables are often test scores, factory outputs, ratings on questionnaires, readings from instruments, etc. SPSS uses other terms for the dependent variable: Dependent list, Test variable. Dependent or independent variable can be either qualitative or quantitative. Taddesse Kassahun Data Analysis using SPSS 7 / 135
8. 8. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Hypotheses and Questions Research hypotheses are predictive statements about the relationship between variables. Research questions do not entail speciﬁc predictions and are phrased in question format. Example. Question: Is there a diﬀerence in students’ scores on a standardized test if they took two tests in one day versus taking only one test on each of two days? Hypothesis: Students who take only one test per day will score better on standardized tests than students who take two tests in one day. Research questions can be diﬀerence, associational, and descriptive. Taddesse Kassahun Data Analysis using SPSS 8 / 135
9. 9. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Statistics The two broad areas of statistics to deal with research questions: descriptive and inferential statistics. Descriptive statistics is devoted to the summarization and description of data. Descriptive statistics presents data using graphs,tables, and numbers such as averages, measures of variation, and percentiles. Inferential statistics consist of methods for drawing generalizations and predictions about population based on information obtained from a sample of the population. The two primary methods for making inference are estimation and hypothesis testing. Taddesse Kassahun Data Analysis using SPSS 9 / 135
10. 10. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Inference Process Taddesse Kassahun Data Analysis using SPSS 10 / 135
11. 11. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Scales of Measurement Taddesse Kassahun Data Analysis using SPSS 11 / 135
12. 12. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Statistics for diﬀerent types of research question Taddesse Kassahun Data Analysis using SPSS 12 / 135
13. 13. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Group comparison questions Usually used for Randomized Experimental and Comparative Approaches. Values or categories of the independent variable are used to categorize the participants into groups (e.g., high and low). Example. Do persons with low and high anxiety diﬀer on their average grades? That is, will the average GPA of the high anxiety persons be signiﬁcantly diﬀerent from the average GPA for low anxiety persons? Taddesse Kassahun Data Analysis using SPSS 13 / 135
14. 14. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Associational Questions Used for the Associational Approach, in which there are two or more variables. The scores on the independent variable are associated with or related to the dependent variable scores. Is the type of relationship positive or negative? How strong is such relationship? Example. Will students’ degree of anxiety be associated with their overall GPA? That is, will knowing students’ level of anxiety tell us anything about their tendency to make higher versus lower grades? Taddesse Kassahun Data Analysis using SPSS 14 / 135
15. 15. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Descriptive questions Used for the Descriptive Approach. For this type of question, scores on a single variable are described in terms of their central tendency, variability, or percentages in each category/level. Example. What percentage of people in Addis Ababa can utilize smart phones? What is the average time a person wait for taxi in Arat Kilo to go to Piassa? Taddesse Kassahun Data Analysis using SPSS 15 / 135
16. 16. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Role of Statistics in Research Taddesse Kassahun Data Analysis using SPSS 16 / 135
17. 17. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Basic concepts Survey - a method of data collection with no special control on one or more of the factors aﬀecting variable of interest. Two types of surveys: census (complete enumeration) and sampling. The two situations where census is mandatory: If information is needed from each and every unit of the study population ; If a study is on rare events. Sampling Unit: an element or a set of elements considered for selection at some stage of sampling, e.g., animals, persons, households, etc. Taddesse Kassahun Data Analysis using SPSS 17 / 135
18. 18. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Basic concepts . . . Sampling Frame: listing of all the things or sampling units that makes up a given population. Sample Design: A set of rules or procedures that specify how a sample is to be selected. Sampling Error: the diﬀerence between the true population value and the statistic. It can be minimized by increasing the size of sample. Non-sampling Error: errors occurring due to data are incorrectly collected, recorded or analyzed. It may happen both in census survey and sample survey. Taddesse Kassahun Data Analysis using SPSS 18 / 135
19. 19. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Basic concepts . . . Planning a research design is the foremost task before we select the data collection instrument(s). Select, modify or develop a data collection instrument: questionnaires,structured interviews, observations,standardized inventories, etc. Pilot test and reﬁne instruments: It is desirable to try out our instrument and directions with, at the very least, a few colleagues or friends. We should check our raw data after we collect it even before it is entered into the computer. We then assign numbers to the values or levels of each categorical variable. Taddesse Kassahun Data Analysis using SPSS 19 / 135
20. 20. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Methods of Data Presentation Data in raw form are usually not easy to use for decision making. Data can be presented using Table (of counts, percentages). It can be one way, contingency or multi-way in general. Chart (bar – simple, clustered,stacked; pie) Graph (histogram, frequency polygons, line plot, stem and leaf plot) Statistical quantities such as mean, range, standard deviation, etc. The type of diagram/ graph to use depends on the variable being summarized. Taddesse Kassahun Data Analysis using SPSS 20 / 135
21. 21. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Describing Data Numerically Taddesse Kassahun Data Analysis using SPSS 21 / 135
22. 22. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Statistical Estimation Estimation is the process of determining a likely value for a variable in the population based on information collected from the sample. Estimation is the use of sample statistics to estimate population parameters. Interest usually lies in looking at estimates of many statistics — totals, averages, proportions etc. for diﬀerent variables. For example, a sample survey could be used to produce estimates for the proportion of smokers among all people aged 15 to 24 in the population. Taddesse Kassahun Data Analysis using SPSS 22 / 135
23. 23. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Point Estimate A single numerical value used to estimate the corresponding population parameter. Taddesse Kassahun Data Analysis using SPSS 23 / 135
24. 24. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Interval Estimate An interval estimate provides how much uncertainty is associated with a point estimate of a population parameter. An interval gives a range of values: by taking into consideration variation in sample statistics from sample to sample; based on observation from 1 sample; to provide information about closeness to unknown population parameters. It is stated in terms of level of conﬁdence. If Pr(a < θ < b) = 1 − α, then the interval from a to b is called a 100(1 − α)% conﬁdence interval of θ. Taddesse Kassahun Data Analysis using SPSS 24 / 135
25. 25. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Interval Estimate The general formula for interval estimates (conﬁdence intervals) is: PointEstimate ± (Reliability Factor)(Standard Error) Taddesse Kassahun Data Analysis using SPSS 25 / 135
26. 26. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Interval Estimate Conﬁdence intervals for µ and P: σ2 known: Z is used as a test statistic. σ2 unknown: t is used as a test statistic. Taddesse Kassahun Data Analysis using SPSS 26 / 135
27. 27. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Conﬁdence Interval for µ (σ2 Known) Assumption: Population is normally distributed. Conﬁdence interval estimate is given by: X − Zα/2 σ √ n < µ < X + Zα/2 σ √ n where Zα/2 is the normal distribution value for a probability of α/2 in each tail. Taddesse Kassahun Data Analysis using SPSS 27 / 135
28. 28. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Conﬁdence Interval for µ (σ2 Unknown) If the population standard deviation is unknown, we can substitute it by the sample standard deviation (s). This introduces extra uncertainty, since s is variable from sample to sample. So, we use the t distribution instead of the normal distribution. Assumption: Population is normally distributed. Conﬁdence interval estimate is given by: X − tn−1,α/2 s √ n < µ < X + tn−1,α/2 s √ n Taddesse Kassahun Data Analysis using SPSS 28 / 135
29. 29. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Conﬁdence Interval for Population Proportion Upper and lower conﬁdence limits for the population proportion are calculated with the formula ˆp − Zα/2 ˆp(1 − ˆp) n < p < ˆp + Zα/2 ˆp(1 − ˆp) n where Zα/2 is the standard normal value for the level of conﬁdence desired. ˆp is the sample proportion n is the sample size Taddesse Kassahun Data Analysis using SPSS 29 / 135
30. 30. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Interpretation of Conﬁdence Interval Suppose that a 95% conﬁdence interval for the true proportion of internet users in a city is: 0.16 < p < 0.33 An investigator is 95% conﬁdent that the true percentage of internet users in the city is between 16% and 33%. Although the interval from 0.16 to 0.33 may or may not contain the true proportion, 95% of intervals formed in this manner will contain the true proportion. Taddesse Kassahun Data Analysis using SPSS 30 / 135
31. 31. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Hypothesis Testing A statistical hypothesis is a statement concerning the probability distribution of population parameters that are inherent in a probability distribution. In a hypothesis testing problem, a null hypothesis (H0) & an alternative hypothesis (H1) are formulated. The null and alternative hypotheses should be mutually exclusive. H0 : Population proportion of people satisﬁed with the service = 0.7 H1 : Not H0 A hypothesis may be one sided, e.g., H1 : P > 0.7 or two sided,e.g., P = 0.7. Taddesse Kassahun Data Analysis using SPSS 31 / 135
32. 32. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Hypothesis Testing Two types of errors, namely Type I and Type II error may be committed in testing a hypothesis. Type I error is the error of rejecting the null hypothesis when it is actually TRUE. The probability of committing Type I error is denoted by α, also called level of signiﬁcance. A Type II error is the error of failure to reject the null hypothesis when it is FALSE. The probability of committing Type II error is denoted by β. 1 − β is called the power of a test. Taddesse Kassahun Data Analysis using SPSS 32 / 135
33. 33. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Outline for Hypothesis Testing 1 From the (word) problem, determine the appropriate null hypothesis & alternative hypothesis. 2 Identify the appropriate test statistic (a sample statistic upon which we base our decision) and calculate its value from the data. 3 Find the rejection region by looking up the critical value in the appropriate table. 4 Observe whether the computed value of the test statistic lies within the rejection region. If so, reject the null hypothesis; otherwise, do not reject the null hypothesis. 5 Interpret the results Taddesse Kassahun Data Analysis using SPSS 33 / 135
34. 34. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Hypothesis Testing Process Taddesse Kassahun Data Analysis using SPSS 34 / 135
35. 35. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation Level of Signiﬁcance and the Rejection Region Taddesse Kassahun Data Analysis using SPSS 35 / 135
36. 36. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation p-Value Approach to Testing p-value: Probability of obtaining a test statistic more extreme (≤ or ≥) than the observed sample value given H0 is true. Also called observed level of signiﬁcance. Smallest value of α for which H0 can be rejected. Convert sample result (e.g.,X) to test statistic (e.g., Z statistic). Obtain the p-value For an upper tail test: Taddesse Kassahun Data Analysis using SPSS 36 / 135
37. 37. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Research Problems Variables Research Hypotheses and Questions Collect the Data Methods of Data Presentation p-Value Approach to Testing For a lower tail test: For a two tail test: p−value = 2×min{Pr(Z > ¯X − µ0 σ/ √ n |H0), Pr(Z ≤ ¯X − µ0 σ/ √ n |H0)} Decision rule: Reject the null hypothesis if p–value < α. Taddesse Kassahun Data Analysis using SPSS 37 / 135
38. 38. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Introduction to SPSS Taddesse Kassahun Data Analysis using SPSS 38 / 135
39. 39. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use What is a statistical package? A computer program/set of programs that provides diﬀerent statistical procedures within a uniﬁed framework. Currently owned and developed by IBM corporation, SPSS stands for ”Statistical Product and Service Solutions”. SPSS is a very powerful and user friendly program. It uses both a graphical and a syntactical interface. SPSS can take data from almost any type of ﬁle and use them to generate tabulated reports, charts, and plots of distributions and trends, descriptive statistics, and conduct complex statistical analyses. Taddesse Kassahun Data Analysis using SPSS 39 / 135
40. 40. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use What is a statistical package? A computer program/set of programs that provides diﬀerent statistical procedures within a uniﬁed framework. Currently owned and developed by IBM corporation, SPSS stands for ”Statistical Product and Service Solutions”. SPSS is a very powerful and user friendly program. It uses both a graphical and a syntactical interface. SPSS can take data from almost any type of ﬁle and use them to generate tabulated reports, charts, and plots of distributions and trends, descriptive statistics, and conduct complex statistical analyses. Taddesse Kassahun Data Analysis using SPSS 39 / 135
41. 41. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use What is a statistical package? A computer program/set of programs that provides diﬀerent statistical procedures within a uniﬁed framework. Currently owned and developed by IBM corporation, SPSS stands for ”Statistical Product and Service Solutions”. SPSS is a very powerful and user friendly program. It uses both a graphical and a syntactical interface. SPSS can take data from almost any type of ﬁle and use them to generate tabulated reports, charts, and plots of distributions and trends, descriptive statistics, and conduct complex statistical analyses. Taddesse Kassahun Data Analysis using SPSS 39 / 135
42. 42. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use What is a statistical package? A computer program/set of programs that provides diﬀerent statistical procedures within a uniﬁed framework. Currently owned and developed by IBM corporation, SPSS stands for ”Statistical Product and Service Solutions”. SPSS is a very powerful and user friendly program. It uses both a graphical and a syntactical interface. SPSS can take data from almost any type of ﬁle and use them to generate tabulated reports, charts, and plots of distributions and trends, descriptive statistics, and conduct complex statistical analyses. Taddesse Kassahun Data Analysis using SPSS 39 / 135
43. 43. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use What is a statistical package? A computer program/set of programs that provides diﬀerent statistical procedures within a uniﬁed framework. Currently owned and developed by IBM corporation, SPSS stands for ”Statistical Product and Service Solutions”. SPSS is a very powerful and user friendly program. It uses both a graphical and a syntactical interface. SPSS can take data from almost any type of ﬁle and use them to generate tabulated reports, charts, and plots of distributions and trends, descriptive statistics, and conduct complex statistical analyses. Taddesse Kassahun Data Analysis using SPSS 39 / 135
44. 44. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Starting SPSS You may use any one of the following options to start SPSS. 1 Go to the Applications folder, and select SPSS from the list of programs (or Start ⇒ All Programs ⇒ IBM SPSS Statistics ⇒ IBM SPSS Statistics 20). 2 Double-click SPSS shortcut icon on the desktop. (if present). Taddesse Kassahun Data Analysis using SPSS 40 / 135
45. 45. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Starting SPSS If you select Run tutorial and click on OK, the tutorial window will be opened. Taddesse Kassahun Data Analysis using SPSS 41 / 135
46. 46. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Starting SPSS If you select the Type in data and click on OK, the data editor window will be opened. Taddesse Kassahun Data Analysis using SPSS 42 / 135
47. 47. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Starting SPSS If you select Run an existing query and click on OK, the following window will be opened. Taddesse Kassahun Data Analysis using SPSS 43 / 135
48. 48. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Staring SPSS If you select Create new query using database wizard and click on OK, the following window will be opened. Taddesse Kassahun Data Analysis using SPSS 44 / 135
49. 49. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Exit SPSS To exit the SPSS program: click on the close button at the top right hand side corner, or. click on the File menu and choose the option Exit. Taddesse Kassahun Data Analysis using SPSS 45 / 135
50. 50. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use SPSS Windows The most common windows in SPSS are: Data Editor Output Viewer Pivot Table Editor Chart Editor Syntax Editor The ﬁle extension for Data Editor window is *.sav The ﬁle extension for output viewer is *.spv. The ﬁle extension for syntax editor is *.sps. There are short cut toolbars which provide quick and easy access to diﬀerent facilities on each window. Taddesse Kassahun Data Analysis using SPSS 46 / 135
51. 51. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Data Editor Window The data editor window is a spreadsheet like window to: create a new data ﬁle and enter data; display the contents of the working data ﬁle; make changes to the existing data ﬁles and run statistical analysis. There are two views in this window, namely data view and variable view Taddesse Kassahun Data Analysis using SPSS 47 / 135
52. 52. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Data Editor Window . . . Unlike most spreadsheets, the Data Editor can only have one dataset open at a time. You can open multiple Data Editors at one time, each of which contains a separate dataset. The Data Editor contains several menu items that are useful for performing various operations on your data. Data Editor containing an example dataset. Taddesse Kassahun Data Analysis using SPSS 48 / 135
53. 53. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Output Viewer Window It displays the results of any statistical procedures you run and other texts. It has two panes: right-hand & left-hand. Right-hand pane contains statistical tables, charts, and text output. Left-hand pane comprises a tree structure, which provides an outline view of the contents. Taddesse Kassahun Data Analysis using SPSS 49 / 135
54. 54. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Pivot Table Editor Window This window will allow you to edit the contents of a table displayed by SPSS. Double clicking a table on Viewer window results the Pivot Table Editor window. Taddesse Kassahun Data Analysis using SPSS 50 / 135
55. 55. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Chart Editor Window This window will help you edit charts produced by SPSS. Double clicking a chart on the Viewer window gives this window. You can change the colors, select diﬀerent type fonts or sizes, rotate axes, change the chart type, etc. Taddesse Kassahun Data Analysis using SPSS 51 / 135
56. 56. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Syntax Editor Window Syntax is a computer code which produces a speciﬁc output. You may use this window if you wish to run SPSS commands instead of clicking on the pull-down menus. The Syntax Editor is recommended when there is a need to repeat a procedure. Use the normal SPSS menus to set up the basic commands and paste them on Syntax Editor Window. Taddesse Kassahun Data Analysis using SPSS 52 / 135
57. 57. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Help in SPSS Help is available in a number of diﬀerent ways such as: Help menu. Every window has a Help menu on the menu bar. The Topics menu item provides access to the help system, where you can use the Contents and Index tabs to ﬁnd topics. The Tutorial menu item provides access to the introductory tutorial. Taddesse Kassahun Data Analysis using SPSS 53 / 135
58. 58. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Help in SPSS Dialog box Help buttons. Most dialog boxes have a Help button that takes you directly to a Help topic for that dialog box. The Help topic provides general information and links to related topics. Statistics Coach. A wizard-like method for ﬁnding the right statistical or charting procedure for what you want to do. Case Studies provides hands-on examples of how to create various types of statistical analyses and interpret the results. Taddesse Kassahun Data Analysis using SPSS 54 / 135
59. 59. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use SPSS Menus Menus are list of procedures that the user will ﬁnd on top of a computer screen. The current version of SPSS has 12 main menus. File ⇒ Edit ⇒ View ⇒ Data ⇒ Transform ⇒ Analyze ⇒ Direct Marketing ⇒ Graphs ⇒ Utilities ⇒ Add-ons ⇒ Window ⇒ Help. File menu: to open, save, print and close ﬁles, provides access to recently used ﬁles, etc. Taddesse Kassahun Data Analysis using SPSS 55 / 135
60. 60. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use SPSS Menus Edit menu: to do things like: copy, paste, insert variables, edit user preferences, etc. View menu: to hide/ show: the toolbar, status bar, gridlines, etc. Data menu is useful to deﬁne variables and make changes to the data ﬁle you are using. Taddesse Kassahun Data Analysis using SPSS 56 / 135
61. 61. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use SPSS Menus Transform menu is used to make changes to selected variable(s) in the data ﬁle you are using. It includes recoding existing variables & computing new variables. Analyze menu is useful to perform statistical analyses such as producing Reports, Calculating Descriptive Statistics, as well as various statistical procedures such as Regression and Correlation. Direct Marketing menu provides tools designed to improve the results of direct marketing campaigns by identifying demographic, purchasing, and other characteristics that deﬁne various groups of consumers. Taddesse Kassahun Data Analysis using SPSS 57 / 135
62. 62. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use SPSS Menus Graphs menu lets one make various types of plots from a given dataset. Utilities menu gives information about variables and ﬁles. Add-ons menu tells about other programs of the SPSS family, such as Amos, Decision Time, Text Analysis, services delivered, etc. Window menu to manage SPSS window, split data, etc. Help menu provides various help facilities. Taddesse Kassahun Data Analysis using SPSS 58 / 135
63. 63. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Start and Exit SPSS SPSS Windows Help in SPSS The SPSS Menus and their Use Typical SPSS Data Flow Taddesse Kassahun Data Analysis using SPSS 59 / 135
64. 64. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Working with the Data Editor Data Editor window can be displayed in one of the two views: Data View Variable View. The Data View displays the contents of the data ﬁle in the form of a spreadsheet. Variable View deﬁnes all variables in data ﬁle. Switching from one view to the other can be done by clicking the appropriate tab (Data View or Variable View). (see the picture on next slide). Taddesse Kassahun Data Analysis using SPSS 60 / 135
65. 65. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Data Editor Window Taddesse Kassahun Data Analysis using SPSS 61 / 135
66. 66. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Data View To display the data & corresponding variables. Columns represent variables & rows represent cases (observations). Press Ctrl-Home to move to the ﬁrst cell. Press Ctrl-End to move to the last cell. Press Ctrl-Home again to move back to the ﬁrst cell. Each cell in the grid contains the value of one particular subject on one particular variable. Taddesse Kassahun Data Analysis using SPSS 62 / 135
67. 67. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Variable View Allows the types of variables to be speciﬁed & viewed. Row is a variable and column is an attribute associated with that variable There are 11 characteristics to be speciﬁed in the Variable View. Taddesse Kassahun Data Analysis using SPSS 63 / 135
68. 68. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Variable View 1 Name: the chosen variable name. It must begin with a letter and cannot end with a period. It is inadvisable to end variable names with an underscore. Blanks and special characters (!, ?,” -,& and *) cannot be used. Each variable name must beunique, no duplications. Reserved words:ALL, AND, BY, EQ, GE, GT, LE, LT, NE, NOT, OT, TO, WITH cannot be used as variable names. Taddesse Kassahun Data Analysis using SPSS 64 / 135
69. 69. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Variable View 2 Type speciﬁes type of data such as numeric, dates, currencies, etc. for each variable, If you click on the Type column, the variable type sub dialog box appears. Taddesse Kassahun Data Analysis using SPSS 65 / 135
70. 70. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Variable View Numeric: the data type is number, e.g., 20, 30.99, etc. Comma: Numeric values with a comma used as the grouping separator and a period used as decimal indicator. For example, 1,234.56. DOT: Numeric values with a period used as the grouping separator and a comma used as the decimal indicator. For example, 1.234,56. Scientiﬁc notation: A numeric variable whose values are displayed with an embedded E and a signed power-of-10 exponent., e.g., 145 as 1.45E+002 Date: Values are displayed in one of several calendar − date or clock − time formats. Taddesse Kassahun Data Analysis using SPSS 66 / 135
71. 71. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Variable View Date: A numeric variable whose values are displayed in one of several calendar − date or clock − time formats. The century range for two-digit year values (e.g., 1916 - 16 and 2016-16 )can be customized. Choose Options from the Edit menu and then click the Data tab. Taddesse Kassahun Data Analysis using SPSS 67 / 135
72. 72. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Variable View Dollar: A variable displayed with a leading dollar sign (\$) Custom currency: A numeric variable whose values are displayed in one of the custom currency formats that you deﬁned on the Currency tab of the Options dialog box. String: A variable whose values are not numeric and therefore are not used in calculations. Taddesse Kassahun Data Analysis using SPSS 68 / 135
73. 73. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Variable View 3 Width: Number of character places. The default is eight. 4 Decimals: number of decimals in case of Numeric type. 5 Label: descriptive name of a variable, e.g., a variable name can be OX and the corresponding label can be Number of oxen owned by a farmer. These descriptive labels are displayed in output. 6 Values: these are labels attached to category codes. E.g., {Female = 0; Male = 1} To assign a label: Taddesse Kassahun Data Analysis using SPSS 69 / 135
74. 74. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Variable View enter the value in the value text box, enter the label in the label text box, then click on Add Repeat these steps until you deﬁne labels for all values Finally, click on OK. Taddesse Kassahun Data Analysis using SPSS 70 / 135
75. 75. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Variable View 7 Missing: deﬁnes speciﬁed data values as user-missing For example, to distinguish between data that are missing because a respondent refused to answer and data missing because the question didn’t apply to that respondent. Data values that are speciﬁed as user-missing are ﬂagged for special treatment and are excluded from most calculations Steps to Deﬁne Missing Values Select the cell which is the intersection of missing value attribute and the variable you need Click the wizard button Taddesse Kassahun Data Analysis using SPSS 71 / 135
76. 76. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement The Variable View The Missing Values dialog box will be opened. you can set up to 3 discrete missing values Or range plus one discrete missing value. Finally click on ok Taddesse Kassahun Data Analysis using SPSS 72 / 135
77. 77. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Variable View 8 Columns: the width of the variable column in Data View. When the Width value is larger than the Columns value, only part of the data entry will be seen in the Data View. 9 Align: to make alignment of variable entries (Left, Right or Center). 10 Measure: assigns measurement scale of variables. The default depends on data type (Scale or Nominal) 11 Role: to assign the role a variable plays. Input - variable will be used as an input (e.g., predictor or independent variable). Taddesse Kassahun Data Analysis using SPSS 73 / 135
78. 78. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Variable View Target - The variable will be used as an output or target (e.g., dependent variable). Both - variable will be used as both input and output. None - variable has no role assignment. Partition - variable used to partition the data into separate samples for training, testing, & validation. Split - Included for round-trip compatibility with SPSS Modeler. Taddesse Kassahun Data Analysis using SPSS 74 / 135
79. 79. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Taddesse Kassahun Data Analysis using SPSS 75 / 135
80. 80. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Entering Data Data can be either entered directly or imported from a number of sources, such as spreadsheet or text ﬁle. Entering data directly: The data editor window looks like a spreadsheet where numbers and text can be entered directly. Open a new data sheet (Ctrl+n or File > New > Data) Try entering some number in the ﬁrst column. Type what you want in each cell, and hit the Enter key. Now try to put some text into the same column and press Enter. The text disappears since SPSS has identiﬁed the column as numeric. Taddesse Kassahun Data Analysis using SPSS 76 / 135
81. 81. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Entering Data Put some text in the next column. Try entering numbers in this column. You can, but you will not be able to do any calculations. Your new variables have been given the names VAR00001 and VAR00002. You can change the names by clicking on Variable View tab. Taddesse Kassahun Data Analysis using SPSS 77 / 135
82. 82. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Taddesse Kassahun Data Analysis using SPSS 78 / 135
83. 83. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Importing Data To import an existing SPSS data data ﬁle, use File > Open > Data. To import data from other formats: File > Open > Data > Click on the ’Files of type’ dialogbox and choose the format (Excel, DB, Text, etc). You can also simply copy and paste the data cells from Excel into SPSS but you will have to label the columns. Check whether text ﬁles may be delimited or not, excel ﬁles may have so many sheets, whether the ﬁrst row may or may not contain variable names. Taddesse Kassahun Data Analysis using SPSS 79 / 135
84. 84. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Importing Data Importing Data from Excel Files: File > Open > Data > Choose Excel as ﬁle type Select the ﬁle you want to import. Then click Open Taddesse Kassahun Data Analysis using SPSS 80 / 135
85. 85. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Importing Data If there are initial empty rows/columns in the spreadsheet, deﬁne the cells where the data are stored in the Range: ﬁeld e.g., A1:B8 (see ﬁg. below) Import Data from CVS File: CVS is a comma-separated values ﬁle. File >Open >Data > Choose All ﬁles as ﬁle type Select the ﬁle you want to import. Then click Open Taddesse Kassahun Data Analysis using SPSS 81 / 135
86. 86. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Importing Data Text Import Wizard shown below will be open. Taddesse Kassahun Data Analysis using SPSS 82 / 135
87. 87. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Importing Data Taddesse Kassahun Data Analysis using SPSS 83 / 135
88. 88. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Importing Data Taddesse Kassahun Data Analysis using SPSS 84 / 135
89. 89. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Importing Data After importing data, it is good practice to clean data (check any inconsistencies). Use Analyze > Descriptive Statistics > Frequencies for ech column. Check the outputs to see if you have variables with wrong and/ or missing values. Taddesse Kassahun Data Analysis using SPSS 85 / 135
90. 90. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Saving Data To save data as SPSS ﬁle: Click on File> Save As > Type in a suitable ﬁle name. The Save as type should be set at SPSS (*.sav) option. SPSS data can be saved as Excel ﬁle. File> Save As > choose Excel from Save as type Taddesse Kassahun Data Analysis using SPSS 86 / 135
91. 91. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Printing a Dataset Highlight the data that will be printed. To print all of the data, ignore this step and continue to step 2. Select Print from the File menu. The Print dialog box opens. Change the options where appropriate. Click OK. Taddesse Kassahun Data Analysis using SPSS 87 / 135
92. 92. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Deleting Entries To delete an entry for a cell, click on the cell and press Delete. Complete rows and columns can be deleted by clicking on the grey cell at the top or left hand side and pressing the Delete key on the keyboard. Taddesse Kassahun Data Analysis using SPSS 88 / 135
93. 93. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Deleting Variables Click on the top of the column which contains the variable to be deleted. Hit the Delete key on the keyboard or. Right click at the top of the column which contains the variable to be deleted and choose clear. Taddesse Kassahun Data Analysis using SPSS 89 / 135
94. 94. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Insert a Case (a row) Click at the side of the row below where you want the new row to appear. Use the insert cases icon or Right click at the side of the row below where you want the new row to appear and use Insert Cases. Or use Edit > Insert Cases Taddesse Kassahun Data Analysis using SPSS 90 / 135
95. 95. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Insert a Variable (a column) Click on the top of the column to the right of where you want the new column to appear. Use the insert variables icon or Right click at the top of the column to the right of where you want the new column to appear and use Insert Variable. Or use Edit > Insert Taddesse Kassahun Data Analysis using SPSS 91 / 135
96. 96. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Sorting Cases SPSS can sort data by, e.g., respondents’ date of birth. Go to Data > click on Sort cases. In the dialog box highlight the respondent’s bdate Click on the arrow to transfer it to the Sort by box. Click on OK Taddesse Kassahun Data Analysis using SPSS 92 / 135
97. 97. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Sorting Variables SPSS can sort variables based on the 11 attributes, i.e, Name, Type, etc. Go to Data > click on Sort Variables. In the dialog box choose the attribute. Click on OK Taddesse Kassahun Data Analysis using SPSS 93 / 135
98. 98. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Splitting File For analysis purpose, data may be split into separate groups based on the values of one or more grouping variables. Go to Data > Split File > Click on Compare Groups. Specify the grouping variable, e.g., gender > click on OK. Taddesse Kassahun Data Analysis using SPSS 94 / 135
99. 99. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Merging Files Two possibilities for merging: 1 Merge ﬁles containing the same variables but diﬀerent cases (Add Cases) 2 Merge ﬁles containing the same cases but diﬀerent variables (Add Variables) We use broadband 1 and broadband 2 datasets for illustration. Taddesse Kassahun Data Analysis using SPSS 95 / 135
100. 100. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Merging Files (Add Cases) Have one of the ﬁles open, and you can add cases to it from an external SPSS ﬁle. Go to Data > merge ﬁles > Add Cases.... Unpaired variables : variables to be excluded from the new merged data ﬁle. Taddesse Kassahun Data Analysis using SPSS 96 / 135
101. 101. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Merging Files (Add Cases) Variable from the working data ﬁle are identiﬁed with an asterisk (*). Variables from the external data ﬁle are identiﬁed with a plus sign (+). Variables in New Active Dataset: variables to be included in the new merged data ﬁle. By default, all the variable that match both name and data type are included on the list. If the same information is recorded under diﬀerent variable names in the two ﬁles, you can create a pair from the unpaired variable list. Select the two variables and click on Pair. Taddesse Kassahun Data Analysis using SPSS 97 / 135
102. 102. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Merging Files (Add Variables) Have one of the ﬁles open. Sort both data ﬁles using the key variable (existing in both). Data > Merge Files > Add Variables... Choose the other dataset from the dialog box and click on Continue. Click on Matched cases on key variable ... Transfer the key variableTaddesse Kassahun Data Analysis using SPSS 98 / 135
103. 103. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Weighing Cases Weight Cases gives cases diﬀerent weights for statistical analysis. The values of the weighing variable should indicate the number of observations represented by single cases in the data ﬁle. Cases with zero, negative, or missing values for the weighing variable are excluded from analysis Consider No.ofcustomers as weight for the following. Taddesse Kassahun Data Analysis using SPSS 99 / 135
104. 104. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Weight Cases We need to ﬁrst create the three variables. To weight cases:Data > Weight Cases > Weight cases by Select the weighting variable and send to Frequency Variable: box > Click on OK. Once a weight is applied, it remains in eﬀect until another weight variable is selected or weighting is turned oﬀ. Taddesse Kassahun Data Analysis using SPSS 100 / 135
105. 105. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Selecting Cases In some situations, we may want to look at part of a dataset only, for example, the data for females or males only. There are three options for selecting cases. If condition is satisﬁed Random sample of cases Based on time or case range Taddesse Kassahun Data Analysis using SPSS 101 / 135
106. 106. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Selecting Cases: If condition is satisﬁed Click Data > Select Cases > If condition is satisﬁed > click on the if... button > Click on the arrow button to move the variable. Click on the mathematical sign (<, >, +, ∗, etc.) from the keyboard displayed on the screen. Taddesse Kassahun Data Analysis using SPSS 102 / 135
107. 107. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Selecting Cases: Random sample of cases To select the actual sample: use Data> Select cases>random sample of cases Click on the Sample button. Fill out the dialog box appropriately. Taddesse Kassahun Data Analysis using SPSS 103 / 135
108. 108. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Selecting Cases: Based on time or case range Case ranges are based on row number as displayed in the Data Editor. Date and time ranges are available only for time series data with deﬁned date variables. To deﬁne date: Data > Deﬁne Dates > Choose the date format Use Data>Select cases based on time or case range > Range... Taddesse Kassahun Data Analysis using SPSS 104 / 135
109. 109. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Taddesse Kassahun Data Analysis using SPSS 105 / 135
110. 110. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Aggregating Data Aggregate Data combines groups of cases into single summary cases and creates a new aggregated data ﬁle. Cases are aggregated based on the value of one or more grouping variables. The new data ﬁle contains one case for each group. One may aggregate country data by region and create a new data ﬁle in which region is the unit of analysis. Taddesse Kassahun Data Analysis using SPSS 106 / 135
111. 111. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Aggregating Data Data > Aggregate... Select a break variable that deﬁnes how cases are grouped to create aggregated data. Select aggregate variables to include in the new data ﬁle. Select an aggregate function for each aggregate variable (MEAN is the default). Taddesse Kassahun Data Analysis using SPSS 107 / 135
112. 112. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Multiple Response Sets If a survey question can be answered multiple valid times, such as questions which note ”Check all that apply” Multiple variables are necessary to capture all the responses. Multiple responses are used in telecom industries, e.g. multiple lines, voice mail, paging, internet, call waiting, call forwarding, electronic billing, etc. Many customers use more than one service. Taddesse Kassahun Data Analysis using SPSS 108 / 135
113. 113. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Multiple Response Sets The variables of a multiple response set are coded as dichotomies or categories. Dichotomy: A separate variable is created for each of the valid responses (alternatives) to the question (Y/N, present/absent, etc.). Category : A separate variable is created for each response (from a number of possible responses) given by the survey taker. Example. Which of the following services do you subscribe? Upto 3 possible answers Voice mail, paging, internet, call waiting, call forwarding, electronic billing. Taddesse Kassahun Data Analysis using SPSS 109 / 135
114. 114. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Example . . . Dichotomy: SIX variables, i.e., voice, pager, Internet, callwait, forward, ebill with values Yes/No are created. Taddesse Kassahun Data Analysis using SPSS 110 / 135
115. 115. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Example . . . Category: THREE variables, i.e., Response1, Response2, Response3 with values voice, pager, Internet, callwait, forward, ebill are created. Taddesse Kassahun Data Analysis using SPSS 111 / 135
116. 116. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Multiple Response Sets To deﬁne a multiple response set (dichotomy): Data > Deﬁne Multiple Response Sets... Select two or more variables. Type 1[for YES] as the counted value of the dichotomies Enter a unique name for each multiple response set. Taddesse Kassahun Data Analysis using SPSS 112 / 135
117. 117. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Multiple Response Sets Click Add to add the multiple response set to the list of deﬁned sets. Use Analyze > Tables > Custom Tables The name of multiple response set should appear at the bottom of the Table dialogue box. Place it in the row and click on OK. Taddesse Kassahun Data Analysis using SPSS 113 / 135
118. 118. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Multiple Response Sets It is also possible to create multiple response sets using the Analyze Menu. Use Analyze > Tables > Multiple Response Sets ... Use Analyze > Multiple Response > Deﬁne Variable Sets ... Taddesse Kassahun Data Analysis using SPSS 114 / 135
119. 119. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Duplicate Cases One of the data entry errors is entering the same case more than once. This can be identiﬁed using: Data > Identify Duplicate Cases > Move the variable(s) to Deﬁne matching cases by: box You will have a new variable which identiﬁes a duplicate. Taddesse Kassahun Data Analysis using SPSS 115 / 135
120. 120. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Unusual Cases There is a possi- bility of having one or more extremely large or small value in a given dataset. This can be identiﬁed using: Data > Identify Unusal Cases > Move the variable(s) to the Analysis Variables: box You can also add a Case Identiﬁer Variable: Taddesse Kassahun Data Analysis using SPSS 116 / 135
121. 121. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Validation Rule A rule is used to determine whether a case is valid. There are two types of validation rules: Single-variable rules consist of a ﬁxed set of checks that apply to a single variable, such as checks for out-of-range values. Cross-variable rules are user-deﬁned rules that can be applied to a single variable or a combination of variables. Cross-variable rules are deﬁned by a logical expression that ﬂags invalid values. Taddesse Kassahun Data Analysis using SPSS 117 / 135
122. 122. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Validation Rule You can quickly obtain a set of ready-to-use validation rules by loading predeﬁned rules from an external data ﬁle included in the installation. To deﬁne validation rules Data > Validation > Deﬁne Rules... Select Single-Variable Rules> enter the minimum and maximum values > Click on OK.Taddesse Kassahun Data Analysis using SPSS 118 / 135
123. 123. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Validation Rule To validate the rule, Data > Validation > Validate Data... Move the variable to the Analysis Variables:box > Click on Single-Variable Rules tab > Check on the box Apply under the Rules: box > Click on OK. Taddesse Kassahun Data Analysis using SPSS 119 / 135
124. 124. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Validation Rule To deﬁne validation rules Data > Validation > Deﬁne Rules... Select Cross-Variable Rules> enter the rules > Click on New to add more.> Click on OK. To validate the rule, Data > Validation > Validate Data...> Cross-Variable Rules > Click on OK. Taddesse Kassahun Data Analysis using SPSS 120 / 135
125. 125. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Compute Variable It is possible to derive a new variable from existing variables, e.g. by summing the values for several variables together. It is also possible to change an existing variable, e.g. by multiplying it by another variable. Use Transform > Compute Variable In the Target Variable: box enter the name of the new variable to be created. As an example, in the data health funding.sav, let us create a variable which increases the funding by 10%. Taddesse Kassahun Data Analysis using SPSS 121 / 135
126. 126. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Compute Variable Taddesse Kassahun Data Analysis using SPSS 122 / 135
127. 127. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Counting Occurrence It is possible to count the occurrences of the same value(s) in a list of variables for each case. For example, You might have a satisfaction survey that contains a lot of questions with the same scale. You can use this procedure to calculate the total number of Very Satisﬁed responses a respondent gave across a number of questions Another example, a survey might contain a list of magazines with yes/no check boxes to indicate which magazines each respondent reads. The number of yes/no responses for each respondent can be counted to create a new variable containing total number of magazines read. Taddesse Kassahun Data Analysis using SPSS 123 / 135
128. 128. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Counting Occurrence Use: Transform > Count Values within Cases... Enter a target variable name. Select two or more variables of the same type. Click on Deﬁne Values and specify which value(s) should be counted. Other options include specifying a range of values or specifying missing values as the values to count. Taddesse Kassahun Data Analysis using SPSS 124 / 135
129. 129. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Counting As an example, let us count the number of Yes (Y’s) Magazine1 Magazine2 Magazine3 Y N Y Y N N Y Y N N Y N N Y N Taddesse Kassahun Data Analysis using SPSS 125 / 135
130. 130. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Recoding Two options available for recoding variables. Recoding into same variable. Creating a new variable. Recoding into ame variable Use: Transform > Recode into Same Variables, move the variable to be recoded into Variables box. Click on Old and New Values. Write the values and click on the add button. Taddesse Kassahun Data Analysis using SPSS 126 / 135
131. 131. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Recoding into Diﬀerent Variables Use Transform > Recode into Diﬀerent Variables Move the variable to be recoded to Variables box Write the name of the output variable and its label. Type old and new values. Click on the add button. Check the option Output variables are strings if you would like to have a string variable. Click on continue Taddesse Kassahun Data Analysis using SPSS 127 / 135
132. 132. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Automatic Recoding Automatic Recode converts string and numeric values into consecutive integers. String values are recoded in strict alphabetical order, with upper-case letters preceding lower-case. The resulting new variable retains the variable and value labels of the old variable from which it has been recoded. Transform > Automatic Taddesse Kassahun Data Analysis using SPSS 128 / 135
133. 133. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Visual Binning Taking two or more values and grouping them into the same category. Create a categorical variable from a scale variable. Combine several response categories into a single category. Use Transform > Visual binning. Select the variables to bin Click on continue and write the binned variable name.Taddesse Kassahun Data Analysis using SPSS 129 / 135
134. 134. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Optimal Binning Optimal binning is useful to optimally bin scale variables without specifying class number or width. The possible number of classes and width are determined by optimizing the scale variable with respect to a nominal variable. Use Transform > Optimal Binning Taddesse Kassahun Data Analysis using SPSS 130 / 135
135. 135. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Ranking Cases Cases can be ranked using Transform > Rank Cases... Move the variable into Variable(s) box Move a variable to the option By if more than one variable are going to be considered. Taddesse Kassahun Data Analysis using SPSS 131 / 135
136. 136. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Date and Time Wizard This wizard simpliﬁes a number of common tasks associated with date and time variables. Calculating the diﬀerence between two dates. Extracting part of a date or time variable Unit to be extracted can be: year, months, weeks, days, hours, minute, second, etc. Use Transform > Date and time wizard Taddesse Kassahun Data Analysis using SPSS 132 / 135
137. 137. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Date and Time Wizard Example. Calculate the number of days between Bdate and Cdate. Bdate CDate 25.10.1900 03.06.1915 11.05.1902 11.05.1917 09.06.1986 10.09.2005 11.07.1991 15.11.2012 Taddesse Kassahun Data Analysis using SPSS 133 / 135
138. 138. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Replace Missing Values Creates new variables from existing ones by replacing missing values with estimates computed with one of several methods. Use Transform > Replace Missing Values... Select the variable that contains missing value. Chose one method of estimation (e.g., mean of nearby points) and click on OK Taddesse Kassahun Data Analysis using SPSS 134 / 135
139. 139. Basics of Research and Statistics Introduction to SPSS Data Management using SPSS Entering Data into SPSS Importing and Saving Data in SPSS Editing Data Data Rearrangement Taddesse Kassahun Data Analysis using SPSS 135 / 135