Frequency distribution, cross-tabulation, and hypothesis testing
- Open file
- Variable ViewVariable INT_FAMILIARMissing9
- AnalyzeDescriptive StatisticsFrequenciesINT_FAMILIARcheck all boxes
- TransformRecodeInto Different variables(INT_FAMILIAR -> INTnew)Old and New ValuesRange 1 through 5 -->1 and Range 6 through 7-->2
- AnalyzeDescriptive StatisticsCrosstabsGENDERColumns(s) – because GENDER is independent; INTnewRows – because INTnew is dependent(s)Statistics (check Nominal only)
- Test with X2 (only on counts data, not %, good for nij>=5)
H0: There is no association between the two variables
- If the table is 2x2 or nij <=10 Continuity correction needed
- Phi (2x2)
- Contingency coefficient C (rxc). where r or c >2
- Cramer’s V (rxc), where r or c >2
- AnalyzeCompare MeansOne-Sample t-testINT_FAMILIARCI=95%, Test Value = 4.0
Note: There are two possible scenarios (in each of them, H0: μ = 4.0)
I) When the value of statistic t in the SPSS computer printout is greater than zero (t > 0) and your alternative hypothesis H1
a) H1: μ ≠ 4.0Then, p-value = sig. (the value "sig." is found in the the SPSS table)
b) H1: μ > 4.0Then, p-value = sig./2
c) H1: μ < 4.0Then, p-value = 1 - (sig./2)
II) When the value of statistic t in the SPSS computer printout is less than zero (t < 0)
andyour alternative hypothesis H1
a) H1: μ ≠ 4.0Then, p-value = sig. (the same as in Scenario I)
b) H1: μ > 4.0Then, p-value = 1 - (sig./2)
c) H1: μ < 4.0Then, p-value = sig./2.
- Two independent samples: AnalyzeCompare MeansIndependent-Samples t- Test
- Test variable: INT_USAGE
- Grouping variable: GENDERDefine Groups: Group 1: 1; Group 2: 2
- Note: F test rejects H0a: The two populations have equal variances, at the significance level of 0.05.
Why? From SPSS, Sig. = 0.000, which is less than alpha = 0.05 reject H0a (the two variances are statistically different at the significance level of 0.05 or less)
- Note: because F test rejects the aboveH0a, in order to test our main hypothesis H0: The two populations have equal means, at the significance level of 0.05, we use the case with “Equal variances not assumed”).
Again, in order to make a decision whether to reject H0 or not, we follow the procedure: From SPSS, Sig. 0.000, which is less than alpha = 0.05 reject H0. The means are statistically different at the significance level of 0.05 or less.
- Paired samples: AnalyzeCompare MeansPaired Samples t-test INT_ATTITUDE vs. TECHN_ATTITUDE
Sig. = 0.000, thus reject H0: The difference between the two means is zero at the confidence level of 0.000 (which is well below the typical 0.05)
- Kolmogorov-Smirnov (K-S): One sample test whether a distribution is normal.
- Exercise: test the distribution of INT_FAMILIAR
H0: The distribution of INT_FAMILIAR is normal
- AnalyzeNonparametric Tests1-Sample K-SINT_FAMILIARcheck the box NormalStatistics: DescriptivesOK
Since Sig. is 0.178, we cannot reject H0 at the significance level of 0.05. In order to reject H0, Sig. would have to be less than the significance level of 0.05.
Attention: If you cannot reject H0, never say: “Therefore, we accept H0”, although it sounds similar. Why? In this and all other statistical tests, we can control only for Type I Error = Reject H0 when, in fact, H0 is CORRECT. We do not control for Type II Error = Accept H0when, in fact, H0 is WRONG. Remember also that the Probability of Type I Error = The Significance Level alpha, e.g. 0.05, which means that when rejecting H0 100 times based on 100 samples from this population, 5 times we will makea mistake and reject a good H0.
1