Your midterm exam will consist of two parts, a practical SPSS application test which will be worth 40% of the test grade, and a multiple choice part which will be worth 60%. The practical SPSS test will consist of three statistical applications: a t-test, a chi-square analysis, and a univariate analysis of variance, worth ten, twelve, and eighteen points, respectively.
The multiple choice part of the test will consist of 40 questions drawn from this list. They will be worth 1.5 points each. The questions come from the class slides and the two required texts.
- Values used to make inferences about the characteristics of the population from which they were drawn, including the variation of the sample characteristics from corresponding population parameters are called
- Descriptive statistics
- Inferential statistics
- Goodness-of-fit measures
- Population parameters
- A list of all the registered Republican voters in Los Angeles Country is an example of a
- Population parameter
- Independent variable
- Macro-level construct
- Sampling frame
- Quasi-experimental studies differ from experimental studies in that in the quasi-experimental study the experimenter
- Has control over assignment of cases to conditions of the independent variable
- Has control over assignment of cases to conditions of the dependent variable
- Has control over when and where the dependent variable can be measured
- Has no control over either dependent variable or independent variable
- Which type of research relies on random assignment?
- Experimental
- Quasi-experimental
- Naturalistic
- Correlational
- Which of the following could describe “reliability” in the context of quantitative research?
- Test-retest correlation
- Internal consistency of items on a test
- Correlation with other measures of the same thing
- A and B above
- B and C above
- A and C above
- None of the above
- Which of the following is an example of cluster sampling?
- Randomly drawing a name out of a hat
- Drawing randomly from randomly drawn lists of neighborhoods or zip codes
- Sampling from a list which has proportional representation of ethnic groups or other demographic characteristics in proportion to their numbers in the population
- None of the above
- Identify the independent variable in the following hypothesis: the impact of gender on job category is thought to be mediated by educational attainment
- Gender
- Job category
- Educational attainment
- None of the above
- Identify the dependent variable in the following hypothesis: the impact of gender on job category is mediated by educational attainment
- Gender
- Job category
- Educational attainment
- None of the above
- Identify the control variable in the following hypothesis: the impact of gender on job category is mediated by educational attainment
- Gender
- Job category
- Educational attainment
- None of the above
- Which of the following is an example of a categorical variable?
- Type of communications medium
- Political attitudes
- Income
- None of the above
- Which of the following is an example of a continuous variable?
- Number of hours spent watching TV every day
- Gender
- Ethnicity
- All of the above
- Which of the following is an example of interval level measurement?
- IQ test
- Gender
- Weight
- Political Affiliation
- Which of the following is an example of ratio level measurement?
- Political affiliation
- Hours spent surfing the Net
- Preference among brands of cereal
- All of the above
- Which of the following is an example of nominal level measurement?
- Ethnicity
- Rankings in a beauty contest
- Temperature
- Income in dollars
- A summary which gives an account of how often answers in each category of responses to a question occur within a sample is called a
- Dendogram
- Normal distribution
- Kurtosis
- Frequency distribution
- In the picture below, how would the summary of the data represented in the picture be described?
- A homogeneous distribution
- A heterogenous distribution
- A normal distribution
- None of the above
- A distribution in which most of the cases occur at the low end of the scale is called
- Negatively skewed
- Positively skewed
- Leptokurtic
- Platykurtic
- A distribution in which most of the cases are peaked around the mean is called
- Negatively skewed
- Positively skewed
- Leptokurtic
- Platykurtic
- Which of the measures below does this describe: Measure which is most stable for random samples, which makes it suitable for making estimates about populations from samples. It has the property that the sum of the deviations of the raw scores from it equals zero.
- Standard deviation
- Mode
- Mean
- Median
- Which of the measures below does this describe: the response value for which there are an equal number of responses both below and above it (e.g., larger or smaller). Used with ordinal or numerical variables
- Standard deviation
- Mode
- Mean
- Median
- Which of the measures below does this describe: the most frequently selected (commonly occurring) response category. The only measure for nominal level variables but can be used with scaled data.
- Standard deviation
- Mode
- Mean
- Median
- What is the effect of an “outlier” on the mean?
- Makes it more different from the median
- Makes it more similar to the median
- Keeps the distribution from being used for inferential statistics
- None of the above
- What is the property of a sample described here: the degree or amount of variability in a set of responses to a quantitative measure such as a questionnaire item
- The significance
- The mean
- Dispersion
- Kurtosis
- The index of qualitative variation would be used for what kind of data?
- Nominal
- Ordinal
- Ratio
- Interval
- What is the following a definition of? the sum of the squared deviations from the sample mean, divided by N-1 where N is the number of cases.
- The mean
- The harmonic mean
- The dispersion
- The variance
- What is the square root of the property described in 25 called?
- The standard deviation
- The interquartile point
- The range
- The variance
- The mean, median and mode of responses all coincide in a
- Platykurtic distribution
- Normal distribution
- Skewed distribution
- None of the above
- What is the measure that has the following properties: it allows you to make comparisons between samples with respect to their variability (how much a respondent from the sample typically departs from the mean). Its size is generally about one-sixth the size of the value of the range
- The standard deviation
- The interquartile point
- The harmonic mean
- The variance
- What is this a definition of? deviation of a raw score from the mean in standard deviation units
- A standard score
- A z score
- Both A and B
- Neither A nor B
- In a normal distribution, what percentage of scores fall above the mean?
- 68%
- 34%
- 50%
- 95%
- About what percent of cases fall within 2 SDs of the mean?
- 68%
- 34%
- 50%
- 95%
- According to the Central Limit Theorem, the larger the sample size, the greater the probability that the obtained sample mean will ______the population mean
- Depart from
- Be the opposite of
- Be the standard deviation of
- Approximate
- In the normal table (“Area under the Normal Curve” ) you look up a Z score of 2.2 and to the right of that you find the “area between the mean and Z” is 48.61. Thus 48.61% of the cases in the normal distribution lie between the mean and Z=2.2. What proportion of cases lie below this?
- 34.39%
- 1.39%
- 98.61%
- 50. 61%
- What proportion of cases lie above this?
- 34.39%
- 1.39%
- 98.61%
- 50. 61%
- The standard deviation of the sampling distribution of sample means is called the
- Standard error of the mean
- Variability of the mean
- Alpha coefficient
- Variance
- What is the term for how much statistics can be expected to deviate from parameters when sampling randomly from the population
- Standard error of the mean
- Variability of the mean
- Alpha coefficient
- Variance
- What is the relationship between sample size and the property described in (36) above?
- It gets greater with increased sample size
- It gets greater as the square of sample size increases
- It gets smaller as the inverse of sample size increases
- It gets smaller with increased sample size
- Suppose we had the following data: 5,6,7,8, 9. and we calculated their mean as 35/5 = 7. What are the degrees of freedom in computing the mean?
- 5
- 4
- 3
- 1
- Suppose we wanted to construct a confidence interval around the mean of 2969.56 such that we can have 95% confidence that the population mean for the variable “vehicle weight” will fall within this range. To obtain this confidence interval, we would need to take the mean and add to it the quantity (1.96 times the standard error). What does the number 1.96 represent in this case?
- A constant which would apply to all such calculations
- The beta weight associated with the variable
- The risk area under the normal curve corresponding to 5%
- The difference between the mean weight and the next higher weight
- A ______variable is one on which each case is coded for either presence or absence of the attribute. For example, we could recode the ethnicity data into the ______variable “whiteness” or “Chinese-ness” so that every case would have either a 1 or a zero on the variable. All of the white (or Chinese) respondents would get a 1 and the others would get a zero on the variable. This kind of variable is called a
- Dummy variable
- Control variable
- Composite variable
- Mediator variable
41. Which of the following is an example of a null hypothesis?
a. There is no difference between males and females in attitudes toward voting
b. Males and females differ in their attitudes towards voting
c. Males tend to vote more often than females
d. The relationship between sex and voting is unknown
42. Statistical significance is
a. an indicator of the importance of the relationship between two variables
b. an indicator of the probability of a test statistic being the result of chance alone.
c. a sign that your sample was drawn randomly
d. a sign that you have eliminated random error
43. A student calculated a Chi square statistic with a significance level of .01 for a table relating gender to voting behavior for those 40-50 years old. She calculated a Chi square statistic with a significance level of .25 for the same two variables for those 20-30 years old. Which of the following best summarizes the findings?
a. She can be more sure that the obtained relationship between gender and voting behavior among those 40-50 years old was not just a chance result than the obtained relationship for those 20-30 years old.
b. She can be relatively sure that the relationship between gender and voting behavior is strong
c. The relationship between gender and voting behavior is stronger for those 40-50 years old than for those 20-30 years old
d. The level of association between gender and voting behavior is quite low for both groups.
44. We are conducting an empirical study that examines the relationship of communication frequency and relationship stability in couples married for at least 3 years. Which is the independent variable? Which is the dependent variable?
a. independent variable: relationship stability
dependent variable: communication frequency
b. independent variable: length of marriage
dependent variable: relationship stability
c. independent variable: communication frequency
dependent variable: length of marriage
d. can’t tell from the information given
- What kinds of question should we not explore through a contingency table?
- What are the differences in job classification attributable to gender?
- What is the impact of gender on socio-economic status?
- How does temperature affect annual rainfall?
- How does job classification affect preference for news source?
- In a chi-square analysis the principal interest is in comparing the obtained frequencies to the
- Column frequencies
- Row marginals
- Obtained marginals
- Expected frequencies
- Consider the contingency table below: What appears to be the relationship between educational attainment and employment category?
- Educational attainment has an impact on job category only for elementary school graduates
- Educational attainment is associated with employment category.
- Only clerical work is affected by educational attainment.
- None of the above.
- Lambda is a
- Measure of association for interval level variables
- Reliability coefficient
- Measure of proportional reduction of error for contingency tables
- Measure only used for ordinal level variables
- Lambda is sometimes flawed as a measure because it
- Ranges from -1 to +1
- Doesn’t test significance
- Comes out to zero quite a lot
- Is too hard to calculate
- An alternative to lambda which SPSS reports is
- Alpha
- Tau
- Chi-square
- Gamma
- One of the things we can do in a cross-tabulation analysis is to look for the effect of a control variable on the relationship between an independent and dependent variable. In the table below, which is the control variable?
- gender
- educational attainment
- employment category
- status as a manager
- Which of the following is not a step in statistical hypothesis testing?
- Specify the research hypothesis and corresponding null hypothesis
- Compute the value of a test statistic about the relationship between the two hypotheses
- Calculate the DF and look up the statistic in the appropriate distribution to see if it falls into the critical region
- If the result is not significant, move the critical region lower until you reach significance
- The null hypothesis with respect to the relationship between two variables is that
- The population and the sample means are different
- The variables are independent of one another
- There is no way to determine if the relationship is significant
- The two variables are related to one another
- Which of the following statements about chi-square is correct?
- Chi-square ranges between -1 and 1
- Chi-square can be interpreted as an index of the proportional reduction of error
- Chi-square is a measure of the statistical independence of two variables
- The larger the value of chi-square for a constant value of DF, the less the dependence of the two variables (the weaker their association)
- Which of the following is a way of controlling for the influence of extraneous variables in an experiment?
- The case-control method
- Randomization
- Large numbers of subjects
- Test-retest reliability
- The ability to show that the causal impact of an independent variable on a dependent variable is legitimate and not attributable to other extraneous and uncontrolled variables is called
- Construct validity
- Statistical conclusion validity
- External validity
- Internal validity
- A “manipulation check is” a way of ensuring adequate
- Construct validity
- Statistical conclusion validity
- External validity
- Internal validity
- Features of an experimental setting or questionnaire which induce people to behave in an artificial way are called
- Demand characteristics
- Debriefing
- Experimental attrition
- Normative role decay
- Which of the following is an example of method variance?
- Changing the gender of the experimenter when gender is not a variable in the study
- Using paper and pencil measures on one occasion and an interview on another
- Using different incentives for different subjects when incentive is not a variable
- All of the above
- None of the above
- Which of the following types of scale does this describe: an object of judgment is evaluated against a set of rating scales (usually five to seven steps) with bi-polar adjectives at either end, such as good-bad or friendly-unfriendly
- Likert scale
- Guttman scale
- Rausch scale
- Semantic differential scale
- Which of the following types of scales does this describe: people are asked to indicate if they strongly agree, agree, are neutral, disagree, or strongly disagree with declarative statements about a topic.
- Likert scale
- Rausch scale
- Semantic differential scale
- None of the above
- Which of the following might be a problem with the way that people fill out questionnaires?
- They prefer odd numbers to even ones
- The have a tendency to disagree with statements rather than agree with them
- They give their best responses toward the end of the questionnaire
- They may reject items with “always” and “never”
- Underlying the t statistic is a sampling distribution of
- Sample means
- Population means
- Differences of sample means
- Standard error of sample means
- The ttest is for comparing
- The difference of means for two independent groups
- The difference of means for dependent samples
- The difference of a sample mean and a population mean
- All of the above
- None of the above
- What is the DF of a t test comparing two independent groups where one group has a N of 20 and the other an N of 38?
- 18
- 58
- 56
- 760
- In conducting a t-test, the researcher can do a one-tailed or a two-tailed test. Under what circumstances would a one-tailed test be conducted?
- When the researcher hoped to have a bigger critical area for getting significance
- When the researcher had predicted the direction of the mean differences
- When the sample was not normal with respect to the underlying distribution
- None of the above
- If I do a two-tailed test of my hypotheses and set the confidence level to .05, what area under the normal curve does my obtained value of t have to fall in to obtain significance?
- The upper 5% of either end of the distribution
- The upper 2.5% of either end of the distribution
- The upper ten percent of either end of the distribution
- The upper 5% of whatever end I predict it’s going to fall in
- In certain cases, for example in “before and after” designs or when members of group A have been matched with members of group B on all salient characteristics except one, the variable of interest, an alternative formula for computing t is used. What is the main difference of this t from the t for independent samples?
- t is based on the departure of the difference scores from the mean difference score
- You use a different menu option in SPSS
- The scores in post-test groups are known to be higher
- All of the above
- We find out if the ______in two groups are equal before deciding on what sort of t-test we will perform
- Means
- Standard errors
- Medians
- Variances
70. An analysis of variance looks for the causal impact of a nominal level independent variable (factor) on
a. A nominal variable
b. An ordinal variable
c. An interval or ratio level variable