Your Midterm Exam Will Consist of Two Parts, a Practical SPSS Application Test Which Will

Your midterm exam will consist of two parts, a practical SPSS application test which will be worth 40% of the test grade, and a multiple choice part which will be worth 60%. The practical SPSS test will consist of three statistical applications: a t-test, a chi-square analysis, and a univariate analysis of variance, worth ten, twelve, and eighteen points, respectively.

The multiple choice part of the test will consist of 40 questions drawn from this list. They will be worth 1.5 points each. The questions come from the class slides and the two required texts.

Values used to make inferences about the characteristics of the population from which they were drawn, including the variation of the sample characteristics from corresponding population parameters are called
Descriptive statistics
Inferential statistics
Goodness-of-fit measures
Population parameters
A list of all the registered Republican voters in Los Angeles Country is an example of a
Population parameter
Independent variable
Macro-level construct
Sampling frame
Quasi-experimental studies differ from experimental studies in that in the quasi-experimental study the experimenter
Has control over assignment of cases to conditions of the independent variable
Has control over assignment of cases to conditions of the dependent variable
Has control over when and where the dependent variable can be measured
Has no control over either dependent variable or independent variable
Which type of research relies on random assignment?
Experimental
Quasi-experimental
Naturalistic
Correlational
Which of the following could describe “reliability” in the context of quantitative research?
Test-retest correlation
Internal consistency of items on a test
Correlation with other measures of the same thing
A and B above
B and C above
A and C above
None of the above
Which of the following is an example of cluster sampling?
Randomly drawing a name out of a hat
Drawing randomly from randomly drawn lists of neighborhoods or zip codes
Sampling from a list which has proportional representation of ethnic groups or other demographic characteristics in proportion to their numbers in the population
None of the above
Identify the independent variable in the following hypothesis: the impact of gender on job category is thought to be mediated by educational attainment
Gender
Job category
Educational attainment
None of the above
Identify the dependent variable in the following hypothesis: the impact of gender on job category is mediated by educational attainment
Gender
Job category
Educational attainment
None of the above
Identify the control variable in the following hypothesis: the impact of gender on job category is mediated by educational attainment
Gender
Job category
Educational attainment
None of the above
Which of the following is an example of a categorical variable?
Type of communications medium
Political attitudes
Income
None of the above
Which of the following is an example of a continuous variable?
Number of hours spent watching TV every day
Gender
Ethnicity
All of the above
Which of the following is an example of interval level measurement?
IQ test
Gender
Weight
Political Affiliation
Which of the following is an example of ratio level measurement?
Political affiliation
Hours spent surfing the Net
Preference among brands of cereal
All of the above
Which of the following is an example of nominal level measurement?
Ethnicity
Rankings in a beauty contest
Temperature
Income in dollars
A summary which gives an account of how often answers in each category of responses to a question occur within a sample is called a
Dendogram
Normal distribution
Kurtosis
Frequency distribution
In the picture below, how would the summary of the data represented in the picture be described?
A homogeneous distribution
A heterogenous distribution
A normal distribution
None of the above

A distribution in which most of the cases occur at the low end of the scale is called
Negatively skewed
Positively skewed
Leptokurtic
Platykurtic
A distribution in which most of the cases are peaked around the mean is called
Negatively skewed
Positively skewed
Leptokurtic
Platykurtic
Which of the measures below does this describe: Measure which is most stable for random samples, which makes it suitable for making estimates about populations from samples. It has the property that the sum of the deviations of the raw scores from it equals zero.
Standard deviation
Mode
Mean
Median
Which of the measures below does this describe: the response value for which there are an equal number of responses both below and above it (e.g., larger or smaller). Used with ordinal or numerical variables
Standard deviation
Mode
Mean
Median
Which of the measures below does this describe: the most frequently selected (commonly occurring) response category. The only measure for nominal level variables but can be used with scaled data.
Standard deviation
Mode
Mean
Median
What is the effect of an “outlier” on the mean?
Makes it more different from the median
Makes it more similar to the median
Keeps the distribution from being used for inferential statistics
None of the above
What is the property of a sample described here: the degree or amount of variability in a set of responses to a quantitative measure such as a questionnaire item
The significance
The mean
Dispersion
Kurtosis
The index of qualitative variation would be used for what kind of data?
Nominal
Ordinal
Ratio
Interval
What is the following a definition of? the sum of the squared deviations from the sample mean, divided by N-1 where N is the number of cases.
The mean
The harmonic mean
The dispersion
The variance
What is the square root of the property described in 25 called?
The standard deviation
The interquartile point
The range
The variance
The mean, median and mode of responses all coincide in a
Platykurtic distribution
Normal distribution
Skewed distribution
None of the above
What is the measure that has the following properties: it allows you to make comparisons between samples with respect to their variability (how much a respondent from the sample typically departs from the mean). Its size is generally about one-sixth the size of the value of the range
The standard deviation
The interquartile point
The harmonic mean
The variance
What is this a definition of? deviation of a raw score from the mean in standard deviation units
A standard score
A z score
Both A and B
Neither A nor B
In a normal distribution, what percentage of scores fall above the mean?
68%
34%
50%
95%
About what percent of cases fall within 2 SDs of the mean?
68%
34%
50%
95%
According to the Central Limit Theorem, the larger the sample size, the greater the probability that the obtained sample mean will ______the population mean
Depart from
Be the opposite of
Be the standard deviation of
Approximate
In the normal table (“Area under the Normal Curve” ) you look up a Z score of 2.2 and to the right of that you find the “area between the mean and Z” is 48.61. Thus 48.61% of the cases in the normal distribution lie between the mean and Z=2.2. What proportion of cases lie below this?
34.39%
1.39%
98.61%
50. 61%
What proportion of cases lie above this?
34.39%
1.39%
98.61%
50. 61%
The standard deviation of the sampling distribution of sample means is called the
Standard error of the mean
Variability of the mean
Alpha coefficient
Variance
What is the term for how much statistics can be expected to deviate from parameters when sampling randomly from the population
Standard error of the mean
Variability of the mean
Alpha coefficient
Variance
What is the relationship between sample size and the property described in (36) above?
It gets greater with increased sample size
It gets greater as the square of sample size increases
It gets smaller as the inverse of sample size increases
It gets smaller with increased sample size
Suppose we had the following data: 5,6,7,8, 9. and we calculated their mean as 35/5 = 7. What are the degrees of freedom in computing the mean?
5
4
3
1
Suppose we wanted to construct a confidence interval around the mean of 2969.56 such that we can have 95% confidence that the population mean for the variable “vehicle weight” will fall within this range. To obtain this confidence interval, we would need to take the mean and add to it the quantity (1.96 times the standard error). What does the number 1.96 represent in this case?
A constant which would apply to all such calculations
The beta weight associated with the variable
The risk area under the normal curve corresponding to 5%
The difference between the mean weight and the next higher weight
A ______variable is one on which each case is coded for either presence or absence of the attribute. For example, we could recode the ethnicity data into the ______variable “whiteness” or “Chinese-ness” so that every case would have either a 1 or a zero on the variable. All of the white (or Chinese) respondents would get a 1 and the others would get a zero on the variable. This kind of variable is called a
Dummy variable
Control variable
Composite variable
Mediator variable

41. Which of the following is an example of a null hypothesis?

a. There is no difference between males and females in attitudes toward voting

b. Males and females differ in their attitudes towards voting

c. Males tend to vote more often than females

d. The relationship between sex and voting is unknown

42. Statistical significance is

a. an indicator of the importance of the relationship between two variables

b. an indicator of the probability of a test statistic being the result of chance alone.

c. a sign that your sample was drawn randomly

d. a sign that you have eliminated random error

43. A student calculated a Chi square statistic with a significance level of .01 for a table relating gender to voting behavior for those 40-50 years old. She calculated a Chi square statistic with a significance level of .25 for the same two variables for those 20-30 years old. Which of the following best summarizes the findings?

a. She can be more sure that the obtained relationship between gender and voting behavior among those 40-50 years old was not just a chance result than the obtained relationship for those 20-30 years old.

b. She can be relatively sure that the relationship between gender and voting behavior is strong

c. The relationship between gender and voting behavior is stronger for those 40-50 years old than for those 20-30 years old

d. The level of association between gender and voting behavior is quite low for both groups.

44. We are conducting an empirical study that examines the relationship of communication frequency and relationship stability in couples married for at least 3 years. Which is the independent variable? Which is the dependent variable?

a. independent variable: relationship stability

dependent variable: communication frequency

b. independent variable: length of marriage

dependent variable: relationship stability

c. independent variable: communication frequency

dependent variable: length of marriage

d. can’t tell from the information given

What kinds of question should we not explore through a contingency table?
What are the differences in job classification attributable to gender?
What is the impact of gender on socio-economic status?
How does temperature affect annual rainfall?
How does job classification affect preference for news source?
In a chi-square analysis the principal interest is in comparing the obtained frequencies to the
Column frequencies
Row marginals
Obtained marginals
Expected frequencies
Consider the contingency table below: What appears to be the relationship between educational attainment and employment category?

Educational attainment has an impact on job category only for elementary school graduates
Educational attainment is associated with employment category.
Only clerical work is affected by educational attainment.
None of the above.

Lambda is a
Measure of association for interval level variables
Reliability coefficient
Measure of proportional reduction of error for contingency tables
Measure only used for ordinal level variables
Lambda is sometimes flawed as a measure because it
Ranges from -1 to +1
Doesn’t test significance
Comes out to zero quite a lot
Is too hard to calculate
An alternative to lambda which SPSS reports is
Alpha
Tau
Chi-square
Gamma
One of the things we can do in a cross-tabulation analysis is to look for the effect of a control variable on the relationship between an independent and dependent variable. In the table below, which is the control variable?

gender
educational attainment
employment category
status as a manager

Which of the following is not a step in statistical hypothesis testing?
Specify the research hypothesis and corresponding null hypothesis
Compute the value of a test statistic about the relationship between the two hypotheses
Calculate the DF and look up the statistic in the appropriate distribution to see if it falls into the critical region
If the result is not significant, move the critical region lower until you reach significance
The null hypothesis with respect to the relationship between two variables is that
The population and the sample means are different
The variables are independent of one another
There is no way to determine if the relationship is significant
The two variables are related to one another
Which of the following statements about chi-square is correct?
Chi-square ranges between -1 and 1
Chi-square can be interpreted as an index of the proportional reduction of error
Chi-square is a measure of the statistical independence of two variables
The larger the value of chi-square for a constant value of DF, the less the dependence of the two variables (the weaker their association)
Which of the following is a way of controlling for the influence of extraneous variables in an experiment?
The case-control method
Randomization
Large numbers of subjects
Test-retest reliability
The ability to show that the causal impact of an independent variable on a dependent variable is legitimate and not attributable to other extraneous and uncontrolled variables is called
Construct validity
Statistical conclusion validity
External validity
Internal validity
A “manipulation check is” a way of ensuring adequate
Construct validity
Statistical conclusion validity
External validity
Internal validity
Features of an experimental setting or questionnaire which induce people to behave in an artificial way are called
Demand characteristics
Debriefing
Experimental attrition
Normative role decay
Which of the following is an example of method variance?
Changing the gender of the experimenter when gender is not a variable in the study
Using paper and pencil measures on one occasion and an interview on another
Using different incentives for different subjects when incentive is not a variable
All of the above
None of the above
Which of the following types of scale does this describe: an object of judgment is evaluated against a set of rating scales (usually five to seven steps) with bi-polar adjectives at either end, such as good-bad or friendly-unfriendly
Likert scale
Guttman scale
Rausch scale
Semantic differential scale
Which of the following types of scales does this describe: people are asked to indicate if they strongly agree, agree, are neutral, disagree, or strongly disagree with declarative statements about a topic.
Likert scale
Rausch scale
Semantic differential scale
None of the above
Which of the following might be a problem with the way that people fill out questionnaires?
They prefer odd numbers to even ones
The have a tendency to disagree with statements rather than agree with them
They give their best responses toward the end of the questionnaire
They may reject items with “always” and “never”
Underlying the t statistic is a sampling distribution of
Sample means
Population means
Differences of sample means
Standard error of sample means
The ttest is for comparing
The difference of means for two independent groups
The difference of means for dependent samples
The difference of a sample mean and a population mean
All of the above
None of the above
What is the DF of a t test comparing two independent groups where one group has a N of 20 and the other an N of 38?
18
58
56
760
In conducting a t-test, the researcher can do a one-tailed or a two-tailed test. Under what circumstances would a one-tailed test be conducted?
When the researcher hoped to have a bigger critical area for getting significance
When the researcher had predicted the direction of the mean differences
When the sample was not normal with respect to the underlying distribution
None of the above
If I do a two-tailed test of my hypotheses and set the confidence level to .05, what area under the normal curve does my obtained value of t have to fall in to obtain significance?
The upper 5% of either end of the distribution
The upper 2.5% of either end of the distribution
The upper ten percent of either end of the distribution
The upper 5% of whatever end I predict it’s going to fall in
In certain cases, for example in “before and after” designs or when members of group A have been matched with members of group B on all salient characteristics except one, the variable of interest, an alternative formula for computing t is used. What is the main difference of this t from the t for independent samples?
t is based on the departure of the difference scores from the mean difference score
You use a different menu option in SPSS
The scores in post-test groups are known to be higher
All of the above
We find out if the ______in two groups are equal before deciding on what sort of t-test we will perform
Means
Standard errors
Medians
Variances

70. An analysis of variance looks for the causal impact of a nominal level independent variable (factor) on

a. A nominal variable

b. An ordinal variable

c. An interval or ratio level variable