Study Guide for Exam 1 ~ STAT 210

Chapter 1 ~ What is Statistics?

Some of the ideas in this section of the study guide come from the introductory Powerpoint presentations shown in class. You should review the content of these presentations in preparation for the exam. These Powerpoints are available in the Additional Links section of the course website.

Polls and Surveys

  • Why do we sample?
  • What do we mean by sampling/random/chance error?
  • What are nonsampling errors?
  • What is the consequence of nonsampling errors?
  • What are the different types of nonsampling errors?
  • Given the description of study be able to identify the following:

-target population

-study population (and whether it is the same as the target population based on the sampling scheme used)

-variables being measured (e.g. gender, smoking status, GPA, income, etc.)

-purpose of the study

-any conclusions reached/inferences made from the study etc.

-any potential problems with the described study (sources of bias etc.)

  • Discuss how you would conduct a simple survey.

Experiments

  • What is a completely randomized design?
  • What is blocking, and what is it used for?
  • When experimenting with humans, understand the following:

-control group

-role of randomization

-placebo

-double blind

-placebo effect

  • Discuss how you would conduct a simple experiment given a research goal.

Observational Studies

  • What is prospective study?
  • What is retrospective study?
  • What are the limitations of observational studies vs. experimental studies?
  • What is controlling for a factor?
  • Given a study description, be able to identify some potential factors that one might want to control for.

Use of Simulations to Make Decisions

e.g. Swain vs. Alabama, Medical Success Rate, Casteneda vs. Partida

Concept of Sampling Variation and Standard Errorwhen estimating
proportions (see Chapter 1 sections 1.3, 1.4 and Homework 1)

Chapter 2 ~ How to Describe and Summarize Data

  • Types of variables– Given a list of variables you should be able to classify them as being numerical (continuous or discrete) or categorical.
  • Graphical displays for continuous variables.
    For each be able to construct, read, and draw conclusions from them.
  • Dot plots – construct by hand
  • Stem-and-leaf plots– be able to read only.
  • Histograms (Be able to comment on the following from a histogram: typical value, variability/spread, and distributional shape)
  • Outlier boxplots
  • CDF Plots
  • Numerical summaries for continuous variables
  • Sample mean – be able to calculate by hand and interpret.
  • Sample median – be able to calculate by hand and interpret.
  • Five number summary – be able to interpret.
  • Range – be able to calculate by hand and interpret.
  • InterquartileRange (IQR)– be able to interpret and use it to compare spread across groups.
  • Standard deviation – be able to calculate by hand and interpret. Also be able to use it to compare spread across groups.
  • Empirical Rule for approximately normal data – be able to apply this rule.
  • Graphical and numerical summaries for discrete data
  • Sample mean and standard deviation - These would be given to you, you do not need to know how to compute them from a table.
  • Bar graphs and frequency distribution tables – Be able to read and draw conclusions from a frequency distribution table. Also make sure you understand frequency, relative frequency, and cumulative relative frequency. This relates to material presented in Chapter 4 ~ Discrete Probability Distributions.
  • Graphical and numerical summaries for categorical data
    Be able to read and draw conclusions from:
  • Bar graphs
  • Frequency and Relative Frequency Tables
  • Pie charts

Extra Graphical Displays from Homework #2

Examining the Relationship Between Two Numeric Variables

  • Scatter plots – be able to read and draw conclusions from them. Be able to discuss trend, scatter, association (both in terms of direction and strength), groups/clusters of points, gaps, and outliers. (see Problem #7 from HW #2)

Comparing Values of Numeric Variable Across Groups/Populations

  • Comparative Boxplots – be able to read and draw conclusions from vertical dot plots with box plots added as we have looked at in JMP. In particular be able to comment on typical value, spread/variation, distributional shape, and within group outliers. Also if given histograms that were plotted in the same scale be able to compare contrast the groups. (see Problem #7 from HW #2)

Chapter 3 – Probability

If you haven’t done so yet, read Chapter 3 Probability. The most important concepts from this chapter are those of independence, conditional probabilities, and Baye’s Rule. Also the use Tree Diagrams to map out probabilities associated with two stage experiments should be reviewed.

Be sure you are able to do the following:

  • Construct and use a tree diagram to find probabilities of interest.
    See example 3.3.2 pgs. 105 – 106, problem 3.52, and Baye’s Rule/screening test
    problems like those on your homework.
  • Apply Baye’s Rule – you had several problems where this was used on your third assignment. If you did not get them all worked out correctly be sure to read through my solutions on the course website.
  • Given a contingency table be able to find probabilities of events of interest. Also be able to compute the relative risk (RR) associated with a potential risk factor for an adverse outcome. Review the probability Powerpoint we went through in class and the additional problem from Assignment #3 that looked at the association between smoking and birthweight.

Chapter 4 – Discrete Random Variables

If you have not done so, read sections 4.1 – 4.6 in your text. Also be sure to look at the solutions to the homework problems assigned from this section.

Be sure you are able to do the following:

  • Given a discrete probability distribution (i.e. the possible values for the random variable X and their associated probabilities of occurrence, be able to find the following:
  • Probabilities associated with specific events.
  • Probability histogram
  • The cumulative distribution function, and graph it.
  • The expectation , the variance , and the standard deviation .
  • The expectation, variance, and standard deviation of linear functions of X, i.e. , , and .
  • For a simple experiment be able to find the discrete probability functionand then items in the list above.

Chapter 5 – Random Variables for Success/Failure

Experiments (Binomial Random Variable)

Read Sections 5.1 and 5.2 of your text and review the solutions to your Chapter 5 homework posted on the web.

Be sure you are able to do the following:

  • Use the binomial probability function to find probabilities associated with an arbitrary binomial random variable K. You need to be able to do this with your calculator and your brain for the exam.

k = 0,1,2,...,n

  • Find the expectation E(K), variance Var(K), and standard deviation SD(K) of an arbitrary binomial random variable.
  • Be able to read the output from the Binomial Table Generator from JMP.

Chapter 6 –Introduction to Hypothesis Testing

Read Chapter 6 – Sections 6.1 – 6.5.

Be sure you are able to do the following:

  • Be able to state the null and alternative hypothesis for a given situation.
  • Be able to state the Type I and Type II errors for a given situation.
  • For a given simple binomial testing situation be able to calculate , and the power.
  • Be able to perform a Sign Test (e.g. reading comprehension course, men’s heights)
  • Be able to perform an Exact Binomial Test for 2 by 2 contingency table.