Ch1.1& 1.2Basic Definitions for Statistics

Objective A : Basic Definition
A1. Definition

What is Statistics?

•Statistics is the science of collecting, organizing, summarizing, and analyzing data to draw conclusions.

Descriptive Statistics

Statistics

Inferential Statistics

•Descriptive statistics consist of collecting, organizing, summarizing, and presenting data.

•Inferential statistics consists of generalizing from samples to populations, performing estimations and hypothesis tests, and making predictions.

Population versus Sample

•A population consists of all individuals (person or object) that are being studied.

•A sample is a subset of the population.

Parameter versus Statistic

•A parameter is a numerical summary of a population.

•A statistic is a numerical summary of a sample.

Example 1:
Identify the population and sample in the study.
A farmer wanted to learn about the weight of his soybean crop. He randomly sampled 100 plants and
weighed the soybeans on each plant.

Example 2:
Determine whether the underlined value is a parameter or a statistic.
(a) Only 12 men have walked on the moon. The average age of these men at the time of their
moonwalkswas 39 years, 11 months, 15 days.

(b) In a national survey on substance abuse, 66.4% of respondents who were full-time college
studentsaged 18 to 22 reported using alcohol within the past month.

A2. Variables and Type of Data

Variable

•A variable is a characteristic that can assume different values called data.

Qualitative versus Quantitative variable

•Qualitative (categorical) variables are variables that can be placed into distinct categories.

•Quantitative (numerical)variables are numerical and arithmetic operations can be performed.

Discretevariable

Quantitativevariable

Continuousvariable

•A discrete variable can assume a countable number of values.

•A continuous variable can assume an infinite number of values between any two specific values. They often include fractions and decimals.

Example 1:

Classify each variable as qualitative or quantitative. If the variable is quantitative, further classify the data
as discrete or continuous.
(a) Number of students attending a university for Fall 2012

(b) Colors of football caps in a store .

(c) Social security number.

(d) Water temperature of a swimming pool.

Objective B : Level of measurement of a Variable

In addition to being classified as qualitative or quantitative, variables can be classified by how they are categorized, counted, or measured.

Four common types of measurement scales are used: nominal, ordinal, interval, and ratio.

•The nominal level of measurement classifies data into categories in which no order or ranking can be imposed on the data.
For example:

•The ordinal level of measurement classifies data into categories that can be ranked.
For example:

•The interval level of measurement has the properties of the ordinal level of measurement and the differences in the values of the variable have meaning. There is no true zero. A value of zero does
not mean the absence of the quantity.
For example:

•The ratio level of measurement has the properties of the interval level of measurement and the ratios of the values of the variable have meaning. There exists a true zero.
For example:

Objective C : Observational Study versus Designed Experiment

Definition

  • In anobservationalstudy, the researcher observes the behavior of the individuals without trying to influence the outcome of the study.
  • In adesignedexperiment, the researcher controls one of the variables and tries to determine how the manipulation influences other variables.
  • The independentvariable which is also called the explanatory variable in a designed experiment is the one that is being controlled by the researcher. The dependent variable which is also called the response variable is the resultant variable.
  • Confounding in a study occurs when the effects of two or more explanatory variables are not separated.
  • A lurking variable is an explanatory variable that was not considered in a study, but that effects the value of the response variable in the study.

Example 1:
Determine whether the study depicts anobservationalstudy or anexperiment.
(a) Rats with cancer are divided into two groups. One group receives 5 mg of a medication that
is thought to fight cancer, and the other receives 10 mg. After 2 years, the spread of the cancer
is measured.

(b) Conservation agents netted 320 large-trout in a lake and determined how many were carrying
parasites.

Example 2:

Identify the explanatoryvariable and the responsevariable for the following studies.

(a) Rats with cancer are divided into two groups. One group receives 5 mg of a medication that is
thoughtto fight cancer, and the other receives 10 mg. After 2 years, the spread of the cancer is
measured.

(b) A researcher wants to determine whether young couples who marry are more likely to gain
weightthan those who stay single.

Ch 1.3 to 1.5 Sampling

Objective A : Ch 1.3 Simple Random Sampling

Simple Random Sampling

The goal of sampling is to obtain as much information as possible about the population at the least cost.

•A randomsample is obtained by using chance methods or random numbers.

Example :

Steps for Obtaining a Simple Random Sample

1. Number all the individuals in the population of interest.

2. Use a random number table, graphing calculator, or statistical software to randomly generate
numbers where is the desired sample size.

Objective B : Ch 1.4 More Sampling Methods

•A systematicsample is obtained by selecting every kth individual from the population.

•A stratifiedsample is obtained by dividing the population into non-overlapping groups (called strata) according to some similar characteristic, then sampling from each stratum.

•A cluster is obtained by dividing the population into groups called the clusters such as geographic area or schools in a large district. Then all the individuals within a randomly selected clusters are selected.

•A conveniencesample is a sample in which the individuals are easily obtained and not based on randomness.

Example 1:
Identify the type of sampling used. (random, systematic, stratified, cluster, convenience)
(a) Every tenth customer entering a grocery store is asked to select her or his favor color.

(b) A farmer divides his orchard into 30 subsections, randomly selects 4, and sample all the trees
within the 4 subsections to approximate the yield of his orchard.

(c) A survey regarding download time on a certain website is administered on the Internet by a
market research firm to anyone who would like to take it.

(d) In an effort to identify is an advertising campaign has been effective, a marketing firm conducts a
nationwide poll by randomly selecting individuals from a list of known users of the product.

(e) A school official divides the student population into five classes: freshman, sophomore, junior,
senior,and graduate student. The official takes a simple random sample from each class and asks
the members’opinions regarding student services.

Objective C : Ch1.5 Bias in Sampling

•Samplingbias means that the technique used to obtain the individuals to be in the sample tends to favor one part of the population over another.

•Non-response bias exists when individuals selected to be in the sample do not wish to respond.

•Response bias exists when the answers on the survey do not reflect the true feelings of the respondent.

Example 1:
The survey has bias. Determine the type of bias. (Sampling, non-response, response)
(a) To determine the public’s opinion of the police department, the police chief obtains a cluster
sample of 15 census tracts within his jurisdiction and samples all households in the randomly
selected tracts. Uniformed police officers go door to door to conduct the survey.

(b) The village of Oak Lawn wishes to conduct a study regarding the income level of households
withinthe village. The village manager selects 10 homes in the southwest corner of the village
and sendsan interviewer to the homes to determine household income.