1. CRT Which of the Following Is Preferred When Cutoffs Are Important? B. Criterion-Referenced

1. CRT
Which of the following is preferred when cutoffs are important?
b. criterion-referenced test

2. Mastery Learning
What problems are associated with mastery learning?
d. all of the above

3. Standards 2
Which of the following in not a basic approach for developing criterion-referenced standards?
a. behavioral

4. Standards 3
How should criterion-referenced standards be specified?
b. by defining a class or domain of tasks that should be performed by the individual

5. Mastery Learning 2
What is the order of the basic steps in mastery learning?
c. establishing behavioral objectives, pretesting, instruction, and posttesting

6. Mastery Learning 3
What name is given to specifically written goals with directions for how to attain them?
c. behavioral objectives

7. Criterion
What name is given to a test score indicating that a person actually met the criterion but the field test indicated he did not?
a. false positive

8. Mastery Learning 6
Which of the following contains compatible words?
b. mastery test, criterion, cutoff score

9. CRT 9
Before allowing students to participate in a gymnastics unit, a teacher administered a safety rules test with the requirement that everyone score above 80%. This is an example of what type of test?
b. criterion-referenced

10. Criterion 2
False positives result when subjects truly ________ the criterion, but the field test indicates they ________.
a. meet; did not

11. Reliability
What statistic is often calculated with criterion-referenced reliability and validity?
c. chi-square

12. Measurement
What level of measurement is most often associated with criterion-referenced measurement?
a. nominal

13. CRT12
What statistical test used with CRTs is actually the number of agreements divided by the total number of classifications made?
c. proportion of agreement

14. Statistical Analysis 2
What does kappa consider that proportion of agreements (P) does not?
a. chance

15. Statistical Analysis 3
What test statistic would be examined by determining how many students met the cutoff score and how many did not meet the cutoff score for a test on day 1 and again on day 2?
d. stability reliability

16. Statistical Analysis 4
What test statistic would be evaluated by determining how many students met the cutoff score and how many did not meet the cutoff score on two different tests of cardiovascular fitness?
c. equivalence reliability

17. Standards 4
A high school volleyball coach determines that each player must be able to serve 8 out of 10 serves into the court in overhand fashion. This is an example of a
a. judgmental standard

18. Definitions
What is the process of interpreting information to make a judgment or interpretation of its meaning called?
b. evaluation

19. Assessment
Which of the following is not an example of an alternative assessment technique?
b. completing a 1 mi run/walk test

20. Assessment 3
Which of the following statements concerning alternative assessment is not true?
a. Alternative assessment lends itself to the use of norm-referenced standards.

21. Assessment 6
Which of the following test characteristics is considered unimportant when dealing with alternative assessment techniques?
d. none of the above

22. Assessment 8
What procedure would be recommended to ensure reliable scoring of alternative assessments?
c. developing well-defined, explicit performance criteria

23. Assessment15
What can you do to help counter bias in alternative assessments?
c. collect several types of data

24. Assessment16
What clarifies student expectations when using alternative assessment?
e. providing explicit directions to the student

25. Assessment17
What do you want to increase most as the consequences of alternative assessments increase?
d. the validity of the measurement

26. Assessment18
To whom are you referring when discussing meaningfulness in alternative assessments?
e. students

27. Portfolio 2
A portfolio consists of
b. examples of student work

28. Assessment21
Which of the following doesn't belong with the others?
c. norm-referenced assessment

29. Assessment22
Which of the following is not an alternative assessment measurement technique?
d. true/false tests

30. Assessment23
What is the most commonly used measurement technique used in alternative assessment?
b. observation

31. Assessment24
What type of standards are most often associated with alternative assessment and why?
a. criterion-referenced, because of the subjectivity involved

32. Grading
What important testing principle would be violated most by using successful completion of the FITNESSGRAM testing program as the primary factor to determine the final grade for students completing a touch football unit?
c. validity

33. Grading 3
Ideally, which of the following should be most reflected in a student's grade?
d. the achievement of the student

34. Grading 4
Which of the following are problems in grading on improvement?
d. all of the above

35. Grading 6
What is the biggest disadvantage of using arbitrary standards in grading?
a. consistency

36. Grading 7
What best describes grading?
c. a summative evaluation

37. Grading 9
Which statement would most likely be made by a teacher using an absolute grading system?
b. All scores below 70% of the maximum will be considered failures.

38. Grading10
Which of the following is not one of the four basic steps in the process of grading?
d. revising course objectives based on results

39. Grading14
Subjectivity in determining grades negatively affects which of the following attributes most seriously?
c. reliability

40. Grading15
The lack of a clear definition of what a particular grade means negatively affects which of the following attributes most seriously?
d. validity

41. Domains
Which domain of human experience is most unique to individuals involved in teaching about and assessing physical activity?
d. psychomotor

42. Objectives
Which of the following is the least important consideration in determining objectives?
b. whether the necessary facilities are available to meet the objective

43. Grading16
What is the advantage of considering a grade to be a measurement rather than an evaluation?
a. It is less susceptible to a particular teacher's expectations.

44. Domains 2
Which of the following class units would most likely have more weight placed on objectives from the affective domain than the other units?
a. dance unit (co-ed)

45. Grading17
What should be your first step in the grading process?
e. determine instructional objectives

46. Distribution
Using natural breaks in a distribution rather than sticking with a predetermined percentage of each grade to be given recognizes what fallibility of measurement?
d. reliability

47. Essay
Which of the following is of first concern when you have an essay examination to grade?
c. objectivity

48. Validity 2
What type of validity is enhanced by having a fellow teacher review the items on the test?
a. content validity

49. Item Analysis
For a 50-item test given to 140 students, approximately how many of the answer sheets would be used in an item analysis?
d. 76

50. Test Characteristic
What do item characteristics' difficulty (P) and discrimination (r or D) affect?
c. both a and b

51. Item Analysis 2
One hundred students are in each of the upper and lower groups. Fifty students in the upper group and eighty students in the lower group got a question correct. What is the index of difficulty of this question?
d. .65

52. Item Analysis 3
One hundred students are in each of the upper and lower groups. Fifty students in the upper group and eighty students in the lower group got a question correct. What is the index of discrimination of this question?
b. .30

53. Test Characteristic 2
What characteristic of a written test would be most enhanced if every student were equally skilled in test taking?
d. validity

54. Test Characteristic 3
What major advantage do multiple-choice questions have over true/false items?
b. Guessing is less of a factor.

55. Test Characteristic 4
Which of the following is most important for an achievement test?
b. the discrimination index

56. Testing 2
Ideally, how difficult should the items be that comprise written achievement tests?
b. Most items should be of middle difficulty.

57. Testing 3
Poor item discrimination leads to
a. unreliability

58. Item Analysis 5
If six of eight pupils answered a question correctly, what is the index of difficulty of the question?
c. 75%

59. Test Characteristic 5
Which of the following factors will ensure that a test is objective?
a. a well-defined scoring method for the test

60. Test Characteristic 6
What characteristic does an item that is answered correctly by a larger number of good students than poor students display?
b. positive discrimination

61. Test Characteristic 8
If you could obtain only one piece of information about a test, what would you like to see?
e. validity coefficient

62. Testing 4
What major advantage do multiple-choice questions have over essay items?
d. Objectivity is greater with multiple-choice items.

63. Testing 5
Other things being equal, increasing the number of items on a test also increases
c. reliability

64. Testing 6
Which of the following is an important advantage of the multiple-choice test?
a. wide sampling

65. Test Characteristic 9
What characteristic is measured by the Kuder-Richardson formulas?
c. reliability

66. Item Analysis 7
One hundred fifty students have taken your written test. Approximately how many answer sheets should you use to conduct an item analysis by hand?
d. 80

67. Testing 8
What increases when additional items are added to a written test?
b. reliability

68. Essay 2
What is the major difference between objective and essay items?
a. Objective items are more reliable.

69. Multiple Choice 2
What is the best way to make a multiple-choice question less difficult?
c. use heterogeneous responses

70. Testing 9
What type of objective test item is most seriously affected by blind guessing?
b. true/false

71. Essay 3
Which of the following should not be done when grading essay exams?
c. sort the papers into three to five rough piles according to a quick inspection of the overall paper

72. Testing12
Which of the following considerations in constructing a written test should be fairly well established before the course begins?
a. the emphasis to be given various topics

73. Testing13
Generally, how many questions could be included on a test so that about 90% of a class of high school or college students would be expected to complete the test in 1 hr?
d. 60 multiple-choice or 120 true/false

74. Testing14
Using which of the following testing formats is most likely to lead to the use of good test construction techniques?
d. objective tests

75. Testing15
A teacher is scheduled to give an examination to four classes on December 1. The teacher begins coaching duties on December 1. If the four classes have been covering the same material in class and average 25 pupils each, what type of test should the teacher administer?
c. multiple-choice; more time to construct the test than to correct it

76. Test Characteristics
Which is most likely to increase as the variance increases on a written test?
c. reliability of the test

77. Test Characteristics 2
What can be said about a test item that discriminates negatively?
d. It is not valid.

78. Item Analysis 9
With what type of test is item analysis typically used?
d. depends on the nature of the test

79. Test Characteristics 3
If a table of specifications calls for 20% of the 50 questions on a test to measure recall (knowledge) and 30% of them to measure strategy, how many of the test items should deal with recall (knowledge) of strategy?
c. 3

80. Untitled
If 20 of 40 in the upper group and 20 of 40 in the lower group choose the correct answer to a multiple-choice question with five possible responses, what would the index of discrimination be for the item?
c. .0

81. Item Analysis10
When is an item considered to have good discrimination power?
c. both of the above

82. Testisng
What type of survey would yield very consistent but inaccurate results?
c. a survey with high bias and high precision

83. Testing16
An open scale survey question gives the respondent
c. the opportunity to write an answer

84. Testing17
A table of specifications for a written test helps to determine
a. what to test for

85. Taxonomy
Which of Bloom's categories of educational objectives is most like Ebel's category of factual information?
c. knowledge

86. Item Analysis11
To help reduce test anxiety, a question with which index of difficulty might appear as the first item on the test?
a .05
d. .85

87. Item Analysis12
Which of the following is not considered to be a semi-objective type question?
b. matching

88. Test Characteristics 4
Which of the following words would least likely appear in a true/false test item that was keyed as false?
c. generally

89. Matching
What is the least important information to include in the directions for a matching test item?
b. the point value of the question

90. Taxonomy 2
Use of novel situations in the construction of written test questions is associated most closely with which of Bloom's educational objectives?
c. application

91. Item Analysis13
What index of discrimination would be most likely for an intrinsically ambiguous question?
a. .50

92. Item Analysis14
If the difficulty index for a test item is .60 and there are 50 students in the high group and 50 students in the low group, what is the maximum discrimination index this item could achieve?
d. .80

93. Reliability 3 2
What assumption reduces the usefulness of the K-R 21 formula for estimating the reliability of a written test?
b. All items are of equal difficulty.

94. Questionnaire
With what domain of human experience are questionnaires most closely associated?
a. affective

95. Questionnaire 2
An open-ended item on a questionnaire is most like which type of written test item?
c. essay

96. Questionnaire 3
What technique is suggested to determine how well new questionnaire items function?
b. pilot studies

97. Testing18
What is the most important ancillary device in helping to obtain the highest response rate for a questionnaire?
d. a cover letter

98. Questionnaire 4
Which of the following is least likely to ensure that a questionnaire is valid?
a. keeping it as short as possible