Pages

Tuesday, January 11, 2011

Standardized Measurement and Assessment

1. Which of the following is not an assumption underlying testing and measurement?
a.  Various approaches to measuring aspects of the same thing can be useful
b.  Error is rarely present in the measurement process
c.  Present-day behavior predicts future behavior
d.  Testing and assessment benefit society

2. Systematic error is associated with:
a.  Reliability
b.  Validity

3. Which of the following is a type of criterion–related validity evidence?
a.  Concurrent evidence
b.  Predictive evidence
c.  Internal consistency
d.  Both a and b are correct answers

4. If a test measures a single construct then:
a.  The items should correlate with the total score
b.  The items should not correlate with the total score
c.  The test should not correlate with other measures of the same construct
d.  There must be a reliable alternative form.

5. Professor X develops a test of emotional intelligence. Which of the following represent convergent and discriminant evidence?
a.  The test correlates highly with another test of emotional intelligence and is
                 uncorrelated with self-efficacy
b.  The test correlates with highly with another test of emotional intelligence and is
            highly correlated with self-efficacy
c.  The test does not correlate with another test of emotional intelligence, but does
                 correlate with self-efficacy
d.  The test does not correlate with other tests of emotional intelligence nor with self-
            efficacy

6. An ordinal scale is used to rank order people, objects, or characteristics.
a. True
b. False

7. Which scale is the simplest form of measurement?
a. Nominal
b. Ordinal
c. Interval
d. Ratio

8.  ______ tests focus on information acquired through the informal learning that goes on in life.
a. Personality
b. Achievement
c. Aptitude
d. Intelligence

9. Let’s say that a test accurately indicates participants’ scores on a future criterion (e.g., the PSAT is used to indicate high-school GPA scores). This test would clearly have which of the following?
a. Face validity
b. Concurrent validity
c. Predictive validity
d. Content validity

10. If a baseball coach calculates batting averages, what scale would be used?
a. Interval scale
b. Ratio scale
c. Nominal scale
d. Ordinal scale

11. According to the text, most of the outcome/dependent variable characteristics and attributes measured in educational research probably exist at the ______________ level of measurement.
a. Nominal
b. Ordinal
c. Interval
d. Ratio

12. Which of the following is most clearly an example of a psychological trait?
a. Anxiety enduring for months or years
b. Anxiety over just seeing a spider
c. Shyness when meeting a stranger for the first time
d. Depression caused by the loss of a ball game

13. All of the following are examples of Intelligence Tests except _________:
a.  Wechsler Scales
b.  Stanford-Binet
c.  Minnesota Multiphasic Personality Inventory(MMPI)
d.  Slosson

14. Reliability is most simply known as which of the following?
a.  Consistency or stability
b.  Appropriateness of interpretations on the basis of test scores
c.  Ways in which people are the same
d.  A rank order of participants on some characteristic

15. An ordinal scale is:
a. The simplest form of measurement
b. A rank-order scale of measurement
c. A scale with equal intervals between adjacent numbers
d. A scale with an absolute zero point
e. A categorical scale

16. Which of the following is not a type of reliability?
a. Test-retest
b. Split-half
c. Content
d. Internal consistency

17. Which of the following statements accurately describes test-retest reliability?
a.  Measure of consistency of test scores over time
b.  Measure of consistency of scores obtained from two equivalent halves of the same test
c.  Measure of consistency with which a test measures a single construct or concept
d.  Measure of degree of agreement between two or more scorers, judges, or raters

18. Which of the following types of reliability refers to the consistency of test scores over time?
a. Equivalent forms reliability
b. Split-half reliability
c. Test-retest reliability
d. Inter-scorer reliability

19. Identify the following term that most closely refers to a judgement of the extent to which scores from a test can be used to infer, or predict, the examinees' performance in some activity:
a. Content reliability
b. Face validity
c. Criterion-related validity
d. Inference validity

20. Which of the following is the correct order of Stevens’ four levels of measurement?
a. Ordinal, nominal, ratio, interval
b. Nominal, ordinal, interval, ratio
c. Interval, nominal, ordinal, ratio
d. Ratio, interval, nominal, ordinal

21. Which is the process of gathering evidence supporting inferences based test scores?
a. Validation
b. Validity
c. Reliability
d. Prediction

22. When evaluating tests and assessments, “reliability” refers to asking ourselves which of the following questions?
a.  Does it measure what it is supposed to measure?
b.  Are there ways to avoid subjective judgments when measuring something?
c.  Does it give consistent results?
d.  Does it measure multiple constructs?

23. Validity of a test designed to measure a construct such as self-esteem is best described by which of the following?
a.  Scores from the test correlate highly with most intelligence tests
b.  Scores from the test correlate highly with most tests of different constructs
c.  Scores from the test are not correlated with anything
d.  Scores from the test have a relatively strong and positive correlation with other tests of the
same construct (i.e., with other measures of self-esteem) but much lower correlations
with tests of different constructs

24. Which type of reliability refers to the consistency of a group of individuals' scores on two  equivalent forms of a test designed to measure the same characteristic? 
a.  Split-half
b.  Test-retest
c.  Split-forms
d.  Equivalent forms

25. Achievement tests are designed to measure the degree of learning that has taken place after a person has been exposed to a specific learning experience.
a.   True
b.   False

26. _________ refers to how well the particular sample of behaviors used to measure a characteristic reflects the entire domain of behaviors that constitutes that characteristic.
a. Construct validity evidence
b. Criterion-related validity evidence
c. Content validity evidence
d. Face validity evidence

Answers:
1. b
2. b
3. d
4. a
5. a
6. a
7. a
8. c
9. c
10. b
11. b
12. a
13. c
14. a
15. b
16. c
17. a
18. c
19. c
20. b
21. a
22. c
23. d
24. d
25. a
26. c

No comments:

Post a Comment