Автор |
Henning, Grant |
Дата выпуска |
1996 |
dc.description |
This analysis of simulated performance ratings on a six-point scale by two independent raters is an attempt to account for nonsystematic error in performance ratings under restrictive assumptions of classical measurement theory. Results suggest that rater agreement or covariance is not always a dependable estimate of score reliability, and that the practice of seeking additional raters for the adjudication of discrepant ratings is not equally appropriate in every evaluation decision context or at every possible rating score step. And, although nonsystematic rating error is most common at the midpoints of the rating scale, most of this midpoint error disappears when rater scores are averaged, so that the need for adjudication is most critical at the rating scale extremes, whether or not raters are in agreement. |
Издатель |
Sage Publications |
Название |
Accounting for nonsystematic error in performance ratings |
Тип |
Journal Article |
DOI |
10.1177/026553229601300104 |
Print ISSN |
0265-5322 |
Журнал |
Language Testing |
Том |
13 |
Первая страница |
53 |
Последняя страница |
61 |
Аффилиация |
Henning, Grant, The Pennsylvania State University |
Выпуск |
1 |
Библиографическая ссылка |
Educational Testing Service1987: Standards for quality and fairness. Princeton, NJ: ETS. |
Библиографическая ссылка |
Gulliksen, H.1987: The theory of mental tests. Hillsdale, NJ: Lawrence Erlbaum Associates. |
Библиографическая ссылка |
Henning, G.1987: A guide to language testing: development, evaluation, research. New York: Newbury House /Heinle & Heinle. |
Библиографическая ссылка |
— 1992: Scalar analysis of the Test of Written English. TOEFL Research Report 38. Princeton, NJ: ETS. |
Библиографическая ссылка |
Henning, G. and Davidson, F. 1987: Scalar analysis of composition ratings. In Bailey, K.M., Dale, T.L. and Clifford, R.T., editors, Language testing research: selected papers from the 1986 colloquium , Monterey, CA: Defense Language Institute. |
Библиографическая ссылка |
Powers, D.E. and Stansfield, C.W. 1983: The Test of Spoken English as a measure of communicative ability in the health professions: validation and standard setting. TOEFL Research Report 13 Princeton, NJ: ETS. |
Библиографическая ссылка |
Wright, B.D. and Masters, G.N. 1982: Rating scale analysis: Rasch measurement. Chicago: MESA Press. |