A computer program for assessing interexaminer agreement when multiple ratings are made on a single subject
[摘要] This report describes a computer program for applying a new statistical method for determining levels of agreement, or reliability, when multiple examiners evaluate a single subject. The statistics thar are performed include the following: an overall level of agreement, expressed as a percentage, that takes into account all possible levels of partial agreement; the same statistical approach for deriving a separate level of agreement of every examiner with every other examiner; and tests of the extent to which a giver examiner's rating (say a symptom score of three on a five-category ordinal rating scale) deviates from the group or overall average rating. These deviation scores are interpreted as standard Z statistics. Finally, both statistical and clinical criteria are provided to evaluate levels of interexaminer agreement. (C) 1997 Elsevier Science Ireland Ltd.
[发布日期] 1997-08-29 [发布机构]
[效力级别] [学科分类]
[关键词] statistics;psychometrics;reliability [时效性]