A generalizability study of a new standardized rating form used to evaluate students' clinical clerkship performances

Acad Med. 1998 Dec;73(12):1294-8. doi: 10.1097/00001888-199812000-00021.

Abstract

Purpose: To investigate the measurement characteristics of standardized clinical evaluation forms (CEFs) used to assign grades for clerkship performance.

Method: In 1996-97, the authors reviewed 5,168 CEFs completed for 175 students in eight clerkships. Limiting their analysis to the three clerkships that produced the most CEFs, the authors conducted a generalizability study to determine the five variance components for each clerkship. A decision study then calculated the generalizability coefficients and standard errors of measurement in each clerkship for varied numbers of raters and CEF items.

Results: The generalizability study found large variance components attributable to rater and rating context. The decision study found that, when three or more raters completed CEFs for a student, the generalizability coefficient and standard error of measurement reached levels acceptable for grading. Increasing the number of items on the CEF had no significant effect.

Conclusion: The reliability of assigning students clerkship grades based on single CEFs is unacceptably low. However, CEFs can accurately measure students' clerkship performances if completed by three or more raters.

Publication types

  • Comparative Study

MeSH terms

  • Clinical Clerkship / organization & administration
  • Clinical Clerkship / standards*
  • Clinical Competence / standards*
  • Educational Measurement / methods*
  • Humans
  • Iowa
  • Problem-Based Learning
  • Program Evaluation* / methods
  • Reproducibility of Results
  • Retrospective Studies
  • Teaching