Derivation of Two Critical Appraisal Scores for Trainees to Evaluate Online Educational Resources: A METRIQ Study
- Author(s): Chan, MD, MHPE, Teresa M.
- Thoma, MD, MA, Brent
- Krishnan, BHSc, Keeth
- Lin, MD, Michelle
- Carpenter, MD, MSc, Christopher R.
- Astin, MD, Matt
- Kulasegaram, PhD, Kulamakan
- et al.
Published Web Locationhttps://doi.org/10.5811/westjem.2016.6.30825
Introduction: Online education resources (OERs) like blogs and podcasts frequently augment or replace traditional medical education resources such as textbooks and lectures. Trainees’ ability to evaluate these resources is poor, and no quality assessment aids have been developed to assist them. This study derived a quality evaluation instrument for this purpose.
Methods: We used a three-phase methodology. In Phase 1, a previously derived list of 151 OER quality indicators was reduced to 13 items using data from published consensus-building studies (of medical educators, expert podcasters, and expert bloggers) and further evaluation by our team. In Phase 2, these 13 items were converted to seven-point Likert scales used by trainee raters (n=40) to evaluate 39 OERs. The reliability and usability of these 13 rating items was determined using responses from trainee raters, and top items were used to create two OER quality evaluation instruments. In Phase 3, these instruments were compared to an external certification process (the ALiEM AIR certification) and the gestalt evaluation of 39 blog posts by 20 faculty educators.
Results: Two quality-evaluation instruments were derived with fair inter-rater reliability: the METRIQ-8 Score (Inter class correlation coefficient [ICC]=0.30, p<0.001) and the METRIQ-5 Score (ICC=0.22, p<0.001). Both scores, when calculated using the derivation data, correlated with educator gestalt (Pearson’s r=0.35, p=0.03 and r=0.41, p<0.01, respectively) and were related to increased odds of receiving an ALiEM AIR certification (Odds Ratio=1.28, p=0.03; OR=1.5, p=0.004, respectively).
Conclusion: Two novel scoring instruments with adequate psychometric properties were derived to assist trainees in evaluating OER quality and correlated favourably with gestalt ratings of online educational resources by faculty educators. Further testing is needed to ensure these instruments are accurate when applied by trainees. [West J Emerg Med. 2016;17(5)574-584.]