Publication:
Influence of pairing in examiner leniency and stringency ('hawk-dove effect') in part II of the European Diploma of Anaesthesiology and Intensive Care: A cohort study.

cris.virtualsource.author-orcid95931ae7-b5ea-4129-9ca1-b2097da37724
datacite.rightsopen.access
dc.contributor.authorSciberras, Stephen
dc.contributor.authorKlimek, Markus
dc.contributor.authorAteleanu, Bazil
dc.contributor.authorScipioni, Hugues
dc.contributor.authorDi Loreto, Rodolphe
dc.contributor.authorBerger-Estilita, Joana
dc.date.accessioned2024-09-26T08:58:59Z
dc.date.available2024-09-26T08:58:59Z
dc.date.issued2024-12
dc.description.abstractBackground The European Diploma of Anaesthesiology and Intensive Care (EDAIC) Part II examination is a supranational examination for anaesthesiologists.Objectives We explore the impact of examiner pairing on leniency and stringency, commonly referred to as the 'hawk-dove effect'. We investigate the potential variations in grading approaches, resulting from different examiner pairs and their implications for candidate performance.Design Retrospective cohort, observational design.Setting EDAIC Part II examination data from 2021 to 2023.Participants Three hundred and twenty-five examiners across 122 EDAIC Part II examination sessions.Interventions We analysed the influence of examiner leniency and examiner pairing on candidate performance in the EDAIC Part II using many-facet Rasch modelling.Main Outcome Measures The study's main outcome measure was determining a leniency score among the examiner population. The study also aimed to assess how examiner pairing influenced candidate performance, as measured by their scores in the examination.Results During the study period, the number of examiners who participated in 2021, 2022 and 2023 were 253, 242 and 247, respectively. The median sessions attended were 7.0 (3 to 10). The examination data revealed a mean leniency score of 0 [95% confidence interval (CI) -0.046 to 0.046], with the standard deviation being one-third that of the candidates' ability scores. There were 1424 different pairs of examiners, with most pairs (97%) having only a one-point difference in marking. The mean leniency score for the pair of examiners was -0.053 (95% CI -0.069 to -0.037).Conclusion The variations in grading approaches associated with different pairings emphasise the potential for the 'hawk-dove effect' to influence candidate performance and outcomes. Understanding these variations can guide curriculum development, examiner training and coupling, ensuring a balanced and equitable assessment process.Trial Registration None.
dc.description.numberOfPages11
dc.description.sponsorshipInstitut für Medizinische Lehre, Assessment und Evaluation, Forschung / Evaluation
dc.identifier.doi10.48620/8367
dc.identifier.pmid39194037
dc.identifier.publisherDOI10.1097/EJA.0000000000002052
dc.identifier.urihttps://boris-portal.unibe.ch/handle/20.500.12422/44758
dc.language.isoen
dc.publisherLippincott, Williams & Wilkins
dc.relation.ispartofEuropean Journal of Anaesthesiology
dc.relation.issn0265-0215
dc.titleInfluence of pairing in examiner leniency and stringency ('hawk-dove effect') in part II of the European Diploma of Anaesthesiology and Intensive Care: A cohort study.
dc.typearticle
dspace.entity.typePublication
dspace.file.typetext
oaire.citation.endPage931
oaire.citation.issue12
oaire.citation.startPage921
oaire.citation.volume41
oairecerif.author.affiliationInstitut für Medizinische Lehre, Assessment und Evaluation, Forschung / Evaluation
unibe.contributor.roleauthor
unibe.contributor.roleauthor
unibe.contributor.roleauthor
unibe.contributor.roleauthor
unibe.contributor.roleauthor
unibe.contributor.roleauthor
unibe.description.ispublishedpub
unibe.refereetrue
unibe.subtype.articlejournal

Files

Original bundle
Now showing 1 - 1 of 1
Name:
influence_of_pairing_in_examiner_leniency_and.212.pdf
Size:
1.57 MB
Format:
Adobe Portable Document Format
File Type:
text
License:
https://creativecommons.org/licenses/by/4.0
Content:
published

Collections