Artificial Intelligence for Retinopathy of Prematurity: Validation of a Vascular Severity Scale against International Expert Diagnosis
File version
Author(s)
Chiang, Michael F
Chen, Jimmy S
Moshfeghi, Darius M
Nudleman, Eric
Ruambivoonsuk, Paisan
Cherwek, Hunter
Cheung, Carol Y
Singh, Praveer
Kalpathy-Cramer, Jayashree
Ostmo, Susan
Eydelman, Malvina
Chan, RV Paul
Capone, Antonio
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
License
Abstract
Purpose To validate a vascular severity score as an appropriate output for artificial intelligence (AI) Software as a Medical Device (SaMD) for retinopathy of prematurity (ROP) through comparison with ordinal disease severity labels for stage and plus disease assigned by the International Classification of Retinopathy of Prematurity, Third Edition (ICROP3), committee.
Design Validation study of an AI-based ROP vascular severity score.
Participants A total of 34 ROP experts from the ICROP3 committee.
Methods Two separate datasets of 30 fundus photographs each for stage (0–5) and plus disease (plus, preplus, neither) were labeled by members of the ICROP3 committee using an open-source platform. Averaging these results produced a continuous label for plus (1–9) and stage (1–3) for each image. Experts were also asked to compare each image to each other in terms of relative severity for plus disease. Each image was also labeled with a vascular severity score from the Imaging and Informatics in ROP deep learning system, which was compared with each grader’s diagnostic labels for correlation, as well as the ophthalmoscopic diagnosis of stage.
Main Outcome Measures Weighted kappa and Pearson correlation coefficients (CCs) were calculated between each pair of grader classification labels for stage and plus disease. The Elo algorithm was also used to convert pairwise comparisons for each expert into an ordered set of images from least to most severe.
Results The mean weighted kappa and CC for all interobserver pairs for plus disease image comparison were 0.67 and 0.88, respectively. The vascular severity score was found to be highly correlated with both the average plus disease classification (CC = 0.90, P < 0.001) and the ophthalmoscopic diagnosis of stage ( P < 0.001 by analysis of variance) among all experts.
Conclusions The ROP vascular severity score correlates well with the International Classification of Retinopathy of Prematurity committee member’s labels for plus disease and stage, which had significant intergrader variability. Generation of a consensus for a validated scoring system for ROP SaMD can facilitate global innovation and regulatory authorization of these technologies.
Journal Title
Ophthalmology
Conference Title
Book Title
Edition
Volume
129
Issue
7
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
Item Access Status
Note
Access the data
Related item(s)
Subject
Persistent link to this record
Citation
Campbell, JP; Chiang, MF; Chen, JS; Moshfeghi, DM; Nudleman, E; Ruambivoonsuk, P; Cherwek, H; Cheung, CY; Singh, P; Kalpathy-Cramer, J; Ostmo, S; Eydelman, M; Chan, RVP; Capone, A, Artificial Intelligence for Retinopathy of Prematurity: Validation of a Vascular Severity Scale against International Expert Diagnosis, Ophthalmology, 2022, 129 (7), pp. e69-e76