Crowdsourcing to Evaluate Fundus Photographs for the Presence of Glaucoma

Xueyang Wang; Lucy I. Mudie; Mani Baskaran; Ching Yu Cheng; Wallace L. Alward; David S. Friedman; Christopher J. Brady

doi:10.1097/IJG.0000000000000660

Crowdsourcing to Evaluate Fundus Photographs for the Presence of Glaucoma

Xueyang Wang, Lucy I. Mudie, Mani Baskaran, Ching Yu Cheng, Wallace L. Alward, David S. Friedman, Christopher J. Brady

School of Medicine

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

Purpose: To assess the accuracy of crowdsourcing for grading optic nerve images for glaucoma using Amazon Mechanical Turk before and after training modules. Materials and Methods: Images (n=60) from 2 large population studies were graded for glaucoma status and vertical cup-to-disc ratio (VCDR). In the baseline trial, users on Amazon Mechanical Turk (Turkers) graded fundus photos for glaucoma and VCDR after reviewing annotated example images. In 2 additional trials, Turkers viewed a 26-slide PowerPoint training or a 10-minute video training and passed a quiz before being permitted to grade the same 60 images. Each image was graded by 10 unique Turkers in all trials. The mode of Turker grades for each image was compared with an adjudicated expert grade to determine accuracy as well as the sensitivity and specificity of Turker grading. Results: In the baseline study, 50% of the images were graded correctly for glaucoma status and the area under the receiver operating characteristic (AUROC) was 0.75 [95% confidence interval (CI), 0.64-0.87]. Post-PowerPoint training, 66.7% of the images were graded correctly with AUROC of 0.86 (95% CI, 0.78-0.95). Finally, Turker grading accuracy was 63.3% with AUROC of 0.89 (95% CI, 0.83-0.96) after video training. Overall, Turker VCDR grades for each image correlated with expert VCDR grades (Bland-Altman plot mean difference=-0.02). Conclusions: Turkers graded 60 fundus images quickly and at low cost, with grading accuracy, sensitivity, and specificity, all improving with brief training. With effective education, crowdsourcing may be an efficient tool to aid in the identification of glaucomatous changes in retinal images.

Original language	English (US)
Pages (from-to)	505-510
Number of pages	6
Journal	Journal of glaucoma
Volume	26
Issue number	6
DOIs	https://doi.org/10.1097/IJG.0000000000000660
State	Published - 2017

Keywords

crowdsourcing
image analysis
teleglaucoma

ASJC Scopus subject areas

Ophthalmology

Access to Document

10.1097/IJG.0000000000000660

Cite this

@article{90de31769fcb41e1b9c191ae35c1fa54,

title = "Crowdsourcing to Evaluate Fundus Photographs for the Presence of Glaucoma",

abstract = "Purpose: To assess the accuracy of crowdsourcing for grading optic nerve images for glaucoma using Amazon Mechanical Turk before and after training modules. Materials and Methods: Images (n=60) from 2 large population studies were graded for glaucoma status and vertical cup-to-disc ratio (VCDR). In the baseline trial, users on Amazon Mechanical Turk (Turkers) graded fundus photos for glaucoma and VCDR after reviewing annotated example images. In 2 additional trials, Turkers viewed a 26-slide PowerPoint training or a 10-minute video training and passed a quiz before being permitted to grade the same 60 images. Each image was graded by 10 unique Turkers in all trials. The mode of Turker grades for each image was compared with an adjudicated expert grade to determine accuracy as well as the sensitivity and specificity of Turker grading. Results: In the baseline study, 50% of the images were graded correctly for glaucoma status and the area under the receiver operating characteristic (AUROC) was 0.75 [95% confidence interval (CI), 0.64-0.87]. Post-PowerPoint training, 66.7% of the images were graded correctly with AUROC of 0.86 (95% CI, 0.78-0.95). Finally, Turker grading accuracy was 63.3% with AUROC of 0.89 (95% CI, 0.83-0.96) after video training. Overall, Turker VCDR grades for each image correlated with expert VCDR grades (Bland-Altman plot mean difference=-0.02). Conclusions: Turkers graded 60 fundus images quickly and at low cost, with grading accuracy, sensitivity, and specificity, all improving with brief training. With effective education, crowdsourcing may be an efficient tool to aid in the identification of glaucomatous changes in retinal images.",

keywords = "crowdsourcing, image analysis, teleglaucoma",

author = "Xueyang Wang and Mudie, {Lucy I.} and Mani Baskaran and Cheng, {Ching Yu} and Alward, {Wallace L.} and Friedman, {David S.} and Brady, {Christopher J.}",

note = "Funding Information: This publication was made possible by the Johns Hopkins Institute for Clinical and Translational Research (ICTR) which is funded in part by Grant Number KL2TR001077 from the National Center for Advancing Translational Sciences (NCATS) a component of the National Institutes of Health (NIH), and NIH Roadmap for Medical Research. Its contents are solely the responsibility of the authors and do not necessarily represent the official view of the Johns Hopkins ICTR, NCATS, or NIH. Publisher Copyright: {\textcopyright} 2017 Wolters Kluwer Health, Inc. All rights reserved.",

year = "2017",

doi = "10.1097/IJG.0000000000000660",

language = "English (US)",

volume = "26",

pages = "505--510",

journal = "Journal of glaucoma",

issn = "1057-0829",

publisher = "Lippincott Williams and Wilkins",

number = "6",

}

TY - JOUR

T1 - Crowdsourcing to Evaluate Fundus Photographs for the Presence of Glaucoma

AU - Wang, Xueyang

AU - Mudie, Lucy I.

AU - Baskaran, Mani

AU - Cheng, Ching Yu

AU - Alward, Wallace L.

AU - Friedman, David S.

AU - Brady, Christopher J.

N1 - Funding Information: This publication was made possible by the Johns Hopkins Institute for Clinical and Translational Research (ICTR) which is funded in part by Grant Number KL2TR001077 from the National Center for Advancing Translational Sciences (NCATS) a component of the National Institutes of Health (NIH), and NIH Roadmap for Medical Research. Its contents are solely the responsibility of the authors and do not necessarily represent the official view of the Johns Hopkins ICTR, NCATS, or NIH. Publisher Copyright: © 2017 Wolters Kluwer Health, Inc. All rights reserved.

PY - 2017

Y1 - 2017

N2 - Purpose: To assess the accuracy of crowdsourcing for grading optic nerve images for glaucoma using Amazon Mechanical Turk before and after training modules. Materials and Methods: Images (n=60) from 2 large population studies were graded for glaucoma status and vertical cup-to-disc ratio (VCDR). In the baseline trial, users on Amazon Mechanical Turk (Turkers) graded fundus photos for glaucoma and VCDR after reviewing annotated example images. In 2 additional trials, Turkers viewed a 26-slide PowerPoint training or a 10-minute video training and passed a quiz before being permitted to grade the same 60 images. Each image was graded by 10 unique Turkers in all trials. The mode of Turker grades for each image was compared with an adjudicated expert grade to determine accuracy as well as the sensitivity and specificity of Turker grading. Results: In the baseline study, 50% of the images were graded correctly for glaucoma status and the area under the receiver operating characteristic (AUROC) was 0.75 [95% confidence interval (CI), 0.64-0.87]. Post-PowerPoint training, 66.7% of the images were graded correctly with AUROC of 0.86 (95% CI, 0.78-0.95). Finally, Turker grading accuracy was 63.3% with AUROC of 0.89 (95% CI, 0.83-0.96) after video training. Overall, Turker VCDR grades for each image correlated with expert VCDR grades (Bland-Altman plot mean difference=-0.02). Conclusions: Turkers graded 60 fundus images quickly and at low cost, with grading accuracy, sensitivity, and specificity, all improving with brief training. With effective education, crowdsourcing may be an efficient tool to aid in the identification of glaucomatous changes in retinal images.

AB - Purpose: To assess the accuracy of crowdsourcing for grading optic nerve images for glaucoma using Amazon Mechanical Turk before and after training modules. Materials and Methods: Images (n=60) from 2 large population studies were graded for glaucoma status and vertical cup-to-disc ratio (VCDR). In the baseline trial, users on Amazon Mechanical Turk (Turkers) graded fundus photos for glaucoma and VCDR after reviewing annotated example images. In 2 additional trials, Turkers viewed a 26-slide PowerPoint training or a 10-minute video training and passed a quiz before being permitted to grade the same 60 images. Each image was graded by 10 unique Turkers in all trials. The mode of Turker grades for each image was compared with an adjudicated expert grade to determine accuracy as well as the sensitivity and specificity of Turker grading. Results: In the baseline study, 50% of the images were graded correctly for glaucoma status and the area under the receiver operating characteristic (AUROC) was 0.75 [95% confidence interval (CI), 0.64-0.87]. Post-PowerPoint training, 66.7% of the images were graded correctly with AUROC of 0.86 (95% CI, 0.78-0.95). Finally, Turker grading accuracy was 63.3% with AUROC of 0.89 (95% CI, 0.83-0.96) after video training. Overall, Turker VCDR grades for each image correlated with expert VCDR grades (Bland-Altman plot mean difference=-0.02). Conclusions: Turkers graded 60 fundus images quickly and at low cost, with grading accuracy, sensitivity, and specificity, all improving with brief training. With effective education, crowdsourcing may be an efficient tool to aid in the identification of glaucomatous changes in retinal images.

KW - crowdsourcing

KW - image analysis

KW - teleglaucoma

UR - http://www.scopus.com/inward/record.url?scp=85015621493&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85015621493&partnerID=8YFLogxK

U2 - 10.1097/IJG.0000000000000660

DO - 10.1097/IJG.0000000000000660

M3 - Article

C2 - 28319525

AN - SCOPUS:85015621493

SN - 1057-0829

VL - 26

SP - 505

EP - 510

JO - Journal of glaucoma

JF - Journal of glaucoma

IS - 6

ER -

Crowdsourcing to Evaluate Fundus Photographs for the Presence of Glaucoma

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this