Data-driven methods distort optimal cutoffs and accuracy estimates of depression screening tools: a simulation study using individual participant data

the Depression Screening Data (DEPRESSD) EPDS Group

doi:10.1016/j.jclinepi.2021.03.031

Data-driven methods distort optimal cutoffs and accuracy estimates of depression screening tools: a simulation study using individual participant data

the Depression Screening Data (DEPRESSD) EPDS Group

Research output: Contribution to journal › Article › peer-review

Abstract

Objective: To evaluate, across multiple sample sizes, the degree that data-driven methods result in (1) optimal cutoffs different from population optimal cutoff and (2) bias in accuracy estimates. Study design and setting: A total of 1,000 samples of sample size 100, 200, 500 and 1,000 each were randomly drawn to simulate studies of different sample sizes from a database (n = 13,255) synthesized to assess Edinburgh Postnatal Depression Scale (EPDS) screening accuracy. Optimal cutoffs were selected by maximizing Youden's J (sensitivity+specificity–1). Optimal cutoffs and accuracy estimates in simulated samples were compared to population values. Results: Optimal cutoffs in simulated samples ranged from ≥ 5 to ≥ 17 for n = 100, ≥ 6 to ≥ 16 for n = 200, ≥ 6 to ≥ 14 for n = 500, and ≥ 8 to ≥ 13 for n = 1,000. Percentage of simulated samples identifying the population optimal cutoff (≥ 11) was 30% for n = 100, 35% for n = 200, 53% for n = 500, and 71% for n = 1,000. Mean overestimation of sensitivity and underestimation of specificity were 6.5 percentage point (pp) and -1.3 pp for n = 100, 4.2 pp and -1.1 pp for n = 200, 1.8 pp and -1.0 pp for n = 500, and 1.4 pp and -1.0 pp for n = 1,000. Conclusions: Small accuracy studies may identify inaccurate optimal cutoff and overstate accuracy estimates with data-driven methods.

Original language	English (US)
Pages (from-to)	137-147
Number of pages	11
Journal	Journal of Clinical Epidemiology
Volume	137
DOIs	https://doi.org/10.1016/j.jclinepi.2021.03.031
State	Published - Sep 2021
Externally published	Yes

Keywords

Accuracy estimates
Bias
Cherry-picking
Data-driven methods
Depression
Optimal cutoff

ASJC Scopus subject areas

Epidemiology

Access to Document

10.1016/j.jclinepi.2021.03.031

Cite this

@article{9043e0dde18b4f1981e18f87f7d27c8f,

title = "Data-driven methods distort optimal cutoffs and accuracy estimates of depression screening tools: a simulation study using individual participant data",

abstract = "Objective: To evaluate, across multiple sample sizes, the degree that data-driven methods result in (1) optimal cutoffs different from population optimal cutoff and (2) bias in accuracy estimates. Study design and setting: A total of 1,000 samples of sample size 100, 200, 500 and 1,000 each were randomly drawn to simulate studies of different sample sizes from a database (n = 13,255) synthesized to assess Edinburgh Postnatal Depression Scale (EPDS) screening accuracy. Optimal cutoffs were selected by maximizing Youden's J (sensitivity+specificity–1). Optimal cutoffs and accuracy estimates in simulated samples were compared to population values. Results: Optimal cutoffs in simulated samples ranged from ≥ 5 to ≥ 17 for n = 100, ≥ 6 to ≥ 16 for n = 200, ≥ 6 to ≥ 14 for n = 500, and ≥ 8 to ≥ 13 for n = 1,000. Percentage of simulated samples identifying the population optimal cutoff (≥ 11) was 30% for n = 100, 35% for n = 200, 53% for n = 500, and 71% for n = 1,000. Mean overestimation of sensitivity and underestimation of specificity were 6.5 percentage point (pp) and -1.3 pp for n = 100, 4.2 pp and -1.1 pp for n = 200, 1.8 pp and -1.0 pp for n = 500, and 1.4 pp and -1.0 pp for n = 1,000. Conclusions: Small accuracy studies may identify inaccurate optimal cutoff and overstate accuracy estimates with data-driven methods.",

keywords = "Accuracy estimates, Bias, Cherry-picking, Data-driven methods, Depression, Optimal cutoff",

author = "{the Depression Screening Data (DEPRESSD) EPDS Group} and Bhandari, {Parash Mani} and Brooke Levis and Dipika Neupane and Patten, {Scott B.} and Ian Shrier and Thombs, {Brett D.} and Andrea Benedetti and Ying Sun and Chen He and Rice, {Danielle B.} and Ankur Krishnan and Yin Wu and Marleine Azar and Sanchez, {Tatiana A.} and Chiovitti, {Matthew J.} and Nazanin Saadat and Riehm, {Kira E.} and Mahrukh Imran and Zelalem Negeri and Boruff, {Jill T.} and Pim Cuijpers and Simon Gilbody and Ioannidis, {John P.A.} and Kloda, {Lorie A.} and Ziegelstein, {Roy C.} and Liane Comeau and Mitchell, {Nicholas D.} and Marcello Tonelli and Vigod, {Simone N.} and Franca Aceti and Rub{\'e}n Alvarado and Cosme Alvarado-Esquivel and Bakare, {Muideen O.} and Jacqueline Barnes and Bavle, {Amar D.} and Beck, {Cheryl Tatano} and Carola Bindt and Boyce, {Philip M.} and Adomas Bunevicius and {Castro e Couto}, Tiago and Chaudron, {Linda H.} and Humberto Correa and {de Figueiredo}, {Felipe Pinheiro} and Valsamma Eapen and Nicolas Favez and Ethel Felice and Michelle Fernandes and Barbara Figueiredo and Fisher, {Jane R.W.} and Tandon, {S. Darius}",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier Inc.",

year = "2021",

month = sep,

doi = "10.1016/j.jclinepi.2021.03.031",

language = "English (US)",

volume = "137",

pages = "137--147",

journal = "Journal of Clinical Epidemiology",

issn = "0895-4356",

publisher = "Elsevier USA",

}

TY - JOUR

T1 - Data-driven methods distort optimal cutoffs and accuracy estimates of depression screening tools

T2 - a simulation study using individual participant data

AU - the Depression Screening Data (DEPRESSD) EPDS Group

AU - Bhandari, Parash Mani

AU - Levis, Brooke

AU - Neupane, Dipika

AU - Patten, Scott B.

AU - Shrier, Ian

AU - Thombs, Brett D.

AU - Benedetti, Andrea

AU - Sun, Ying

AU - He, Chen

AU - Rice, Danielle B.

AU - Krishnan, Ankur

AU - Wu, Yin

AU - Azar, Marleine

AU - Sanchez, Tatiana A.

AU - Chiovitti, Matthew J.

AU - Saadat, Nazanin

AU - Riehm, Kira E.

AU - Imran, Mahrukh

AU - Negeri, Zelalem

AU - Boruff, Jill T.

AU - Cuijpers, Pim

AU - Gilbody, Simon

AU - Ioannidis, John P.A.

AU - Kloda, Lorie A.

AU - Ziegelstein, Roy C.

AU - Comeau, Liane

AU - Mitchell, Nicholas D.

AU - Tonelli, Marcello

AU - Vigod, Simone N.

AU - Aceti, Franca

AU - Alvarado, Rubén

AU - Alvarado-Esquivel, Cosme

AU - Bakare, Muideen O.

AU - Barnes, Jacqueline

AU - Bavle, Amar D.

AU - Beck, Cheryl Tatano

AU - Bindt, Carola

AU - Boyce, Philip M.

AU - Bunevicius, Adomas

AU - Castro e Couto, Tiago

AU - Chaudron, Linda H.

AU - Correa, Humberto

AU - de Figueiredo, Felipe Pinheiro

AU - Eapen, Valsamma

AU - Favez, Nicolas

AU - Felice, Ethel

AU - Fernandes, Michelle

AU - Figueiredo, Barbara

AU - Fisher, Jane R.W.

AU - Tandon, S. Darius

PY - 2021/9

Y1 - 2021/9

N2 - Objective: To evaluate, across multiple sample sizes, the degree that data-driven methods result in (1) optimal cutoffs different from population optimal cutoff and (2) bias in accuracy estimates. Study design and setting: A total of 1,000 samples of sample size 100, 200, 500 and 1,000 each were randomly drawn to simulate studies of different sample sizes from a database (n = 13,255) synthesized to assess Edinburgh Postnatal Depression Scale (EPDS) screening accuracy. Optimal cutoffs were selected by maximizing Youden's J (sensitivity+specificity–1). Optimal cutoffs and accuracy estimates in simulated samples were compared to population values. Results: Optimal cutoffs in simulated samples ranged from ≥ 5 to ≥ 17 for n = 100, ≥ 6 to ≥ 16 for n = 200, ≥ 6 to ≥ 14 for n = 500, and ≥ 8 to ≥ 13 for n = 1,000. Percentage of simulated samples identifying the population optimal cutoff (≥ 11) was 30% for n = 100, 35% for n = 200, 53% for n = 500, and 71% for n = 1,000. Mean overestimation of sensitivity and underestimation of specificity were 6.5 percentage point (pp) and -1.3 pp for n = 100, 4.2 pp and -1.1 pp for n = 200, 1.8 pp and -1.0 pp for n = 500, and 1.4 pp and -1.0 pp for n = 1,000. Conclusions: Small accuracy studies may identify inaccurate optimal cutoff and overstate accuracy estimates with data-driven methods.

AB - Objective: To evaluate, across multiple sample sizes, the degree that data-driven methods result in (1) optimal cutoffs different from population optimal cutoff and (2) bias in accuracy estimates. Study design and setting: A total of 1,000 samples of sample size 100, 200, 500 and 1,000 each were randomly drawn to simulate studies of different sample sizes from a database (n = 13,255) synthesized to assess Edinburgh Postnatal Depression Scale (EPDS) screening accuracy. Optimal cutoffs were selected by maximizing Youden's J (sensitivity+specificity–1). Optimal cutoffs and accuracy estimates in simulated samples were compared to population values. Results: Optimal cutoffs in simulated samples ranged from ≥ 5 to ≥ 17 for n = 100, ≥ 6 to ≥ 16 for n = 200, ≥ 6 to ≥ 14 for n = 500, and ≥ 8 to ≥ 13 for n = 1,000. Percentage of simulated samples identifying the population optimal cutoff (≥ 11) was 30% for n = 100, 35% for n = 200, 53% for n = 500, and 71% for n = 1,000. Mean overestimation of sensitivity and underestimation of specificity were 6.5 percentage point (pp) and -1.3 pp for n = 100, 4.2 pp and -1.1 pp for n = 200, 1.8 pp and -1.0 pp for n = 500, and 1.4 pp and -1.0 pp for n = 1,000. Conclusions: Small accuracy studies may identify inaccurate optimal cutoff and overstate accuracy estimates with data-driven methods.

KW - Accuracy estimates

KW - Bias

KW - Cherry-picking

KW - Data-driven methods

KW - Depression

KW - Optimal cutoff

UR - http://www.scopus.com/inward/record.url?scp=85105570449&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85105570449&partnerID=8YFLogxK

U2 - 10.1016/j.jclinepi.2021.03.031

DO - 10.1016/j.jclinepi.2021.03.031

M3 - Article

C2 - 33838273

AN - SCOPUS:85105570449

SN - 0895-4356

VL - 137

SP - 137

EP - 147

JO - Journal of Clinical Epidemiology

JF - Journal of Clinical Epidemiology

ER -

Data-driven methods distort optimal cutoffs and accuracy estimates of depression screening tools: a simulation study using individual participant data

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this