Application of generalizability theory confirmed lower reliability of the standard gamble than the feeling thermometer

Holger J. Schünemann; Geoff Norman; Milo A. Puhan; Elisabeth Ståhl; Lauren Griffith; Diane Heels-Ansdell; Victor M. Montori; Ingela Wiklund; Roger Goldstein; M. Jeffery Mador; Gordon H. Guyatt

doi:10.1016/j.jclinepi.2007.03.010

Application of generalizability theory confirmed lower reliability of the standard gamble than the feeling thermometer

Holger J. Schünemann, Geoff Norman, Milo A. Puhan, Elisabeth Ståhl, Lauren Griffith, Diane Heels-Ansdell, Victor M. Montori, Ingela Wiklund, Roger Goldstein, M. Jeffery Mador, Gordon H. Guyatt

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Objectives: Recent studies suggest that rating clinical marker states (CMS) does not improve the measurement properties of the standard gamble (SG) and only slightly improves those of the feeling thermometer (FT). The poor intrarater (test-retest) reliability of CMS may explain their meager performance. Further, lack of interrater reliability may compromise the use of CMS in interpreting health state ratings. The aim of this study was to assess the reliability of CMS ratings for the SG and the FT. Study Design and Setting: Two similar studies in patients with chronic obstructive pulmonary disease (COPD, n = 91) and in patients with gastroesophageal reflux disease (GERD, n = 112) provided data for this analysis. Patients rated three different CMS (mild, moderate, and severe disease) twice several weeks apart. We used generalizability theory to calculate reliability coefficients. Results: Test-retest reliability for CMS ratings was higher for the FT compared to the SG (COPD: 0.86 vs. 0.67; GERD: 0.86 vs. 0.67). Interrater reliability was much higher for the FT compared to the SG (COPD: 0.78 vs. 0.46; GERD: 0.71 vs. 0.26). Conclusions: These results suggest that the markedly poorer reliability of CMS for the SG than the FT is driven largely by poor interrater reliability.

Original language	English (US)
Pages (from-to)	1256-1262
Number of pages	7
Journal	Journal of Clinical Epidemiology
Volume	60
Issue number	12
DOIs	https://doi.org/10.1016/j.jclinepi.2007.03.010
State	Published - Dec 2007
Externally published	Yes

Keywords

Preference-based instruments
Reliability
Standard gamble
Utilities
Visual analogue scale

ASJC Scopus subject areas

General Medicine
Public Health, Environmental and Occupational Health
Epidemiology

Access to Document

10.1016/j.jclinepi.2007.03.010

Cite this

Schünemann, H. J., Norman, G., Puhan, M. A., Ståhl, E., Griffith, L., Heels-Ansdell, D., Montori, V. M., Wiklund, I., Goldstein, R., Mador, M. J., & Guyatt, G. H. (2007). Application of generalizability theory confirmed lower reliability of the standard gamble than the feeling thermometer. Journal of Clinical Epidemiology, 60(12), 1256-1262. https://doi.org/10.1016/j.jclinepi.2007.03.010

Schünemann, HJ, Norman, G, Puhan, MA, Ståhl, E, Griffith, L, Heels-Ansdell, D, Montori, VM, Wiklund, I, Goldstein, R, Mador, MJ & Guyatt, GH 2007, 'Application of generalizability theory confirmed lower reliability of the standard gamble than the feeling thermometer', Journal of Clinical Epidemiology, vol. 60, no. 12, pp. 1256-1262. https://doi.org/10.1016/j.jclinepi.2007.03.010

@article{8ae4e8cd039948d1bb537970e62519c8,

title = "Application of generalizability theory confirmed lower reliability of the standard gamble than the feeling thermometer",

abstract = "Objectives: Recent studies suggest that rating clinical marker states (CMS) does not improve the measurement properties of the standard gamble (SG) and only slightly improves those of the feeling thermometer (FT). The poor intrarater (test-retest) reliability of CMS may explain their meager performance. Further, lack of interrater reliability may compromise the use of CMS in interpreting health state ratings. The aim of this study was to assess the reliability of CMS ratings for the SG and the FT. Study Design and Setting: Two similar studies in patients with chronic obstructive pulmonary disease (COPD, n = 91) and in patients with gastroesophageal reflux disease (GERD, n = 112) provided data for this analysis. Patients rated three different CMS (mild, moderate, and severe disease) twice several weeks apart. We used generalizability theory to calculate reliability coefficients. Results: Test-retest reliability for CMS ratings was higher for the FT compared to the SG (COPD: 0.86 vs. 0.67; GERD: 0.86 vs. 0.67). Interrater reliability was much higher for the FT compared to the SG (COPD: 0.78 vs. 0.46; GERD: 0.71 vs. 0.26). Conclusions: These results suggest that the markedly poorer reliability of CMS for the SG than the FT is driven largely by poor interrater reliability.",

keywords = "Preference-based instruments, Reliability, Standard gamble, Utilities, Visual analogue scale",

author = "Sch{\"u}nemann, {Holger J.} and Geoff Norman and Puhan, {Milo A.} and Elisabeth St{\aa}hl and Lauren Griffith and Diane Heels-Ansdell and Montori, {Victor M.} and Ingela Wiklund and Roger Goldstein and Mador, {M. Jeffery} and Guyatt, {Gordon H.}",

year = "2007",

month = dec,

doi = "10.1016/j.jclinepi.2007.03.010",

language = "English (US)",

volume = "60",

pages = "1256--1262",

journal = "Journal of Clinical Epidemiology",

issn = "0895-4356",

publisher = "Elsevier USA",

number = "12",

}

TY - JOUR

T1 - Application of generalizability theory confirmed lower reliability of the standard gamble than the feeling thermometer

AU - Schünemann, Holger J.

AU - Norman, Geoff

AU - Puhan, Milo A.

AU - Ståhl, Elisabeth

AU - Griffith, Lauren

AU - Heels-Ansdell, Diane

AU - Montori, Victor M.

AU - Wiklund, Ingela

AU - Goldstein, Roger

AU - Mador, M. Jeffery

AU - Guyatt, Gordon H.

PY - 2007/12

Y1 - 2007/12

N2 - Objectives: Recent studies suggest that rating clinical marker states (CMS) does not improve the measurement properties of the standard gamble (SG) and only slightly improves those of the feeling thermometer (FT). The poor intrarater (test-retest) reliability of CMS may explain their meager performance. Further, lack of interrater reliability may compromise the use of CMS in interpreting health state ratings. The aim of this study was to assess the reliability of CMS ratings for the SG and the FT. Study Design and Setting: Two similar studies in patients with chronic obstructive pulmonary disease (COPD, n = 91) and in patients with gastroesophageal reflux disease (GERD, n = 112) provided data for this analysis. Patients rated three different CMS (mild, moderate, and severe disease) twice several weeks apart. We used generalizability theory to calculate reliability coefficients. Results: Test-retest reliability for CMS ratings was higher for the FT compared to the SG (COPD: 0.86 vs. 0.67; GERD: 0.86 vs. 0.67). Interrater reliability was much higher for the FT compared to the SG (COPD: 0.78 vs. 0.46; GERD: 0.71 vs. 0.26). Conclusions: These results suggest that the markedly poorer reliability of CMS for the SG than the FT is driven largely by poor interrater reliability.

AB - Objectives: Recent studies suggest that rating clinical marker states (CMS) does not improve the measurement properties of the standard gamble (SG) and only slightly improves those of the feeling thermometer (FT). The poor intrarater (test-retest) reliability of CMS may explain their meager performance. Further, lack of interrater reliability may compromise the use of CMS in interpreting health state ratings. The aim of this study was to assess the reliability of CMS ratings for the SG and the FT. Study Design and Setting: Two similar studies in patients with chronic obstructive pulmonary disease (COPD, n = 91) and in patients with gastroesophageal reflux disease (GERD, n = 112) provided data for this analysis. Patients rated three different CMS (mild, moderate, and severe disease) twice several weeks apart. We used generalizability theory to calculate reliability coefficients. Results: Test-retest reliability for CMS ratings was higher for the FT compared to the SG (COPD: 0.86 vs. 0.67; GERD: 0.86 vs. 0.67). Interrater reliability was much higher for the FT compared to the SG (COPD: 0.78 vs. 0.46; GERD: 0.71 vs. 0.26). Conclusions: These results suggest that the markedly poorer reliability of CMS for the SG than the FT is driven largely by poor interrater reliability.

KW - Preference-based instruments

KW - Reliability

KW - Standard gamble

KW - Utilities

KW - Visual analogue scale

UR - http://www.scopus.com/inward/record.url?scp=36048988376&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=36048988376&partnerID=8YFLogxK

U2 - 10.1016/j.jclinepi.2007.03.010

DO - 10.1016/j.jclinepi.2007.03.010

M3 - Article

C2 - 17998080

AN - SCOPUS:36048988376

SN - 0895-4356

VL - 60

SP - 1256

EP - 1262

JO - Journal of Clinical Epidemiology

JF - Journal of Clinical Epidemiology

IS - 12

ER -

Application of generalizability theory confirmed lower reliability of the standard gamble than the feeling thermometer

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this