Negative results of randomized clinical trials published in the surgical literature: Equivalency or error?

Justin B. Dimick; Marie Diener-West; Pamela A. Lipsett

doi:10.1001/archsurg.136.7.796

Negative results of randomized clinical trials published in the surgical literature: Equivalency or error?

Justin B. Dimick, Marie Diener-West, Pamela A. Lipsett

Research output: Contribution to journal › Article › peer-review

81 Scopus citations

Abstract

Hypothesis: We hypothesized that review of randomized controlled clinical trials (RCTs) with nonstatistically significant or "negative" results published in the surgical literature do not have appropriate statistical power to demonstrate equivalency between treatment arms. Data Sources and Study Selection: The MEDLINE database was searched to obtain reports of all RCTs with negative results published in 3 surgical journals from 1988 to 1998. Manual review of one year (1997) of publications for each journal was performed to validate our search strategy. Equivalency was evaluated using the Two One-Sided Tests Procedure and post hoc power calculations. Data Synthesis: Ninety reports of RCTs with negative results were identified in the surgical literature between 1988 and 1998. The manual review of 1997 showed a 100% retrieval rate for our search strategy. After applying the Two One-Sided Tests Procedure, 35 reports (39%) met the criteria for demonstrating equivalency. The other 55 reports (61%) contained at least a 10% absolute difference in the 90% confidence interval of Δ. Using the power calculation method, only 22 (24%) articles had a power greater than .80 to detect a 50% difference in therapeutic effect. Only 29% of the reports included a formal sample size calculation and these studies were more likely to demonstrate equivalency than those without a sample size estimate (P<.01). Conclusions: Many reports from negative RCTs published in the surgical literature lack sufficient statistical power to establish that clinically important differences are not present. Surgeons should perform appropriate sample size calculations when designing RCTs and recognize the utility of confidence intervals when reporting negative results.

Original language	English (US)
Pages (from-to)	796-800
Number of pages	5
Journal	Archives of surgery
Volume	136
Issue number	7
DOIs	https://doi.org/10.1001/archsurg.136.7.796
State	Published - 2001

ASJC Scopus subject areas

Surgery

Access to Document

10.1001/archsurg.136.7.796

Cite this

@article{bb6c1b17228a47cd85ab3e15675c8b90,

title = "Negative results of randomized clinical trials published in the surgical literature: Equivalency or error?",

abstract = "Hypothesis: We hypothesized that review of randomized controlled clinical trials (RCTs) with nonstatistically significant or {"}negative{"} results published in the surgical literature do not have appropriate statistical power to demonstrate equivalency between treatment arms. Data Sources and Study Selection: The MEDLINE database was searched to obtain reports of all RCTs with negative results published in 3 surgical journals from 1988 to 1998. Manual review of one year (1997) of publications for each journal was performed to validate our search strategy. Equivalency was evaluated using the Two One-Sided Tests Procedure and post hoc power calculations. Data Synthesis: Ninety reports of RCTs with negative results were identified in the surgical literature between 1988 and 1998. The manual review of 1997 showed a 100% retrieval rate for our search strategy. After applying the Two One-Sided Tests Procedure, 35 reports (39%) met the criteria for demonstrating equivalency. The other 55 reports (61%) contained at least a 10% absolute difference in the 90% confidence interval of Δ. Using the power calculation method, only 22 (24%) articles had a power greater than .80 to detect a 50% difference in therapeutic effect. Only 29% of the reports included a formal sample size calculation and these studies were more likely to demonstrate equivalency than those without a sample size estimate (P<.01). Conclusions: Many reports from negative RCTs published in the surgical literature lack sufficient statistical power to establish that clinically important differences are not present. Surgeons should perform appropriate sample size calculations when designing RCTs and recognize the utility of confidence intervals when reporting negative results.",

author = "Dimick, {Justin B.} and Marie Diener-West and Lipsett, {Pamela A.}",

year = "2001",

doi = "10.1001/archsurg.136.7.796",

language = "English (US)",

volume = "136",

pages = "796--800",

journal = "Archives of surgery",

issn = "0004-0010",

publisher = "American Medical Association",

number = "7",

}

TY - JOUR

T1 - Negative results of randomized clinical trials published in the surgical literature

T2 - Equivalency or error?

AU - Dimick, Justin B.

AU - Diener-West, Marie

AU - Lipsett, Pamela A.

PY - 2001

Y1 - 2001

N2 - Hypothesis: We hypothesized that review of randomized controlled clinical trials (RCTs) with nonstatistically significant or "negative" results published in the surgical literature do not have appropriate statistical power to demonstrate equivalency between treatment arms. Data Sources and Study Selection: The MEDLINE database was searched to obtain reports of all RCTs with negative results published in 3 surgical journals from 1988 to 1998. Manual review of one year (1997) of publications for each journal was performed to validate our search strategy. Equivalency was evaluated using the Two One-Sided Tests Procedure and post hoc power calculations. Data Synthesis: Ninety reports of RCTs with negative results were identified in the surgical literature between 1988 and 1998. The manual review of 1997 showed a 100% retrieval rate for our search strategy. After applying the Two One-Sided Tests Procedure, 35 reports (39%) met the criteria for demonstrating equivalency. The other 55 reports (61%) contained at least a 10% absolute difference in the 90% confidence interval of Δ. Using the power calculation method, only 22 (24%) articles had a power greater than .80 to detect a 50% difference in therapeutic effect. Only 29% of the reports included a formal sample size calculation and these studies were more likely to demonstrate equivalency than those without a sample size estimate (P<.01). Conclusions: Many reports from negative RCTs published in the surgical literature lack sufficient statistical power to establish that clinically important differences are not present. Surgeons should perform appropriate sample size calculations when designing RCTs and recognize the utility of confidence intervals when reporting negative results.

AB - Hypothesis: We hypothesized that review of randomized controlled clinical trials (RCTs) with nonstatistically significant or "negative" results published in the surgical literature do not have appropriate statistical power to demonstrate equivalency between treatment arms. Data Sources and Study Selection: The MEDLINE database was searched to obtain reports of all RCTs with negative results published in 3 surgical journals from 1988 to 1998. Manual review of one year (1997) of publications for each journal was performed to validate our search strategy. Equivalency was evaluated using the Two One-Sided Tests Procedure and post hoc power calculations. Data Synthesis: Ninety reports of RCTs with negative results were identified in the surgical literature between 1988 and 1998. The manual review of 1997 showed a 100% retrieval rate for our search strategy. After applying the Two One-Sided Tests Procedure, 35 reports (39%) met the criteria for demonstrating equivalency. The other 55 reports (61%) contained at least a 10% absolute difference in the 90% confidence interval of Δ. Using the power calculation method, only 22 (24%) articles had a power greater than .80 to detect a 50% difference in therapeutic effect. Only 29% of the reports included a formal sample size calculation and these studies were more likely to demonstrate equivalency than those without a sample size estimate (P<.01). Conclusions: Many reports from negative RCTs published in the surgical literature lack sufficient statistical power to establish that clinically important differences are not present. Surgeons should perform appropriate sample size calculations when designing RCTs and recognize the utility of confidence intervals when reporting negative results.

UR - http://www.scopus.com/inward/record.url?scp=0034939731&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034939731&partnerID=8YFLogxK

U2 - 10.1001/archsurg.136.7.796

DO - 10.1001/archsurg.136.7.796

M3 - Article

C2 - 11448393

AN - SCOPUS:0034939731

SN - 0004-0010

VL - 136

SP - 796

EP - 800

JO - Archives of surgery

JF - Archives of surgery

IS - 7

ER -

Negative results of randomized clinical trials published in the surgical literature: Equivalency or error?

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this