Logistic regression and Bayesian networks to study outcomes using large data sets

Sun Mi Lee, Patricia Abbott, Mary Johantgen

Research output: Contribution to journalArticle

Abstract

Background: In nursing research, the interest in using large health care databases to predict nursing sensitive outcomes is growing rapidly. Traditionally, one of the most frequently used methods is logistic regression (LR), which, although powerful and familiar, has several limitations when used in the analysis of large databases. As a result, innovative approaches are required. Approach: To (a) introduce an innovative/alternative data analysis approach (Bayesian network), (b) discuss the constraints of LR and the complementary advantages of Bayesian networks (BNs) in working with large and multidimensional health care data, and (c) provide a fundamental understanding of the use of BNs in the nursing/health care domain. Results: Studies have shown that BNs have several advantages over LR in analyzing complex and large data: (a) statistical assumptions, such as linearity and additivity, are relaxed; (b) handling of a larger number of predictors and identification of interactions among predictors is less complex; and (c) the discovery of structure, pattern, and knowledge, for example, of unknown, complex, and nonlinear relationships, in data is facilitated. Conclusion: Outcome studies, such as those undertaken by nurse researchers, may benefit from the examination and use of innovative approaches such as BNs to the analysis of very large and complex health care data sets.

Original languageEnglish (US)
Pages (from-to)133-138
Number of pages6
JournalNursing research
Volume54
Issue number2
DOIs
StatePublished - Jan 1 2005

    Fingerprint

Keywords

  • Bayesian network
  • Large databases
  • Logistic regression
  • Nursing research
  • Outcomes
  • Prediction

ASJC Scopus subject areas

  • Nursing(all)

Cite this