TY - JOUR
T1 - The Use of Logic Regression in Epidemiologic Studies to Investigate Multiple Binary Exposures
T2 - An Example of Occupation History and Amyotrophic Lateral Sclerosis
AU - Bellavia, Andrea
AU - Rotem, Ran S.
AU - Dickerson, Aisha S.
AU - Hansen, Johnni
AU - Gredal, Ole
AU - Weisskopf, Marc G.
N1 - Publisher Copyright:
© 2020 Walter de Gruyter GmbH, Berlin/Boston.
PY - 2020/1/1
Y1 - 2020/1/1
N2 - Investigating the joint exposure to several risk factors is becoming a key component of epidemiologic studies. Individuals are exposed to multiple factors, often simultaneously, and evaluating patterns of exposures and high-dimension interactions may allow for a better understanding of health risks at the individual level. When jointly evaluating high-dimensional exposures, common statistical methods should be integrated with machine learning techniques that may better account for complex settings. Among these, Logic regression was developed to investigate a large number of binary exposures as they relate to a given outcome. This method may be of interest in several public health settings, yet has never been presented to an epidemiologic audience. In this paper, we review and discuss Logic regression as a potential tool for epidemiological studies, using an example of occupation history (68 binary exposures of primary occupations) and amyotrophic lateral sclerosis in a population-based Danish cohort. Logic regression identifies predictors that are Boolean combinations of the original (binary) exposures, fully operating within the regression framework of interest (e. g. linear, logistic). Combinations of exposures are graphically presented as Logic trees, and techniques for selecting the best Logic model are available and of high importance. While highlighting several advantages of the method, we also discuss specific drawbacks and practical issues that should be considered when using Logic regression in population-based studies. With this paper, we encourage researchers to explore the use of machine learning techniques when evaluating large-dimensional epidemiologic data, as well as advocate the need of further methodological work in the area.
AB - Investigating the joint exposure to several risk factors is becoming a key component of epidemiologic studies. Individuals are exposed to multiple factors, often simultaneously, and evaluating patterns of exposures and high-dimension interactions may allow for a better understanding of health risks at the individual level. When jointly evaluating high-dimensional exposures, common statistical methods should be integrated with machine learning techniques that may better account for complex settings. Among these, Logic regression was developed to investigate a large number of binary exposures as they relate to a given outcome. This method may be of interest in several public health settings, yet has never been presented to an epidemiologic audience. In this paper, we review and discuss Logic regression as a potential tool for epidemiological studies, using an example of occupation history (68 binary exposures of primary occupations) and amyotrophic lateral sclerosis in a population-based Danish cohort. Logic regression identifies predictors that are Boolean combinations of the original (binary) exposures, fully operating within the regression framework of interest (e. g. linear, logistic). Combinations of exposures are graphically presented as Logic trees, and techniques for selecting the best Logic model are available and of high importance. While highlighting several advantages of the method, we also discuss specific drawbacks and practical issues that should be considered when using Logic regression in population-based studies. With this paper, we encourage researchers to explore the use of machine learning techniques when evaluating large-dimensional epidemiologic data, as well as advocate the need of further methodological work in the area.
KW - Logic regression
KW - amyotrophic lateral sclerosis
KW - big data
KW - machine learning
KW - occupational epidemiology
UR - http://www.scopus.com/inward/record.url?scp=85081716027&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85081716027&partnerID=8YFLogxK
U2 - 10.1515/em-2019-0032
DO - 10.1515/em-2019-0032
M3 - Article
C2 - 33224709
AN - SCOPUS:85081716027
SN - 2194-9263
VL - 9
JO - Epidemiologic Methods
JF - Epidemiologic Methods
IS - 1
M1 - 20190032
ER -