Solving genetic heterogeneity in extended families by identifying sub-types of complex diseases

Arafat Tayeb; Aurélie Labbe; Alexandre Bureau; Chantal Mérette

doi:10.1007/s00180-010-0224-2

Solving genetic heterogeneity in extended families by identifying sub-types of complex diseases

Arafat Tayeb, Aurélie Labbe, Alexandre Bureau, Chantal Mérette

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

The study of genetic properties of a disease requires the collection of information concerning the subjects in a set of pedigrees. The main focus of this study was the detection of susceptible genes. However, even with large pedigrees, the heterogeneity of phenotypes in complex diseases such as Schizophrenia, Bipolar and Autism, makes the detection of susceptible genes difficult to accomplish. This is mainly due to a genetic heterogeneity: many genes phenomena are involved in the disease. In order to reduce this heterogeneity, our idea consists in sub-typing the disease and in partitioning the population into more alike sub-groups. We developed a probabilistic model based on a Latent Class Analysis (LCA) that takes into account the familial dependence inside a pedigree, even for large pedigrees. It also takes into account individuals with missing and partially missing measurements. Estimation of model parameters is performed by an EM algorithm, and computations for the E step inside a pedigree are achieved using a pedigree peeling algorithm. When more than one model are fitted, we use model selection strategies such as cross-validation or/and BIC approaches to choose the suitable model among a set of candidates. Moreover, we present a simulation based on a genetic disease class model and we show that our model leads to better individual classification than the model that assumes independence among subjects. An application of our model to a Schizophrenia-Bipolar pedigree data set from Eastern Quebec is also performed.

Original language	English (US)
Pages (from-to)	539-560
Number of pages	22
Journal	Computational Statistics
Volume	26
Issue number	3
DOIs	https://doi.org/10.1007/s00180-010-0224-2
State	Published - Sep 2011
Externally published	Yes

Keywords

Familial dependence
Latent class model
Pedigree peeling
Triplet-transmission probability

ASJC Scopus subject areas

Statistics and Probability
Statistics, Probability and Uncertainty
Computational Mathematics

Access to Document

10.1007/s00180-010-0224-2

Cite this

@article{29de97318ab94dafb6e2943ed52f7a40,

title = "Solving genetic heterogeneity in extended families by identifying sub-types of complex diseases",

abstract = "The study of genetic properties of a disease requires the collection of information concerning the subjects in a set of pedigrees. The main focus of this study was the detection of susceptible genes. However, even with large pedigrees, the heterogeneity of phenotypes in complex diseases such as Schizophrenia, Bipolar and Autism, makes the detection of susceptible genes difficult to accomplish. This is mainly due to a genetic heterogeneity: many genes phenomena are involved in the disease. In order to reduce this heterogeneity, our idea consists in sub-typing the disease and in partitioning the population into more alike sub-groups. We developed a probabilistic model based on a Latent Class Analysis (LCA) that takes into account the familial dependence inside a pedigree, even for large pedigrees. It also takes into account individuals with missing and partially missing measurements. Estimation of model parameters is performed by an EM algorithm, and computations for the E step inside a pedigree are achieved using a pedigree peeling algorithm. When more than one model are fitted, we use model selection strategies such as cross-validation or/and BIC approaches to choose the suitable model among a set of candidates. Moreover, we present a simulation based on a genetic disease class model and we show that our model leads to better individual classification than the model that assumes independence among subjects. An application of our model to a Schizophrenia-Bipolar pedigree data set from Eastern Quebec is also performed.",

keywords = "Familial dependence, Latent class model, Pedigree peeling, Triplet-transmission probability",

author = "Arafat Tayeb and Aur{\'e}lie Labbe and Alexandre Bureau and Chantal M{\'e}rette",

year = "2011",

month = sep,

doi = "10.1007/s00180-010-0224-2",

language = "English (US)",

volume = "26",

pages = "539--560",

journal = "Computational Statistics",

issn = "0943-4062",

publisher = "Springer Verlag",

number = "3",

}

TY - JOUR

T1 - Solving genetic heterogeneity in extended families by identifying sub-types of complex diseases

AU - Tayeb, Arafat

AU - Labbe, Aurélie

AU - Bureau, Alexandre

AU - Mérette, Chantal

PY - 2011/9

Y1 - 2011/9

N2 - The study of genetic properties of a disease requires the collection of information concerning the subjects in a set of pedigrees. The main focus of this study was the detection of susceptible genes. However, even with large pedigrees, the heterogeneity of phenotypes in complex diseases such as Schizophrenia, Bipolar and Autism, makes the detection of susceptible genes difficult to accomplish. This is mainly due to a genetic heterogeneity: many genes phenomena are involved in the disease. In order to reduce this heterogeneity, our idea consists in sub-typing the disease and in partitioning the population into more alike sub-groups. We developed a probabilistic model based on a Latent Class Analysis (LCA) that takes into account the familial dependence inside a pedigree, even for large pedigrees. It also takes into account individuals with missing and partially missing measurements. Estimation of model parameters is performed by an EM algorithm, and computations for the E step inside a pedigree are achieved using a pedigree peeling algorithm. When more than one model are fitted, we use model selection strategies such as cross-validation or/and BIC approaches to choose the suitable model among a set of candidates. Moreover, we present a simulation based on a genetic disease class model and we show that our model leads to better individual classification than the model that assumes independence among subjects. An application of our model to a Schizophrenia-Bipolar pedigree data set from Eastern Quebec is also performed.

AB - The study of genetic properties of a disease requires the collection of information concerning the subjects in a set of pedigrees. The main focus of this study was the detection of susceptible genes. However, even with large pedigrees, the heterogeneity of phenotypes in complex diseases such as Schizophrenia, Bipolar and Autism, makes the detection of susceptible genes difficult to accomplish. This is mainly due to a genetic heterogeneity: many genes phenomena are involved in the disease. In order to reduce this heterogeneity, our idea consists in sub-typing the disease and in partitioning the population into more alike sub-groups. We developed a probabilistic model based on a Latent Class Analysis (LCA) that takes into account the familial dependence inside a pedigree, even for large pedigrees. It also takes into account individuals with missing and partially missing measurements. Estimation of model parameters is performed by an EM algorithm, and computations for the E step inside a pedigree are achieved using a pedigree peeling algorithm. When more than one model are fitted, we use model selection strategies such as cross-validation or/and BIC approaches to choose the suitable model among a set of candidates. Moreover, we present a simulation based on a genetic disease class model and we show that our model leads to better individual classification than the model that assumes independence among subjects. An application of our model to a Schizophrenia-Bipolar pedigree data set from Eastern Quebec is also performed.

KW - Familial dependence

KW - Latent class model

KW - Pedigree peeling

KW - Triplet-transmission probability

UR - http://www.scopus.com/inward/record.url?scp=79960604826&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79960604826&partnerID=8YFLogxK

U2 - 10.1007/s00180-010-0224-2

DO - 10.1007/s00180-010-0224-2

M3 - Article

AN - SCOPUS:79960604826

SN - 0943-4062

VL - 26

SP - 539

EP - 560

JO - Computational Statistics

JF - Computational Statistics

IS - 3

ER -

Solving genetic heterogeneity in extended families by identifying sub-types of complex diseases

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this