The impact of data quality on the identification of complex disease genes: Experience from the Family Blood Pressure Program

Yen Pei Christy Chang, James Dae Ok Kim, Karen Schwander, Dabeeru C. Rao, Mike B. Miller, Alan B. Weder, Richard S. Cooper, Nicholas J. Schork, Michael A. Province, Alanna C. Morrison, Sharon L R Kardia, Thomas Quertermous, Aravinda Chakravarti

Research output: Contribution to journalArticle

Abstract

The application of genome-wide linkage scans to uncover susceptibility loci for complex diseases offers great promise for the risk assessment, treatment, and understanding of these diseases. However, for most published studies, linkage signals are typically modest and vary considerably from one study to another. The multicenter Family Blood Pressure Program has analyzed genome-wide linkage scans of over 12 000 individuals. Based on this experience, we developed a protocol for large linkage studies that reduces two sources of data error: pedigree structure and marker genotyping errors. We then used the linkage signals, before and after data cleaning, to illustrate the impact of missing and erroneous data. A comprehensive error-checking protocol is an important part of complex disease linkage studies and enhances gene mapping. The lack of significant and reproducible linkage findings across studies is, in part, due to data quality.

Original languageEnglish (US)
Pages (from-to)469-477
Number of pages9
JournalEuropean Journal of Human Genetics
Volume14
Issue number4
DOIs
StatePublished - Apr 2006

Fingerprint

Blood Pressure
Genome
Genes
Chromosome Mapping
Information Storage and Retrieval
Pedigree
Research Design
Data Accuracy

Keywords

  • Data quality
  • Family relationship
  • Genome-wide linkage studies

ASJC Scopus subject areas

  • Genetics(clinical)

Cite this

Chang, Y. P. C., Kim, J. D. O., Schwander, K., Rao, D. C., Miller, M. B., Weder, A. B., ... Chakravarti, A. (2006). The impact of data quality on the identification of complex disease genes: Experience from the Family Blood Pressure Program. European Journal of Human Genetics, 14(4), 469-477. https://doi.org/10.1038/sj.ejhg.5201582

The impact of data quality on the identification of complex disease genes : Experience from the Family Blood Pressure Program. / Chang, Yen Pei Christy; Kim, James Dae Ok; Schwander, Karen; Rao, Dabeeru C.; Miller, Mike B.; Weder, Alan B.; Cooper, Richard S.; Schork, Nicholas J.; Province, Michael A.; Morrison, Alanna C.; Kardia, Sharon L R; Quertermous, Thomas; Chakravarti, Aravinda.

In: European Journal of Human Genetics, Vol. 14, No. 4, 04.2006, p. 469-477.

Research output: Contribution to journalArticle

Chang, YPC, Kim, JDO, Schwander, K, Rao, DC, Miller, MB, Weder, AB, Cooper, RS, Schork, NJ, Province, MA, Morrison, AC, Kardia, SLR, Quertermous, T & Chakravarti, A 2006, 'The impact of data quality on the identification of complex disease genes: Experience from the Family Blood Pressure Program', European Journal of Human Genetics, vol. 14, no. 4, pp. 469-477. https://doi.org/10.1038/sj.ejhg.5201582
Chang, Yen Pei Christy ; Kim, James Dae Ok ; Schwander, Karen ; Rao, Dabeeru C. ; Miller, Mike B. ; Weder, Alan B. ; Cooper, Richard S. ; Schork, Nicholas J. ; Province, Michael A. ; Morrison, Alanna C. ; Kardia, Sharon L R ; Quertermous, Thomas ; Chakravarti, Aravinda. / The impact of data quality on the identification of complex disease genes : Experience from the Family Blood Pressure Program. In: European Journal of Human Genetics. 2006 ; Vol. 14, No. 4. pp. 469-477.
@article{62f88d716f9345ff998a3550f928d504,
title = "The impact of data quality on the identification of complex disease genes: Experience from the Family Blood Pressure Program",
abstract = "The application of genome-wide linkage scans to uncover susceptibility loci for complex diseases offers great promise for the risk assessment, treatment, and understanding of these diseases. However, for most published studies, linkage signals are typically modest and vary considerably from one study to another. The multicenter Family Blood Pressure Program has analyzed genome-wide linkage scans of over 12 000 individuals. Based on this experience, we developed a protocol for large linkage studies that reduces two sources of data error: pedigree structure and marker genotyping errors. We then used the linkage signals, before and after data cleaning, to illustrate the impact of missing and erroneous data. A comprehensive error-checking protocol is an important part of complex disease linkage studies and enhances gene mapping. The lack of significant and reproducible linkage findings across studies is, in part, due to data quality.",
keywords = "Data quality, Family relationship, Genome-wide linkage studies",
author = "Chang, {Yen Pei Christy} and Kim, {James Dae Ok} and Karen Schwander and Rao, {Dabeeru C.} and Miller, {Mike B.} and Weder, {Alan B.} and Cooper, {Richard S.} and Schork, {Nicholas J.} and Province, {Michael A.} and Morrison, {Alanna C.} and Kardia, {Sharon L R} and Thomas Quertermous and Aravinda Chakravarti",
year = "2006",
month = "4",
doi = "10.1038/sj.ejhg.5201582",
language = "English (US)",
volume = "14",
pages = "469--477",
journal = "European Journal of Human Genetics",
issn = "1018-4813",
publisher = "Nature Publishing Group",
number = "4",

}

TY - JOUR

T1 - The impact of data quality on the identification of complex disease genes

T2 - Experience from the Family Blood Pressure Program

AU - Chang, Yen Pei Christy

AU - Kim, James Dae Ok

AU - Schwander, Karen

AU - Rao, Dabeeru C.

AU - Miller, Mike B.

AU - Weder, Alan B.

AU - Cooper, Richard S.

AU - Schork, Nicholas J.

AU - Province, Michael A.

AU - Morrison, Alanna C.

AU - Kardia, Sharon L R

AU - Quertermous, Thomas

AU - Chakravarti, Aravinda

PY - 2006/4

Y1 - 2006/4

N2 - The application of genome-wide linkage scans to uncover susceptibility loci for complex diseases offers great promise for the risk assessment, treatment, and understanding of these diseases. However, for most published studies, linkage signals are typically modest and vary considerably from one study to another. The multicenter Family Blood Pressure Program has analyzed genome-wide linkage scans of over 12 000 individuals. Based on this experience, we developed a protocol for large linkage studies that reduces two sources of data error: pedigree structure and marker genotyping errors. We then used the linkage signals, before and after data cleaning, to illustrate the impact of missing and erroneous data. A comprehensive error-checking protocol is an important part of complex disease linkage studies and enhances gene mapping. The lack of significant and reproducible linkage findings across studies is, in part, due to data quality.

AB - The application of genome-wide linkage scans to uncover susceptibility loci for complex diseases offers great promise for the risk assessment, treatment, and understanding of these diseases. However, for most published studies, linkage signals are typically modest and vary considerably from one study to another. The multicenter Family Blood Pressure Program has analyzed genome-wide linkage scans of over 12 000 individuals. Based on this experience, we developed a protocol for large linkage studies that reduces two sources of data error: pedigree structure and marker genotyping errors. We then used the linkage signals, before and after data cleaning, to illustrate the impact of missing and erroneous data. A comprehensive error-checking protocol is an important part of complex disease linkage studies and enhances gene mapping. The lack of significant and reproducible linkage findings across studies is, in part, due to data quality.

KW - Data quality

KW - Family relationship

KW - Genome-wide linkage studies

UR - http://www.scopus.com/inward/record.url?scp=33646182610&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33646182610&partnerID=8YFLogxK

U2 - 10.1038/sj.ejhg.5201582

DO - 10.1038/sj.ejhg.5201582

M3 - Article

C2 - 16493446

AN - SCOPUS:33646182610

VL - 14

SP - 469

EP - 477

JO - European Journal of Human Genetics

JF - European Journal of Human Genetics

SN - 1018-4813

IS - 4

ER -