Incorporating individual error rate into association test of unmatched case-control design

Ke Hao, Xiaobin Wang

Research output: Contribution to journalArticle

Abstract

Objectives: Genotyping error commonly occurs and could reduce the power and bias statistical inference in genetics studies. In addition to genotypes, some automated biotechnologies also provide quality measurement of each individual genotype. We studied the relationship between the quality measurement and genotyping error rate. Furthermore, we propose two association tests incorporating the genotyping quality information with the goal to improve statistical power and inference. Methods: 50 pairs of DNA sample duplicates were typed for 232 SNPs by BeadArray technology. We used scatter plot, smoothing function and generalized additive models to investigate the relationship between genotype quality score (g) and inconsistency rate (ĩ) among duplicates. We constructed two association tests: (1) weighted contingency table test (WCT) and (2) likelihood ratio test (LRT) to incorporate individual genotype error rate (εi), in unmatched case-control setting. Results: In the 50 duplicates, we found q and ĩ were in strong negative association, suggesting the genotypes with low quality score were more likely to be mistyped. The WCT improved the statistical power and partially corrects the bias in point estimation. The LRT offered moderate power gain, but was able to correct the bias in odds ratio estimation. The two new methods also performed favorably in some scenarios when εi, was mis-specified. Conclusions: With increasing number of genetic studies and application of automated genotyping technology, there is a growing need to adequately account for individual genotype error rate in statistical analysis. Our study represents an initial step to address this need and points out a promising direction for further research.

Original languageEnglish (US)
Pages (from-to)154-163
Number of pages10
JournalHuman Heredity
Volume58
Issue number3-4
DOIs
StatePublished - Mar 2004
Externally publishedYes

Fingerprint

Genotype
Technology
Biotechnology
Single Nucleotide Polymorphism
Odds Ratio
DNA
Research

Keywords

  • Automated genotyping biotechnology
  • Contingency table
  • Genotype quality score
  • Genotyping error rate
  • Likelihood ratio test
  • Unmatched case-control

ASJC Scopus subject areas

  • Genetics(clinical)

Cite this

Incorporating individual error rate into association test of unmatched case-control design. / Hao, Ke; Wang, Xiaobin.

In: Human Heredity, Vol. 58, No. 3-4, 03.2004, p. 154-163.

Research output: Contribution to journalArticle

@article{84226298ccc347e282cd8395df6569fa,
title = "Incorporating individual error rate into association test of unmatched case-control design",
abstract = "Objectives: Genotyping error commonly occurs and could reduce the power and bias statistical inference in genetics studies. In addition to genotypes, some automated biotechnologies also provide quality measurement of each individual genotype. We studied the relationship between the quality measurement and genotyping error rate. Furthermore, we propose two association tests incorporating the genotyping quality information with the goal to improve statistical power and inference. Methods: 50 pairs of DNA sample duplicates were typed for 232 SNPs by BeadArray technology. We used scatter plot, smoothing function and generalized additive models to investigate the relationship between genotype quality score (g) and inconsistency rate (ĩ) among duplicates. We constructed two association tests: (1) weighted contingency table test (WCT) and (2) likelihood ratio test (LRT) to incorporate individual genotype error rate (εi), in unmatched case-control setting. Results: In the 50 duplicates, we found q and ĩ were in strong negative association, suggesting the genotypes with low quality score were more likely to be mistyped. The WCT improved the statistical power and partially corrects the bias in point estimation. The LRT offered moderate power gain, but was able to correct the bias in odds ratio estimation. The two new methods also performed favorably in some scenarios when εi, was mis-specified. Conclusions: With increasing number of genetic studies and application of automated genotyping technology, there is a growing need to adequately account for individual genotype error rate in statistical analysis. Our study represents an initial step to address this need and points out a promising direction for further research.",
keywords = "Automated genotyping biotechnology, Contingency table, Genotype quality score, Genotyping error rate, Likelihood ratio test, Unmatched case-control",
author = "Ke Hao and Xiaobin Wang",
year = "2004",
month = "3",
doi = "10.1159/000083542",
language = "English (US)",
volume = "58",
pages = "154--163",
journal = "Human Heredity",
issn = "0001-5652",
publisher = "S. Karger AG",
number = "3-4",

}

TY - JOUR

T1 - Incorporating individual error rate into association test of unmatched case-control design

AU - Hao, Ke

AU - Wang, Xiaobin

PY - 2004/3

Y1 - 2004/3

N2 - Objectives: Genotyping error commonly occurs and could reduce the power and bias statistical inference in genetics studies. In addition to genotypes, some automated biotechnologies also provide quality measurement of each individual genotype. We studied the relationship between the quality measurement and genotyping error rate. Furthermore, we propose two association tests incorporating the genotyping quality information with the goal to improve statistical power and inference. Methods: 50 pairs of DNA sample duplicates were typed for 232 SNPs by BeadArray technology. We used scatter plot, smoothing function and generalized additive models to investigate the relationship between genotype quality score (g) and inconsistency rate (ĩ) among duplicates. We constructed two association tests: (1) weighted contingency table test (WCT) and (2) likelihood ratio test (LRT) to incorporate individual genotype error rate (εi), in unmatched case-control setting. Results: In the 50 duplicates, we found q and ĩ were in strong negative association, suggesting the genotypes with low quality score were more likely to be mistyped. The WCT improved the statistical power and partially corrects the bias in point estimation. The LRT offered moderate power gain, but was able to correct the bias in odds ratio estimation. The two new methods also performed favorably in some scenarios when εi, was mis-specified. Conclusions: With increasing number of genetic studies and application of automated genotyping technology, there is a growing need to adequately account for individual genotype error rate in statistical analysis. Our study represents an initial step to address this need and points out a promising direction for further research.

AB - Objectives: Genotyping error commonly occurs and could reduce the power and bias statistical inference in genetics studies. In addition to genotypes, some automated biotechnologies also provide quality measurement of each individual genotype. We studied the relationship between the quality measurement and genotyping error rate. Furthermore, we propose two association tests incorporating the genotyping quality information with the goal to improve statistical power and inference. Methods: 50 pairs of DNA sample duplicates were typed for 232 SNPs by BeadArray technology. We used scatter plot, smoothing function and generalized additive models to investigate the relationship between genotype quality score (g) and inconsistency rate (ĩ) among duplicates. We constructed two association tests: (1) weighted contingency table test (WCT) and (2) likelihood ratio test (LRT) to incorporate individual genotype error rate (εi), in unmatched case-control setting. Results: In the 50 duplicates, we found q and ĩ were in strong negative association, suggesting the genotypes with low quality score were more likely to be mistyped. The WCT improved the statistical power and partially corrects the bias in point estimation. The LRT offered moderate power gain, but was able to correct the bias in odds ratio estimation. The two new methods also performed favorably in some scenarios when εi, was mis-specified. Conclusions: With increasing number of genetic studies and application of automated genotyping technology, there is a growing need to adequately account for individual genotype error rate in statistical analysis. Our study represents an initial step to address this need and points out a promising direction for further research.

KW - Automated genotyping biotechnology

KW - Contingency table

KW - Genotype quality score

KW - Genotyping error rate

KW - Likelihood ratio test

KW - Unmatched case-control

UR - http://www.scopus.com/inward/record.url?scp=19944401078&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=19944401078&partnerID=8YFLogxK

U2 - 10.1159/000083542

DO - 10.1159/000083542

M3 - Article

C2 - 15812172

AN - SCOPUS:19944401078

VL - 58

SP - 154

EP - 163

JO - Human Heredity

JF - Human Heredity

SN - 0001-5652

IS - 3-4

ER -