Identification of genotype errors

Yin Y. Shugart; Ying Wang

doi:10.1007/978-1-61779-555-8_2

Identification of genotype errors

Yin Y. Shugart, Ying Wang

Research output: Chapter in Book/Report/Conference proceeding › Chapter

1 Scopus citations

Abstract

It has been documented that there exist some errors in most large genotype datasets and that an error rate of 1-2% is adequate to lead to the distortion of map distance as well as a false conclusion of linkage (Abecasis et al. Eur J Hum Genet 9(2):130-134, 2001), therefore one needs to ensure that the data are as clean as possible. On the other hand, the process of data cleaning is tedious and demands efforts and experience. O'Connell and Weeks implemented four error-checking algorithms in computer software called PedCheck. In this chapter, the four algorithms implemented in PedCheck are discussed with a focus on the genotype-elimination method. Furthermore, an example for four levels of error checking permitted by PedCheck is provided with the required input files. In addition, alternative algorithms implemented in other statistical computing programs are also briefly discussed.

Original language	English (US)
Title of host publication	Statistical Human Genetics
Subtitle of host publication	Methods and Protocols
Editors	Robert Elston, Shuying Sun, Jaya Satagopan
Pages	11-24
Number of pages	14
DOIs	https://doi.org/10.1007/978-1-61779-555-8_2
State	Published - 2012
Externally published	Yes

Publication series

Name	Methods in Molecular Biology
Volume	850
ISSN (Print)	1064-3745

Keywords

Automatic genotype elimination
Computational efficiency
Critical-genotype method
Genotype
Genotype error
Genotype-elimination method
LOD score
Nuclear-pedigree method
Odds-ratio method
Parametric linkage analysis

ASJC Scopus subject areas

Molecular Biology
Genetics

Access to Document

10.1007/978-1-61779-555-8_2

Cite this

@inbook{fb43b7c844b649699b7cbefd51cfc15e,

title = "Identification of genotype errors",

abstract = "It has been documented that there exist some errors in most large genotype datasets and that an error rate of 1-2% is adequate to lead to the distortion of map distance as well as a false conclusion of linkage (Abecasis et al. Eur J Hum Genet 9(2):130-134, 2001), therefore one needs to ensure that the data are as clean as possible. On the other hand, the process of data cleaning is tedious and demands efforts and experience. O'Connell and Weeks implemented four error-checking algorithms in computer software called PedCheck. In this chapter, the four algorithms implemented in PedCheck are discussed with a focus on the genotype-elimination method. Furthermore, an example for four levels of error checking permitted by PedCheck is provided with the required input files. In addition, alternative algorithms implemented in other statistical computing programs are also briefly discussed.",

keywords = "Automatic genotype elimination, Computational efficiency, Critical-genotype method, Genotype, Genotype error, Genotype-elimination method, LOD score, Nuclear-pedigree method, Odds-ratio method, Parametric linkage analysis",

author = "Shugart, {Yin Y.} and Ying Wang",

year = "2012",

doi = "10.1007/978-1-61779-555-8_2",

language = "English (US)",

isbn = "9781617795541",

series = "Methods in Molecular Biology",

pages = "11--24",

editor = "Robert Elston and Shuying Sun and Jaya Satagopan",

booktitle = "Statistical Human Genetics",

}

TY - CHAP

T1 - Identification of genotype errors

AU - Shugart, Yin Y.

AU - Wang, Ying

PY - 2012

Y1 - 2012

N2 - It has been documented that there exist some errors in most large genotype datasets and that an error rate of 1-2% is adequate to lead to the distortion of map distance as well as a false conclusion of linkage (Abecasis et al. Eur J Hum Genet 9(2):130-134, 2001), therefore one needs to ensure that the data are as clean as possible. On the other hand, the process of data cleaning is tedious and demands efforts and experience. O'Connell and Weeks implemented four error-checking algorithms in computer software called PedCheck. In this chapter, the four algorithms implemented in PedCheck are discussed with a focus on the genotype-elimination method. Furthermore, an example for four levels of error checking permitted by PedCheck is provided with the required input files. In addition, alternative algorithms implemented in other statistical computing programs are also briefly discussed.

AB - It has been documented that there exist some errors in most large genotype datasets and that an error rate of 1-2% is adequate to lead to the distortion of map distance as well as a false conclusion of linkage (Abecasis et al. Eur J Hum Genet 9(2):130-134, 2001), therefore one needs to ensure that the data are as clean as possible. On the other hand, the process of data cleaning is tedious and demands efforts and experience. O'Connell and Weeks implemented four error-checking algorithms in computer software called PedCheck. In this chapter, the four algorithms implemented in PedCheck are discussed with a focus on the genotype-elimination method. Furthermore, an example for four levels of error checking permitted by PedCheck is provided with the required input files. In addition, alternative algorithms implemented in other statistical computing programs are also briefly discussed.

KW - Automatic genotype elimination

KW - Computational efficiency

KW - Critical-genotype method

KW - Genotype

KW - Genotype error

KW - Genotype-elimination method

KW - LOD score

KW - Nuclear-pedigree method

KW - Odds-ratio method

KW - Parametric linkage analysis

UR - http://www.scopus.com/inward/record.url?scp=84863282405&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84863282405&partnerID=8YFLogxK

U2 - 10.1007/978-1-61779-555-8_2

DO - 10.1007/978-1-61779-555-8_2

M3 - Chapter

C2 - 22307691

AN - SCOPUS:84863282405

SN - 9781617795541

T3 - Methods in Molecular Biology

SP - 11

EP - 24

BT - Statistical Human Genetics

A2 - Elston, Robert

A2 - Sun, Shuying

A2 - Satagopan, Jaya

ER -

Identification of genotype errors

Abstract

Publication series

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this