Accurate identification of significant aberrations in contaminated cancer genome

Xuchu Hou, Guoqiang Yu, Xiguo Yuan, Bai Zhang, Ie Ming Shih, Zhen Zhang, Robert Clarke, Subha Madhavan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Somatic Copy Number Alterations (CNAs) are quite common in human cancers. Identifying CNAs and Significant Copy number Aberrations (SCAs) in cancer genomes is a critical task in searching for cancer-associated genes. The advanced genomic technologies, such as SNP array technology, facilitate copy number study at a genome-wide scale with high resolution. However, in reality, due to normal tissue contamination, the observed intensity signals are actually the mixture of copy number signals contributed from both tumor cells and normal cells. This genetic heterogeneity could significantly affect the subsequent copy number analysis and SCAs detection. In order to accurately identify significant aberrations in contaminated cancer genome, we devise an approach including two major steps. We first use a statistical method, Bayesian Analysis of Copy number Mixtures (BACOM) to estimate the normal tissue contamination fraction and recover the "true" copy number profile. Then, based on the recovered profiles, we detect SCAs using Genome-wide Identification of Significant Aberrations in Cancer Genome (SAIC). We comprehensively evaluate the performance of the proposed algorithm on a large number of simulation data. The results show that the algorithm has higher detection power than peer methods including the most popular GISTIC. We then apply the method to the real copy number data of Glioblastoma Multiforme and successfully identified majority of SCAs reported by GISTIC, and some novel SCAs that contain some cancer-associated genes.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE International Workshop on Genomic Signal Processing and Statistics
Pages74-77
Number of pages4
DOIs
StatePublished - 2012
Event2012 IEEE International Workshop on Genomic Signal Processing and Statistics, GENSIPS 2012 - Washington, DC, United States
Duration: Dec 2 2012Dec 4 2012

Other

Other2012 IEEE International Workshop on Genomic Signal Processing and Statistics, GENSIPS 2012
CountryUnited States
CityWashington, DC
Period12/2/1212/4/12

Fingerprint

Aberrations
Genes
Genome
Neoplasm Genes
Neoplasms
Technology
Bayes Theorem
Genetic Heterogeneity
Contamination
Glioblastoma
Tissue
Single Nucleotide Polymorphism
Tumors
Statistical methods
Cells

Keywords

  • copy number alterations
  • normal tissue contamination
  • significant copy number aberrations

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology (miscellaneous)
  • Computational Theory and Mathematics
  • Signal Processing
  • Biomedical Engineering

Cite this

Hou, X., Yu, G., Yuan, X., Zhang, B., Shih, I. M., Zhang, Z., ... Madhavan, S. (2012). Accurate identification of significant aberrations in contaminated cancer genome. In Proceedings - IEEE International Workshop on Genomic Signal Processing and Statistics (pp. 74-77). [6507730] https://doi.org/10.1109/GENSIPS.2012.6507730

Accurate identification of significant aberrations in contaminated cancer genome. / Hou, Xuchu; Yu, Guoqiang; Yuan, Xiguo; Zhang, Bai; Shih, Ie Ming; Zhang, Zhen; Clarke, Robert; Madhavan, Subha.

Proceedings - IEEE International Workshop on Genomic Signal Processing and Statistics. 2012. p. 74-77 6507730.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Hou, X, Yu, G, Yuan, X, Zhang, B, Shih, IM, Zhang, Z, Clarke, R & Madhavan, S 2012, Accurate identification of significant aberrations in contaminated cancer genome. in Proceedings - IEEE International Workshop on Genomic Signal Processing and Statistics., 6507730, pp. 74-77, 2012 IEEE International Workshop on Genomic Signal Processing and Statistics, GENSIPS 2012, Washington, DC, United States, 12/2/12. https://doi.org/10.1109/GENSIPS.2012.6507730
Hou X, Yu G, Yuan X, Zhang B, Shih IM, Zhang Z et al. Accurate identification of significant aberrations in contaminated cancer genome. In Proceedings - IEEE International Workshop on Genomic Signal Processing and Statistics. 2012. p. 74-77. 6507730 https://doi.org/10.1109/GENSIPS.2012.6507730
Hou, Xuchu ; Yu, Guoqiang ; Yuan, Xiguo ; Zhang, Bai ; Shih, Ie Ming ; Zhang, Zhen ; Clarke, Robert ; Madhavan, Subha. / Accurate identification of significant aberrations in contaminated cancer genome. Proceedings - IEEE International Workshop on Genomic Signal Processing and Statistics. 2012. pp. 74-77
@inproceedings{03043f249f374dbe86038d31d10a9041,
title = "Accurate identification of significant aberrations in contaminated cancer genome",
abstract = "Somatic Copy Number Alterations (CNAs) are quite common in human cancers. Identifying CNAs and Significant Copy number Aberrations (SCAs) in cancer genomes is a critical task in searching for cancer-associated genes. The advanced genomic technologies, such as SNP array technology, facilitate copy number study at a genome-wide scale with high resolution. However, in reality, due to normal tissue contamination, the observed intensity signals are actually the mixture of copy number signals contributed from both tumor cells and normal cells. This genetic heterogeneity could significantly affect the subsequent copy number analysis and SCAs detection. In order to accurately identify significant aberrations in contaminated cancer genome, we devise an approach including two major steps. We first use a statistical method, Bayesian Analysis of Copy number Mixtures (BACOM) to estimate the normal tissue contamination fraction and recover the {"}true{"} copy number profile. Then, based on the recovered profiles, we detect SCAs using Genome-wide Identification of Significant Aberrations in Cancer Genome (SAIC). We comprehensively evaluate the performance of the proposed algorithm on a large number of simulation data. The results show that the algorithm has higher detection power than peer methods including the most popular GISTIC. We then apply the method to the real copy number data of Glioblastoma Multiforme and successfully identified majority of SCAs reported by GISTIC, and some novel SCAs that contain some cancer-associated genes.",
keywords = "copy number alterations, normal tissue contamination, significant copy number aberrations",
author = "Xuchu Hou and Guoqiang Yu and Xiguo Yuan and Bai Zhang and Shih, {Ie Ming} and Zhen Zhang and Robert Clarke and Subha Madhavan",
year = "2012",
doi = "10.1109/GENSIPS.2012.6507730",
language = "English (US)",
isbn = "9781467352369",
pages = "74--77",
booktitle = "Proceedings - IEEE International Workshop on Genomic Signal Processing and Statistics",

}

TY - GEN

T1 - Accurate identification of significant aberrations in contaminated cancer genome

AU - Hou, Xuchu

AU - Yu, Guoqiang

AU - Yuan, Xiguo

AU - Zhang, Bai

AU - Shih, Ie Ming

AU - Zhang, Zhen

AU - Clarke, Robert

AU - Madhavan, Subha

PY - 2012

Y1 - 2012

N2 - Somatic Copy Number Alterations (CNAs) are quite common in human cancers. Identifying CNAs and Significant Copy number Aberrations (SCAs) in cancer genomes is a critical task in searching for cancer-associated genes. The advanced genomic technologies, such as SNP array technology, facilitate copy number study at a genome-wide scale with high resolution. However, in reality, due to normal tissue contamination, the observed intensity signals are actually the mixture of copy number signals contributed from both tumor cells and normal cells. This genetic heterogeneity could significantly affect the subsequent copy number analysis and SCAs detection. In order to accurately identify significant aberrations in contaminated cancer genome, we devise an approach including two major steps. We first use a statistical method, Bayesian Analysis of Copy number Mixtures (BACOM) to estimate the normal tissue contamination fraction and recover the "true" copy number profile. Then, based on the recovered profiles, we detect SCAs using Genome-wide Identification of Significant Aberrations in Cancer Genome (SAIC). We comprehensively evaluate the performance of the proposed algorithm on a large number of simulation data. The results show that the algorithm has higher detection power than peer methods including the most popular GISTIC. We then apply the method to the real copy number data of Glioblastoma Multiforme and successfully identified majority of SCAs reported by GISTIC, and some novel SCAs that contain some cancer-associated genes.

AB - Somatic Copy Number Alterations (CNAs) are quite common in human cancers. Identifying CNAs and Significant Copy number Aberrations (SCAs) in cancer genomes is a critical task in searching for cancer-associated genes. The advanced genomic technologies, such as SNP array technology, facilitate copy number study at a genome-wide scale with high resolution. However, in reality, due to normal tissue contamination, the observed intensity signals are actually the mixture of copy number signals contributed from both tumor cells and normal cells. This genetic heterogeneity could significantly affect the subsequent copy number analysis and SCAs detection. In order to accurately identify significant aberrations in contaminated cancer genome, we devise an approach including two major steps. We first use a statistical method, Bayesian Analysis of Copy number Mixtures (BACOM) to estimate the normal tissue contamination fraction and recover the "true" copy number profile. Then, based on the recovered profiles, we detect SCAs using Genome-wide Identification of Significant Aberrations in Cancer Genome (SAIC). We comprehensively evaluate the performance of the proposed algorithm on a large number of simulation data. The results show that the algorithm has higher detection power than peer methods including the most popular GISTIC. We then apply the method to the real copy number data of Glioblastoma Multiforme and successfully identified majority of SCAs reported by GISTIC, and some novel SCAs that contain some cancer-associated genes.

KW - copy number alterations

KW - normal tissue contamination

KW - significant copy number aberrations

UR - http://www.scopus.com/inward/record.url?scp=84877824971&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84877824971&partnerID=8YFLogxK

U2 - 10.1109/GENSIPS.2012.6507730

DO - 10.1109/GENSIPS.2012.6507730

M3 - Conference contribution

SN - 9781467352369

SP - 74

EP - 77

BT - Proceedings - IEEE International Workshop on Genomic Signal Processing and Statistics

ER -