Integrated analysis of gene expression and copy number data on gene shaving using independent component analysis

Jinhua Sheng, Hong Wen Deng, Vince Calhoun, Yu Ping Wang

Research output: Contribution to journalArticlepeer-review

16 Scopus citations


DNA microarray gene expression and microarray-based comparative genomic hybridization (aCGH) have been widely used for biomedical discovery. Because of the large number of genes and the complex nature of biological networks, various analysis methods have been proposed. One such method is "gene shaving, a procedure which identifies subsets of the genes with coherent expression patterns and large variation across samples. Since combining genomic information from multiple sources can improve classification and prediction of diseases, in this paper we proposed a new method, "ICA gene shaving (ICA, independent component analysis), for jointly analyzing gene expression and copy number data. First we used ICA to analyze joint measurements, gene expression and copy number, of a biological system and project the data onto statistically independent biological processes. Next, we used these results to identify patterns of variation in the data and then applied an iterative shaving method. We investigated the properties of our proposed method by analyzing both simulated and real data. We demonstrated that the robustness of our method to noise using simulated data. Using breast cancer data, we showed that our method is superior to the Generalized Singular Value Decomposition (GSVD) gene shaving method for identifying genes associated with breast cancer.

Original languageEnglish (US)
Article number5753880
Pages (from-to)1568-1579
Number of pages12
JournalIEEE/ACM Transactions on Computational Biology and Bioinformatics
Issue number6
StatePublished - 2011
Externally publishedYes


  • Clustering technique
  • Independent component analysis (ICA)
  • comparative genomic hybridization (CGH)
  • copy number variation (CNV)
  • gene expression
  • gene shaving
  • generalized singular value decomposition (GSVD)

ASJC Scopus subject areas

  • Biotechnology
  • Genetics
  • Applied Mathematics


Dive into the research topics of 'Integrated analysis of gene expression and copy number data on gene shaving using independent component analysis'. Together they form a unique fingerprint.

Cite this