Adaptive sparse multiple canonical correlation analysis with application to imaging (epi)genomics study of schizophrenia

Wenxing Hu, Dongdong Lin, Shaolong Cao, Jing Yu Liu, Jiayu Chen, Vince Daniel Calhoun, Yuping Wang

Research output: Contribution to journalArticle

Abstract

Finding correlations across multiple data sets in imaging and (epi)genomics is a common challenge. Sparse multiple canonical correlation analysis (SMCCA) is a multivariate model widely used to extract contributing features from each data while maximizing the cross-modality correlation. The model is achieved by using the combination of pairwise covariances between any two data sets. However, the scales of different pairwise covariances could be quite different and the direct combination of pairwise covariances in SMCCA is unfair. The problem of “unfair combination of pairwise covariances” restricts the power of SMCCA for feature selection. In this paper, we propose a novel formulation of SMCCA, called adaptive SMCCA, to overcome the problem by introducing adaptive weights when combining pairwise covariances. Both simulation and real data analysis show the outperformance of adaptive SMCCA in terms of feature selection over conventional SMCCA and SMCCA with fixed weights. Large-scale numerical experiments show that adaptive SMCCA converges as fast as conventional SMCCA. When applying it to imaging (epi)genetics study of schizophrenia subjects, we can detect significant (epi)genetic variants and brain regions, which are consistent with other existing reports. In addition, several significant brain-development related pathways, e.g., neural tube development, are detected by our model, demonstrating imaging epigenetic association may be overlooked by conventional SMCCA. All these results demonstrate that adaptive SMCCA are well-suited for detecting three-way or multi-way correlations and thus can find widespread applications in multiple omics and imaging data integration.

Original languageEnglish (US)
JournalIEEE Transactions on Biomedical Engineering
DOIs
StateAccepted/In press - Nov 7 2017
Externally publishedYes

Keywords

  • Adaptation models
  • Biomarkers
  • canonical correlation analysis
  • Correlation
  • data fusion
  • Diseases
  • Feature extraction
  • genomic data analysis
  • Imaging
  • imaging genomics
  • Linear programming
  • multi-omics data integration

ASJC Scopus subject areas

  • Biomedical Engineering

Fingerprint Dive into the research topics of 'Adaptive sparse multiple canonical correlation analysis with application to imaging (epi)genomics study of schizophrenia'. Together they form a unique fingerprint.

  • Cite this