TY - GEN
T1 - A multimodality ICA study - Integrating genomic single nucleotide polymorphisms with functional neuroimaging data
AU - Liu, Jingyu
AU - Bixler, Joel N.
AU - Calhoun, Vince D.
PY - 2008/12/1
Y1 - 2008/12/1
N2 - Genomic experiments encapsulate such rich information that specific analysis tools are needed to take full advantage of the available data. Recently, more attention has been devoted to developing tools to identify the association between genomic variations and phenotypic traits. This paper focuses on understanding an approach called parallel independent component analysis (ICA), while applied to massive microarray data. Parallel ICA is a multivariate approach which can process genotypic and phenotypic data in parallel while emphasizing the relationship between data types. We simulated single nucleotide polymorphism (SNP) array data and functional magnetic resonance imaging data, generating hidden connections between the two modalities. The performance of parallel ICA was compared with a correlation test for a single hypothesis. Our evaluation shows that parallel ICA, in general, is able to extract more accurately the components and connections than the correlation test, in particular for weak linkages. Results also indicate that the ratio of sample size to SNP size should be at least 0.02.However, when the data have a low odds ratio or cases vs. controls ratio, the correlation test provides results reliably, though with lower accuracy.
AB - Genomic experiments encapsulate such rich information that specific analysis tools are needed to take full advantage of the available data. Recently, more attention has been devoted to developing tools to identify the association between genomic variations and phenotypic traits. This paper focuses on understanding an approach called parallel independent component analysis (ICA), while applied to massive microarray data. Parallel ICA is a multivariate approach which can process genotypic and phenotypic data in parallel while emphasizing the relationship between data types. We simulated single nucleotide polymorphism (SNP) array data and functional magnetic resonance imaging data, generating hidden connections between the two modalities. The performance of parallel ICA was compared with a correlation test for a single hypothesis. Our evaluation shows that parallel ICA, in general, is able to extract more accurately the components and connections than the correlation test, in particular for weak linkages. Results also indicate that the ratio of sample size to SNP size should be at least 0.02.However, when the data have a low odds ratio or cases vs. controls ratio, the correlation test provides results reliably, though with lower accuracy.
UR - http://www.scopus.com/inward/record.url?scp=58049185185&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=58049185185&partnerID=8YFLogxK
U2 - 10.1109/BIBMW.2008.4686229
DO - 10.1109/BIBMW.2008.4686229
M3 - Conference contribution
AN - SCOPUS:58049185185
SN - 9781424428908
T3 - Proceedings - 2008 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW
SP - 151
EP - 157
BT - Proceedings - 2008 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW
T2 - 2008 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW
Y2 - 3 November 2008 through 5 November 2008
ER -