Gene ontology semi-supervised possibilistic clustering of gene expression data

Ioannis A. Maraziotis, George Dimitrakopoulos, Anastasios Bezerianos

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Clustering is one of the most important data analysis methods with applications of significant importance in many scientific fields. In computational biology, clustering of gene expression data from microarrays assists biologists to investigate uncharacterized genes by identifying biologically relevant groups of genes. Semi-supervised clustering algorithms have proven to bring substantial improvements in the results of standard clustering methods especially on datasets of increased complexity. In this paper we propose a semi-supervised possibilistic clustering algorithm (SSPCA) utilizing supervision via pair-wise constraints indicating whether a pair of patterns should belong to the same cluster or not. Furthermore we show how external sources of biological information like gene ontology data can provide constraints to guide the clustering process of SSPCA. Our results show that the proposed algorithm outperformed other well established standard and semi-supervised methodologies.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Pages262-269
Number of pages8
Volume7297 LNCS
DOIs
StatePublished - 2012
Externally publishedYes
Event7th Hellenic Conference on Artificial Intelligence, SETN 2012 - Lamia, Greece
Duration: May 28 2012May 31 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7297 LNCS
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other7th Hellenic Conference on Artificial Intelligence, SETN 2012
CountryGreece
CityLamia
Period5/28/125/31/12

Fingerprint

Gene Ontology
Gene Expression Data
Gene expression
Clustering algorithms
Clustering Algorithm
Ontology
Genes
Clustering
Semi-supervised Clustering
Gene
Computational Biology
Microarrays
Clustering Methods
Microarray
Data analysis
Methodology
Standards

Keywords

  • constraints
  • gene expression
  • gene ontology
  • possibilistic clustering
  • semi-supervision

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Maraziotis, I. A., Dimitrakopoulos, G., & Bezerianos, A. (2012). Gene ontology semi-supervised possibilistic clustering of gene expression data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7297 LNCS, pp. 262-269). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7297 LNCS). https://doi.org/10.1007/978-3-642-30448-4_33

Gene ontology semi-supervised possibilistic clustering of gene expression data. / Maraziotis, Ioannis A.; Dimitrakopoulos, George; Bezerianos, Anastasios.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7297 LNCS 2012. p. 262-269 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 7297 LNCS).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Maraziotis, IA, Dimitrakopoulos, G & Bezerianos, A 2012, Gene ontology semi-supervised possibilistic clustering of gene expression data. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 7297 LNCS, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 7297 LNCS, pp. 262-269, 7th Hellenic Conference on Artificial Intelligence, SETN 2012, Lamia, Greece, 5/28/12. https://doi.org/10.1007/978-3-642-30448-4_33
Maraziotis IA, Dimitrakopoulos G, Bezerianos A. Gene ontology semi-supervised possibilistic clustering of gene expression data. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7297 LNCS. 2012. p. 262-269. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-642-30448-4_33
Maraziotis, Ioannis A. ; Dimitrakopoulos, George ; Bezerianos, Anastasios. / Gene ontology semi-supervised possibilistic clustering of gene expression data. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 7297 LNCS 2012. pp. 262-269 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{33164517b7a549f6bff5b71d9ac8a3af,
title = "Gene ontology semi-supervised possibilistic clustering of gene expression data",
abstract = "Clustering is one of the most important data analysis methods with applications of significant importance in many scientific fields. In computational biology, clustering of gene expression data from microarrays assists biologists to investigate uncharacterized genes by identifying biologically relevant groups of genes. Semi-supervised clustering algorithms have proven to bring substantial improvements in the results of standard clustering methods especially on datasets of increased complexity. In this paper we propose a semi-supervised possibilistic clustering algorithm (SSPCA) utilizing supervision via pair-wise constraints indicating whether a pair of patterns should belong to the same cluster or not. Furthermore we show how external sources of biological information like gene ontology data can provide constraints to guide the clustering process of SSPCA. Our results show that the proposed algorithm outperformed other well established standard and semi-supervised methodologies.",
keywords = "constraints, gene expression, gene ontology, possibilistic clustering, semi-supervision",
author = "Maraziotis, {Ioannis A.} and George Dimitrakopoulos and Anastasios Bezerianos",
year = "2012",
doi = "10.1007/978-3-642-30448-4_33",
language = "English (US)",
isbn = "9783642304477",
volume = "7297 LNCS",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "262--269",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Gene ontology semi-supervised possibilistic clustering of gene expression data

AU - Maraziotis, Ioannis A.

AU - Dimitrakopoulos, George

AU - Bezerianos, Anastasios

PY - 2012

Y1 - 2012

N2 - Clustering is one of the most important data analysis methods with applications of significant importance in many scientific fields. In computational biology, clustering of gene expression data from microarrays assists biologists to investigate uncharacterized genes by identifying biologically relevant groups of genes. Semi-supervised clustering algorithms have proven to bring substantial improvements in the results of standard clustering methods especially on datasets of increased complexity. In this paper we propose a semi-supervised possibilistic clustering algorithm (SSPCA) utilizing supervision via pair-wise constraints indicating whether a pair of patterns should belong to the same cluster or not. Furthermore we show how external sources of biological information like gene ontology data can provide constraints to guide the clustering process of SSPCA. Our results show that the proposed algorithm outperformed other well established standard and semi-supervised methodologies.

AB - Clustering is one of the most important data analysis methods with applications of significant importance in many scientific fields. In computational biology, clustering of gene expression data from microarrays assists biologists to investigate uncharacterized genes by identifying biologically relevant groups of genes. Semi-supervised clustering algorithms have proven to bring substantial improvements in the results of standard clustering methods especially on datasets of increased complexity. In this paper we propose a semi-supervised possibilistic clustering algorithm (SSPCA) utilizing supervision via pair-wise constraints indicating whether a pair of patterns should belong to the same cluster or not. Furthermore we show how external sources of biological information like gene ontology data can provide constraints to guide the clustering process of SSPCA. Our results show that the proposed algorithm outperformed other well established standard and semi-supervised methodologies.

KW - constraints

KW - gene expression

KW - gene ontology

KW - possibilistic clustering

KW - semi-supervision

UR - http://www.scopus.com/inward/record.url?scp=84861662326&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84861662326&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-30448-4_33

DO - 10.1007/978-3-642-30448-4_33

M3 - Conference contribution

AN - SCOPUS:84861662326

SN - 9783642304477

VL - 7297 LNCS

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 262

EP - 269

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

ER -