Bioinformatic analysis of neural stem cell differentiation

Loyal Goff, Jonathan Davila, Rebecka Jörnsten, Sunduz Keles, Ronald P. Hart

Research output: Contribution to journalArticle

Abstract

Regulated mRNAs during differentiation of rat neural stem cells were analyzed using the ABI1700 microarray platform. This microarray, while technically advanced, suffers from the difficulty of integrating hybridization results into public databases for systems-level analysis. This is particularly true for the rat array, since many of the probes were designed for transcripts based on predicted human and mouse homologs. Using several strategies, we increased the public annotation of the 27,531 probes from 43% to over 65%. To increase the dynamic range of annotation, probes were mapped to numerous public keys from several data sources. Consensus annotation from multiple sources was determined for well-scoring alignments, and a confidence-based ranking system established for probes with less agreement across multiple data sources. Previous attempts at genomic interpretation using the Celera annotation model resulted in poor overlap with expected genomic sequences. Since the public keys are more precisely mapped to the genome, we could now analyze the relationships between predicted transcription-factor binding sites and expression clusters. Results collected from a differentiation time course of two neural stem cell clones were clustered using a model-based algorithm. Transcription-factor binding sites were predicted from upstream regions of mapped transcripts using position weight matrices from either JASPAR or TRANSFAC, and the resulting scores were used to discriminate between observed expression clusters. A classification and regression tree analysis was conducted using cluster numbers as gene identifiers and TFBS scores as predictors, pruning back to obtain a tree with the lowest gene class prediction error rate. Results identify several transcription factors, the presence or absence of which are sufficient to describe clusters of mRNAs changing over time-those that are static, as well as clusters describing cell line differences. Public annotation of the AB1700 rat genome array will be valuable for integrating results into future systems-level analyses.

Original languageEnglish (US)
Pages (from-to)205-212
Number of pages8
JournalJournal of Biomolecular Techniques
Volume18
Issue number4
StatePublished - Sep 2007
Externally publishedYes

Fingerprint

Neural Stem Cells
Computational Biology
Cell Differentiation
Transcription Factors
Information Storage and Retrieval
Position-Specific Scoring Matrices
Binding Sites
Genome
Messenger RNA
Genes
Clone Cells
Regression Analysis
Databases
Cell Line

Keywords

  • Bioinformatics
  • Microarray
  • Neural differentiation
  • Probe annotation
  • Transcription factors

ASJC Scopus subject areas

  • Molecular Biology
  • Medicine(all)

Cite this

Goff, L., Davila, J., Jörnsten, R., Keles, S., & Hart, R. P. (2007). Bioinformatic analysis of neural stem cell differentiation. Journal of Biomolecular Techniques, 18(4), 205-212.

Bioinformatic analysis of neural stem cell differentiation. / Goff, Loyal; Davila, Jonathan; Jörnsten, Rebecka; Keles, Sunduz; Hart, Ronald P.

In: Journal of Biomolecular Techniques, Vol. 18, No. 4, 09.2007, p. 205-212.

Research output: Contribution to journalArticle

Goff, L, Davila, J, Jörnsten, R, Keles, S & Hart, RP 2007, 'Bioinformatic analysis of neural stem cell differentiation', Journal of Biomolecular Techniques, vol. 18, no. 4, pp. 205-212.
Goff L, Davila J, Jörnsten R, Keles S, Hart RP. Bioinformatic analysis of neural stem cell differentiation. Journal of Biomolecular Techniques. 2007 Sep;18(4):205-212.
Goff, Loyal ; Davila, Jonathan ; Jörnsten, Rebecka ; Keles, Sunduz ; Hart, Ronald P. / Bioinformatic analysis of neural stem cell differentiation. In: Journal of Biomolecular Techniques. 2007 ; Vol. 18, No. 4. pp. 205-212.
@article{c6584de619a140f7a20b982cf746c888,
title = "Bioinformatic analysis of neural stem cell differentiation",
abstract = "Regulated mRNAs during differentiation of rat neural stem cells were analyzed using the ABI1700 microarray platform. This microarray, while technically advanced, suffers from the difficulty of integrating hybridization results into public databases for systems-level analysis. This is particularly true for the rat array, since many of the probes were designed for transcripts based on predicted human and mouse homologs. Using several strategies, we increased the public annotation of the 27,531 probes from 43{\%} to over 65{\%}. To increase the dynamic range of annotation, probes were mapped to numerous public keys from several data sources. Consensus annotation from multiple sources was determined for well-scoring alignments, and a confidence-based ranking system established for probes with less agreement across multiple data sources. Previous attempts at genomic interpretation using the Celera annotation model resulted in poor overlap with expected genomic sequences. Since the public keys are more precisely mapped to the genome, we could now analyze the relationships between predicted transcription-factor binding sites and expression clusters. Results collected from a differentiation time course of two neural stem cell clones were clustered using a model-based algorithm. Transcription-factor binding sites were predicted from upstream regions of mapped transcripts using position weight matrices from either JASPAR or TRANSFAC, and the resulting scores were used to discriminate between observed expression clusters. A classification and regression tree analysis was conducted using cluster numbers as gene identifiers and TFBS scores as predictors, pruning back to obtain a tree with the lowest gene class prediction error rate. Results identify several transcription factors, the presence or absence of which are sufficient to describe clusters of mRNAs changing over time-those that are static, as well as clusters describing cell line differences. Public annotation of the AB1700 rat genome array will be valuable for integrating results into future systems-level analyses.",
keywords = "Bioinformatics, Microarray, Neural differentiation, Probe annotation, Transcription factors",
author = "Loyal Goff and Jonathan Davila and Rebecka J{\"o}rnsten and Sunduz Keles and Hart, {Ronald P.}",
year = "2007",
month = "9",
language = "English (US)",
volume = "18",
pages = "205--212",
journal = "Journal of Biomolecular Techniques",
issn = "1524-0215",
publisher = "Association of Biomolecular Resource Facilities",
number = "4",

}

TY - JOUR

T1 - Bioinformatic analysis of neural stem cell differentiation

AU - Goff, Loyal

AU - Davila, Jonathan

AU - Jörnsten, Rebecka

AU - Keles, Sunduz

AU - Hart, Ronald P.

PY - 2007/9

Y1 - 2007/9

N2 - Regulated mRNAs during differentiation of rat neural stem cells were analyzed using the ABI1700 microarray platform. This microarray, while technically advanced, suffers from the difficulty of integrating hybridization results into public databases for systems-level analysis. This is particularly true for the rat array, since many of the probes were designed for transcripts based on predicted human and mouse homologs. Using several strategies, we increased the public annotation of the 27,531 probes from 43% to over 65%. To increase the dynamic range of annotation, probes were mapped to numerous public keys from several data sources. Consensus annotation from multiple sources was determined for well-scoring alignments, and a confidence-based ranking system established for probes with less agreement across multiple data sources. Previous attempts at genomic interpretation using the Celera annotation model resulted in poor overlap with expected genomic sequences. Since the public keys are more precisely mapped to the genome, we could now analyze the relationships between predicted transcription-factor binding sites and expression clusters. Results collected from a differentiation time course of two neural stem cell clones were clustered using a model-based algorithm. Transcription-factor binding sites were predicted from upstream regions of mapped transcripts using position weight matrices from either JASPAR or TRANSFAC, and the resulting scores were used to discriminate between observed expression clusters. A classification and regression tree analysis was conducted using cluster numbers as gene identifiers and TFBS scores as predictors, pruning back to obtain a tree with the lowest gene class prediction error rate. Results identify several transcription factors, the presence or absence of which are sufficient to describe clusters of mRNAs changing over time-those that are static, as well as clusters describing cell line differences. Public annotation of the AB1700 rat genome array will be valuable for integrating results into future systems-level analyses.

AB - Regulated mRNAs during differentiation of rat neural stem cells were analyzed using the ABI1700 microarray platform. This microarray, while technically advanced, suffers from the difficulty of integrating hybridization results into public databases for systems-level analysis. This is particularly true for the rat array, since many of the probes were designed for transcripts based on predicted human and mouse homologs. Using several strategies, we increased the public annotation of the 27,531 probes from 43% to over 65%. To increase the dynamic range of annotation, probes were mapped to numerous public keys from several data sources. Consensus annotation from multiple sources was determined for well-scoring alignments, and a confidence-based ranking system established for probes with less agreement across multiple data sources. Previous attempts at genomic interpretation using the Celera annotation model resulted in poor overlap with expected genomic sequences. Since the public keys are more precisely mapped to the genome, we could now analyze the relationships between predicted transcription-factor binding sites and expression clusters. Results collected from a differentiation time course of two neural stem cell clones were clustered using a model-based algorithm. Transcription-factor binding sites were predicted from upstream regions of mapped transcripts using position weight matrices from either JASPAR or TRANSFAC, and the resulting scores were used to discriminate between observed expression clusters. A classification and regression tree analysis was conducted using cluster numbers as gene identifiers and TFBS scores as predictors, pruning back to obtain a tree with the lowest gene class prediction error rate. Results identify several transcription factors, the presence or absence of which are sufficient to describe clusters of mRNAs changing over time-those that are static, as well as clusters describing cell line differences. Public annotation of the AB1700 rat genome array will be valuable for integrating results into future systems-level analyses.

KW - Bioinformatics

KW - Microarray

KW - Neural differentiation

KW - Probe annotation

KW - Transcription factors

UR - http://www.scopus.com/inward/record.url?scp=36048993696&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=36048993696&partnerID=8YFLogxK

M3 - Article

C2 - 17916793

AN - SCOPUS:36048993696

VL - 18

SP - 205

EP - 212

JO - Journal of Biomolecular Techniques

JF - Journal of Biomolecular Techniques

SN - 1524-0215

IS - 4

ER -