li-unsupervisedconditional-2008.pdf (180.65 kB)
An unsupervised conditional random fields approach for clustering gene expression time series
journal contribution
posted on 2008-01-01, 00:00 authored by Chang-Tsun LiChang-Tsun Li, Y Yuan, R WilsonMotivation: There is a growing interest in extracting statistical patterns from gene expression time-series data, in which a key challenge is the development of stable and accurate probabilistic models. Currently popular models, however, would be computationally prohibitive unless some independence assumptions are made to describe large-scale data. We propose an unsupervised conditional random fields (CRF) model to overcome this problem by progressively infusing information into the labelling process through a small variable voting pool. Results: An unsupervised CRF model is proposed for efficient analysis of gene expression time series and is successfully applied to gene class discovery and class prediction. The proposed model treats each time series as a random field and assigns an optimal cluster label to each time series, so as to partition the time series into clusters without a priori knowledge about the number of clusters and the initial centroids. Another advantage of the proposed method is the relaxation of independence assumptions. © The Author 2008. Published by Oxford University Press. All rights reserved.
History
Journal
BioinformaticsVolume
24Issue
21Pagination
2467 - 2473Publisher
Oxford University PressLocation
Oxford, Eng.Publisher DOI
ISSN
1367-4803eISSN
1460-2059Language
engPublication classification
C1.1 Refereed article in a scholarly journalUsage metrics
Categories
No categories selectedKeywords
Science & TechnologyLife Sciences & BiomedicineTechnologyPhysical SciencesBiochemical Research MethodsBiotechnology & Applied MicrobiologyComputer Science, Interdisciplinary ApplicationsMathematical & Computational BiologyStatistics & ProbabilityBiochemistry & Molecular BiologyComputer ScienceMathematicsMIXTURE MODEL
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC