Cluster Analysis of Gene Expression Data Based on Self-Splitting and Merging Competitive Learning

Loading...
Thumbnail Image
File version
Author(s)
Wu, SH
Liew, AWC
Yan, H
Yang, MS
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2004
Size

922363 bytes

File type(s)

application/pdf

Location
License
Abstract

Cluster analysis of gene expression data from a cDNA microarray is useful for identifying biologically relevant groups of genes. However, finding the natural clusters in the data and estimating the correct number of clusters are still two largely unsolved problems. In this paper, we propose a new clustering framework that is able to address both these problems. By using the one-prototype-take-one-cluster (OPTOC) competitive learning paradigm, the proposed algorithm can find natural clusters in the input data, and the clustering solution is not sensitive to initialization. In order to estimate the number of distinct clusters in the data, we propose a cluster splitting and merging strategy. We have applied the new algorithm to simulated gene expression data for which the correct distribution of genes over clusters is known a priori. The results show that the proposed algorithm can find natural clusters and give the correct number of clusters. The algorithm has also been tested on real gene expression changes during yeast cell cycle, for which the fundamental patterns of gene expression and assignment of genes to clusters are well understood from numerous previous studies. Comparative studies with several clustering algorithms illustrate the effectiveness of our method.

Journal Title

IEEE Transactions on Information Technology in Biomedicine

Conference Title
Book Title
Edition
Volume

8

Issue

1

Thesis Type
Degree Program
School
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2004 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Item Access Status
Note
Access the data
Related item(s)
Subject

Information and computing sciences

Engineering

Biomedical and clinical sciences

Persistent link to this record
Citation
Collections