Clustering of Small-Sample Single-Cell RNA-Seq Data via Feature Clustering and Selection
File version
Accepted Manuscript (AM)
Author(s)
Sharma, A
Patil, A
Shigemizu, D
Tsunoda, T
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
Cuvu, Fiji
License
Abstract
We present FeatClust, a software tool for clustering small sample size single-cell RNA-Seq datasets. The FeatClust approach is based on feature selection. It divides features into several groups by performing agglomerative hierarchical clustering and then iteratively clustering the samples and removing features belonging to groups with the least variance across samples. The optimal number of feature groups is selected based on silhouette analysis on the clustered data, i.e., selecting the clustering with the highest average silhouette coefficient. FeatClust also allows one to visually choose the number of clusters if it is not known, by generating silhouette plot for a chosen number of groupings of the dataset. We cluster five small sample single-cell RNA-seq datasets and use the adjusted rand index metric to compare the results with other clustering packages. The results are promising and show the effectiveness of FeatClust on small sample size datasets.
Journal Title
Conference Title
Lecture Notes in Computer Science
Book Title
Edition
Volume
11672
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© Springer Nature Switzerland AG 2019. This is the author-manuscript version of this paper. Reproduced in accordance with the copyright policy of the publisher.The original publication is available at www.springerlink.com
Item Access Status
Note
Access the data
Related item(s)
Subject
Artificial intelligence
Persistent link to this record
Citation
Vans, E; Sharma, A; Patil, A; Shigemizu, D; Tsunoda, T, Clustering of Small-Sample Single-Cell RNA-Seq Data via Feature Clustering and Selection, PRICAI 2019: Trends in Artificial Intelligence , 2019, 11672, pp. 445-456