Clustering methods for single-cell RNA-sequencing expression data: Performance evaluation with varying sample sizes and cell compositions

Küçük Resim Yok

Tarih

2019

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

De Gruyter

Erişim Hakkı

info:eu-repo/semantics/openAccess

Özet

A number of specialized clustering methods have been developed so far for the accurate analysis of single-cell RNA-sequencing (scRNA-seq) expression data, and several reports have been published documenting the performance measures of these clustering methods under different conditions. However, to date, there are no available studies regarding the systematic evaluation of the performance measures of the clustering methods taking into consideration the sample size and cell composition of a given scRNA-seq dataset. Herein, a comprehensive performance evaluation study of 11 selected scRNA-seq clustering methods was performed using synthetic datasets with known sample sizes and number of subpopulations, as well as varying levels of transcriptome complexity. The results indicate that the overall performance of the clustering methods under study are highly dependent on the sample size and complexity of the scRNA-seq dataset. In most of the cases, better clustering performances were obtained as the number of cells in a given expression dataset was increased. The findings of this study also highlight the importance of sample size for the successful detection of rare cell subpopulations with an appropriate clustering tool. © 2019 Walter de Gruyter GmbH, Berlin/Boston.

Açıklama

Anahtar Kelimeler

clustering, performance evaluation, RNA sequencing, single cell

Kaynak

Statistical Applications in Genetics and Molecular Biology

WoS Q Değeri

Scopus Q Değeri

Q3

Cilt

Sayı

Künye