five

Supporting data for "Triku: a feature selection method based on nearest neighbors for single-cell data"

收藏
DataCite Commons2025-05-26 更新2025-04-15 收录
下载链接:
http://gigadb.org/dataset/100989
下载链接
链接失效反馈
官方服务:
资源简介:
Feature selection is a relevant step in the analysis of single-cell RNA sequencing datasets. Most of the current feature selection methods are based on general univariate descriptors of the data such as the dispersion or the percentage of zeros. Despite the use of correction methods, the generality of these feature selection methods biases the genes selected towards highly-expressed genes, instead of the genes defining the cell populations of the dataset. <br>Triku is a feature selection method that favors genes defining the main cell populations. It does so by selecting genes expressed by groups of cells that are close in the <i>k</i> nearest neighbor graph. The expression of these genes is higher than the expected expression if the <i>k</i> cells were chosen at random. Triku efficiently recovers cell populations present in artificial and biological benchmarking datasets, based on ARI, NMI, supervised classification, and silhouette coefficient measurements. Additionally, gene sets selected by triku are more likely to be related to relevant Gene Ontology terms and contain fewer ribosomal and mitochondrial genes. <br> Triku is developed in Python 3 and is available at https://github.com/alexmascension/triku.
提供机构:
GigaScience Database
创建时间:
2022-01-25
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作