特征选择
特征(语言学)
计算机科学
Python(编程语言)
人工智能
特征模型
可视化
机器学习
可扩展性
降维
源代码
数据挖掘
模式识别(心理学)
软件
操作系统
数据库
哲学
程序设计语言
语言学
作者
Pengfei Liang,Hao Wang,Yuchao Liang,Jianzhong Zhou,Haicheng Li,Yongchun Zuo
出处
期刊:Current Bioinformatics
[Bentham Science]
日期:2022-08-01
卷期号:17 (7): 578-585
被引量:6
标识
DOI:10.2174/1574893617666220608123804
摘要
Background: Inferring feature importance is both a promise and challenge in bioinformatics and computational biology. While multiple biological computation methods exist to identify decisive factors of single cell subpopulation, there is a need for a comprehensive toolkit that presents an intuitive and custom view of the feature importance. Objective: We developed a Feature-scML, a scalable and friendly toolkit that allows the users to visualize and reveal decisive factors for single cell omics analysis. Method: Feature-scML incorporates the following main three functions: (i) There are seven feature selection algorithms to comprehensively score and rank every feature. (ii) Four machine learning approaches and increment feature selection (IFS) strategy jointly determine the number of selected features. (iii) The Feature-scML supports the visualized feature importance, model performance evaluation, and model interpretation. The source code is available at https://github.com/liameihao/Feature-scML. Results: We systematically compared the performance of seven feature selection algorithms from Feature-scML on two single cell transcriptome datasets. It demonstrates the effectiveness and power of the Feature-scML. Conclusion: Feature-scML is effective for analyzing single-cell RNA omics datasets to automate the machine learning process and customize the visual analysis from the results.
科研通智能强力驱动
Strongly Powered by AbleSci AI