计算机科学
RNA序列
注释
领域(数学分析)
域适应
相似性(几何)
人工智能
适应(眼睛)
计算生物学
数据挖掘
模式识别(心理学)
转录组
生物
基因表达
基因
数学
遗传学
图像(数学)
分类器(UML)
数学分析
神经科学
作者
Yan Zhao,Junliang Shang,Baojuan Qin,Limin Zhang,Xin He,Daohui Ge,Qianqian Ren,Jin‐Xing Liu
出处
期刊:IEEE Journal of Biomedical and Health Informatics
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:: 1-9
标识
DOI:10.1109/jbhi.2024.3468310
摘要
Cell type annotation refers to the process of categorizing and labeling cells to identify their specific cell types, which is crucial for understanding cell functions and biological processes. Although many methods have been developed for automated cell type annotation, they often encounter challenges such as batch effects due to variations in data distribution across platforms and species, thereby compromising their performance. To address batch effects, in this study, a pre-trained domain adaptation model based on structural similarity, named pscAdapt, is proposed for cell type annotation. Specifically, a pre-trained strategy is employed to initialize model parameters to learn the data distribution of source domain. This strategy is also combined with an adversarial learning strategy to train the domain adaptation network for achieving domain level alignment and reducing domain discrepancy. Furthermore, to better distinguish different types of cells, a structural similarity loss is designed, aiming to shorten distances between cells of the same type and increase distances between cells of different types in feature space, thus achieving cell level alignment and enhancing the discriminability of cell types. Comprehensive experiments were conducted on simulated datasets, cross-platforms datasets and cross-species datasets to validate the effectiveness of pscAdapt, results of which demonstrate that pscAdapt outperforms several popular cell type annotation methods. The source code of pscAdapt is available online at https://github.com/CDMBlab/pscAdapt.
科研通智能强力驱动
Strongly Powered by AbleSci AI