Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data

来源 :基因组蛋白质组与生物信息学报(英文版) | 被引量 : 0次 | 上传用户:cyuaxl
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Annotating cell types is a critical step in single-cell RNA sequencing (scRNA-seq) data analysis.Some supervised or semi-supervised classification methods have recently emerged to enable automated cell type identification.However,comprehensive evaluations of these methods are lacking.Moreover,it is not clear whether some classification methods originally designed for ana-lyzing other bulk omics data are adaptable to scRNA-seq analysis.In this study,we evaluated ten cell type annotation methods publicly available as R packages.Eight of them are popular methods developed specifically for single-cell research,including Seurat,scmap,SingleR,CHETAH,Sin-gleCellNet,scID,Garnett,and SCINA.The other two methods were repurposed from deconvolut-ing DNA methylation data,i.e.,linear constrained projection (CP) and robust partial correlations(RPC).We conducted systematic comparisons on a wide variety of public scRNA-seq datasets as well as simulation data.We assessed the accuracy through intra-dataset and inter-dataset predic-tions;the robustness over practical challenges such as gene filtering,high similarity among cell types,and increased cell type classes;as well as the detection of rare and unknown cell types.Over-all,methods such as Seurat,SingleR,CP,RPC,and SingleCellNet performed well,with Seurat being the best at annotating major cell types.Additionally,Seurat,SingleR,CP,and RPC were more robust against downsampling.However,Seurat did have a major drawback at predicting rare cell populations,and it was suboptimal at differentiating cell types highly similar to each other,compared to SingleR and RPC.All the code and data are available from https://github.com/qian-huiSenn/scRNA_ cell_ deconv_benchmark.
其他文献
TGF-β信号通路是一个由众多细胞因子组成的大家族,调节着包括细胞生长、上皮-间质转化、迁移、分化以及凋亡等在内的许多细胞功能,在组织与器官的正常生长、胚胎发育等过程中起着关键作用.一旦TGF-β信号传导过程发生异常,随之而来的就是一系列发育缺陷和疾病的产生,如肿瘤的发生,组织器官的纤维化,结缔组织以及骨骼疾病等.通过对TGF-β信号通路在不同疾病发生中的作用进行研究,目前已经开发了许多靶向此通路的治疗策略,如单克隆抗体、受体激酶抑制剂和反义寡聚核苷酸等.本综述总结了 TGF-β信号通路在不同疾病发生中的
冠状病毒(Coronavirus)是具有包膜的正单链RNA病毒,基因组大小介于26 000与32 000 nt之间,编码刺突蛋白(S)、包膜蛋白(E)、膜蛋白(M)和核壳蛋白(N)等四种结构蛋白、复制酶(ORF1a/b)与若干辅助蛋白,部分病毒还具有血细胞凝集素酯酶(HE),这些蛋白除维持病毒结构,还有促进感染与抵抗宿主免疫反应等功能,其中刺突蛋白可与宿主细胞表面的受体结合,使病毒包膜和宿主细胞的膜融合以感染细胞.冠状病毒的感染会影响细胞的许多信号转导途径,引发免疫反应,是一类可感染哺乳动物与鸟类的病毒.
楝酰胺类化合物存在于楝属植物中,因其独特的化学结构而具有杀虫、抗炎及抗癌的活性.目前研究发现,楝酰胺类化合物对多种癌症如肺癌、肾癌、胰腺癌、恶性外周神经鞘瘤等都具有独特的细胞凋亡作用,而对正常细胞无毒害作用.楝酰胺类化合物抗肿瘤的机制主要有:抑制癌细胞翻译起始、调控细胞周期、诱导肿瘤细胞凋亡、抑制细胞增殖、降低药物细胞毒性等.因此,楝酰胺及其衍生物作为潜在抗癌药物也有极大的应用前景,成为近年来的研究热点.本综述总结了楝属植物中的次生代谢产物-楝酰胺类化合物的发现过程、结构特征和抗癌活性,重点阐述了其在癌症
The recent advancement of single-cell RNA sequencing (scRNA-seq) technologies facilitates the study of cell lineages in developmental processes and cancer.In this study,we developed a computational method,called redPATH,to reconstruct the pseudo developme
Single-cell mass cytometry (SCMC) combines features of traditional flow cytometry (i.e.,fluorescence-activated cell sorting) with mass spectrometry,making it possible to measure several parameters at the single-cell level for a complex analysis of biologi
Single-cell RNA sequencing (scRNA-seq) is generally used for profiling transcriptome of individual cells.The droplet-based 10X Genomics Chromium (10X) approach and the plate-based Smart-seq2 full-length method are two frequently used scRNA-seq platforms,y
Successful pregnancy in placental mammals substantially depends on the establishment of maternal immune tolerance to the semi-allogenic fetus.Disorders in this process are tightly asso-ciated with adverse pregnancy outcomes including recurrent miscarriage
The rapid advancement of single-cell technologies has shed new light on the complex mechanisms of cellular heterogeneity.However,compared to bulk RNA sequencing (RNA-seq),single-cell RNA-seq (scRNA-seq) suffers from higher noise and lower coverage,which b
One of the major challenges in single-cell data analysis is the determination of cellular developmental trajectories using single-cell data.Although substantial studies have been conducted in recent years,more effective methods are still strongly needed t
Accurate identification of cell types from single-cell RNA sequencing (scRNA-seq) data plays a critical role in a variety of scRNA-seq analysis studies.This task corresponds to solving an unsupervised clustering problem,in which the similarity measurement