Item request has been placed!
×
Item request cannot be made.
×

Processing Request
A comprehensive benchmarking study on computational tools for cross-omics label transfer from single-cell RNA to ATAC data.
Item request has been placed!
×
Item request cannot be made.
×

Processing Request
- Author(s): Wang Y;Wang Y; Zhao H; Zhao H; Zhao H; Zhao H
- Source:
G3 (Bethesda, Md.) [G3 (Bethesda)] 2026 Apr 01; Vol. 16 (4).
- Publication Type:
Journal Article; Research Support, N.I.H., Extramural
- Language:
English
- Additional Information
- Source:
Publisher: Oxford University Press Country of Publication: England NLM ID: 101566598 Publication Model: Print Cited Medium: Internet ISSN: 2160-1836 (Electronic) Linking ISSN: 21601836 NLM ISO Abbreviation: G3 (Bethesda) Subsets: MEDLINE
- Publication Information:
Publication: 2021- : [Oxford] : Oxford University Press
Original Publication: Bethesda, MD : Genetics Society of America, 2011-
- Subject Terms:
- Abstract:
With continuous progress of single-cell chromatin accessibility profiling techniques, scATAC-seq has become more commonly used in investigating regulatory genomic regions and their involvement in developmental, evolutionary, and disease-related processes. At the same time, accurate cell type annotation plays a crucial role in comprehending the cellular makeup of complex tissues and uncovering novel cell types. Unfortunately, the majority of existing methods primarily focus on label transfer within scRNA-seq datasets and only a limited number of approaches have been specifically developed for transferring labels from scRNA-seq to scATAC-seq data. Moreover, many methods have been published for the joint embedding of data from the two modalities, which can be used for label transfer by adding a classifier trained on the latent space. Given these available methods, this study presents a comprehensive benchmarking study evaluating 27 computational tools for scATAC-seq label annotations through tasks involving single-cell RNA and ATAC data from various human and mouse tissues. We found that when high-quality paired data were available to transfer labels across unpaired data, Bridge and GLUE were the best performers; otherwise, bindSC and GLUE achieved the highest prediction accuracy overall. All these methods were able to use peak-level information instead of purely relying on the gene activities from scATAC-seq. Furthermore, we found that data imbalance, cross-omics dissimilarity on common cell types, data binarization, and the introduction of semi-supervised strategy usually had negative impacts on model performance. In terms of scalability, we found that the most time and memory efficient methods were Bridge and deep learning-based algorithms like GLUE. Based on the results of this study, we provide several suggestions for future methodology development.
(© The Author(s) 2026. Published by Oxford University Press on behalf of The Genetics Society of America.)
- Grant Information:
U24 HG012108 United States GF NIH HHS; U01 HG013840 United States GF NIH HHS
- Contributed Indexing:
Keywords: benchmarking study; cell type annotation; cross-omics label transfer; scATAC-seq; scRNA-seq
- Accession Number:
63231-63-0 (RNA)
- Publication Date:
Date Created: 20260208 Date Completed: 20260401 Latest Revision: 20260426
- Publication Date:
20260427
- Accession Number:
PMC13042286
- Accession Number:
10.1093/g3journal/jkag026
- Accession Number:
41655240
No Comments.