Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

A comprehensive benchmarking study on computational tools for cross-omics label transfer from single-cell RNA to ATAC data.

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Author(s): Wang Y;Wang Y; Zhao H; Zhao H; Zhao H; Zhao H
  • Source:
    G3 (Bethesda, Md.) [G3 (Bethesda)] 2026 Apr 01; Vol. 16 (4).
  • Publication Type:
    Journal Article; Research Support, N.I.H., Extramural
  • Language:
    English
  • Additional Information
    • Source:
      Publisher: Oxford University Press Country of Publication: England NLM ID: 101566598 Publication Model: Print Cited Medium: Internet ISSN: 2160-1836 (Electronic) Linking ISSN: 21601836 NLM ISO Abbreviation: G3 (Bethesda) Subsets: MEDLINE
    • Publication Information:
      Publication: 2021- : [Oxford] : Oxford University Press
      Original Publication: Bethesda, MD : Genetics Society of America, 2011-
    • Subject Terms:
    • Abstract:
      With continuous progress of single-cell chromatin accessibility profiling techniques, scATAC-seq has become more commonly used in investigating regulatory genomic regions and their involvement in developmental, evolutionary, and disease-related processes. At the same time, accurate cell type annotation plays a crucial role in comprehending the cellular makeup of complex tissues and uncovering novel cell types. Unfortunately, the majority of existing methods primarily focus on label transfer within scRNA-seq datasets and only a limited number of approaches have been specifically developed for transferring labels from scRNA-seq to scATAC-seq data. Moreover, many methods have been published for the joint embedding of data from the two modalities, which can be used for label transfer by adding a classifier trained on the latent space. Given these available methods, this study presents a comprehensive benchmarking study evaluating 27 computational tools for scATAC-seq label annotations through tasks involving single-cell RNA and ATAC data from various human and mouse tissues. We found that when high-quality paired data were available to transfer labels across unpaired data, Bridge and GLUE were the best performers; otherwise, bindSC and GLUE achieved the highest prediction accuracy overall. All these methods were able to use peak-level information instead of purely relying on the gene activities from scATAC-seq. Furthermore, we found that data imbalance, cross-omics dissimilarity on common cell types, data binarization, and the introduction of semi-supervised strategy usually had negative impacts on model performance. In terms of scalability, we found that the most time and memory efficient methods were Bridge and deep learning-based algorithms like GLUE. Based on the results of this study, we provide several suggestions for future methodology development.
      (© The Author(s) 2026. Published by Oxford University Press on behalf of The Genetics Society of America.)
    • Grant Information:
      U24 HG012108 United States GF NIH HHS; U01 HG013840 United States GF NIH HHS
    • Contributed Indexing:
      Keywords: benchmarking study; cell type annotation; cross-omics label transfer; scATAC-seq; scRNA-seq
    • Accession Number:
      63231-63-0 (RNA)
    • Publication Date:
      Date Created: 20260208 Date Completed: 20260401 Latest Revision: 20260426
    • Publication Date:
      20260427
    • Accession Number:
      PMC13042286
    • Accession Number:
      10.1093/g3journal/jkag026
    • Accession Number:
      41655240