Content area

Abstract

Flow and mass cytometry experiments are essential for profiling immune cells at single cell resolution. Better understanding of human immunology increasingly involves analyzing studies at the scale of hundreds or thousands of samples, with data analysis a significant bottleneck. This trend increases the demand for automated analysis methods. In particular, a common preprocessing step in cytometry data analysis is distinguishing single cells from doublets (or multiplets), events in which two (or more) cells pass simultaneously through the detector. Typically, doublets are identified on two-dimensional density plots, using their high measured values for DNA intercalators (mass cytometry) or scattering channels (flow cytometry). Despite its popularity, this bivariate gating method is sometimes imprecise: for example, we show that bivariate gating of mass cytometry data can mistake single eosinophils for doublets, due to their high DNA content. Taking inspiration from methods already used in single cell transcriptomics, but not in the cytometry community, we propose an alternative approach. Our method, called Cleanet, first simulates doublet events, then identifies true events with protein expression similar to the simulated doublets. This simple method is completely automated and detects both homotypic and heterotypic doublets. We validate it in datasets acquired with mass and flow cytometry; moreover, we verify with imaging flow cytometry that events predicted to be doublets truly consist of multiple cells. Cleanet can also classify doublets based on their component cell types, which potentially enables the study of cell-cell interactions, mining extra information out of doublet events that would otherwise be discarded. As a proof of concept, we demonstrate that Cleanet can detect a treatment-specific increase in interactions between two cell lines. By automating doublet detection and classification, we aim to streamline the data analysis in large cytometry studies and provide a more accurate picture of both immune cell populations and cell-cell interactions.

Competing Interest Statement

The authors have declared no competing interest.

Details

1009240
Business indexing term
Title
Cleanet: robust doublet detection in cytometry data based on protein expression patterns
Publication title
bioRxiv; Cold Spring Harbor
Publication year
2025
Publication date
Jan 14, 2025
Section
New Results
Publisher
Cold Spring Harbor Laboratory Press
Source
BioRxiv
Place of publication
Cold Spring Harbor
Country of publication
United States
University/institution
Cold Spring Harbor Laboratory Press
Publication subject
ISSN
2692-8205
Source type
Working Paper
Language of publication
English
Document type
Working Paper
ProQuest document ID
3155458438
Document URL
https://www.proquest.com/working-papers/cleanet-robust-doublet-detection-cytometry-data/docview/3155458438/se-2?accountid=208611
Copyright
© 2025. This article is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (“the License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-01-15
Database
ProQuest One Academic