Back to Search Start Over

A versatile information retrieval framework for evaluating profile strength and similarity.

Authors :
Kalinin AA
Arevalo J
Vulliard L
Serrano E
Tsang H
Bornholdt M
Rajwa B
Carpenter AE
Way GP
Singh S
Source :
BioRxiv : the preprint server for biology [bioRxiv] 2024 Apr 02. Date of Electronic Publication: 2024 Apr 02.
Publication Year :
2024

Abstract

In profiling assays, thousands of biological properties are measured in a single test, yielding biological discoveries by capturing the state of a cell population, often at the single-cell level. However, for profiling datasets, it has been challenging to evaluate the phenotypic activity of a sample and the phenotypic consistency among samples, due to profiles' high dimensionality, heterogeneous nature, and non-linear properties. Existing methods leave researchers uncertain where to draw boundaries between meaningful biological response and technical noise. Here, we developed a statistical framework that uses the well-established mean average precision (mAP) as a single, data-driven metric to bridge this gap. We validated the mAP framework against established metrics through simulations and real-world data applications, revealing its ability to capture subtle and meaningful biological differences in cell state. Specifically, we used mAP to assess both phenotypic activity for a given perturbation (or a sample) as well as consistency within groups of perturbations (or samples) across diverse high-dimensional datasets. We evaluated the framework on different profile types (image, protein, and mRNA profiles), perturbation types (CRISPR gene editing, gene overexpression, and small molecules), and profile resolutions (single-cell and bulk). Our open-source software allows this framework to be applied to identify interesting biological phenomena and promising therapeutics from large-scale profiling data.<br />Competing Interests: Competing interests The Authors declare the following competing interests: S.S. and A.E.C. serve as scientific advisors for companies that use image-based profiling and Cell Painting (A.E.C: Recursion, SyzOnc, Quiver Bioscience, S.S.: Waypoint Bio, Dewpoint Therapeutics, Deepcell) and receive honoraria for occasional scientific visits to pharmaceutical and biotechnology companies. All other authors declare no competing interests.

Details

Language :
English
Database :
MEDLINE
Journal :
BioRxiv : the preprint server for biology
Accession number :
38617315
Full Text :
https://doi.org/10.1101/2024.04.01.587631