final / scDFM /README.md
tintin1027's picture
Upload folder from local machine
ae10e20 verified

scDFN logo

scDFM: Distributional Flow Matching for Robust Single-Cell Perturbation Prediction (ICLR 2026)

arXiv Codebase License: MIT YouTube Slides

Official repo for the paper scDFM, ICLR 2026.
Chenglei Yuβˆ—1,2, Chuanrui Wangβˆ—1, Bangyan Liaoβˆ—1,2 & Tailin Wu†1.

1School of Engineering, Westlake University; 2Zhejaing University;

*Equal contribution, †Corresponding authors


Overview

We propose a novel distributional flow matching framework (scDFM) for robust single-cell perturbation prediction, which models the full distribution of perturbed cellular expression profiles conditioned on control states, thereby overcoming limitations of existing methods that rely on cell-level correspondences and fail to capture population-level transcriptional shifts.

Framework of paper:

Install dependencies

conda env create -f environment.yml

⏬ Dataset download

Put dataset into data file:

Alternative Data Access

We also provide the datasets via Google Drive. This folder contains:

  • The Norman dataset and its corresponding data splits.
  • The ComboSciPlex dataset.

Example directory layout after download (relative to repo root):

scDFM/
β”œβ”€ data/
β”‚  β”œβ”€ norman.h5ad
β”‚  └─ combosciplex.h5ad
β”œβ”€ src/
β”‚  └─ ...
└─ run.sh

πŸ“₯ Training

An example on additive task.

bash run.sh

🫑 Citation

If you find our work and/or our code useful, please cite us via:

@inproceedings{yu2026scdfm,
  title={sc{DFM}: Distributional Flow Matching Model for Robust Single-Cell Perturbation Prediction},
  author={Chenglei Yu and Chuanrui Wang and Bangyan Liao and Tailin Wu},
  booktitle={The Fourteenth International Conference on Learning Representations},
  year={2026},
  url={https://openreview.net/forum?id=QSGanMEcUV}
}

πŸ“š Related Resources