From Snapshots to Trajectories: Benchmarking Senescence-Conditioned Cell Morphology Generation

A benchmark-oriented project page for senescence-conditioned morphology generation with real single-cell nuclei, continuous senescence scores, and generated temporal trajectories.

Xiang Zhang, Boxuan Zhang, Haotian Zhuang, Ting Long, Hang Hua, Zhicheng Ji, Heng Fan, Ruixiang Tang, Dongfang Liu

AT A GLANCE

The first benchmark for senescence-conditioned cell morphology generation

SenoFlow combines 87,391 real single-cell DAPI nuclear crops from three public Xenium tissue panels with continuous senescence scores and computationally generated temporal sequences, creating a benchmark for studying how cell morphology changes along senescence progression.

0Real single-cell images
0Xenium tissue panels
0Cell types
0temporal sequences

Dataset

Dataset Demo

Real Cell Crops

Real crop: Lung epithelial
TissueLung
Cell TypeEpithelial
Score0.18
Real crop: Lung T cells
TissueLung
Cell TypeT cells
Score0.47
Real crop: Prostate myeloid
TissueProstate
Cell TypeMyeloid
Score0.74
Real crop: Skin myeloid
TissueSkin
Cell TypeMyeloid
Score0.91

Generated Temporal Sequences

Low senescence score High senescence score
sequence_id: gen_lung_epi_0001 | target scores: 0.00 to 1.00 (10 bins)
sequence_id: gen_pro_fib_0002 | target scores: 0.00 to 1.00 (10 bins)

Method

Method Summary

We adopt flow matching as the backbone for senescence-conditioned generation. We propose SD-Prototype OT to stabilize transport under heterogeneous cell types with continuous scores and Continuous-Score Velocity Correction to preserve biologically consistent trajectory progression.

method figure

Benchmark Results

Evaluation Protocol and Benchmark Comparison

Evaluation Protocols

Automatic Metrics

  • FID: measures realism of generated images via feature distributions.
  • KID: complements FID with more reliable estimation under small sample sizes.
  • ρ (sig): mean signed Spearman correlation on significant metric across cell types.
  • Effect: fraction of real-data morphological shift reproduced by the model.
  • Viol: mean monotonicity violation ratio.

Expert Study

Specialists in senescence and cell biology review generated trajectories in a blinded setting. Scores are reported on a 1-5 scale and averaged across experts and evaluated samples.

  • Realism: whether generated images resemble real cell nuclei.
  • Fidelity: whether progression reflects biologically plausible ageing dynamics.
  • preservation: whether original cell identity is maintained.

Benchmark Results

We evaluate 13 baselines and our method under the same per-cell-type protocol, grouped into non-FM generators and Flow Matching variants.

Method FID KID (×103) ρ (sig) Effect (%) Viol (%) Realism Fidelity preservation
NON-FM GENERATORS
cVAE71.7044.290.0203.0----
CCDM43.2021.370.16237.0----
FLOW MATCHING VARIANTS
iMF14.163.720.10840.00.2---
MeanFlow17.956.050.11580.039.3---
Contrastive FM26.2410.830.30755.738.5---
Noise-start CFM36.8117.240.57575.633.4---
Rectified Flow61.0234.680.05740.048.5---
Untyped OT-CFM66.9440.150.20264.647.1---
I-CFM (random)85.5556.810.30741.339.2---
LogitNorm OT-CFM92.9662.430.07048.748.0---
Ours11.112.140.57060.223.2---

Download

Core Resources

Dataset

Real nuclei crops + generated temporal sequences with per-cell-type protocol setup.

🤗 Hugging Face

Code

Training, inference, and evaluation scripts for our method.

GitHub Repo

Pretrained Models

Released checkpoints for reproducible benchmark comparison and downstream testing.

Model Weights

Citation

BibTeX

@article{zhang2026xensen,
  title   = {XenSen-Bench: A Multi-Tissue Benchmark for Cellular
             Senescence from Spatial Transcriptomics},
  author  = {Xiang Zhang and Boxuan Zhang and Ting Long and
             Haotian Zhuang and Jason Ji and Ruixiang Tang and
             Heng Fan and Dongfang Liu},
  journal = {NeurIPS Datasets and Benchmarks Track},
  year    = {2026}
}