Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CellScientist: Dual-Space Hierarchical Orchestration for Closed-Loop Refinement of Virtual Cell Models

About

Virtual Cell Modeling (VCM) requires models that not only predict perturbation responses, but also support targeted revision when predictions fail. Current LLM-assisted modeling workflows face a refinement-routing problem: prediction discrepancies are observed through executable implementations, but the relevant revision may involve the modeling assumption, representation design, implementation, or task constraint. Without structured feedback propagation across these levels, iterative refinement may repair code while failing to revise the assumption responsible for the discrepancy. We propose CellScientist, a dual-space hierarchical framework that couples a high-level hypothesis space with a low-level executable implementation space. CellScientist represents modeling decisions as structured states, realizes them as admissible programs under task and interface constraints, and routes execution discrepancies back to targeted hypothesis or implementation updates. This enables a closed Hypothesis -> Implementation -> Hypothesis loop where failures become structured signals for model refinement rather than debugging events. Across morphology and transcriptomic benchmarks, with additional single-cell perturbation evaluations, the final executable models selected by CellScientist improve over reference baselines under fixed split and evaluation protocols, while the workflow produces auditable refinement traces.

Mengran Li, Bo Li, Jiaying Wang, Wenbin Xing, Yixuan Dong, Chengyang Zhang, Hongliang Zhang, Yuzhong Peng, Jinlin Wu, Bob Zhang, Bingo Wing-Kuen Ling, Fuji Yang, Zhen Lei, Jiebo Luo, Zelin Zang• 2026

Related benchmarks

TaskDatasetResultRank
Cell Painting morphology predictionBBBC036 SMILES-based
MSE3.3115
7
Cell Painting morphology predictionBBBC036 Plate-based
MSE2.3662
7
Cell Painting morphology predictionBBBC047 SMILES-based split
MSE2.7612
7
Cell Painting morphology predictionBBBC047 Plate-based
MSE2.5704
7
Cell Painting morphology predictionCPG0016 (SMILES-based split)
MSE1.1113
7
Cell Painting morphology predictionCPG0016 Plate-based
MSE0.9937
7
Transcriptomic perturbation predictionLINCS L1000 Seven Cell Lines 2020
RMSE0.5088
6
Single-cell perturbation predictionNorman Gene Knockout Perturbation – scRNA-seq
PCC0.9871
2
Single-cell perturbation predictionSchiebinger Cytokine Perturbation – scRNA-seq
PCC0.8818
2
Single-cell perturbation predictionPapalexi RNA Gene Knockout Perturbation – scCITE-seq RNA
PCC0.8193
2
Showing 10 of 12 rows

Other info

Follow for update