CellFateBench is a scientific software and benchmark-engineering project for evaluating reasoning over single-cell genomics workflows.
The project was designed around a practical question:
How can single-cell analysis outputs be turned into reproducible benchmark tasks with public prompts, hidden answer keys, oracle outputs, scoring, calibration, Docker validation, and CI?
What CellFateBench does
Single-cell genomics workflows often produce outputs such as:








