CellFateBench is a scientific software and benchmark-engineering project for evaluating reasoning over single-cell genomics workflows.

The project was designed around a practical question:

How can single-cell analysis outputs be turned into reproducible benchmark tasks with public prompts, hidden answer keys, oracle outputs, scoring, calibration, Docker validation, and CI?

What CellFateBench does

Single-cell genomics workflows often produce outputs such as: