Abstract Cognition's Frontier Code benchmark reframes how we evaluate AI coding...

Abstract Cognition's Frontier Code benchmark reframes how we evaluate AI coding...

Cognition Labs launches FrontierCode, a benchmark testing AI coding agents on real-world maintainability. The top model scores just 13% on its hardest