White Paper

FormulaOne: Measuring the Depth of Algorithmic Reasoning Beyond Competitive Programming

FormulaOne is a 120-problem MSO-based dynamic programming benchmark on tree-like graphs where frontier models solve under 1%.

Published on
Share this post