AlphaProof Created 2025-04-24 Updated 2025-10-14
deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/
AI achieves silver-medal standard solving International Mathematical Olympiad problems
Art of Problem Solving (school) Created 2024-07-12 Updated 2025-07-16
Very focused on the International Mathematical Olympiad, notably they maintain all solutions at: artofproblemsolving.com/wiki/index.php/IMO_Problems_and_Solutions
FrontierMath Created 2025-02-11 Updated 2025-10-14
Paper: arxiv.org/abs/2411.04872
arstechnica.com/ai/2024/11/new-secret-math-benchmark-stumps-ai-models-and-phds-alike/ mentions what the official website is unable to clearly state out:So yeah, fuck off.
The design of FrontierMath differs from many existing AI benchmarks because the problem set remains private and unpublished to prevent data contamination
The expected answer output for all problems is just one single, possibly ridiculously large, integer, which is kind of a cool approach. Similar to Project Euler in that aspect.
The most interesting aspect of this benchmark is the difficulty. Mathematical olympiad coach Evan Chen comments:[ref]
Problems in [the International Mathematical Olympiad] typically require creative insight while avoiding complex implementation and specialized knowledge [but for FrontierMath] they keep the first requirement, but outright invert the second and third requirement
International Mathematical Olympiad Created 2024-07-12 Updated 2025-10-14