Project Euler as an AI benchmark Created 2025-03-24 Updated 2025-10-14
The beauty of Project Euler is that it would serve both as a AI code generation benchmark and as an AI Math benchmark!
Verina 2025-12-13
AI code generation benchmark in which part of the benchmark includes producing a formal Lean proof of the implementation. Sweet.