Project Euler as an AI benchmark Created 2025-03-24 Updated 2026-01-30
The beauty of Project Euler is that it would serve both as a AI code generation benchmark and as an AI Math benchmark!
Verina 2025-12-13
AI code generation benchmark in which part of the benchmark includes producing a formal Lean proof of the implementation. Sweet.