Verina by Ciro Santilli 37 2025-12-13
AI code generation benchmark in which part of the benchmark includes producing a formal Lean proof of the implementation. Sweet.

New to topics? Read the docs here!