Verina by Ciro Santilli 40 2025-12-13
AI code generation benchmark in which part of the benchmark includes producing a formal Lean proof of the implementation. Sweet.

New to topics? Read the docs here!