MathArena (source code)

= MathArena
{c}

https://matharena.ai/

This project tests various models against various competitions.

How they "ensure" that models are not contaminated:
>  By evaluating models as soon as new problems are released, we effectively eliminate the risk of contamination

Most of their problems come from <high school knowledge olympiads> and they are therefore completely irrelevant for 2025 <LLMs>.