MathArena
= MathArena
{c}
https://matharena.ai/
This project tests various models against various competitions.
How they "ensure" that models are not contaminated:
> By evaluating models as soon as new problems are released, we effectively eliminate the risk of contamination
Most of their problems come from <high school knowledge olympiads> and they are therefore completely irrelevant for 2025 <LLMs>.