OurBigBook About$ Donate
 Sign in Sign up

MathArena

Ciro Santilli (@cirosantilli, 37) ... Computer Machine learning Artificial intelligence AI by capability Automated theorem proving Math AI benchmark
2025-10-14  0 By others on same topic  0 Discussions Create my own version
matharena.ai/
This project tests various models against various competitions.
How they "ensure" that models are not contaminated:
By evaluating models as soon as new problems are released, we effectively eliminate the risk of contamination
Most of their problems come from high school knowledge olympiads and they are therefore completely irrelevant for 2025 LLMs.

 Ancestors (10)

  1. Math AI benchmark
  2. Automated theorem proving
  3. AI by capability
  4. Artificial intelligence
  5. Machine learning
  6. Computer
  7. Information technology
  8. Area of technology
  9. Technology
  10.  Home

 Incoming links (1)

  • Project Euler as an AI benchmark

 View article source

 Discussion (0)

New discussion

There are no discussions about this article yet.

 Articles by others on the same topic (0)

There are currently no matching articles.
  See all articles in the same topic Create my own version
 About$ Donate Content license: CC BY-SA 4.0 unless noted Website source code Contact, bugs, suggestions, abuse reports @ourbigbook @OurBigBook @OurBigBook