Questions available to anyone under Hugging Face login / .zip with password, but you have to promise not to post them online. Lol. Either do the thing or don't.
Numerical solution:
367554579311Earliest known public leak: github.com/lucky-bai/projecteuler-solutions/issues/93
Programs:
Numerical solution:
1033654680825334184Earliest known public leak: github.com/lucky-bai/projecteuler-solutions/issues/87
Programs:
Numerical solution:
3575508Earliest known public leak:
Programs:
Not too exciting because of the high school knowledge olympiad level, but respectable.
- Every problem has one final integer answer:Also unlike Project Euler and like IMO, all only limited computations are required, i.e. you are not expected to do full blown program generation to reach a final answer. Which makes this further less exciting.
This section is about formalization efforts of specific fields of mathematics.
Numerical solution:
33626723890930Earliest known public leak:
Programs:
Numerical solution:
44754029Earliest known public leak: x.com/cirosantilli/status/1990363555309490585
Programs:
This one doesn't seem to exciting to be honest, but it might be useful. Sample question:and it expects the correct answer down to the cents:It should be noted that Project Euler has such "precision matters" problems.
53892.27
Even more than in other areas of benchmarking, in maths where you only have a right or wrong answer, and it is costly to come up with good sample problems, some benchmarks have adopted private test data sets.
The situation is kind of sad, in that ideally we should have open data sets and only test them on models that were trained on data exclusively published before the problem publish date.
However this is not practical for the following reasons:
There are unlisted articles, also show them or only show them.