Cool deeptech ones:
Boring ones:
International ones with a British presence:
ARC-AGI visualization by Ciro Santilli 37 Created 2025-10-14 Updated 2025-10-18
www.kaggle.com/code/allegich/arc-agi-2025-visualization-all-1000-120-tasks contains plots of all questions and answers. It is truly very convenient.
LeanAgent by Ciro Santilli 37 2025-10-14
They do have a database system which is interesting.
We introduce Putnam-AXIOM, a benchmark of 522 university-level competition problems drawn from the prestigious William Lowell Putnam Mathematical Competition, and Putnam-AXIOM Variation, an unseen companion set of 100 functional variants generated by programmatically perturbing variables and constants.
MathArena by Ciro Santilli 37 2025-10-14
This project tests various models against various competitions.
How they "ensure" that models are not contaminated:
By evaluating models as soon as new problems are released, we effectively eliminate the risk of contamination
Most of their problems come from high school knowledge olympiads and they are therefore completely irrelevant for 2025 LLMs.
LIGO by Ciro Santilli 37 2025-10-14
Video 1.
LIGO documentary by Advanced LIGO Documentary Project
. Source.

There are unlisted articles, also show them or only show them.