OurBigBook About$ Donate
 Sign in+ Sign up
by Ciro Santilli (@cirosantilli, 37)

Humanity's Last Exam (2025)

 ... Generative AI by modality AI text generation Text-to-text model Large language model LLM benchmark List of LLM benchmarks
 0 By others on same topic  0 Discussions  Updated 2025-05-23  +Created 2025-05-21  See my version
Tags: AI Math benchmark
Contains highly specialized questions in various academic fields, including mathematics. The problems are answered either with a number, or multiple choice, or free text.
  • arxiv.org/abs/2501.1424
  • huggingface.co/datasets/cais/hle
  • agi.safe.ai/

 Ancestors (15)

  1. List of LLM benchmarks
  2. LLM benchmark
  3. Large language model
  4. Text-to-text model
  5. AI text generation
  6. Generative AI by modality
  7. Generative AI
  8. AI by capability
  9. Artificial intelligence
  10. Machine learning
  11. Computer
  12. Information technology
  13. Area of technology
  14. Technology
  15.  Home

 View article source

 Discussion (0)

+ New discussion

There are no discussions about this article yet.

 Articles by others on the same topic (0)

There are currently no matching articles.
  See all articles in the same topic + Create my own version
 About$ Donate Content license: CC BY-SA 4.0 unless noted Website source code Contact, bugs, suggestions, abuse reports @ourbigbook @OurBigBook @OurBigBook