OurBigBook About$ Donate
 Sign in+ Sign up

Ciro Santilli @cirosantilli 37

 Message
User's profile image

 Incoming links: Python standard library

BigCodeBench  Updated 2025-05-13  +Created 2025-03-20
 View more
  • github.com/bigcode-project/bigcodebench
  • bigcode-bench.github.io/
  • arxiv.org/abs/2406.15877
Their most interesting subset, the -hard one, appears to be present at: huggingface.co/datasets/bigcode/bigcodebench-hard in Parquet format. OMG why.
The tests make free usage of the Python standard library and other major external libraries, e.g. huggingface.co/datasets/bigcode/bigcodebench-hard/viewer/default/v0.1.0_hf?views%5B%5D=v010_hf&row=0 uses FTPlib. Kind of cool.
They even test graph plotting? huggingface.co/datasets/bigcode/bigcodebench-hard/viewer/default/v0.1.0_hf?views%5B%5D=v010_hf&row=11 How does it evaluate?
 Read the full article
Total articles: 1
 About$ Donate Content license: CC BY-SA 4.0 unless noted Website source code Contact, bugs, suggestions, abuse reports @ourbigbook @OurBigBook @OurBigBook