OurBigBook About$ Donate
 Sign in Sign up

 BigCodeBench

ID: bigcodebench

 Top articles Latest articles New article in topic
BigCodeBench by Ciro Santilli 37 Created 2025-03-20 Updated 2025-07-16
  • github.com/bigcode-project/bigcodebench
  • bigcode-bench.github.io/
  • arxiv.org/abs/2406.15877
Their most interesting subset, the -hard one, appears to be present at: huggingface.co/datasets/bigcode/bigcodebench-hard in Parquet format. OMG why.
The tests make free usage of the Python standard library and other major external libraries, e.g. huggingface.co/datasets/bigcode/bigcodebench-hard/viewer/default/v0.1.0_hf?views%5B%5D=v010_hf&row=0 uses FTPlib. Kind of cool.
They even test graph plotting? huggingface.co/datasets/bigcode/bigcodebench-hard/viewer/default/v0.1.0_hf?views%5B%5D=v010_hf&row=11 How does it evaluate?
 Read the full article
Total articles: 1

 New to topics? Read the docs here!

 About$ Donate Content license: CC BY-SA 4.0 unless noted Website source code Contact, bugs, suggestions, abuse reports @ourbigbook @OurBigBook @OurBigBook