OurBigBook About$ Donate
 Sign in Sign up

Cosmopedia

Ciro Santilli (@cirosantilli, 37) ... Website Website genre Collaborative writing platform Wiki Type of wiki LLM generated wiki
Updated 2025-07-16  0 By others on same topic  0 Discussions Create my own version
  • github.com/huggingface/cosmopedia
  • huggingface.co/datasets/HuggingFaceTB/cosmopedia
Cosmopedia is a dataset of synthetic textbooks, blogposts, stories, posts and WikiHow articles generated by Mixtral-8x7B-Instruct-v0.1.The dataset contains over 30 million files and 25 billion tokens, making it the largest open synthetic dataset to date.

 Ancestors (8)

  1. LLM generated wiki
  2. Type of wiki
  3. Wiki
  4. Collaborative writing platform
  5. Website genre
  6. Website
  7. Art
  8.  Home

 View article source

 Discussion (0)

New discussion

There are no discussions about this article yet.

 Articles by others on the same topic (0)

There are currently no matching articles.
  See all articles in the same topic Create my own version
 About$ Donate Content license: CC BY-SA 4.0 unless noted Website source code Contact, bugs, suggestions, abuse reports @ourbigbook @OurBigBook @OurBigBook