Previously called "bitcoin-strings-with-txids" since text was the initial focus, but Ciro Santilli decided to go for the more general name once images became more and more important to the project.
Set of scripts b Ciro Santilli, primarily created while researching Cool data embedded in the Bitcoin blockchain.
Bitcoin varint by Ciro Santilli 40 Updated 2025-07-16
Implementations:
GitHub is for newbs.
  • 50002f38a40aeca96f7d03ceac1c62fc233b44207af99df8f1daddf03f6ef61c via cryptograffiti.info contains a Python script that starts with:
    #!/usr/bin/env python3
    #
    # This file is placed in the public domain.
    #
    # CryptoGraffiti tool
    #
    # Requires python-bitcoinlib-v0.2.1
    #
    # https://github.com/petertodd/python-bitcoinlib
    #
    # pip install python-bitcoinlib
  • 209c9106c7261582f5d0907819c6e10dea670c273133047d911be41f8a42d86f via cryptograffiti.info contains a Base64 encoded Python script starting in:
    #!/usr/bin/env python
    # brainwallet "base58"
    # v2015-05-18, fixed Tor DNS problem
    import binascii
    import hashlib
    Some related ones:
    • 25658f625c8f3964593b9e3c632040cb69aea9cf24403af33ab173d7cba7c42f
    • 7d188bd499137b5a0d68271ef8a4f3c4dc2f2b38bd03dfc913cb2b0be15b1e0d
Cosmopedia by Ciro Santilli 40 Updated 2025-07-16
Cosmopedia is a dataset of synthetic textbooks, blogposts, stories, posts and WikiHow articles generated by Mixtral-8x7B-Instruct-v0.1.The dataset contains over 30 million files and 25 billion tokens, making it the largest open synthetic dataset to date.

There are unlisted articles, also show them or only show them.