Previously called "bitcoin-strings-with-txids" since text was the initial focus, but Ciro Santilli decided to go for the more general name once images became more and more important to the project.
Set of scripts b Ciro Santilli, primarily created while researching Cool data embedded in the Bitcoin blockchain.
Bitcoin varint by Ciro Santilli 37 Updated 2025-07-16
Implementations:
GitHub is for newbs.
  • 50002f38a40aeca96f7d03ceac1c62fc233b44207af99df8f1daddf03f6ef61c via cryptograffiti.info contains a Python script that starts with:
    #!/usr/bin/env python3
    #
    # This file is placed in the public domain.
    #
    # CryptoGraffiti tool
    #
    # Requires python-bitcoinlib-v0.2.1
    #
    # https://github.com/petertodd/python-bitcoinlib
    #
    # pip install python-bitcoinlib
  • 209c9106c7261582f5d0907819c6e10dea670c273133047d911be41f8a42d86f via cryptograffiti.info contains a Base64 encoded Python script starting in:
    #!/usr/bin/env python
    # brainwallet "base58"
    # v2015-05-18, fixed Tor DNS problem
    import binascii
    import hashlib
    Some related ones:
    • 25658f625c8f3964593b9e3c632040cb69aea9cf24403af33ab173d7cba7c42f
    • 7d188bd499137b5a0d68271ef8a4f3c4dc2f2b38bd03dfc913cb2b0be15b1e0d
Cosmopedia by Ciro Santilli 37 Updated 2025-07-16
Cosmopedia is a dataset of synthetic textbooks, blogposts, stories, posts and WikiHow articles generated by Mixtral-8x7B-Instruct-v0.1.The dataset contains over 30 million files and 25 billion tokens, making it the largest open synthetic dataset to date.
eLearning provider by Ciro Santilli 37 Updated 2025-07-16
Video 1.
Infomercial: For-Profit Online University by Adult Swim
.
Suppose we specify:
  • a .dat file
  • the offset in bytes within that file
The question then is, which transaction is encoded at that position of the file?
This would allow us to index inscriptions in the .dat files directly with fast C tools, and then retrive the transaction ID to get cleaner data and metadata.
It should be possible if we managed to take the information from bitcoindev.network/understanding-the-data/ and dump into an indexed SQLite database.
I tried to start things off with LevelDBDumper:
LevelDBDumper -d ~/snap/bitcoin-core/common/.bitcoin/indexes/txindex -f btc.csv -q -o . -t csv
but that consumed all 64 GB of RAM on P51... github.com/mdawsonuk/LevelDBDumper/issues/15
But OK, nevermind that repo, it can be done easily with the LevelDB API of any language: bitcoin.stackexchange.com/questions/121888/what-is-the-data-format-layout-for-txindex-leveldb-values. Just the data seems wrong and we don't know why.
pciutils by Ciro Santilli 37 Updated 2025-07-16
Sotware project that provides lspci.

Unlisted articles are being shown, click here to show only listed articles.