The remedy to cowardice, inattention, censorship and amorality.
Due to Ciro Santilli's campaign for freedom of speech in China, Ciro Santilli maintains information on this at mostly at:
Dan Dascalescu's "Web page archiving" comparison table: web.archive.org/web/20130922192354/http://wiki.dandascalescu.com/reviews/online_services/web_page_archiving
- drive.google.com/file/d/1JTPVd09NPaGH-KzGv2jU3XXcFiJAoUjw/view some crazy due investigating, let's see how long until it goes down, posted at: Points to:"Alex Conferno" is also brought up: twitter.com/conferno
- www.reddit.com/r/DataHoarder/comments/12trawt/has_anyone_ever_actually_spoken_to_denis_petrov/
- gyrovague.com/2023/08/05/archive-today-on-the-trail-of-the-mysterious-guerrilla-archivist-of-the-internet/. Trended on Hacker News: news.ycombinator.com/item?id=37009598
- gigazine.net/gsc_news/en/20240326-archive-today/
Other mentions of "Denis Petrov":
Previously called "Lending Library" it seems: help.archive.org/hc/en-us/articles/360016554912-Borrowing-From-The-Lending-Library
You can borrow online books from them for a few hours/days: help.archive.org/hc/en-us/articles/360016554912-Borrowing-From-The-Lending-Library This is the most amazing thing ever made!!! You can even link to specific pages, e.g. archive.org/details/supermenstory00murr/page/80/mode/2up
They seem to a have a separate URL with the same content as well for some reason: openlibrary.org/, classic messy Internet Archive style.
Bastards are suing them www.theverge.com/2020/6/1/21277036/internet-archive-publishers-lawsuit-open-library-ebook-lending: Hachette, Penguin Random House, Wiley, and HarperCollins
It is quite hard to decide if an upload is from the official legal lending library, or just some illegal upload, e.g.:so the URLs are basically the same style. Some legality indicators:
- archive.org/details/TheGoogleStory likely illegal
- archive.org/details/isbn_9780385342728 likely legal
Access-restricted-item
: true- present in the collection: archive.org/details/internetarchivebooks?tab=about
Feature added in 2019 apparently: www.reddit.com/r/DataHoarder/comments/dj6ot5/you_can_now_save_a_screenshot_of_your_saved_pages/
github.com/ourbigbook/template/archive/refs/heads/master.zip
But TODO: how to access the screenshot afterwards?
archive.org/details/toomanyrequests_20191110 says 15 archives / minute, but apparently aslo 15 retrievals per minutes on Wikipedia, after which 5 min blacklist. After that, you start getting some 429s, and after that, server refuses to connect at al.
CDX: no limits apparently, they might just throttle you? Made 10k requets on bash loop and was going fine. But not that if you get blacklisted by create/fetch requests blacklist, server fails to connect here as well.
- archive.org/post/1055220/how-to-query-for-all-the-websites-that-end-in-combr
- archive.org/details/WebArchiveDomainFiles only a random list with per-ccTLDs upon request of (paid presumably) partners. As of 2023 only contains the Netherlands: archive.org/details/Dotnl-2016-present-domains-in-wayback-domainyear-of-last-capture
Articles by others on the same topic
There are currently no matching articles.