It can't be HTML crawl because presumably there wouldn't have been links to those websites? Presumably this is why Common Crawl doesn't seem to have any hits.
So they must have had some kind of DNS A record database?
Or would IPv4 sweep have worked, without the
Host
header with the CIA's setup?The same question also applies to the 2013 DNS Census. It has less hits, but still has many.
Whatever they did, we are so so glad that they did!
In this section we document the outcomes of more detailed inspection of both the communication mechanisms (JavaScript, JAR, swf) and HTML that might help to better fingerprint the websites.
Major projects can be seen at: Section "The most important projects done by Ciro Santilli".
These are some smaller projects that Ciro Santilli carried out. They are all either for fun, or misguided use of his time done by an younger self:
- small naughty stuff is listed at: Section "Ciro Santilli's naughty projects"
- Because Ciro cares about education, around 2014 he looked into markup languages and version control for books, before he noticed that this approach was useless and that ranking algorithms are all that matter:
- He implemented some large features and several smaller improvements.GitLab sent Ciro a free swag bottle later after they got funding on to thank him for his contributions: Figure 1. "Ciro Santilli in a dune lake in Jericoacoara, Brazil, with his GitLab bottle". He had to pay for the beach trip though.
Figure 1. Ciro Santilli in a dune lake in Jericoacoara, Brazil, with his GitLab bottle. - Markdown Style Guide
- karlcow/markdown-testsuite improvements: Ciro has implemented the test runner a few months before CommonMark left stealth mode and killed it instantaneously.At least MacFarlane was able to reuse part of the HTML normalizer he wrote, and he extracted the multi-engine comparison to: CommonMark Implementation Compare.Playing with this project has led Ciro to find and report many Markdown bugs/bad behavior on other software, e.g. GitHub and MultiMarkdown-4.
- isaacs/github public unofficial GitHub issue tracker: he has commented there so often that he was made a collaborator
- Node Express Sequelize Next.js realworld example app
- VCDVCD: value change dump command-line pretty printer!!! The type of thing that a billion dollar EDA tool vendor will never implement ;-)
0 time 1 counter_tb.clock 2 counter_tb.enable 3 counter_tb.out[1:0] 4 counter_tb.reset 5 counter_tb.top.out[1:0] 0 1 2 3 4 5 =========== 0 1 0 x 0 x 1 0 0 x 1 x 2 1 0 0 1 0 3 0 0 0 0 0 4 1 0 0 0 0 5 0 1 0 0 0
- Vim: sometimes Ciro want crazy and wasted his time with Vimscript:
- Vim Markdown: the owner
plasticboy
was really nice and made Ciro a collaborator for his contributions, notably a live ToC outline and the header mappings - Vundle Plugin Tester, which he used to start the testing system of Vim Markdown
- Vim Markdown: the owner
- Breakthrough Message: aliens!!! Creative/media project, powered by some Python scripts.
- making Google Maps reviews of places he's visited to help other people. Ciro's photos reached 1 million views in 2019: www.google.com/maps/contrib/106598607405640635523/photos (archive)
tx e3e37ed5c1de2631c147bd39429e42ff634e95b7d72423bc32d6c6b9d8eef8ee (2014-07-01):
For my first official Journal entry I've decided to archive some old poetry. Here are a few of the computational poems I've created using cyphers.
Shiemaa&Vincent.jpg
Message:TODO identify Shiemaa and Vincent.
"Even if we tried to do it on purpose, never would have we succeeded." My beloved Vincent.
tx 36d0d77acd760f0aa549b6b314f0c1e9690baa6bcc2d0f07ea9f3167f4a5ec99, block 318836 (2014-09-03)Cost: ~0.002 BTC ~ $0.77 at the time.
Several other interesting uploads were also made around block 318836 (September 2014):
RedRaven.jpg
bitfossil.org/e17b83234402d85f3a18207eec11bc5c4397f88aa880aae4fb7d15802806a971/index.htmEarth3Archive.jpg
bitfossil.org/ae8d3b46b934bedc363e11abe8c8607171994470957c286274f699a0b3a9bbd7/index.htmSkyEarth5Archive.jpg
bitfossil.org/ae8d3b46b934bedc363e11abe8c8607171994470957c286274f699a0b3a9bbd7/index.htm
BikeLady.jpg
Bike Lady by Allen Lee Vandever. tx 2c4b9497af8c0c0eb9383357b40c3de33dba0b4f481099a32719f2b9036da8e7, block 319927 (2014-09-10)
This seems to be a novel work uploaded by its creator artist Allen Vandever according to EMBII.[ref].
Arecibo_message.svg
Arecibo message on tx c6d2e535cd2ba4659e954a61198c66fd98c60f6475cf8ff92a404f3fe3a16c4b, block 337874 (2015-01-07)
An "artificially" colored visualization of the Arecibo message ripped from Wikipedia: en.wikipedia.org/wiki/File:Arecibo_message.svg (with attribution).
The cool thing about this image is that it highlights the striking parallels between the encoding of the Arecibo message with crypto graffiti, because in both cases people were creating undocumented new ways of communicating with strangers on a new medium in those early blockchain days.
The associated message contains the Arecibo message as ASCII 0's and 1's. When properly cut at the newlines, they draw the message as ASCII art, as the original Arecibo encoding intends, here's a version with the 0's replaced by spaces to make it more readabale:
1 1 1 1
1 1 1 1 1
1 1 1 1 1 11 1
1 1 1 1 1 1 1 1 1 1
11
11 1
11 1
1 1 1
11111
11 111 11 11
1 11 1
11 1 11 11 11 1
11111 11111 11111 11111
1 1
1 1
11111 11111
11 11 111 11
1 1 1
11 1 11 111 11 1
11111 11111 11111 11111
1 11 1
11
1 11 1
11111 11 11111
11
1 1 1
1 11 1
11 11 1
11 1 11
11 11
11 1 11
11 11 1
1 1 1
1 11 1
1 11 1
1 1 1
1 1 1
1 11
11 11
1 111 1 11
1 1
1 11111
1 1 111 1 1 11 11
1 111 1 111111
1 111 111 11 111
1 1 111 11
1 1 1 111111
1 1 1 11
1 11 11
111 1
111 1 1 1 1 1 1 1 1
111 1 1 1 1
1 1
11111
111111111
111 111
11 11
11 1 1 11
11 11 11 11
1 1 1 1 1 1
1 1 1 1 1 1
1 1 1 1
1 1 1
1 1
1 1 1
1111 11111 1 1111
He sleeps in a temple.jpg
PIA17563.jpg
. Source. Associated message:
NASA: A purple nebula, in honor of #Prince, who passed away today. Image: Crab #Nebula, as Seen by Herschel and #Hubble Image credit: ESA/Herschel/PACS/MESS Key Programme Supernova Remnant Team; #NASA, ESA and Allison Loll/Jeff Hester (Arizona State University) #PIA17563
Dr_Craig_Wright.jpg
. Source. Associated message:
The image is present e.g. at: www.kitguru.net/channel/jon-martindale/australian-man-claims-he-is-satoshi-nakamoto-bitcoin-creator/ It was inscribed about two months after Craig publicly claimed that he is Satoshi.
This is a relatively unusual AtomSea & EMBII upload as it does not have the common toplevel transaction, everything, text + image fits into a single transaction. This is perhaps why the image is relatively low resolution to have a smaller size.
YellowRobot.jpg
Yellow Robot on tx 67b2facfd8160d4fa11b02829b6387d07537b57a7a24f19b029b2a5ae7b81830, block 450516 (2017-01-29)
Photography by EMBII, original art by TODO.
The associated message reads:This is one of Ciro's favorite AtomSea & EMBII uploads. This is the cutest thing ever, and perfectly encapsules the "medium as an artform" approach to blockchain art. More Chiharu stalking at: ILoveYouMore.jpg.
Chiharu and I found this little yellow robot while exploring Chicago. It will be covered by tar or eventually removed but this tribute will remain. N 41.880778 E -87.629210
At twitter.com/EMBII4U/status/1615389973343268871 EMBII announced that he would be giving off shares of that image on a Bitcoin-based NFT sale system he's making called Sup!?, and in December 2023 gave some shares to Ciro Santilli. Amen.
Other possibly novel EMBII street photography:
- bitfossil.com/f2efd446475ad58a3ea808cc0f05a63c55cece9fced70d84799a1ffce5d307e4/index.htm "I will not Stop until the Finite becomes the Infinite."
Audio:
alien.wav
block 318638 bitfossil.org/a3a24d6ea01ce481a50346818b8977220687f3ba385838fe8894ce61c9718bbc/OneGiantLeapForMankind.mp3
at tx 4f5b25fa8021c67235423930580e69121aa0d2c2bb779f75139bf442f8dc7297 EMBII-indexed at 743f3286b00fc96c13db4b16d5aead8a1e059fee9ce775b1761be9be5bdc2501 and then indexed at: 0427ec598df38b7d7dc75721316c0bbdec54de4871e11aff8ea64f3717c07efbThe toplevel index does appear on Bitfossil: bitfossil.org/0427ec598df38b7d7dc75721316c0bbdec54de4871e11aff8ea64f3717c07efb/index.htm but the audio is not there as it was for Spock below, maybe a bug on upload/Bitfossil?Spock_Live_Long_And_Prosper.mp3
block 345858 bitfossil.org/1bc87dbff1ff5831287f62ac7cf95579794e4386688479bab66174963f9a4a0c/index.htm. Audio of Mr. Spock saying the Vulcan salute.OuterSpace.mp3
block 409471 bitfossil.org/c14c1bd862bab6269052bf0a2cda7a35940d7a2d9c3415d4fb8fb8dcb9394fae/ "Outer Space by embii 4MB Large file storage test Apertus 0.3.5-beta" OMG, I don't want to calculate how much it cost to upload this, it will make me sad.At twitter.com/EMBII4U/status/1655969645927563266 EMBII mentions that this inscription, made by him, is the largest inscription he knows of.TODO song composer/performer?- bitfossil.com/c2b170ff450f4529dfbd784e0cf5cdddaca494e67a243dd846c0a9450a5558af/ (2021-03-13) contains
Seikilos.mid
, a MIDI file
Interesting text:
- block 273522 bitfossil.org/70fd289901bae0409f27237506c330588d917716944c6359a8711b0ad6b4ce76/index.htm pi to 1000+ decimal digits:
- bitfossil.org/8522787e7e49f3f3b6a9f9e86bc30336d26a3acbaecc93809d2e8b4bb1c4d611/ "Antarctic Ice Cores Revised 800KYr CO2 Data" evidence for global warming
- bitfossil.org/ffa6893a70bcde9b940df9823e0f597f0b6cff964c78473c77db838655e1aeb5/ en.wikipedia.org/wiki/Laudato_si', global warming related
HTML pages:
- block 335290 bitfossil.org/0166db6053f1969c28de8b1f9a8fa4ec890cc4bdfee7602757993b306bb7f295/ JavaScript animated timer clock counting down until the start of the next year
- block 340379 bitfossil.org/062990d54045a9c316110fb713009d1313b2f64c4b216d66891c7284d6c1ca0e/ links to bitfossil.org/062990d54045a9c316110fb713009d1313b2f64c4b216d66891c7284d6c1ca0e/bong-ball.html and has a working JavaScript Pong
- block 328445
tom-signature.jpg
bitfossil.org/daa050bf8ac22752e40412c9265b4533f68ab8e6ed26d2db1eeee6710e7d9e4b/index.htm Unrendered HTML of:Likely an obituary for: Thomas L. Magliozzi. Images show fine though.- www.cartalk.com/content/tom-and-rays-bios-photos-2
- www.cartalk.com/content/rant-and-rave-36 "The New Theory of Learning" which agrees perfectly with backward design
- block 401648 bitfossil.com/31c5e5336512568e4a1deb4bbf0e57c3565c32094c0e1a118c48e7929ab49e35/bong-ball.html another one! This one is full-screen, and does not have JavaScript
alert
s :-) - block 401657 bitfossil.org/03cb74f270d498302d4dd9cbe82c090d801c8840ab6cb26b71d862489b981db8/ has a JavaScript Pac-Man
This is a standard way to embed images in HTML pages with the
img
tag.This is a good thing. It basically contains an entire website, with HTML and assets inside a single ZIP, and a little bit of metadata.
It is incomprehensible why browsers don't just implement it as they already have all the web part, and also ZIP stuff:
The situation is so sad. Ubuntu 21.04 doesn't come with a reader installed by default:
As of the 2020's, a slumbering giant.
But the pre-Internet impact of IBM was insane! Including notably:
- some of the most important business computers of the pre-personal computer era
- SQL
- IBM Generalized Markup Language, which is a predecessor to XML and HTML
Companies have been really slow to support SVG features in their browsers, and that is very saddening: medium.com/@michaelmangial1/introduction-to-scalable-vector-graphics-6450c03e8d2e
You can't drop SVG support for
canvas
until there's a way to run untrusted JavaScript on the browser!SVG does have some compatibility annoyances, notably SVG fonts. But we should as a society work to standardize and implement a fix those, the benefits of SVG are just too great!
Examples:
- svg/svg.svg a minimal somewhat sane SVG:
- if the
width
andheight
properties were not given, you get the default 300x150, which seems to be set in the SVG standard:
- if the
- how to add na SVG image to a HTML file:
- svg/svg.html: external image. The included file is svg/svg.svg.
- svg/inline.html: inline.
- svg/billion-laughs.svg
- svg/html.svg
- svg/triangle.svg
- svg/viewBox.svg: this attribute allows you to control the default SVG
svg width=
andheight=
while keeping the coordinates of the drawing untouched. If theviewBox
aspect ratio differs from the width/height ratio, you likely want to play withpreserveAspectRatio
, otherwise you would get white spaces by default on the generated image - CSS with SVG:
- svg/style.svg: inline CSS
- svg/style-external.svg: external CSS with:
<?xml-stylesheet type="text/css" href="svg.css" ?>
, see also: stackoverflow.com/questions/18434094/how-to-style-svg-with-external-css- svg/subdir/style-external.html: is the relative CSS relative to the HTML or to the SVG? Answer: to the SVG... OMG. So how to make it work reliably?
- svg/current-color.html and svg/current-color.svg: illustrates
fill="currentColor"
. Only works for inline SVG however... See also: stackoverflow.com/questions/13000682/how-do-i-have-an-svg-image-inherit-colors-from-the-html-document/13002311
- JavaScript with SVG:
- svg/defs.html hows how
defs
works- svg/defs-external.html tries to include external
defs
from svg/defs.svg, but that fails like everything else related to external SVGs
- svg/defs-external.html tries to include external
Old cheat on separate repo: web.
Now moving to either:
- separate files under: web-cheat/ for the boring stuff
- subsections under this section for the more exciting stuff!
Examples under:
This mini-project walks the category hierarchy Wikipedia dumps and dumps them in various simple formats, HTML being the most interesting!
Mathematics dump of Wikipedia CatTree
. Source. Do you know what is worse than XML? Pseudo XML: stackoverflow.com/questions/5558502/is-html5-valid-xml/39560454#39560454