Ciro Santilli @cirosantilli 37

 Incoming links: PHP

Bitcoin Forum Updated 2025-07-16

Good article about its history: en.bitcoin.it/wiki/BitcoinTalk

Founded by Satoshi Nakamoto, making it the earliest and one of the most important Bitcoin communities. TODO official in any way? Who founded it?

Instance of Simple Machines Forum, an open source, PHP-based forum system.

Some notable appearances:

in 2010, it is where Laszlo's pizzas offer was announced
it was used e.g. on the Mt. Gox investigation: youtu.be/tJ-TsrK6SuY?t=2018
Jimmy Zhong's investigation: youtu.be/pxvd1YOMGxU?t=1004

 Read the full article

CIA 2010 covert communication websites / HTML analysis Created 2025-05-07 Updated 2025-07-16

 View more

The HTML from the index page of Wayback Machine were:

dumped at: github.com/cirosantilli/media/tree/master/cia-2010-covert-communication-websites/html
downloaded with: github.com/cirosantilli/media/tree/master/cia-2010-covert-communication-websites/download-html.sh. Note that there were many supurious errors notably:
OpenSSL SSL_connect: SSL_ERROR_SYSCALL in connection to web.archive.org:443
we just ran it multiple times until all errors were gone.

The best way to analyse the HTML is to grap our dumps from: github.com/cirosantilli/cia-2010-websites-dump.

Some possibly interesting searches include:

list all HTML comments, maybe something spicy was left over:
```
git grep '<!--'
```

search for weird file extensions:

git ls-files | grep -Ev '\.(jpg|gif|html|txt|png|css|php|js|jar|cgi|htm|swf|ico|JPG|class|zip|sf)'

have a look at the largest folers:
```
ncdu
```

Some of the HTML files contain conditional comments e.g. web.archive.org/web/20091023041107/http://aquaswimming.com/ contains:

<!--[if IE 6]> <link href="swimstyleie6.css" rel="stylesheet" type="text/css"> <![endif]-->

Varios of the non-English websites seem to have comments translating the content e.g.:

./noticiasmusica.net/20101230165001/index.html:<h2>Alguns dos Melhores Sites Nacionais</h2><!--some of the best national sites (in music)-->

This feels like it could be the translation helping the technical webdev team know what is what.

Many of the RSS frame pages use:

<base target="_blank" />

which is a weird HTML tag that would lead all links to open on new tabs, e.g. web.archive.org/web/20110202124411/http://thecricketfan.com/home.html.

Various websites have pages with .php extension. It feels likely that all websites were written in PHP.

Some sites use a feeds.php for the feeds, e.g. http://www.absolutebearing.net//absolutebearing_feeds/feeds.php?src=http%3A%2F%2Ffeeds2.feedburner.com%2FOceanyachtsinfo&desc=1

Some URLs existed both in HTML and .php extension, or were converted at some point:

allworldstatistics.com/20110207151941/comprehensivesources.html
allworldstatistics.com/20130818155225/comprehensivesources.php

A few of the PHP urls have weird IDs in them like omktf, juqwt and qlaqft:

./middle-east-newstoday.com/20100829004127/omktf/uirl.php?ok=461128
./newsandsportscentral.com/20100327130237/juqwt/eubcek.php?pe=747155
./pondernews.net/20100826031745/lldwg/qlaqft.php?fc=281298

we wonder what they mean.

A few separate websites have an archive with the same pid parameter:

fightwithoutrules.com/20131220205811/?pid=2POQ7BC1G/index.html
half-court.net/20131223165013/?pid=2POQ7BC1G/index.html
health-men-today.com/20131223002237/?pid=2POQ7BC1G/index.html
intlnewsdaily.com/20131221121441/?pid=2POQ7BC1G/index.html
intoworldnews.com/20131217193621/?pid=2POQ7BC1G/index.html

It is unclear what it means. All of them contain something like:

<html>
<head>
<meta name="robots" content="noarchive" />
<meta name="googlebot" content="nosnippet" />
</head>
<body>
<div align=center>
<h3>Error. Page cannot be displayed. Please contact your service provider for more details.  (11)</h3>
</div>
</body>
</html>

so looks like an archival artifact only.

The following two websites have a feeds.php system for their RSS:

./mydailynewsreport.com/20110211111053/myrss/feeds.php?src=http:/www.refahemelli.com/pashto/news/rss.php&chan=y&desc=1&targ=y&utf=y
./magneticfieldnews.com/20110208063545/magneticfeeds/feeds.php?src=http:/www.bbc.co.uk/pashto/index.xml&chan=y&desc=1&targ=y&utf=y

Some of the HTML uses attributes without quotes, which is legal, but very unusual nowadays:

soldiersofsouthasia.com/20110207203705/home.htm: <a href=http://www.rss-to-javascript.com

We can try to search for any link leaks by listing all domains linked to with:

git grep --no-color -I -h --no-line -o 'https?://[^/">?]+[/">?]' | sed -r 's/.$//' | sort | uniq -c | sort -nk1

The first thing that shows up is that there are some IPs linked to directly! But they seem to be the direct IPs of legitimate websites, we are not sure why IPs were used rather than domain names:

69.167.160.171 at web.archive.org/web/20110208053653/http://sa-michigan.com/ to web.archive.org/web/20100304122019/http://69.167.160.171/ marked with image "fantasyplayers.com", a legit website called Fantasy Players Network
69.94.11.53 at web.archive.org/web/20101229193800/http://newsresolution.net/ titled "International Tribunal for Rwanda" to web.archive.org/web/20101229193800/http://69.94.11.53/default.htm
74.125.77.132 mynepalnews.com Webalizer
194.165.154.66/index.php web.archive.org/web/20110129161937/http://icwb-news.com/ MiddleEast links to 194.165.154.66/index.php but that is an actual page: web.archive.org/web/20110529142501/http://194.165.154.66/index.php
200.55.6.87 at web.archive.org/web/20110128170204/http://noticiasdelmundolatino.com/ after clicking "Maps" tab entitled "Mapas en la red" to web.archive.org/web/20100329150648/http://200.55.6.87/es/index.htm
213.97.154.118 at web.archive.org/web/20120429042725/http://montanismoaventura.com/ entitled "Mallorca Verde" to web.archive.org/web/20120430191214/http://213.97.154.118/mallorcaverde/ The target is a bit weird and almost empty.
216.218.196.146 at entitled "AskTheDr.com" to web.archive.org/web/20070303080403/http://216.218.196.146/askthedr/index.htm

We can also get the full line for each with sorted by least common domains with the slow:

git grep --no-color -I -h --no-line -o 'https?://[^/">?]+[/">?]' | sed -r 's/.$//' | sort | uniq -c | sort -nk1 | awk '{if ($1 < 10) print $2}' | xargs -I{} git --no-pager grep -h --no-line -o '{}.*<' | tee tmp.log

We can search for all IP-like strings with:

git grep '[0-9]{1,3}\.[0-9]{1,3}\.[0-9]{1,3}\.[0-9]{1,3}\b'

 Read the full article

OurBigBook.com / Alternatives Updated 2025-07-16

 View more

These are websites that offer somewhat overlapping services, many of which served inspirations, and why we think something different is needed to achieve our goals.

Notably, OurBigBook is the result of Ciro Santilli's experiences with:

Wikipedia
GitHub
Stack Exchange (or as non techies might point out, Urban Dictionary, or Quora before it was such an incomprehensible shitshow)

OurBigBook could be seen as a cross between those three websites.

Quick mentions:

handwiki.org/wiki/HandWiki:About: technically the same as Wikipedia, but with more aligned moderation policies
ecotext.co/ similar goals. Their website seems quite broken now though as of 2021, can't see text properly. Crunchbase entry: www.crunchbase.com/organization/ecotext says they are from Durham, New Hampshire, United States. Cannot see how to publish, curated material only? Twitter: twitter.com/ecotextinc?lang=en One of the founders: twitter.com/BigNel_21 | www.linkedin.com/in/ecotextnelsonthomas/. Their LinkedIn: www.linkedin.com/company/ecotext/people/
fiveable.me/ bad: separates students and teachers, as a student I don't see where to create my content. Good: focus on teaching university level stuff to people outside of university via Advanced Placement. Bad: Lots of video content. Bad: Can't see the issue tracker attached to each page.
LessWrong: their website system does have some similar feature sets to what we want. Reputation, Q&A sections, links between articles most likely, sort by upvote everywhere.
crowdpub.org collaborative writing website, somehow goes to paragraph level, TODO how they reconcile different authors? Closed beta as of writing, so hard to be sure. From quick presentation on beta website, appears to attempt to share revenue to authors proportionally to the size of their contribution. Some blockchain-based reputation. Meh.
TODO migrate all from: github.com/booktree/booktree/blob/master/alternatives.md
studynotes.ie/. Admin approval on everything. No ToC. Fixed tag list for university entry exams topics.
mindstone.com: there appears to be no sharing focus? File upload basesd? Not sure.
EverybodyWiki
looking for open source Confluence-alternatives is an interesting way to go:
- lists:
  - opensource.com/article/20/9/open-source-alternatives-confluence
  - www.nuclino.com/alternatives/confluence-alternative#ynaw
- BookStack:
  - fixed 3-level page hierarchy
  - writen in PHP
  - Markdown support: www.bookstackapp.com/docs/user/markdown-editor/
  - no source-level import-export apparently: www.bookstackapp.com/docs/admin/backup-restore/, youtu.be/WUvtzJfCAKE?t=904
  - WYSIWYG: www.bookstackapp.com/docs/user/wysiwyg-editor/ via TinyMCE
  - page content repeating: www.bookstackapp.com/docs/user/reusing-page-content/ (will be useful for course modelling)
github.com/shuding/nextra converts Markdown links to Next.js links. We should look into how it works.
zettelkasten.de/the-archive/ "The Archive" from zettelkasten.de/. Closed source. By German software engineer Christian Tietze twitter.com/ctietze?lang=en
LLM generated wiki e.g.:
- Kinnu
docs.tigyog.app/cli beautiful website, but doesn't achieve much. Has a Markdown upload mechanism. Ah, those newbs who think the average user will care about markup upload to DB... Oh, wait...
www.stuvia.com/en-gb/school/uk/oxford-university/physics. PDF uploads. In theory you have to own copyright: www.stuvia.com/en-gb/copyright/guidelines but it feels unlikely that most material was uploaded by the copyright owners. If those people are up, then why can't we? Maybe... Registred in the UK. People: some Dutch dudes:
- www.linkedin.com/in/jaapvannes
- www.linkedin.com/in/hugo-kuijzer-bb81a625
Project Xanadu: crazy overlaps, though that project is vaporware apparently?
Administrators of Project Xanadu have declared it superior to the World Wide Web, with the mission statement: "Today's popular software simulates paper. The World Wide Web (another imitation of paper) trivialises our original hypertext model with one-way ever-breaking links and no management of version or contents.

Static website-only alternatives:

quarto.org/
- Links have forced file scope:^[ref]
  [about](about.qmd) [about](about.qmd#section)
- WYSIWYG via RStudio
vitepress.dev. vitepress.dev/guide/markdown unmanaged internal links. Sample website: wiki.nikiv.dev/.

Conceptual:

The Final Encyclopedia: science fiction concept, but the name was reused by Paul Allen in a research project
second brain
collective intelligence

TODO:

pressbooks.directory/

 Read the full article