Source: cirosantilli/cia-2010-covert-communication-websites/html-title-element

= HTML title element

The discoverty of a <possible HTML information leaks> on HTML `<title>` of https://web.archive.org/web/20110207150839/http://webofcheer.com/[webofcheer.com] which is cryptically set as:
``
pg1c
``
motivated us to download all HTML and have a grep.

We started grepping with:
``
grep -ai '<title>' */index.html
``
and to just get the titles alone for visual inspection:
``
grep -ahi '<title>' */index.html | sed -r 's/^\s*<title>//;s/<\/title>.*//'
``

Some mildly interesting facts include:
* opensourcenewstoday.com is titled just as "Title"
  ``
  opensourcenewstoday.com/index.html:<title>Title</title>
  ``
* a few sites are titled "Untitled Document" e.g.:
  ``
  media-coverage-now.com/index.html:<title>Untitled Document</title>
  newsandsportscentral.com/index.html:  <title>Untitled Document</title>
  newsincirculation.com/index.html:<title>Untitled Document</title>
  newsworldsite.com/index.html:<title>Untitled Document</title>
  primetimemovies.net/index.html:<title>Untitled Document</title>
  unganadormundial.com/index.html:<title>Untitled Document</title>
  ``
  This may have been the default title in Adobe Dreamweaver.
* some others have empty title:
  ``
  aeronet-news.com/index.html:<title></title>
  al-rashidrealestate.com/index.html:             <title></title>
  arabicnewsunfiltered.com/index.html:<title></title>
  dailynewsandsports.com/index.html:<title></title>
  electronictechreviews.com/index.html:<title></title>
  indirectfreekick.com/index.html:<title></title>
  iran-newslink-today.com/index.html:<title></title>
  iraniangoals.com/index.html:<title></title>
  kickitnews.com/index.html:<title></title>
  mediocampodefutbol.com/index.html:<title></title>
  middle-east-newstoday.com/index.html:      <title></title>
  mygadgettech.com/index.html:<title></title>
  sayaara-auto.com/index.html:<title></title>
  techwatchtoday.com/index.html:<title></title>
  the-open-book-online.com/index.html:<title></title>
  thenewsofpakistan.com/index.html:<title></title>
  theworld-news.net/index.html:<title></title>
  todaysengineering.com/index.html:<title></title>
  todaysnewsreports.net/index.html:<title></title>
  worldnewsandent.com/index.html:<title></title>
  ``
* some others are titled just "index" or a variant of it:
  ``
  all-sport-headlines.com/index.html:<title>index</title>
  europeannewsflash.com/index.html:<title>Index</title>
  fgnl.net/index.html:<title>Index Page</title>
  iraniangoalkicks.com/index.html:<title>index</title>
  just-the-news.com/index.html:<title>index</title>
  mide-news.com/index.html:<title>index</title>
  mytravelopian.com/index.html:<title>Index</title>
  noticiasdelmundolatino.com/index.html:<title>index</title>
  pakcricketgrd.com/index.html:  <title>index</title>
  pangawana.com/index.html:<title>index</title>
  sportsnewsfinder.com/index.html:<title>index</title>
  thenewseditor.com/index.html:<title>index</title>
  turkishnewslinks.com/index.html:<title>index2</title>
  wahidfutbol.com/index.html:<title>index</title>
  webscooper.com/index.html:<title>index</title>
  webworldsports.com/index.html:<title>index</title>
  ``
* a few don't have `<title>` at all:
  ``
  b2bworldglobal.com/index.html
  bailandstump.com/index.html
  businessexchangetoday.com/index.html
  commercialspacedesign.com/index.html
  court-masters.com/index.html
  flyingtimeline.com/index.html
  marketflows.net/index.html
  nouvellesetdesrapports.com/index.html
  senderosdemontana.com/index.html
  sixty2media.com/index.htm
  ``
It is impossible to tell if these were oversights, or intentional to simulate common web development quircks. But they are cute in any case.