The discoverty of a possible HTML information leaks on HTML <title> of webofcheer.com which is cryptically set as:
pg1c
motivated us to download all HTML and have a grep.
We started grepping with:
grep -ai '<title>' */index.html
and to just get the titles alone for visual inspection:
grep -ahi '<title>' */index.html | sed -r 's/^\s*<title>//;s/<\/title>.*//'
Some mildly interesting facts include:
  • opensourcenewstoday.com is titled just as "Title"
    opensourcenewstoday.com/index.html:<title>Title</title>
  • a few sites are titled "Untitled Document" e.g.:
    media-coverage-now.com/index.html:<title>Untitled Document</title>
    newsandsportscentral.com/index.html:  <title>Untitled Document</title>
    newsincirculation.com/index.html:<title>Untitled Document</title>
    newsworldsite.com/index.html:<title>Untitled Document</title>
    primetimemovies.net/index.html:<title>Untitled Document</title>
    unganadormundial.com/index.html:<title>Untitled Document</title>
    This may have been the default title in Adobe Dreamweaver.
  • some others have empty title:
    aeronet-news.com/index.html:<title></title>
    al-rashidrealestate.com/index.html:             <title></title>
    arabicnewsunfiltered.com/index.html:<title></title>
    dailynewsandsports.com/index.html:<title></title>
    electronictechreviews.com/index.html:<title></title>
    indirectfreekick.com/index.html:<title></title>
    iran-newslink-today.com/index.html:<title></title>
    iraniangoals.com/index.html:<title></title>
    kickitnews.com/index.html:<title></title>
    mediocampodefutbol.com/index.html:<title></title>
    middle-east-newstoday.com/index.html:      <title></title>
    mygadgettech.com/index.html:<title></title>
    sayaara-auto.com/index.html:<title></title>
    techwatchtoday.com/index.html:<title></title>
    the-open-book-online.com/index.html:<title></title>
    thenewsofpakistan.com/index.html:<title></title>
    theworld-news.net/index.html:<title></title>
    todaysengineering.com/index.html:<title></title>
    todaysnewsreports.net/index.html:<title></title>
    worldnewsandent.com/index.html:<title></title>
  • some others are titled just "index" or a variant of it:
    all-sport-headlines.com/index.html:<title>index</title>
    europeannewsflash.com/index.html:<title>Index</title>
    fgnl.net/index.html:<title>Index Page</title>
    iraniangoalkicks.com/index.html:<title>index</title>
    just-the-news.com/index.html:<title>index</title>
    mide-news.com/index.html:<title>index</title>
    mytravelopian.com/index.html:<title>Index</title>
    noticiasdelmundolatino.com/index.html:<title>index</title>
    pakcricketgrd.com/index.html:  <title>index</title>
    pangawana.com/index.html:<title>index</title>
    sportsnewsfinder.com/index.html:<title>index</title>
    thenewseditor.com/index.html:<title>index</title>
    turkishnewslinks.com/index.html:<title>index2</title>
    wahidfutbol.com/index.html:<title>index</title>
    webscooper.com/index.html:<title>index</title>
    webworldsports.com/index.html:<title>index</title>
  • a few don't have <title> at all:
    b2bworldglobal.com/index.html
    bailandstump.com/index.html
    businessexchangetoday.com/index.html
    commercialspacedesign.com/index.html
    court-masters.com/index.html
    flyingtimeline.com/index.html
    marketflows.net/index.html
    nouvellesetdesrapports.com/index.html
    senderosdemontana.com/index.html
    sixty2media.com/index.htm
It is impossible to tell if these were oversights, or intentional to simulate common web development quircks. But they are cute in any case.

Articles by others on the same topic (0)

There are currently no matching articles.