You’ve Been Archived By The Internet’s “Wayback Machine” – Maybe

imageWe hear it said repeatedly, so it’s generally taken at face value – The Internet is forever; Once it’s out there, it’s out there forever; Everything posted online is there forever, even after it’s been deleted.

Forever, of course, is – until the end of time. I can’t really get my head around “until the end of time” – so, I’ll stick with “the foreseeable future”. That’s a concept I can work with.

Despite the fact that “it’s out there forever” is commonly believed – I’ve yet to see verifiable evidence that it’s true in all instances. In the short term – OK, I’ll buy into this. So should those who like to air opinion, perspectives and their dirty laundry (intentional, or not), on social sites like Facebook, Twitter, Tumblr, and so on.

But long term – as in “forever” – balderdash! I say this, having had the experience of writing and posting a series of articles to a blog in which I documented my experience in dealing with a “crazy neighbor”.

Having resolved the issue to my satisfaction (shining a light on aberrant behavior was worth the effort), I took the site down. Within 12 months, no references to the site, the neighbor (who was repeatedly named), or the issues brought to light, were indexed anywhere on the Internet. So much for “forever”, or even “the foreseeable future”, for that matter.

It’s fair to say though, that in the example I’ve used here, the situation was within my control. Just as deleting my Facebook page back in 2007 was within my control. Again, no references to this deleted page are available on the Internet. However, that page is still being stored on a Facebook server and is available to me – should I chose to access it.

On the other hand – references that are outside my control (or yours), are another matter. Let’s say, for example, that I choose to shut down this blog. As opposed to deleting the “crazy neighbor” blog, mentioned previously, which disappeared without a trace – Tech Thoughts would not disappear – it would leave traces – substantial traces at that. The Internet Archive’s “Wayback Machine” has made sure of that.

Quoting the site: “The Internet Archive Wayback Machine puts the history of the World Wide Web at your fingertips. Browse through over 150 billion web pages archived from 1996 to a few months ago.

To start surfing the Wayback, type in the web address of a site or page where you would like to start, and press enter. Then select from the archived dates available. The resulting pages point to other archived pages at as close a date as possible.”

Here’s an example –

The Wayback Machine has indexed this site (Tech Thoughts), 163 times …..

image

going back to November 10, 2007 – as shown below.

image

A capture dated December 3, 2008. A pretty gaudy theme but……..

imageimage

not quite as “in your face” as I progressed through my colorful phase……

image

before getting down to serious business.

image

The Internet Archive – of which the Wayback machine is just one component –  is full of surprises, and is definitely worth a visit. Some of the available resources include:

Moving Images – 635,268 movies.

Live Music Archive – 100,665 concerts.

Audio – 1,210,381 recordings.

Texts – 3,331,892 texts.

10 Comments

Filed under FaceBook, Interconnectivity, Recommended Web Sites

10 responses to “You’ve Been Archived By The Internet’s “Wayback Machine” – Maybe

  1. That’s very interesting, Bill. I looked up my old self-hosted blog and there it was in all its glory. lol I thought it was gone forever.

    • Hi Paul,

      Been meaning to write this up for a couple of years – finally got to it last week.

      Kind of makes the point – ya gotta be careful what you say in a blog. 🙂

      Best,

      Bill

  2. Mal

    Hey Bill,
    I wonder how much of our “internet history” ends up in some government database somewhere, where even an internet search can’t find it? Would not surprise me one bit if such a thing existed.
    Just a thought.
    Good article.
    Cheers

    • Hey Mal,

      It’s beyond dispute – everything published to the Internet is content scraped by governments. So, your thinking is on the money.

      Unknown to most people, is the existence of the so called Invisible Web (Deepnet/Deep Web/Undernet/Hidden Web), as well as the Dark Internet.

      From WikiMost of the Web’s information is buried far down on dynamically generated sites, and standard search engines do not find it. Traditional search engines cannot “see” or retrieve content in the deep Web—those pages do not exist until they are created dynamically as the result of a specific search. The deep Web is several orders of magnitude larger than the surface Web.

      Not all of this, of course, is related to the accumulation and storage of data by/for governments. The reality is, only a very small portion fits into this category – but, it’s there, none the less.

      It’s unfortunate that most users believe the Internet revolves around YouTube and other such services. If they only knew. 🙂

      Excellent comment – the content of which needs to be discussed more often.

      Best,

      Bill

  3. Thank you for the article Sir Bill. Actually we used wayback machine to trace some of the problem in the website on its early days. We used this tool to improve some of its contents or to be a reference to its prevoius owner of the domain for our clients. A great tool!

  4. delenn13

    Money and knowledge is power. But blackmail is even better.

    • Hey Delenn13,

      Absolutely! I live in a downtown area where there are lots of University students – including one young woman who lives in a house beside me. Generally, she’s pretty good with her occasional parties – not too loud. Pretty respectful of others.

      But, last night, things got a little out of hand so, I walked next door with a vid cam and started shooting. Didn’t even get 30 seconds in when the noise dropped below the level of a cemetery. So yeah, “blackmail” has its place. 🙂

      Best,

      Bill