Thursday, January 22, 2015

The Vanishing Internet (And Why It Should Be Archived)

I’m currently writing an MA thesis, and as I start piling up the chapters and the footnotes, I find that perhaps a quarter of my citations either include URLs to web-hosted versions of sources that are also available as print sources, or exist exclusively as web-hosted sources.    In “the Cobweb”, an article in this week’s New Yorker magazine’s Annals of Technology series, Jill Lepore notes that the problem with sources hosted on the web is that they don’t stick around as long as we think.  “The web”, she writes, “dwells in a never-ending present.  It is - elementally - ethereal, ephemeral, unstable, and unreliable”.

Lepore talks about something called “link rot”.  Here’s an example from my own research.  Yesterday I was checking a secondary source where the author cited numerous references in the form of URLs hosted on Canada’s Department of National Defence servers.  However, when I checked one of those references, the URL no longer existed.  It had been taken down or overwritten.  The secondary source I was using said at the top of its list of footnotes that “All web citations were active at the time of the writing of this article” or something to that effect, but in the two years since the author had written that article, at least one footnote no longer worked.  Probably more than one no longer worked, if Lepore’s data is right.

 

 

Lepore cites a 2014 Harvard Law School study which found that “more than 70% of the URLs within the Harvard Law Review and other journals, and 50% of the URLs within United States Supreme Court opinions, do not link to the originally cited information”.   Another study suggests that of 3.5 million scholarly articles published in academic journals from science, technology and medicine published between 1997 and 2012, one in five links “suffers from reference rot”.

Libraries have always been subject to attrition, loss, rot and even physical destruction, but somehow they’ve survived and knowledge has been transmitted over time.   To make that transmission easier, scholars invented the footnote.  But now the footnote itself is in danger of being made unstable as more and more knowledge moves to the shifting sands of the internet.

Here’s an example of how the internet vanishes and why its loss matters.   After the downing of the Malaysian Airliner over the Ukraine last summer, we knew that Russian-backed separatists were likely behind it because they boasted of it over social media.  Those media traces were soon scrubbed, but they were captured by self-proclaimed internet archivists. As Lepore puts it, “One day last summer, a missile was launched into the sky and a plane crashed in a field.  “We just drowned a plane,” a soldier told the world.  People fell to earth, their last passage.  Somewhere, someone hit “Save Page Now”.

But, because most pages aren’t saved, knowledge is vanishing, paradoxically, as the capacity of the internet expands.  

These thoughts will remain here until I either delete this blog or some VP at Google decides that Blogger is no longer part of its business strategy, and Blogger, like Geocities before it, is bulldozed.  Just a small example of what Lepore is talking about.

7 comments:

Anibal Invictus said...

Mike, this is a total surprise to me!
I always thought that whatever went into the web, stayed forever
Good to know

Jason said...

A very interesting article Mike, and very thought provoking.
I read a blog post a few days ago http://commissarmoody.blogspot.com/ about tintype photos of the Afghanistan War. A comment on the original article said in effect that tintypes (as per the Civil War photos) could be the great survivor as digital photos are progressively lost through various means.
I have lots of photos on digital format waiting for the day I turn them into photobooks - I think I should do it sooner rather than later!
Best wishes,
Jason

Jason said...

Another link to the article I mentioned above if it's helpful:

http://cameras.reviewed.com/features/us-soldier-ed-drew-shoots-modern-portraits-with-civil-war-tech?utm_source=taboola&utm_medium=USAT%20Recirc

Conrad Kinch said...

Truer than you know. The Wayback Machine has some things, but it can be very frustrating for research purposes.

tradgardmastare said...

A thought provoking post.

Thomas Nissvik said...

The internet never forgets, but it occasionally gets rather confused. ;-)
In short, what you need will be there, you just won't be able to find it. And what you don't want to remember will be there, and someone else will find it for you! Several Swedish politicians found that out the hard way during our latest election...

Olive Tree said...

Mike Peterson eats Ketchup

Mad Padre

Mad Padre
Opinions expressed within are in no way the responsibility of anyone's employers or facilitating agencies and should by rights be taken as nothing more than one person's notional musings, attempted witticisms, and prayerful posturings.

Followers

Blog Archive

Labels