Whoa, I think I just coined a phrase in my uber clever title. Antirchive. Anyway, despite this looking like a post about my cool new word, this is instead about the incredible amount of thought I’ve recently been giving to what appears to be the total lack of a schema, a rubric if you will, nay a modus operandi for archiving web content. Really we’re lacking a convention for how we should be documenting that we updated our site Friday, September 5, 1997.
Granted, some folks such as the Internet Archive Wayback Machine have recognized this hole and taken snapshots of the vast Interspace at certain points and copied that in varying levels of completeness. But, really, I’m sure a huge portion of the folks putting information out there are not keeping copies. Now, the recent trend of blogs, and the idea of posting material and tying it specifically to the time at which it was posted will create a bit of an archive. But from 1997 until about a year ago I would venture to say most folks, with very few exceptions, were opening a page, editing it, and putting it right back up.
Since most of my sites have been portfolioesque by nature, I have a pretty good archive of most of my work. However, now that I work for a college I find that this topic becomes quite interesting. Day to day I am guilty of replacing content, making edits, moving things and otherwise not keeping copies. Most of this stuff is out of sheer necessity, little date tweaks and such would simply create a huge archive of very small changes.
Where this gets really interesting in a college setting, however, is in sites and pages related to academic material. We are in the business of admitting students, and then imparting knowledge upon them (beyond that, God help them). To me this means that anything that comes across my eDesk (yeah, yeah, leave me alone) relating to this business is of utmost importance.
Right now I’m struggling, however, with not only how to archive all this content, but whether it really needs to be at all. I’ve been talking to lots of folks and have honestly been somewhat shocked at how many don’t think it is important. It may not be, but I certainly don’t want to be the guy that makes the decision to delete all this old stuff. It seems to me that 5 years down the road, someone will be interested in what was going on in Human Rights and International Law this semester.
This could be a really long post about how we might go about actually doing this, but I don’t think that is the interesting point. I think what is interesting is that the Internets have changed, grown and evolved in amazing leaps and bounds but it seems to me that no one is concerned with what that all means for the actual content. If we are all caught up with keeping everything up-to-the-minute, what happens to the stuff from a few minutes ago?
Could blogs be our savior? Perhaps after some evolving of their own (seriously, I don’t want to read about your modular furniture Ted), the format of tying the content to the significance of when it was posted will create that convention, and the archive itself.