Industries that talk proudly of the “content” they offer — raise your hand, journalism organizations — have a special need to preserve what they’ve created in a consistent and easy-to-find way. Content, in this context, includes the links that people have been using to find it.
You would imagine that the news industry would understand this. If so, you would be overestimating the industry’s collective common sense.
A current case in point is what the New York Times Co. (a company in which I, perhaps stupidly at this point, still hold a bit of stock) just did to my friend Thomas Crampton and a host of other journalists whose work has appeared in the International Herald Tribune (owned by the Times) in recent years. In an open letter to the NYT chief executive, “Reporter to NY Times Publisher: You Erased My Career“, he observes:
The IHT website earned an ever-increasing pagerank due to all of the blogs and sites linking to stories there. (Based on the number of Internet pages linking back to a site, pagerank starts at 1 and rises to 10. A page with a Google rank of 5 will show up higher than a page with a Google rank of 3 and the IHT.com grew to match nytimes.com at a Google rank of 9. You can check pagerank of any site here.)
So, what did the NY Times do to merge these sites?
They killed the IHT and erased the archives.
No doubt, the Times will eventually make his and other IHT journalists’ articles available again via search through the mother ship’s own archives. But not preserving the URLs is truly foolish move.
Thomas isn’t alone among media folks in this content carnage, as he reported in a follow-up posting. Changing the URL structure of websites is a too-common event. Even if, as is the case most of the time, the originals are still around, disappearing the links is tantamount to hiding the original material.
In my case, when I worked for Knight Ridder, what happened was considerably worse. My former employer deleted my entire archive of blog postings — not just once but twice.
The first time, Knight Ridder moved all of its Web properties to a centralized system. This was part of a move decreed by the bosses who’d been sold on the notion that homogenizing the company’s content — and, more importantly, centralizing the display advertising engine — would be a brilliant business move. I question that, by itself, but I can assure you it was a stupid journalistic move to wipe out years worth of what I’d been creating for Knight Ridder.
The second time came after I left the company. I was offered a CD of my blog archives before the site was turned off, but I don’t recall ever receiving it, and by the time I realized I didn’t have anything it was too late.
I was enraged, both times. And entirely powerless to do anything about it.
I’ve learned my lesson. Anything I write — for myself or for someone else — is backed up on my machines under my control. I’m creating a cloud backup as well. I realize that there are circumstances under which I could lose even those copies, but I can’t make my stuff 100 percent safe.
This applies in spades to other kinds of things we store online. As far as I know, practically every service you use reserves the right to delete your account. Some of them will give you an opportunity to download what you’ve posted, but you should not even count on that when push comes to shove, especially in when economic pressures are as high as they have become today.
The point is that I no longer rely entirely on the good graces of other people, including employers, to preserve what I’ve created, much less keep it available for you to see. I try to rely on myself.