Preserving old blogs/sites and the importance of GitHub

Generating static copies of sites and serving them up from GitHub may preserve them for many more years

Many of us remember the cultural ignorance of Yahoo deleting GeoCities and Twitter deleting Posterous off the web. Those responsible are no better than library arsonists.

Whether you think all of that published creativity was rubbish or amazing, there is something very wrong with it all just disappearing. The Internet Archive does an amazing job but it’s simply impossible for it to keep up.

So I’ve taken it as a bit of a challenge to try and keep as much old content as possible online if I’ve been involved with it.

clocks

Photo by Jon Tyson on Unsplash

I’ve realised that GitHub is a critical tool for doing this in an inexpensive and portable way. Over the past few years I’ve taken several old blogs/sites, saved them as static HTML via various methods (WordPress export, wget, Harp, Hugo, etc), loaded them up into a GitHub repo and made them accessible on the web via GitHub Pages. Even better, that now includes a free https cert.

The only expense in keeping these sites online, and accessible at their original address, is the domain fee. I’ve switched over to Porkbun for that. A great inexpensive service. Of course you can just use a GitHub address if you don’t want to spend anything. But I do think the domain is an important aspect of keeping continuity.

It’ll be interesting to see how many more years I’ll be able to keep these sites online. Here’s hoping GitHub and Microsoft continue to make this type of setup free. But the fantastic thing about static HTML is that you can serve it up from anywhere, even from a Raspberry Pi in your living room. So no matter what they do, I’ll always be able to easily move the sites to new homes. Until I’m dead. And then? Well that’s a whole different conversation about P2P.

Just to give you a sense of the timescales I’m talking about, here’s some of those static HTML sites:

  • hushvine.com - First went live on WordPress in October 2011. Last updated December 2011. Dublin Web Summit 7 2011!!
  • conoroneill.com - First went live via Microsoft Frontpage in June 2000. Then Blogger. Then WordPress. Then Hexo. Now powered by Hugo. Still being updated, albeit not as regularly as I’d like.
  • This site - First went live on Posterous in June 2009. Still being updated, albeit not as regularly as I’d like. Also powered by Hugo.
  • web2ireland.org - First went live on WordPress in September 2005. Last updated September 2012.
  • manicmammy.com - First went live on WordPress in June 2007. Last updated December 2011.

I’d strongly encourage you to do the same type of archival process. Particularly if you have old WordPress sites that you haven’t updated in years and are probably riddled with security holes. The other fantastic thing about static HTML is that there is nothing to hack.

DWS

Conor O'Neill

Tech guy who likes running slowly

Bandon, Cork, Ireland https://conoroneill.net