My thoughts on the best strategies to preserve human knowledge and creation in perpetuity
1) don't put all your eggs in one basket
archives need to be paper and digital, public and private sector, centralised and decentralised, fully legal / by the book and rogue, in vaults and cabinets and servers and torrents
create as many redundancies as you can: make copies, and copies of the copies, and copies of the copies of the copies, ad nauseam; anyone anywhere who can make copies, should
spread the physical hubs (paper stacks or servers) geographically, in as many places as possible; you never know what kind of natural disaster or man-made horror will take out a whole building, city, region, or continent tomorrow
2) entropy is a bitch, think longterm
pick methods that are more likely to last
schedule regular copying: you gotta transfer the stuff to a new medium before the old one falls apart, so have some idea when it's expected to fall apart
3) keep converting to new formats
no format becomes obsolete instantly, there's always a transition period; use transition periods to furiously convert everything
4) indexing and searching is as important as the content itself
5) eyes on the prize: the end goal is public access
if a random nobody, with no status and no money, can't access it easily, freely, and anonymously, the job is only half-done; you've built the back-end and neglected the front-end; get someone to complete it ASAP, because now it's just sitting pretty and isn't doing anything; or isn't doing enough, in any case
bonus: use. fucking. torrents.
It is truly bonkers that the bittorrent protocol is not being used for archiving. It's an ideal method for digital archiving and it should be standard procedure. If a university has stuff on a hard disk, it can put it on its server, and if it can put it on a server, it can torrent it and seed it 24/7. If the same archive is useful for another university on the other side of the planet, that one can download it and then stay in the swarm, also seeding it. If a library or city council anywhere on earth finds the archive of interest, it can do the same. The more the merrier, every download is a potential redundancy and every seeder is an actual redundancy.
If you got space to store it, you got space to share it. And of course, any private individual can at any time join the swarm. So we get excellent preservation (with multiple redundancies, spread far and wide geographically) AND public access, global and free, which is what preservation is FOR in the first place! It ain't for the heck of it, it ain't only for the eyes of the elite, it's for everyone, that's the purpose, that's the end goal. If that's not your end goal, you're doing it wrong.