Delorean
scroll to learn more
Roads?!? Where we're going, we don't need roads.
Project Delorean a time machine to preserve College's digital heritage.
Brief
College archive kept physical artifacts for heritage but little thought was given to digital heritage.
The system was built to take snapshots of the website automatically, periodically. Snapshots are then made available via an interface for reference.
c.200 gigabytes of snapshots
Infrastructure
Built using an AWS EC2 instance with large cold storage attached, crawling College domains is performed using wget with custom PHP scripts to parse the data.
A simple front end to display the archive in frames allows the users to go back in time and look at any snapshot.
An additional integration with updown.io is available for users to see availability information for the various sites around the world, key stake holders are also notified should the sites become unavailable.