I am seeking a tool to space-efficiently archive a blog that is changing every day or even two or three times a day. I don't mean that individual blog posts change - not regularly anyway - I just mean that new blog entries are added and older entries are shifted down the front page. One problem I see is that it will be inefficient to archive the same blog entry multiple times. Revisions to the same entry should be archived, ideally, but the original need not be since the revision is likely due to an improvement or correction.
It is a blogspot.com blog with text and static images. A linux solution is preferred.