Last updated: Jul 18, 2005Ark is a Firefox extension designed to automatically archive certain pages visited with the browser, according to a whitelist of filters.
Archive everything you read online in the New York Times, or everything you read online, period. Keep snapshots of the same pages, like a changelog. Just grab interesting, individual pages.
Whitelist filter patterns can full regular expressions or simple strings with "*" wildcards.
There's more information in the included help files.
This is completely, 100% unsupported. I wrote it because I wanted to use it, and I'm using it right now, but there's a fair chance it's not what you're looking for. You might be better off with Slogger or ScrapBook, which both perform similar functions.
Changelog
v0.12
- Fixed stupid bug where it had the "don't save" and "overwrite" options mixed up. Oops.
- Check for and remove the # and anything past it from the URL.
- Adds .txt extension to plain text files, rather than saving them with no extension at all.
v0.11
- Fixed stupid bug where it wasn't possible to save pages as anything other than "Web page, HTML only".
- Added "Ark: Save Image" context menu item.
- Added (sort of) file extension guessing, so it's possible to save images and whatnot.
- Added choices for when the file is already in the archive -- prompt, overwrite, cancel.
- Added "always prompt" options in case the new file is smaller or larger than the old one, and the necessary code to check filesizes.
- General code organization.
v0.1
Development Wishlist
Planned/possible features:
- Per-pattern template settings (important!).
- Per-pattern options (overwrite, save-as type).
- Autosave page elements -- e.g., archive webcomics by saving every file like "20040412.jpg" in pages. Pipe dream.
- RDF datastore of saved pages and associated details.
- Log of saved/visited pages. (Generated from RDF store?)
Download