Hacker News new | past | comments | ask | show | jobs | submit login

For keeping local copies, https://addons.mozilla.org/en-US/firefox/addon/mozilla-archi... is a good add-on that improves Firefox's save functionality. I use it very frequently: ctrl-s, Enter, and I get an .mht of the page I'm reading.

For remote copies, https://archive.org/ can be instructed to keep a copy of a page, e.g.

https://web.archive.org/web/*/http://news.ycombinator.com/ to check for copies

https://web.archive.org/save/https://news.ycombinator.com/ to record a copy

These bookmarklets can be used to go from your current page to /web/ or /save/:

  javascript:(function(){window.open('https://web.archive.org/web/*/'+(''+window.location));})();

  javascript:(function(){window.open('https://web.archive.org/save/'+(''+window.location));})();
Unfortunately, this won't work on sites that robots.txt-block Internet Archive, and existing archives may be made inaccessible by a future robots.txt block.

http://archive.is/ is another useful site that executes all of the JavaScript and captures the post-JS-execution DOM.




Thank you for your information. I'm looking for tools to save my useful information like other people do. I have two problems with this type of archival tool besides the robots.txt issue:

1. The static snapshot can only give us an idea of what it looked like because it's not functioning, not updating. Better than nothing, but we may have no clue what we want to find other than a few keywords. If you need to search the entire context of the archival, everybody needs to install a mini search engine.

2. If it becomes popular like Google, so many people are going to use it, it will create so much data duplication on everybody's pc, which is helpful but not an efficient solution.

Do you like to have a self-organized personal private web on top of the public web via a few clicks? http://bit.ly/1c2DzN6




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: