Hacker News new | past | comments | ask | show | jobs | submit login

How has your experience with archivebox after running it for a while? After trying to set it up multiple times I gave it another try a few days ago and it always feels like it's doing too much and is therefore very sluggish and buggy.

I was looking for alternatives but couldn't really find something great with a decent UI and full-text search.




It isn't horrible. I have it running in a docker-compose stack and after initial setup I haven't really thought about it other than checking the Raindrop script I have is still populating it.

I don't really use it interactively, it's more to have a "backup" of websites I find useful after finding some I used to reference for years disappeared and were never added to archive.org or occasionally sending the Readability/PDF versions to my Kindle.

I also setup YaCY[1] at one point with the idea of having my own local personal search engine for the archived sites, but I ended up never using it.

1. https://github.com/yacy


Similar story, getting ArchiveBox setup and running was a breeze, but everything after that was kinda rough. For one, ArchiveBox doesn't have a proper API, so I had to rig one up with Puppeteer. And then there's YaCY. On paper, it seemed like the dream tool for indexing and making a searchable bookmark collection. But in reality, it was a whole lot of work followed by a whole lot of disappointment.


I’ve tried Archivebox (using docker compose) several times, and every single time it just stops.

I import around 3k bookmarks, it starts archiving them. Immediately some archival methods fail (usually screenshot and pdf), and after archiving a few hundred bookmarks it never continues to archive the rest. I‘ve let it sit and do its thing for several days, it never manages to get through all of them (or even a sizable minority).

Different machines, different filesystems, different networks. No idea what‘s wrong.


I’m glad to read that as it confirms my experiences too. Seeing that it is also not that actively maintained I even started writing a similar thing myself as I really only need a small subset of the functionality.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: