Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

hey, the captcha system here is an f'n nightmare - anyway I wrote the software! happy to answer questions will try to do so below

since I am hell banned I can only edit this comment to reply.

as for video, I discuss research here: https://www.youtube.com/watch?v=OqW8erWi1Wo

but if you mean screencast of the software I haven't had time.

---

@coleifer: there was a sqlite branch, but it doesn't scale well to many-million record sets which is what I have been doing. the design of the software allows drop-in db replacement, it just lacks the code. I can't decide to go back and sqlite or to just make a web front-end.

---

@captn3m0 I have an academic paper in revision that is an analysis of the alexa 1M list, I also have other projects i development.

---

@snorrah: this was my first python project, so I went with the newest version. it's made a lot of things very difficult, especially porting to a web version.

---

@linuxlizard: proxy is problem when you want to do a lot of concurrent tests, I usually load about 64 pages in tandem to get good speeds on large sets.

---

@radmuzon: webdxray runs large batch jobs, so you can get lunch, and come back with all of pages analyzed. I know it does work on windows, and I apologize for not being able to provide directions...see comments above.

---

@TeMPOraL: thanks!

---

@pearjuice: yeah, I wish there was an easier way to get python3 to talk to mysql, that's the biggest PITA.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: