Hacker News new | past | comments | ask | show | jobs | submit login

Wouldn’t exporting pages to images and using pixel diff accurately identify differences in PDF’s?



I guess it depends on the use case. Imagine adding an extra sentence in the second PDF, and this causes the paragraph to have 6 instead of 5 lines, and the next paragraph begins a line further down, and the last paragraph of that page ends up in the next page, etc...


Thanks. That helped understand it better.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: