> I try a bunch of different OCR programs, but can't find any that can transcrib...

mminer237 · 2024-05-31T22:21:04 1717194064

It's hexadecimal. There is no spelling, so there's no way for an LLM to know if something is supposed to be a `D` or a `0` any more than traditional OCR software can.

jdthedisciple · 2024-06-01T10:31:27 1717237887

yes i noticed that way too late, my bad

wizzwizz4 · 2024-05-31T19:46:11 1717184771

Denoising algorithms are always lossy. An LLM (or, y'know, Markov chain) could do this job by exploiting statistical regularities in the English language, but a hex dump isn't quite the English language, so it'd be completely useless. Even if this text were English, though, the LLM would make opinionated edits (e.g. twiddling the punctuation): you'd be unlikely to get a faithful reproduction out the other end.

ratboy666 · 2024-06-01T16:06:43 1717258003

Of course, use search and replace to change 0 to zero... etc. The OCR will (should) work better.

wizzwizz4 · 2024-06-01T18:18:27 1717265907

You might as well just use an error-correction code: same result, less overhead.

jdthedisciple · 2024-05-31T20:19:08 1717186748

> hex dump

ah, missed that, was just skipping through

Moru · 2024-05-31T22:39:34 1717195174

Still would not solve the problem of copying data without changing it.