Show HN: Handwriter.ttf – Handwriting Synthesis with Harfbuzz WASM

PhilipRoman · 2024-08-21T15:29:29.000000Z

I bet this is what people felt when JavaScript was first demoed on the web.

In the year 2077, when each font will run it's own virtual machine on WASM containing a "minimal" Ubuntu image, some enlightened blogger will suggest server side rendered fonts as a performance improvement.

petercooper · 2024-08-21T17:53:16.000000Z

JS felt less impressive to me when I first saw it, simply because it couldn't do much. However, I remember the showcase Microsoft put together to show off what CSS could do in IE3 and that was very cool at the time.

pandeiro · 2024-08-21T19:03:19.000000Z

I remember that, too, and switched to IE for the next X years

pjmlp · 2024-08-21T16:05:26.000000Z

Nice demo.

Without trying to steal the thread, what I would care is actually the opposite direction.

Neither in Swift Playgrounds, nor in any other programming development environment apps for both mobile OSes, have I found a good development experience using pen instead of keyboard.

Given how many of us "program" in paper notebooks, it is quite incredible that besides a couple of research projects done by PhD students, no one cares to actually make it more widespread in a usable way.

Unearned5161 · 2024-08-21T20:13:51.000000Z

I could see something like Apples new calculator drawing app taking this by storm. Writing out your code and getting syntax support and all the things in your ide but with your handwriting?

In the same way that the calculator writes in the answers in your handwriting, it could write in snippets of code in your handwriting.

If it's going to be made though, it has to be e-ink. Go big or go home!

pjmlp · 2024-08-22T06:39:23.000000Z

Yeah, something like that.

fenollp · 2024-08-22T09:45:29.000000Z

Yes!

I'm on this route myself, trying various things out at https://github.com/fenollp/reMarkable-tools

Handwriting (in and out) support is very important IMO. Also being able to draw DAGs.

I'd like an e-ink device with high frame rate and HW powerful enough to run some models locally or with good enough connectivity and sensors that e.g. Computer Vision tasks can be offloaded to the users' smartphone.

Feel free to expose your ideas on there :) I welcome Open Source discussion!

tombh · 2024-08-21T15:31:30.000000Z

I've read the README and watched the video, but I'm still not sure what this is doing? I know it can "synthesize [a] font at runtime", so does that mean it's creating a random handwritten font as you type? But it's not based on the user's handwriting?

block_dagger · 2024-08-22T08:01:04.000000Z

Correct

gwern · 2024-08-21T18:00:34.000000Z

This looks like it'd be quite useful for faking documents more convincingly. Existing handwriting fonts always have tell-tale regularities and there's so few that forensics analysis exposes them easily.

jcelerier · 2024-08-21T14:07:04.000000Z

I wonder what makes SIMD an improvement here - in the end it all boils down to TTF bytecode and I don't think this comes with SIMD instructions, right?

hsfzxjy · 2024-08-21T14:13:56.000000Z

wasm has SIMD extension https://github.com/WebAssembly/simd/blob/master/proposals/si... . You can use them even in browsers (e.g. Chrome >= 91)

jsheard · 2024-08-21T14:21:28.000000Z

OP is asking how SIMD is beneficial to this specific application. If there's a small neural network involved then evaluating that is probably a good place to use SIMD.

hsfzxjy · 2024-08-21T14:38:02.000000Z

Yes. The program actually includes an ONNX runtime, which uses SIMD to accelerate NN inference.

jcelerier · 2024-08-21T14:43:02.000000Z

but... the program ends up entirely compiled and executed as TTF bytecode in the end right, since it's entirely contained in the TTF font ? And TTF bytecode is only the following instructions : https://developer.apple.com/fonts/TrueType-Reference-Manual/... and I don't see anything related to SIMD in there

erk__ · 2024-08-21T14:45:25.000000Z

No that is not what is happening, HarfBuzz have a experimental Wasm shaper, so the font embeds some wasm code that tells Harfbuzz what to output.

https://github.com/harfbuzz/harfbuzz/blob/main/docs/wasm-sha...

jcelerier · 2024-08-21T16:55:10.000000Z

oh ok, so it can't work in any system that doesn't use harfbuzz.. that's much less interesting than what I originally thought

jahewson · 2024-08-22T00:08:20.000000Z

It’s not TTF bytecode, but WASM bytecode. There’s an experimental version of HarfBuzz that can run this.

fulafel · 2024-08-22T06:29:50.000000Z

I wondered why there's just a video and no demo. But does it require the new wasm features in the browser ttf support?

hsfzxjy · 2024-08-22T06:46:59.000000Z

You can run the demo locally following the Usage part. Currently the WASM shaper feature is not enabled in any browser, so the demo won't work in web pages.

fulafel · 2024-08-23T03:51:07.000000Z

Ah, the wasm association calibrated my brain to seeing a browser window in the demo whereas it's really gedit.

BigParm · 2024-08-21T23:35:22.000000Z

Wtf is happening here what are the inputs and outputs? Hard to tell what this program does for me

hsfzxjy · 2024-08-22T02:39:41.000000Z

The handwriting in fact consists of multiple "tiny black box" glyphs, similar to pixels in traditional rendering. The program takes the text (e.g., "Hello world") as input, and works out the (x, y) locations of these block boxes.

amelius · 2024-08-22T10:56:16.000000Z

I prefer fonts that can be zoomed.

kragen · 2024-08-21T13:48:02.000000Z

this is amazing! i'm guessing you can probably get it to antialias without much more work

as the demo video shows, it's probably not something you want to have in between you and the ability to scroll a web page or close a tab. but i guess using harfbuzz now means we're buying into a turing-complete virtual machine running an arbitrary program in order to display a glyph. how seriously crippled are the harfbuzzless rendering paths? i'm assuming opting out of harfbuzz means opting out of arabic, devanagari and other indic scripts, etc.? is there a less out-of-control alternative that doesn't leave two billion people out in the cold?

hsfzxjy · 2024-08-21T14:26:31.000000Z

This is just a project for fun, like llama.ttf.

WASM shaper is still experimental and not yet shipped in and products, not before more limitation carried out as I think. So I'm not too worried about it.

kragen · 2024-08-21T15:18:46.000000Z

it's super cool!

vintermann · 2024-08-21T21:20:04.000000Z

So you trained a THAT small RNN to make this good handwriting? I'm actually even more impressed at that than at the crazy pipeline turning it into a font in realtime.

hsfzxjy · 2024-08-22T02:35:13.000000Z

I didn't train the model myself. A pretrained model is adopted from another repo [0].

[0]: https://github.com/X-rayLaser/pytorch-handwriting-synthesis-...

calebj0seph · 2024-08-21T15:10:45.000000Z

Damn, this is impressive!

We built a WebGL text renderer with full CJK support using Harfbuzz for our production whiteboard web app. I thought that was complicated until now.

MarceColl · 2024-08-22T04:58:42.000000Z

Is that open sourced? I want to render shakuhachi music notation in the browser trying to replicate the caligraphic style and it's not trivial.

adzm · 2024-08-22T12:08:51.000000Z

I've been curious about seeing if harfbuzz etc could help create a styled Elianscript font but not really sure where to start.

eigenvalue · 2024-08-21T16:31:59.000000Z

Fun hack. I bet Alex Graves never in a million years anticipated that his PhD thesis work would be encapsulated in a novelty font.

a1o · 2024-08-21T14:15:51.000000Z

I don't get it, why there's no link to a GitHub pages website to test the thing?

hsfzxjy · 2024-08-21T14:34:21.000000Z

You can't test it in a browser, since no browser at present is linked against libharfbuzz with WASM shaper. Instead, one can test it with a modified local program such as gedit in the demo.

As convenience, I built a Docker image that packs both the ttf file and the modified gedit together. You can try it out via `make run` as stated in the instructions.

okcdz · 2024-08-21T15:22:03.000000Z

If you just want to run locally. Why not using the native libharfbuzz directly? What's the purpose of WASM here?

0x457 · 2024-08-21T17:42:56.000000Z

Font includes WASM code that harfbuzz executes for shaping: https://github.com/harfbuzz/harfbuzz/blob/main/docs/wasm-sha...

It's an experimental feature, so it's not available unless explicitly enabled during compilation.

bawolff · 2024-08-21T15:36:52.000000Z

WASM is to libharfbuzz (when experimental compile time option is enabled) what javascript is to HTML.

So this is essentially native (albeit experimental) libharfbuzz. WASM is used because its how font files are scripted (when using this experimental version of lubharfbuzz)

Its important to keep in mind that wasm is a general technology and is not just used by web browsers.

a1o · 2024-08-21T16:07:19.000000Z

Harfbuzz can be built easily to Wasm using Emscripten.

Lockal · 2024-08-22T02:21:04.000000Z

Interesting artifact of time it would be. Harfbuzz uses https://github.com/bytecodealliance/wasm-micro-runtime to execute wasm, so when compiled it would be wasm runtime running under another wasm runtime.

einpoklum · 2024-08-21T14:32:05.000000Z

OP's repository has a Makefile which assumes a docker daemon is available.

hsfzxjy · 2024-08-21T14:40:40.000000Z

The project must be tested in an application linked against libharfbuzz with WASM shaper enabled. Since it's not easy to build a library like this, I make a Docker image which contains both the ttf file and a modified gedit, so that anyone can test the project with a single command.

andrewmcwatters · 2024-08-22T03:20:27.000000Z

It visually fails to preserve “handedness.”

piyushtechsavy · 2024-08-22T08:36:08.000000Z

Nice demo, for sure a fun app.

a2128 · 2024-08-22T01:42:49.000000Z

Link to llama.ttf, another fun font that abuses HarfBuzz for anyone interested (it runs a language model inside a font): https://fuglede.github.io/llama.ttf/

The llama.ttf video does a pretty good job explaining what the heck is going on

cyberax · 2024-08-21T14:47:04.000000Z

Year 2045: fonts become self-aware and go on a strikethrough.

noman-land · 2024-08-21T15:00:44.000000Z

I'm mad at you for forcing me to laugh at this.

koolala · 2024-08-21T16:05:22.000000Z

:) I sware that smilely just winked at me

shove · 2024-08-21T15:52:30.000000Z

Best comment

dancemethis · 2024-08-21T14:48:02.000000Z

This is so cursed, there is so much that begs the question - "why?"...

I love it.

I can't wait for the next beautiful nightmare. Maybe someone should mix font rendering with PDF rendering. Of course, with a LLM doing something in the middle.

hsfzxjy · 2024-08-21T14:50:43.000000Z

The answer is "no why", just for fun :) Though I learned a lot about how to stuff an NN model in a WASM binary and some tricks to optimize performance.

dancemethis · 2024-08-22T06:15:54.000000Z

I usually follow with "why not?", but forgot this time. Thank you for this. Again, I love it.

thenegation · 2024-08-22T05:39:40.000000Z

Best reason!