Very cool, thanks for sharing. A couple questions: - any thought about wake word...

TeMPOraL · 2025-05-06T08:05:09 1746518709

FWIW, wake words are a stopgap; if we want to have a Star Trek level voice interfaces, where the computer responds only when you actually meant to call it, as opposed to using the wake word as a normal word in the conversation, the computer needs to be constantly listening.

A good analogy here is to think of the computer (assistant) as another person in the room, busy with their own stuff but paying attention to the conversations happening around them, in case someone suddenly requests their assistance.

This, of course, could be handled by a more lightweight LLM running locally and listening for explicit mentions/addressing the computer/assistant, as opposed to some context-free wake words.

Dr4kn · 2025-05-06T08:38:19 1746520699

Home Assistant is much nearer to this than other solutions.

You have a wake word, but it can also speak to you based on automations. You come home and it could tell you that the milk is empty, but with a holiday coming up you probably should go shopping.

Dlemo · 2025-05-06T10:36:21 1746527781

I want that for privacy reasons and for resource reasons.

And having this as a small hardware device should not add relevant latency to it.

jillyboel · 2025-05-06T10:58:50 1746529130

Privacy isn't a concern when everything is local

Dlemo · 2025-05-06T19:51:42 1746561102

Yes it is.

Malware, bugs etc can happen.

And I also might not want to disable it for every guest either.

ben_w · 2025-05-06T20:50:42 1746564642

If the AI is local, it doesn't need to be on an internet connected device. At that point, malware and bugs in that stack don't add extra privacy risks* — but malware and bugs in all your other devices with microphones etc. remain a risk, even if the LLM is absolutely perfect by whatever standard that means for you.

* unless you put the AI on a robot body, but that's then your own new and exciting problem.

jillyboel · 2025-05-06T21:12:19 1746565939

There is no privacy difference between a local LLM listening versus a local wake word model listening.

koljab · 2025-05-06T10:40:18 1746528018

That would be quite easy to integrate. RealtimeSTT already has wakeword support for both pvporcupine and openwakewords.

justlikereddit · 2025-05-06T07:24:27 1746516267

Modify it with an ultra light LLM agent that always listens that uses a wake word to agentically call the paid API?

Dr4kn · 2025-05-06T08:36:06 1746520566

You could use open wake word. Which Home Assistant developed for its own Voice Assistant

supermatt · 2025-05-06T09:32:30 1746523950

It was developed by David Scripka: https://github.com/dscripka/openWakeWord