Hacker News new | past | comments | ask | show | jobs | submit | shtack's comments login

Cool, I built a prototype of something very similar (face+voice cloning, no video analysis) using openly available models/APIs: https://bslsk0.appspot.com/

The video latency is definitely the biggest hurdle. With dedicated a100s I can get it down <2s, but it's pricy.


This looks awesome. Didn’t seem to hear me, but the video looks great. Can you share what models you are using? You say these are all open models.


The model doing the heavy lifting is https://github.com/Rudrabha/Wav2Lip

Mic permissions on mobile are tricky, which might have been your issue? Note in this prototype you also need to hold the blue button down to speak.


Interesting. I didn’t think you could get anything close to realtime with Wav2Lip.


With a dedicated GPU and some cleverness it can be relatively quick. I split the response on punctuation and generate smaller clips in a pipeline. I haven't taken the model apart to try streaming the frames coming out of ffmpeg yet, but that would probably help a lot.


This was my undergraduate "thesis" project. We built a GDB plugin that would generate a puredata audio structure to help with debugging. Eg. it would play snoring during a sleep() call, each breakpoint could be set to a different frequency, etc. It was actually pretty interesting for multi-threaded code, though pausing to play the audio could change the runtime profile.


I have a lot of examples but a funny one that comes to mind is: in the early 2000s when IM clients were all the rage, I wrote a VB6 application to go through my MSN Messenger logs and rank my friends by how much I talk to them. Kind of like a MySpace top 10 prior to MySpace.

I spent a decent amount of time tweaking the UI, improving performance, adding filters, providing different file output formats, etc. Never shared it with anyone.


Also, at least the last time I checked, the Google Maps API was exorbitantly expensive. Niantic would likely get a discount, but avoiding this exploit doesn’t override all of the other benefits of OSM.


Similarly the Canadian province of Ontario leased it's only toll highway for 99 years to an international group, for an amount it currently earns in tolls every ~2 years. Fun.


You're missing the bigger picture though. By doing so they managed to look like they had balanced the budget in an election year. That's what really matters here.


Earns, or receives in revenue?

‘Earns’ implies net profit after all expenses (including operations and maintenance), which I find hard to believe in this specific case.


Fair. At the current rate, the revenue takes two years to cover the original cost. The net profit takes 6. The margins are quite good due to consistent price increases.


In case the devs are reading, the documentation link from the Github description is broken (https://bulma.io/documentation/overview/start/), but https://bulma.io/documentation works


I tried so hard earlier this year to move back to FF from Chrome. I actually found the desktop experience great but the mobile app on iOS was so clunky, and I rely enough on sync, that I had to move back after a few months.


I think the misleading aspect in this case is that when looking at the 800k view, it seems that CO2 levels have doubled (or more) compared to previous peaks, whereas it's actually a ~30% increase. It's still a very significant increase, and should be shown accurately.


The inverse of a 33% reduction is not a 33% increase. It is a 50% increase. In other words, humanity has added half of the preindustrial CO2 level on top of the preindustrial CO2 level.

What you probably meant is that 33% of the CO2 in the atmosphere is "man-made".


I run/ran a website where you could shop on AliExpress using crypto and advertised it as the only legitimate crypto website around. I had universally positive reviews but made low 5 figures in profit over the course of ~4y (not terrible as passive income considering the operating costs were nearly zero) before the latest crash dropped usage to nothing and I let bit rot take over.

Unfortunately the vast majority of crypto adopters are just riding the bubble for investment and have no interest in any other use cases, which are admittedly limited.


Nice, thanks!


there’s a relevant xkcd, as usual: https://xkcd.com/2682/


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: