Hacker News new | past | comments | ask | show | jobs | submit | makaimc's comments login

I worked at Twilio from 2014-2023 and had an incredible experience during those 9 years, including all of my interactions with Jeff.

One of my favorite moments was running into Jeff in the hallway at our Beale Street office just before he had to do a quarterly earnings call after we went public. I said "good luck on the earnings call" and Jeff said thanks then asked what happened to a link to a specific Stack Overflow question/answer from one of our docs pages that got removed. He had just been building an application and felt like that answer was particularly helpful for context in a certain programming language, and was surprised it was removed.

Jeff's always been software developer at heart even though he had a ton of non-dev responsibilities. There are definitely advantages and disadvantages in that mindset! But he created a tremendous company along the way and I wouldn't change working there for anywhere else during those years.





I'd recommend just trying the Colab in my comment above to test out how quick you can do what you want with LeMUR versus building your own. Piping in 100 hours of audio into an LLM can be a lot of work compared to an API call, but it'll depend on what you are building


Hey HN, Matt from AssemblyAI here. If you want to test out LeMUR one of the fastest ways is with our Google Colab: https://colab.research.google.com/drive/1xX-YeAgW5aFQfoquJPX...

I'm happy to answer questions about the API as well


Do you use Whisper for the transcript (which version? base?) and GPT-3.5-turbo for the language model? Do you provide a self-hosted solution for the companies that don't want their meetings going "on the cloud"? I do not mean to be dismissive of all your work, I know too well the devil is in the details, but what are the key advantages of using your solution over having a Python dev (or GPT-4) write a similar tool using Langchain + whisper + llama2 for example? Again, please do not take this as a cheap shot, I might not be the target audience but if I were to use such a tool I would like everything to run locally because of privacy/corporate spying concerns. Thanks!

EDIT: Also it is unclear if you support other languages than English. Whisper does, so in theory you should. There are companies out there where English is not the work language.


They have their own ASR Conformer-2[0] and support 9 languages (they count it as 12)[1]

It looks like their synchronous transcribe is much slower than whisper, but if you need it fast, you need their realtime ASR (or amazon or google's).

[0] Conformer-2 is trained on 1.1M hours of English https://www.assemblyai.com/blog/conformer-2/ [1] https://www.assemblyai.com/docs/Concepts/supported_languages


You can use deepgram who has their own model but also has an option to use whisper hosted by them


Love using Google Collab as your onboarding doc.






Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: