Hacker News new | past | comments | ask | show | jobs | submit login

I am Unbabel's CTO. Thanks for your comment. Our goal is exactly to use that feedback to improve our MT systems. We are currently outsourcing our MT, while training our own models (using Moses). Besides generating parallel text, the types of data we will be collecting (e.g. chain of editions performed by each editor on a task), will allow new and interesting algorithms to update the translation models.



I remember reading your NLP papers back in my academic days. Great work. One typo: in the https://www.unbabel.com/pricing/ page "from Portuguese to Portuguese can take some time".


And Italian to Portuguese should be not available.


Why should it not be available?


There is currently a gray dot, which for the other languages is used to indicate that the source and destination language is the same. For Italian to Portuguese this is not the case. It should have one of the other colored icons.


Ah, thanks, sorry, bug on our part, being corrected right now, thank you for pointing it out.


Do you offer support for time-coded transcriptions (i.e. video subtitles) as well?

We would love a good provider to enrich video with translations (for when we find a good one that offers machine transcription.)


Hello,

Not yet, but could you send us an example?


We haven't found a good provider yet to do this properly for our use case, but SpeakerText, Koemei and VoiceBase are examples of companies that offer these functionalities.

Unfortunately SpeakerText doesn't offer non-post-processed prices, Koemei integrated it into their own product and VoiceBase didn't offer post-processing on request, which we would need for integration into our product.

Which format will become mainstream probably depends on HTML5 adoption, which is detailed here http://www.3playmedia.com/how-it-works/how-to-guides/html5-v... Currently WebVTT seems to be in the lead.

Those formats don't accommodate for timestamps per spoken word though, which would be possible with machine transcription and which I would pay a premium for.


Awesome. I'm learning about Moses at the moment (MT course at Edinburgh), see you worked with Ben Taskar also, RIP. Exciting idea here!


Maybe this is intentional, but from your homepage, I have no idea what languages you handle.


Good point, we are in the process of updating our homepage. Things have been moving so fast it is hard to keep up. In any case we currently offer translations in English, Portuguese, Spanish, Italian, French and Turkish. More languages to come soon.




Consider applying for YC's W25 batch! Applications are open till Nov 12.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: