BERT didn’t go anywhere and I have seen fine-tuned BERT backbones everywhere. Th...

isaacfung · 2024-07-20T09:39:17 1721468357

Aside from being used alone, T5 is also used as the text encoder of some recent multimodal models.

https://stability.ai/news/stable-diffusion-3-research-paper

https://t5tts.github.io/

Related discussion

https://www.reddit.com/r/StableDiffusion/comments/1c0by2y/wh...

hdhshdhshdjd · 2024-07-19T22:26:49 1721428009

I tried some large scale translation tasks with T5 and results were iffy at best. I’m going to try the same task with the newest Mistral small models and compare. My guess is Mistral will be better.

iftheshoefitss · 2024-07-20T09:07:35 1721466455

Translation with the Mistral 7B has been eye opening kind of sad it’s not for all languages but for the languages it does support it’s been awesome kind of exciting to think where everything will be in a few years

llm_trw · 2024-07-19T22:36:38 1721428598

T5 is not Bert, translation is not embedding.

hdhshdhshdjd · 2024-07-19T22:53:58 1721429638

The article mentions T5 and translation is something T5 is supposedly good at - just sharing I was less than impressed.