What are you talking about? It's MIT licensed.

MitPitt · on May 15, 2023

there is no training code and devs don't plan on ever releasing it

woodson · on May 15, 2023

It’s mostly there in https://github.com/lucidrains/audiolm-pytorch#hierarchical-t.... They just used FAIRs EnCodec (https://github.com/facebookresearch/encodec) instead of soundstream.

dragonwriter · on May 16, 2023

The voices aren’t the model; while the model takes cobventional training for which code is not provided, voices are, or at least can be, built by what could be described as “accumulated in-context learning”. Every time you run text with a voice (which can be null) through the inference process, the result is an audio waveform and an updated history prompt.

huggingmouth · on May 15, 2023

It's only a matter of time.