Today, right now, you can have a language model create lyrics to "AI killed the Internet star," a TTS model sing it in the style of The Buggles, and a single channel blind source separation model replace the voice track. Tomorrow, you will just talk to your voice assistant running an action transformer to have it done for you.