Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Very nice being able to read an actual paper on a powerful text-to-video model.


Yeah. It is great. So apparently separating spatial / temporal attention works if you are careful and train with large enough dataset too!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: