daVinci-MagiHuman Conjures Expressive Talking Videos from Text
daVinci-MagiHuman is an open-source audio-video generation model that creates synchronized video and audio content from text prompts. The model uses a single-stream Transformer architecture to process text, video, and audio…
Read more →