Paper | Publisher | Reader | Date | Repo | Notes |
---|---|---|---|---|---|
CogVideoX | arXiv | Mingzhe Huang | Mar.4.2025 | YourRepoName | |
T2I Survey | arXiv | Ruoxuan Li | Mar.25.2025 | YourRepoName | |
T2V Survey | arXiv | Zhizhong Kong | Mar.25.2025 | YourRepoName | |
... | ... | ... | ... | ... | ... |
Zhao Ding, Mingzhe Huang, Zhizhong Kong, Ruoxuan Li
- Controllable Generation with Text-to-Image Diffusion Models: A Survey
- Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation
- MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
- Video-Infinity: Distributed Long Video Generation
- Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
- Adding Conditional Control to Text-to-Image Diffusion Models
- ControlVideo: Conditional Control for One-shot Text-driven Video Editing and Beyond
- Visual Generation without Guidance