LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation
Paper
•
2412.15188
•
Published
•
1
None defined yet.
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Scaling Zero-Shot Reference-to-Video Generation