Post
453
๐ Introducing VideoCoF: Unified Video Editing with a Temporal Reasoner (Chain-of-Frames)!
Weโre excited to introduce VideoCoF, a unified framework for instruction-based video editing that enables temporal reasoning and ~4ร video length extrapolation, trained with only 50k video pairs. ๐ฅ
๐ What makes VideoCoF different?
๐ง Chain-of-Frames reasoning , mimic human thinking process like Seeing โ Reasoning โ Editing to apply edits accurately over time without external masks, ensuring physically plausible results.
๐ Strong length generalization โ trained on 33-frame clips, yet supports multi-shot editing and long-video extrapolation (~4ร).
๐ฏ Unified fine-grained editing โ Object Removal, Addition, Swap, and Local Style Transfer, with instance-level & part-level, spatial-aware control.
โก Fast inference update
๐ H100: ~20s / video with 4-step inference, making high-quality video editing far more practical for real-world use.
๐ Links
๐ Paper: https://arxiv.org/abs/2512.07469
๐ป Code: https://github.com/knightyxp/VideoCoF
๐ค Demo: XiangpengYang/VideoCoF
๐งฉ Models: XiangpengYang/VideoCoF
๐ Project Page: https://videocof.github.io/
#VideoEditing #DiffusionModels #GenerativeAI #ComputerVision #AI
Weโre excited to introduce VideoCoF, a unified framework for instruction-based video editing that enables temporal reasoning and ~4ร video length extrapolation, trained with only 50k video pairs. ๐ฅ
๐ What makes VideoCoF different?
๐ง Chain-of-Frames reasoning , mimic human thinking process like Seeing โ Reasoning โ Editing to apply edits accurately over time without external masks, ensuring physically plausible results.
๐ Strong length generalization โ trained on 33-frame clips, yet supports multi-shot editing and long-video extrapolation (~4ร).
๐ฏ Unified fine-grained editing โ Object Removal, Addition, Swap, and Local Style Transfer, with instance-level & part-level, spatial-aware control.
โก Fast inference update
๐ H100: ~20s / video with 4-step inference, making high-quality video editing far more practical for real-world use.
๐ Links
๐ Paper: https://arxiv.org/abs/2512.07469
๐ป Code: https://github.com/knightyxp/VideoCoF
๐ค Demo: XiangpengYang/VideoCoF
๐งฉ Models: XiangpengYang/VideoCoF
๐ Project Page: https://videocof.github.io/
#VideoEditing #DiffusionModels #GenerativeAI #ComputerVision #AI