Solaris: Building a Multiplayer Video World Model in Minecraft Paper • 2602.22208 • Published 16 days ago • 28
MIND: Benchmarking Memory Consistency and Action Control in World Models Paper • 2602.08025 • Published Feb 8 • 12
When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models Paper • 2602.10179 • Published about 1 month ago • 6
MIND: Benchmarking Memory Consistency and Action Control in World Models Paper • 2602.08025 • Published Feb 8 • 12
Olaf-World: Orienting Latent Actions for Video World Modeling Paper • 2602.10104 • Published about 1 month ago • 27
Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory Paper • 2602.02393 • Published Feb 2 • 16
WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation Paper • 2511.11434 • Published Nov 14, 2025 • 46
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation Paper • 2511.02778 • Published Nov 4, 2025 • 102