jina-vlm Collection Jina-VLM: Small Multilingual Vision Language Model • 2 items • Updated 9 days ago • 5
OmniPSD: Layered PSD Generation with Diffusion Transformer Paper • 2512.09247 • Published 4 days ago • 40
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published 4 days ago • 119
Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch Paper • 2512.02395 • Published 12 days ago • 46
Skywork-R1V4 Collection Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch • 4 items • Updated 4 days ago • 5
PaCoRe Collection Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning • 3 items • Updated 4 days ago • 6
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published 5 days ago • 71
view changelog Changelog Team & Enterprise Articles Now Featured on the Hugging Face Blog 5 days ago • 50
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published 9 days ago • 166
view article Article Introducing swift-huggingface: The Complete Swift Client for Hugging Face 9 days ago • 29
XVLA Collection X-VLA is a soft-prompted Transformer for cross-embodiment robot learning • 6 items • Updated 9 days ago • 9
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling Paper • 2512.03000 • Published 11 days ago • 35