LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published 11 days ago • 52
VGGT-Det: Mining VGGT Internal Priors for Sensor-Geometry-Free Multi-View Indoor 3D Object Detection Paper • 2603.00912 • Published 14 days ago • 36
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing Paper • 2603.00141 • Published 18 days ago • 134
Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels Paper • 2603.02573 • Published 12 days ago • 11
RoMa v2: Harder Better Faster Denser Feature Matching Paper • 2511.15706 • Published Nov 19, 2025 • 9
Adam Improves Muon: Adaptive Moment Estimation with Orthogonalized Momentum Paper • 2602.17080 • Published 24 days ago • 3
UniWeTok: An Unified Binary Tokenizer with Codebook Size 2^{128} for Unified Multimodal Large Language Model Paper • 2602.14178 • Published 27 days ago • 13
FLAC: Maximum Entropy RL via Kinetic Energy Regularized Bridge Matching Paper • 2602.12829 • Published 29 days ago • 4
Conversational Image Segmentation: Grounding Abstract Concepts with Scalable Supervision Paper • 2602.13195 • Published 29 days ago • 4
Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching Paper • 2602.12280 • Published 30 days ago • 32
MOSS-Audio-Tokenizer: Scaling Audio Tokenizers for Future Audio Foundation Models Paper • 2602.10934 • Published Feb 11 • 49
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale Paper • 2602.05711 • Published Feb 5 • 11