Embodied Referring Expression Comprehension in Human-Robot Interaction Paper • 2512.06558 • Published Dec 6, 2025 • 3
oMeBench: Towards Robust Benchmarking of LLMs in Organic Mechanism Elucidation and Reasoning Paper • 2510.07731 • Published Oct 9, 2025 • 5
TICL: Text-Embedding KNN For Speech In-Context Learning Unlocks Speech Recognition Abilities of Large Multimodal Models Paper • 2509.13395 • Published Sep 16, 2025 • 1
Open-NeRF: Towards Open Vocabulary NeRF Decomposition Paper • 2310.16383 • Published Oct 25, 2023 • 2
Learning Implicit Representation for Reconstructing Articulated Objects Paper • 2401.08809 • Published Jan 16, 2024
MagicPose4D: Crafting Articulated Models with Appearance and Motion Control Paper • 2405.14017 • Published May 22, 2024 • 3
S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular Video Paper • 2405.12607 • Published May 21, 2024
RGB-Only Supervised Camera Parameter Optimization in Dynamic Scenes Paper • 2509.15123 • Published Sep 18, 2025 • 5
HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation Paper • 2506.21546 • Published Jun 26, 2025 • 2
Energy-Based Transformers are Scalable Learners and Thinkers Paper • 2507.02092 • Published Jul 2, 2025 • 69
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time Paper • 2505.24863 • Published May 30, 2025 • 97
Leveraging Hyperbolic Embeddings for Coarse-to-Fine Robot Design Paper • 2311.00462 • Published Nov 1, 2023 • 1
Offline Meta Reinforcement Learning with In-Distribution Online Adaptation Paper • 2305.19529 • Published May 31, 2023
Symmetry-Aware Robot Design with Structured Subgroups Paper • 2306.00036 • Published May 31, 2023 • 2
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Paper • 2502.09560 • Published Feb 13, 2025 • 35
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models Paper • 2411.00836 • Published Oct 29, 2024 • 15
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation Paper • 2409.18964 • Published Sep 27, 2024 • 27