Beyond Language Modeling: An Exploration of Multimodal Pretraining
Paper
• 2603.03276 • Published
• 86
None defined yet.
TactAlign: Human-to-Robot Policy Transfer via Tactile Alignment
CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization