IVRA: Improving Visual-Token Relations for Robot Action Policy with Training-Free Hint-Based Guidance Paper โข 2601.16207 โข Published 7 days ago โข 7
Future Optical Flow Prediction Improves Robot Control & Video Generation Paper โข 2601.10781 โข Published 14 days ago โข 19
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy Paper โข 2406.20095 โข Published Jun 28, 2024 โข 18
Perceptual Grouping in Contrastive Vision-Language Models Paper โข 2210.09996 โข Published Oct 18, 2022