VLM/Robotics
updated
Chat-UniVi: Unified Visual Representation Empowers Large Language Models
with Image and Video Understanding
Paper
•
2311.08046
•
Published
•
2
Robotics
•
2B
•
Updated
•
131
•
341
Image-Text-to-Text
•
1B
•
Updated
•
159
•
26
nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim
Updated
•
841k
•
180
Robotics
•
4B
•
Updated
•
540
•
304
Robotics
•
Updated
•
16
•
13
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State
Space Duality
Paper
•
2411.15241
•
Published
•
7
MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
Paper
•
2411.15941
•
Published
•
2
Image Classification
•
Updated
•
48
Vision Mamba: Efficient Visual Representation Learning with
Bidirectional State Space Model
Paper
•
2401.09417
•
Published
•
62
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Paper
•
2407.20179
•
Published
•
47