Dhruv's picture

1 4 2

Dhruv

prieuredesion

AI & ML interests

None yet

Organizations

None yet

upvoted an article 7 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

+7

Jun 3, 2025

•

305

upvoted a collection about 1 year ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated 16 days ago • 85

upvoted 2 papers over 2 years ago

3D-LLM: Injecting the 3D World into Large Language Models

Paper • 2307.12981 • Published Jul 24, 2023 • 38

ViNT: A Foundation Model for Visual Navigation

Paper • 2306.14846 • Published Jun 26, 2023 • 7