| --- |
| license: cc-by-nc-sa-4.0 |
| tags: |
| - robotics |
| - vision-language-action-model |
| - vision-language-model |
| --- |
| # Model Card for InternVLA-M1_object |
| InternVLA-M1 is an open-source, end-to-end vision–language–action (VLA) framework for building and researching generalist robot policies. |
| - 🌐 Homepage: [InternVLA-M1 Project Page](https://internrobotics.github.io/internvla-m1.github.io/) |
| - 💻 Codebase: [InternVLA-M1 GitHub Repo](https://github.com/InternRobotics/InternVLA-M1) |
| |
| ## Training Details |
| ``` |
| action_chunk: 8 |
| batch_size: 128 |
| training_steps: 30k |
| ``` |
| |
| ## Citation |
| ``` |
| @misc{internvla2024, |
| title = {InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy}, |
| author = {InternVLA-M1 Contributors}, |
| year = {2025}, |
| booktitle={arXiv}, |
| } |
| ``` |