VLM2Vec

community

https://github.com/TIGER-AI-Lab/VLM2Vec

AI & ML interests

Multimodal Embeddings and Retrieval.

Recent Activity

ziyjiang updated a Space 22 days ago

VLM2Vec/README

ziyjiang updated a dataset about 2 months ago

VLM2Vec/Video_Caption_HN

ziyjiang published a dataset about 2 months ago

VLM2Vec/Video_Caption_HN

View all activity

Organization Card

Community About org cards

VLM2Vec & MMEB: Benchmarking multimodal embeddings and adapting state-of-the-art multimodal large language models into embedding models.

Website - https://tiger-ai-lab.github.io/VLM2Vec/
Github https://github.com/TIGER-AI-Lab/VLM2Vec

List of Our Papers

Main VLM2Vec / MMEB Series

VLM2Vec / MMEB – Image embedding benchmarking and models. (ICLR2025)
VLM2Vec-V2 / MMEB-V2 – Extension of our previous work to video and visual document tasks. (TMLR2026)

Other Related Papers from Our Team

GAE-Retriever – Benchmark and model for trajectory modeling in GUI environments. (Computer-use Agents@ICML 2025)
B3 – A novel batch mining strategy for contrastive learning. (Neurips2025)

models 1

VLM2Vec/VLM2Vec-V2.0

Image-to-Text • Updated Jul 13, 2025 • 5.67k • 26

datasets 42

VLM2Vec/Video_Caption_HN

Viewer • Updated Dec 20, 2025 • 302k • 2

VLM2Vec/MMLongBench-page-fixed

Viewer • Updated Nov 4, 2025 • 8.91k • 2.81k

VLM2Vec/ViDoSeek-page-fixed

Viewer • Updated Nov 4, 2025 • 8.78k • 2.85k

VLM2Vec/MMEB-V2

Updated Sep 24, 2025 • 163

VLM2Vec/B3-7b

Viewer • Updated Aug 29, 2025 • 1.03M • 186 • 1

VLM2Vec/B3-2b

Viewer • Updated Aug 29, 2025 • 1.03M • 59

VLM2Vec/MVBench

Viewer • Updated Aug 15, 2025 • 4k • 601

VLM2Vec/MomentSeeker_1k8

Viewer • Updated Aug 14, 2025 • 1.8k • 78 • 1

VLM2Vec/ActivityNetQA

Viewer • Updated Aug 8, 2025 • 1k • 429

VLM2Vec/VATEX

Viewer • Updated Aug 3, 2025 • 4.48k • 1.59k • 1

View 42 datasets