VLM2Vec & MMEB: Benchmarking multimodal embeddings and adapting state-of-the-art multimodal large language models into embedding models.
List of Our Papers
Main VLM2Vec / MMEB Series
Other Related Papers from Our Team
- GAE-Retriever – Benchmark and model for trajectory modeling in GUI environments. (Computer-use Agents@ICML 2025)
- B3 – A novel batch mining strategy for contrastive learning. (Neurips2025)
datasets
42
Viewer
•
Updated
•
302k
•
2
VLM2Vec/MMLongBench-page-fixed
Viewer
•
Updated
•
8.91k
•
2.81k
VLM2Vec/ViDoSeek-page-fixed
Viewer
•
Updated
•
8.78k
•
2.85k
Updated
•
163
Viewer
•
Updated
•
1.03M
•
186
•
1
Viewer
•
Updated
•
1.03M
•
59
Viewer
•
Updated
•
4k
•
601
Viewer
•
Updated
•
1.8k
•
78
•
1
Viewer
•
Updated
•
1k
•
429
Viewer
•
Updated
•
4.48k
•
1.59k
•
1