PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language Paper • 2505.10055 • Published May 15, 2025 • 1
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 9 days ago • 550
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated 9 days ago • 227