Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding Paper • 2501.07783 • Published Jan 14, 2025 • 8
OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_640-224_7B Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 12
OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_1024-336_7B Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 9
OpenGVLab/PIIP-LLaVA_ConvNeXt-L_CLIP-L_1024-336_7B Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 7
OpenGVLab/PIIP-LLaVA-Plus_ConvNeXt-L_CLIP-L_1024-336_7B Image-Text-to-Text • 7B • Updated Apr 20, 2025 • 10
OpenGVLab/PIIP-LLaVA_ConvNeXt-L_CLIP-L_1024-336_13B Image-Text-to-Text • 14B • Updated Apr 20, 2025 • 7
OpenGVLab/PIIP-LLaVA_ConvNeXt-B_CLIP-L_1024-336_13B Image-Text-to-Text • 14B • Updated Apr 20, 2025 • 7