lmms-lab/LLaVA-Video-178K
Viewer • Updated • 1.63M • 29k • 194
VideoRoPE: What Makes for Good Video Rotary Position Embedding?
Trained model: Qwen2VL Vision Tower + Qwen2 Language Model
RoPE type: VideoRoPE
To use this model, simply set which_type='videorope' and scale_factor=2.0.
For more details, please refer to the code implementation.