arxiv:2512.22905
Hao Fei
scofield7419
AI & ML interests
Multimodal Learning, Large Language Model, Vision and Language, Natural Language Processing, Structural Modeling
Recent Activity
liked
a dataset
about 1 hour ago
UniVA-Agent/UniVA-Bench
authored
a paper
24 days ago
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
upvoted
a
paper
25 days ago
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation