arxiv:2601.16208
Jihan Yang PRO
jihanyang
AI & ML interests
Computer Vision, Multimodality, Embodied AI
Recent Activity
upvoted a paper 12 days ago
Beyond Language Modeling: An Exploration of Multimodal Pretraining upvoted a paper 17 days ago
Solaris: Building a Multiplayer Video World Model in Minecraft liked
a dataset 26 days ago
nyu-visionx/scale-rae-data