19 35 33

Max Ku

vinesmsuic

https://kuwingfung.github.io/

AI & ML interests

Computer Vision, World Models

Recent Activity

upvoted a paper 1 day ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

upvoted a paper 2 days ago

SWE-Next: Scalable Real-World Software Engineering Tasks for Agents

authored a paper 10 days ago

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

View all activity

Organizations

upvoted a paper 1 day ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published 4 days ago • 27

upvoted a paper 2 days ago

SWE-Next: Scalable Real-World Software Engineering Tasks for Agents

Paper • 2603.20691 • Published 20 days ago • 10

upvoted a paper 10 days ago

ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks

Paper • 2603.27862 • Published 11 days ago • 30

upvoted a paper 16 days ago

OpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory Synthesis

Paper • 2603.20278 • Published 23 days ago • 94

upvoted a paper about 2 months ago

VisPhyWorld: Probing Physical Reasoning via Code-Driven Video Reconstruction

Paper • 2602.13294 • Published Feb 9 • 13

upvoted a paper 2 months ago

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Paper • 2602.06028 • Published Feb 5 • 36

upvoted a paper 4 months ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published Dec 1, 2025 • 74

upvoted a paper 6 months ago

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published Oct 12, 2025 • 28

upvoted 3 papers about 1 year ago

upvoted a collection about 1 year ago

TheoremExplain

Collection

2 items • Updated Feb 27, 2025 • 4

upvoted 7 papers about 1 year ago

Position: Interactive Generative Video as Next-Generation Game Engine

Paper • 2503.17359 • Published Mar 21, 2025 • 61

Cube: A Roblox View of 3D Intelligence

Paper • 2503.15475 • Published Mar 19, 2025 • 31

Long-Video Audio Synthesis with Multi-Agent Collaboration

Paper • 2503.10719 • Published Mar 13, 2025 • 9

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14, 2025 • 148

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

Mobius: Text to Seamless Looping Video Generation via Latent Shift

Paper • 2502.20307 • Published Feb 27, 2025 • 18

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published Feb 26, 2025 • 47

upvoted a paper over 1 year ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 29

Max Ku

AI & ML interests

Recent Activity

Organizations

vinesmsuic's activity