Shengju Qian's picture

2 10 9

Shengju Qian

thesouthfrog

·

thesouthfrog

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models

authored a paper about 2 months ago

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

authored a paper about 2 months ago

MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation

View all activity

Organizations

upvoted a paper about 2 months ago

V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models

Paper • 2511.16668 • Published Nov 20, 2025 • 54

upvoted a paper 2 months ago

Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations

Paper • 2510.23607 • Published Oct 27, 2025 • 177

upvoted 2 papers 3 months ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 176

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26, 2025 • 184

upvoted 2 papers 6 months ago

EmbRACE-3K: Embodied Reasoning and Action in Complex Environments

Paper • 2507.10548 • Published Jul 14, 2025 • 36

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 159

upvoted a paper 8 months ago

DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception

Paper • 2505.04410 • Published May 7, 2025 • 44

upvoted a paper about 1 year ago

VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping

Paper • 2412.11279 • Published Dec 15, 2024 • 13

upvoted a paper over 1 year ago

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 52

upvoted a paper over 2 years ago

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 89