Open to Collab

7 118 266

Muhammad Umair

umair894

AI & ML interests

Multimodal Reidentification | Feature Upscaling | Object Tracking |PhD UESTC

Recent Activity

liked a Space 1 day ago

C4G-HKUST/AnyTalker

updated a Space 2 days ago

umair894/ai-alchemist-portfolio

published a Space 2 days ago

umair894/ai-alchemist-portfolio

View all activity

Organizations

upvoted a paper 3 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 3 days ago • 116

upvoted a paper 7 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 8 days ago • 166

upvoted a paper 17 days ago

MedSAM3: Delving into Segment Anything with Medical Concepts

Paper • 2511.19046 • Published 18 days ago • 48

upvoted a paper 18 days ago

Insights from the ICLR Peer Review and Rebuttal Process

Paper • 2511.15462 • Published 23 days ago • 6

upvoted a paper 19 days ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published 22 days ago • 109

upvoted a paper 22 days ago

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published 22 days ago • 106

upvoted a paper 25 days ago

Depth Anything 3: Recovering the Visual Space from Any Views

Paper • 2511.10647 • Published 29 days ago • 93

upvoted 2 papers about 1 month ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published Oct 16 • 106

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 121

upvoted 11 papers about 2 months ago

DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents

Paper • 2510.19336 • Published Oct 22 • 16

Chronos-2: From Univariate to Universal Forecasting

Paper • 2510.15821 • Published Oct 17 • 19

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Paper • 2510.15742 • Published Oct 17 • 50

ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints

Paper • 2510.14847 • Published Oct 16 • 55

BitNet Distillation

Paper • 2510.13998 • Published Oct 15 • 54

WithAnyone: Towards Controllable and ID Consistent Image Generation

Paper • 2510.14975 • Published Oct 16 • 84

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13 • 165

InfiniHuman: Infinite 3D Human Creation with Precise Control

Paper • 2510.11650 • Published Oct 13 • 5

Muhammad Umair

AI & ML interests

Recent Activity

Organizations

umair894's activity