ChenYuWei's picture

16 11

ChenYuWei

Yvonnnne

·

AI & ML interests

None yet

Recent Activity

liked a dataset 8 days ago

bigai/TongSIM-Asset

upvoted a paper 14 days ago

Region-Constraint In-Context Generation for Instructional Video Editing

liked a model 23 days ago

FutureMa/Qwen3-8B-Drama-Thinking

View all activity

Organizations

None yet

liked a dataset 8 days ago

bigai/TongSIM-Asset

Updated 7 days ago • 17.7k • 270

upvoted a paper 14 days ago

Region-Constraint In-Context Generation for Instructional Video Editing

Paper • 2512.17650 • Published 17 days ago • 50

liked a model 23 days ago

FutureMa/Qwen3-8B-Drama-Thinking

Text Generation • 308k • Updated 13 days ago • 2.21k • 89

upvoted 5 papers about 1 month ago

AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning

Paper • 2511.19304 • Published Nov 24, 2025 • 90

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published Nov 20, 2025 • 92

V-Thinker: Interactive Thinking with Images

Paper • 2511.04460 • Published Nov 6, 2025 • 97

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment

Paper • 2511.20614 • Published Nov 25, 2025 • 37

liked 2 models about 1 month ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated Nov 8, 2025 • 329k • • 1.59k

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 3.35M • 3.04k

upvoted a paper 7 months ago

VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos

Paper • 2505.23693 • Published May 29, 2025 • 53

upvoted 6 papers 10 months ago

RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15, 2024 • 72

Uni-SMART: Universal Science Multimodal Analysis and Research Transformer

Paper • 2403.10301 • Published Mar 15, 2024 • 54

Veagle: Advancements in Multimodal Representation Learning

Paper • 2403.08773 • Published Jan 18, 2024 • 10

VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding

Paper • 2403.09530 • Published Mar 14, 2024 • 10

3D-VLA: A 3D Vision-Language-Action Generative World Model

Paper • 2403.09631 • Published Mar 14, 2024 • 11

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences

Paper • 2403.09347 • Published Mar 14, 2024 • 22

liked a dataset 10 months ago

axxkaya/UVT-Terminological-based-Vision-Tasks

Viewer • Updated May 26, 2025 • 1.32M • 1.49k • 43

liked a model 10 months ago

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 124k • • 2.88k

upvoted a paper 11 months ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published Feb 13, 2025 • 191