7 9 15

Yingfa Chen

chen-yingfa

https://chen-yingfa.github.io

AI & ML interests

Long-context modeling, continual learning, architectures

Recent Activity

upvoted a paper about 2 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

liked a dataset 2 months ago

caskcsg/Litelong_Nextlong_512k

authored a paper 3 months ago

StateX: Enhancing RNN Recall via Post-training State Expansion

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 119

upvoted a paper 3 months ago

StateX: Enhancing RNN Recall via Post-training State Expansion

Paper • 2509.22630 • Published Sep 26, 2025 • 3

upvoted a paper 6 months ago

BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity

Paper • 2507.08771 • Published Jul 11, 2025 • 9

upvoted an article 7 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

Jun 12, 2025

•

151

upvoted a paper 10 months ago

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

Paper • 2503.09579 • Published Mar 12, 2025 • 5

upvoted 2 papers about 1 year ago

MARS: Unleashing the Power of Variance Reduction for Training Large Models

Paper • 2411.10438 • Published Nov 15, 2024 • 13

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling

Paper • 2410.07145 • Published Oct 9, 2024 • 2

upvoted an article over 1 year ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

•

upvoted a paper over 1 year ago

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

Paper • 2406.15718 • Published Jun 22, 2024 • 14