arxiv:2412.01800
hangyu guo
Rosiness
AI & ML interests
Natural Language Processing
Recent Activity
upvoted
a
paper
5 days ago
mHC: Manifold-Constrained Hyper-Connections
upvoted
a
paper
7 days ago
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
upvoted
a
paper
13 days ago
Scaling Laws for Code: Every Programming Language Matters