Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sebastianrcnt 's Collections
interesting

interesting

updated Mar 2, 2025
Upvote
-

  • Slamming: Training a Speech Language Model on One GPU in a Day

    Paper • 2502.15814 • Published Feb 19, 2025 • 69

  • Small Models Struggle to Learn from Strong Reasoners

    Paper • 2502.12143 • Published Feb 17, 2025 • 39

  • HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading

    Paper • 2502.12574 • Published Feb 18, 2025 • 13

  • Large Language Diffusion Models

    Paper • 2502.09992 • Published Feb 14, 2025 • 126

  • Distillation Scaling Laws

    Paper • 2502.08606 • Published Feb 12, 2025 • 47

  • Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

    Paper • 2502.05171 • Published Feb 7, 2025 • 152
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs