社区博客与文章

You can also read this blog in English

Community Articles

view all

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

FINAL-Bench

•

7 days ago

• 18

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

EMO: Pretraining mixture of experts for emergent modularity

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

Code a simple RAG from scratch

Small Language Models (SLM): A Comprehensive Overview

Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages

Talking to a 4-Year-Old: A Multilingual Benchmark for Children's AI Companions

How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance

Introduction to State Space Models (SSM)

Mastering Tensor Dimensions in Transformers

Norm-Preserving Biprojected Abliteration

LLM Architectures Explained: What Powers Today’s Top Models

QVAC MedPsy: State-of-the-Art Medical and Healthcare Language Models for Edge Devices

Introducing the agentic robotics appstore for 10,000 Reachy Minis

Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law

一文带你入门图机器学习

2023年1月3日

communitystable-diffusionguide

基于AI进行游戏开发：5天！创建一个农场游戏！第1部分

2023年1月2日

guideintelhardware

使用英特尔 Sapphire Rapids 加速 PyTorch Transformers 模型（第一部分）

2023年1月2日

partnershipshabana

更快的训练和推理：对比 Habana Gaudi®2 和英伟达 A100 80GB

2022年12月14日

elixirtransformersstable-diffusion

从 GPT2 到 Stable Diffusion：Elixir 社区迎来了 Hugging Face

2022年12月9日

rlhfrlguide

ChatGPT 背后的“功臣”——RLHF 技术详解

2022年12月9日

researchtime-series

使用 🤗 Transformers 进行概率时间序列预测

2022年12月1日

guideexpert-acceleration-program

加速 Document AI (文档智能) 发展

2022年11月21日

guideinference

Hugging Face 提供的推理（Inference）解决方案

2022年11月21日

nlptext generationresearch

在 Transformers 中使用对比搜索生成可媲美人类水平的文本🤗

2022年11月8日

diffusersstable-diffusiondreambooth

使用 Diffusers 通过 Dreambooth 技术来训练 Stable Diffusion

2022年11月7日

guideaudio

使用 🤗 Transformers 为多语种语音识别任务微调 Whisper 模型

2022年11月3日

guideresearchopen-source-collab

从 PyTorch DDP 到 Accelerate 到 Trainer，轻松掌握分布式训练

2022年10月21日

open-source-collabcommunityresearch

优化故事: BLOOM 模型推理

2022年10月12日

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

Software Forgets: Agent Traces Are the Memory

LeRobot Humanoid: An Open, Low-Cost, 3D-Printed Humanoid for Robot Learning

KV Caching Explained: Optimizing Transformer Inference Efficiency

Uncensor any LLM with abliteration

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

EMO: Pretraining mixture of experts for emergent modularity

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

Code a simple RAG from scratch

Small Language Models (SLM): A Comprehensive Overview

Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages

Talking to a 4-Year-Old: A Multilingual Benchmark for Children's AI Companions

How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance

Introduction to State Space Models (SSM)

Mastering Tensor Dimensions in Transformers

Norm-Preserving Biprojected Abliteration

LLM Architectures Explained: What Powers Today’s Top Models

QVAC MedPsy: State-of-the-Art Medical and Healthcare Language Models for Edge Devices

Introducing the agentic robotics appstore for 10,000 Reachy Minis

Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law