Quentin Gallouédec's picture

Hiring 💼

Quentin Gallouédec PRO

qgallouedec

huggingface

·

AI & ML interests

None yet

Recent Activity

updated a dataset 2 days ago

hf-doc-build/doc-build-dev

updated a Space 2 days ago

qgallouedec/diff-view

updated a Space 2 days ago

qgallouedec/trackio-0.20

View all activity

Organizations

upvoted a paper 4 days ago

Fine-Tuning Language Models from Human Preferences

Paper • 1909.08593 • Published Sep 18, 2019 • 4

upvoted a paper 16 days ago

Fewer Truncations Improve Language Modeling

Paper • 2404.10830 • Published Apr 16, 2024 • 5

upvoted an article 18 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

20 days ago

•

79

upvoted an article 19 days ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

20 days ago

•

186

upvoted an article 26 days ago

Article

Bringing Autonomous Driving RL to OpenEnv and TRL

Feb 26

•

21

upvoted a collection 27 days ago

Qwen3.5

21 items • Updated 20 days ago • 1.34k

upvoted 2 articles about 1 month ago

Article

Did GPT 5.2 make a breakthrough discovery in theoretical physics?

Feb 19

•

62

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

Feb 20

•

490

upvoted 2 papers about 1 month ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published Feb 17 • 119

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published Feb 3 • 12

upvoted an article about 1 month ago

Article

Scaling OpenEnv: From Free Usage to Thousands of Concurrent Environments

Jan 20

•

12

upvoted 2 articles about 2 months ago

Article

Transformers.js v4 Preview: Now Available on NPM!

Feb 9

•

77

Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Feb 11, 2025

•

109

upvoted 3 papers about 2 months ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published Feb 4 • 37

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30, 2025 • 31

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published Feb 5 • 52

upvoted an article about 2 months ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

+3

Jan 18, 2024

•

80

upvoted a collection about 2 months ago

AlphaGenome

Collection of AlphaGenome models. • 5 items • Updated 18 days ago • 35

upvoted an article 2 months ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

+2

Dec 1, 2025

•

307

upvoted a paper 2 months ago

Your Group-Relative Advantage Is Biased

Paper • 2601.08521 • Published Jan 13 • 158