Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Nguyen Hoang Phi's picture
3 1

Nguyen Hoang Phi

nghgphi

AI & ML interests

None yet

Recent Activity

updated a model 2 days ago
nghgphi/dpo-rollout-mistralai-ultrafeedback
published a model 2 days ago
nghgphi/dpo-rollout-mistralai-ultrafeedback
updated a collection 3 months ago
nice-to-read
View all activity

Organizations

None yet

Collections 1

nice-to-read
  • Don't Waste Mistakes: Leveraging Negative RL-Groups via Confidence Reweighting

    Paper • 2510.08696 • Published Oct 9, 2025 • 14
  • Mitigating Overthinking through Reasoning Shaping

    Paper • 2510.09535 • Published Oct 10, 2025 • 4
nice-to-read
  • Don't Waste Mistakes: Leveraging Negative RL-Groups via Confidence Reweighting

    Paper • 2510.08696 • Published Oct 9, 2025 • 14
  • Mitigating Overthinking through Reasoning Shaping

    Paper • 2510.09535 • Published Oct 10, 2025 • 4

models 2

nghgphi/dpo-rollout-mistralai-ultrafeedback

Updated 2 days ago

nghgphi/qwen_alpha_iter0-ckpt_merged

2B • Updated Jun 6, 2025 • 2

datasets 3

nghgphi/DomainNet

Viewer • Updated Sep 20, 2025 • 587k • 39

nghgphi/evaluate_spin

Viewer • Updated May 27, 2025 • 180k • 16

nghgphi/ultrachat_200k

Updated Apr 2, 2025 • 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs