M Saad Salman's picture

M Saad Salman

MSS444

·

MSS444

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Understanding by Reconstruction: Reversing the Software Development Process for LLM Pretraining

upvoted a paper 3 days ago

Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training

upvoted a paper 4 days ago

ReflexiCoder: Teaching Large Language Models to Self-Reflect on Generated Code and Self-Correct It via Reinforcement Learning

View all activity

Organizations

None yet

commented a paper 6 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190 •

commented 3 papers 8 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 319 •

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263 •

New activity in huggingchat/chat-ui over 1 year ago

[MODELS] Discussion

#372 opened about 2 years ago by

[MODELS] Discussion

#372 opened about 2 years ago by

[MODELS] Discussion

#372 opened about 2 years ago by