Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
11398.0
TFLOPS
10
9
17
Sumuk Shashidhar
PRO
sumuks
Follow
qubvel-hf's profile picture
ivcr1905's profile picture
Shabo1245's profile picture
35 followers
·
54 following
https://sumuk.org
sumukx
sumukshashidhar
sumuks
AI & ML interests
Evaluations, Reasoning, Long Term Planning
Recent Activity
updated
a dataset
about 5 hours ago
sumuks/Litbench-Verified-with-rewards-hard-only
published
a dataset
about 5 hours ago
sumuks/Litbench-Verified-with-rewards-hard-only
updated
a dataset
about 7 hours ago
sumuks/Litbench-Verified-with-rewards
View all activity
Organizations
Articles
1
Article
4
Getting Started with YourBench
Papers
5
arxiv:
2505.01592
arxiv:
2504.20090
arxiv:
2504.01833
arxiv:
2410.03731
Expand 5 papers
models
0
None public yet
datasets
31
Sort: Recently updated
sumuks/Litbench-Verified-with-rewards-hard-only
Viewer
•
Updated
about 5 hours ago
•
7.9k
sumuks/Litbench-Verified-with-rewards
Viewer
•
Updated
about 6 hours ago
•
20k
sumuks/helpsteer3
Viewer
•
Updated
3 days ago
•
37.9k
•
193
sumuks/openai-coval-dpo
Viewer
•
Updated
7 days ago
•
5.58k
•
128
sumuks/preference-atlas-rewards
Viewer
•
Updated
24 days ago
•
5.03k
•
33
sumuks/preference-atlas
Viewer
•
Updated
24 days ago
•
329k
•
106
•
1
sumuks/reward-bench-2
Viewer
•
Updated
24 days ago
•
1.87k
•
48
sumuks/helpsteer3-easy
Viewer
•
Updated
Feb 17
•
7.93k
•
30
sumuks/helpsteer-pairwise-grading
Viewer
•
Updated
Feb 12
•
51.8k
•
20
sumuks/rupo-eval-logs-helpsteer3-1
Viewer
•
Updated
Feb 10
•
1.43k
•
47
View 31 datasets