stephen-flood
's Collections
Benchmarks
updated
Viewer
•
Updated
•
2.09k
•
18
•
4
Viewer
•
Updated
•
5.82M
•
11k
•
43
Viewer
•
Updated
•
231k
•
302k
•
609
Benchmark
•
Updated
•
17.6k
•
417k
•
1.1k
Viewer
•
Updated
•
19.6k
•
20
lighteval/legal_summarization
Viewer
•
Updated
•
26.9k
•
99
•
25
Viewer
•
Updated
•
1.6k
•
132
•
1
lighteval/synthetic_reasoning
Viewer
•
Updated
•
33k
•
133
•
7
lighteval/synthetic_reasoning_natural
Viewer
•
Updated
•
22k
•
68
•
15
Viewer
•
Updated
•
90.3k
•
208
•
3
lighteval/GPT3_unscramble
Viewer
•
Updated
•
50k
•
12
•
1
lighteval/aimo_progress_prize_1
Viewer
•
Updated
•
10
•
7
Viewer
•
Updated
•
1.7k
•
35
Viewer
•
Updated
•
72.5k
•
2.36k
•
140
Viewer
•
Updated
•
860k
•
9.96k
•
524
Text Classification
•
73B
•
Updated
•
32.3k
•
81
Jofthomas/hermes-function-calling-thinking-V1
Viewer
•
Updated
•
3.57k
•
642
•
72
NousResearch/hermes-function-calling-v1
Viewer
•
Updated
•
11.6k
•
1.74k
•
368
Viewer
•
Updated
•
15.7k
•
59
•
5
Viewer
•
Updated
•
621M
•
35.7k
•
84
open-web-math/open-web-math
Viewer
•
Updated
•
6.32M
•
7.29k
•
324