stephen-flood
's Collections
Benchmarks
updated
Viewer
•
Updated
•
2.09k
•
2
•
4
Viewer
•
Updated
•
5.82M
•
5.31k
•
43
Viewer
•
Updated
•
231k
•
288k
•
628
Benchmark
•
Updated
•
17.6k
•
475k
•
1.13k
Viewer
•
Updated
•
19.6k
•
19
lighteval/legal_summarization
Viewer
•
Updated
•
26.9k
•
111
•
25
Viewer
•
Updated
•
1.6k
•
113
•
1
lighteval/synthetic_reasoning
Viewer
•
Updated
•
33k
•
73
•
7
lighteval/synthetic_reasoning_natural
Viewer
•
Updated
•
22k
•
53
•
15
Viewer
•
Updated
•
90.3k
•
96
•
3
lighteval/GPT3_unscramble
Viewer
•
Updated
•
50k
•
19
•
1
lighteval/aimo_progress_prize_1
Viewer
•
Updated
•
10
•
24
Viewer
•
Updated
•
1.7k
•
14
Viewer
•
Updated
•
72.5k
•
2.69k
•
141
Viewer
•
Updated
•
860k
•
10.7k
•
531
Text Classification
•
73B
•
Updated
•
32.3k
•
81
Jofthomas/hermes-function-calling-thinking-V1
Viewer
•
Updated
•
3.57k
•
361
•
73
NousResearch/hermes-function-calling-v1
Viewer
•
Updated
•
11.6k
•
1.76k
•
373
Viewer
•
Updated
•
15.7k
•
23
•
5
Viewer
•
Updated
•
621M
•
20.4k
•
86
open-web-math/open-web-math
Viewer
•
Updated
•
6.32M
•
7.41k
•
326