hazyresearch/ncm-tokenized-datasets
hazyresearch/OT_8K_seed_all_responses
Viewer
• Updated
• 396k • 42
hazyresearch/Arena-Hard-Auto-raw-minimal-v0.1
Viewer
• Updated
• 750 • 11
hazyresearch/naturalreasoning_balanced_gpt5_09.11
Viewer
• Updated
• 1.32k • 7
hazyresearch/wildchat_balanced_gpt5_09.11
Viewer
• Updated
• 1.68k • 4
hazyresearch/m07d28_niah_synthesize_llama-3.2-3b_n65536_k1-1
Viewer
• Updated
• 65.5k • 10
hazyresearch/m07d28_niah_synthesize_llama-3.2-3b_n65536_k1-0
Viewer
• Updated
• 65.5k • 12
hazyresearch/m07d28_mtob_synthesize_qwen3-4b_n65536-1
Viewer
• Updated
• 65.5k • 14
hazyresearch/m07d28_mtob_synthesize_qwen3-4b_n65536-0
Viewer
• Updated
• 65.5k • 25
hazyresearch/m07d28_mtob_synthesize_llama-3.2-3b_n65536-1
Viewer
• Updated
• 65.5k • 14
hazyresearch/m07d28_mtob_synthesize_llama-3.2-3b_n65536-0
Viewer
• Updated
• 65.5k • 25
hazyresearch/m07d11_longhealth_synthesize_qwen3-4b_p10_n65536-1
Viewer
• Updated
• 65.5k • 47
hazyresearch/m07d11_longhealth_synthesize_qwen3-4b_p10_n65536-0
Viewer
• Updated
• 65.5k • 73
hazyresearch/m07d11_longhealth_synthesize_llama-3.2-3b_p10_n65536-2
Viewer
• Updated
• 65.5k • 20
hazyresearch/m07d11_longhealth_synthesize_llama-3.2-3b_p10_n65536-1
Viewer
• Updated
• 65.5k • 18
hazyresearch/m07d11_longhealth_synthesize_llama-3.2-3b_p10_n65536-0
Viewer
• Updated
• 65.5k • 45
Viewer
• Updated
• 8.19k • 8
Viewer
• Updated
• 8.19k • 6
Viewer
• Updated
• 128 • 7
• 1
hazyresearch/arxiv_synthesize_eval_gpt-5-mini-2025-08-07_n32-0
Viewer
• Updated
• 32 • 21
hazyresearch/arxiv_synthesize_qwen-qwen3-4b_n8192-0
Viewer
• Updated
• 8.19k • 23
hazyresearch/MATH500_with_Llama_3.1_8B_Instruct_v1
Viewer
• Updated
• 500 • 11
hazyresearch/GPQA_with_Llama_3.1_8B_Instruct_v1
Viewer
• Updated
• 646 • 140
hazyresearch/MMLU_with_Llama_3.1_8B_Instruct_v1
Viewer
• Updated
• 719 • 10
hazyresearch/MMLU-Pro_with_Llama_3.1_8B_Instruct_v1
Viewer
• Updated
• 500 • 10
hazyresearch/MMLU-Pro_with_Llama_3.1_70B_Instruct_v1
Viewer
• Updated
• 500 • 38
hazyresearch/MMLU_with_Llama_3.1_70B_Instruct_v1
Viewer
• Updated
• 719 • 7
hazyresearch/GPQA_with_Llama_3.1_70B_Instruct_v1
Viewer
• Updated
• 646 • 256
hazyresearch/MATH500_with_Llama_3.1_70B_Instruct_v1
Viewer
• Updated
• 500 • 100
hazyresearch/MATH-500_with_Llama_3.1_8B_Instruct_v1
Viewer
• Updated
• 500 • 9