common-dataset
updated
HuggingFaceH4/ultrachat_200k
Viewer
• Updated
• 515k • 36k
• 661
Text Generation
• 7B • Updated
• 3.74k
• 315
shareAI/ShareGPT-Chinese-English-90k
Preview
• Updated
• 592
• 278
Viewer
• Updated
• 207M • 20.2k
• 487
lmsys/chatbot_arena_conversations
Viewer
• Updated
• 33k • 1.98k
• 445
Viewer
• Updated
• 968M • 14.8k
• 892
WizardLMTeam/WizardLM_evol_instruct_70k
Viewer
• Updated
• 70k • 1.51k
• 197
LargeWorldModel/LWM-Text-Chat-1M
Text Generation
• Updated
• 287
• 174
Updated
• 1.03k
• 122
microsoft/orca-math-word-problems-200k
Viewer
• Updated
• 200k • 5.59k
• 476
Preview
• Updated
• 231
• 27
Viewer
• Updated
• 52.5B • 164k
• 2.69k
Yukang/LongAlpaca-16k-length
Viewer
• Updated
• 6.28k • 30
• 25
Viewer
• Updated
• 51.8k • 22.2k
• 793
Viewer
• Updated
• 343M • 830
• 10
NousResearch/json-mode-eval
Viewer
• Updated
• 100 • 817
• 41
NousResearch/func-calling-eval-singleturn
Viewer
• Updated
• 112 • 10
• 7
NousResearch/func-calling-eval-glaive
Viewer
• Updated
• 100 • 20
• 8
legacy-datasets/wikipedia
Updated
• 61k
• 610
Viewer
• Updated
• 10.4B • 543k
• 529
open-web-math/open-web-math
Viewer
• Updated
• 6.32M • 12.4k
• 329
codeparrot/github-code-clean
Viewer
• Updated
• 11M • 17.4k
• 134
HuggingFaceFW/fineweb-edu-score-2
Viewer
• Updated
• 13.9B • 9.71k
• 84
HuggingFaceFW/fineweb-edu
Viewer
• Updated
• 3.5B • 226k
• 979
Viewer
• Updated
• 52k • 65k
• 925
Viewer
• Updated
• 772k • 57
• 26
YeungNLP/WizardLM_evol_instruct_V2_143k
Viewer
• Updated
• 143k • 17
• 11
Viewer
• Updated
• 2.94M • 17.3k
• 1.5k
WizardLMTeam/WizardLM_evol_instruct_V2_196k
Viewer
• Updated
• 143k • 2.58k
• 246
timdettmers/openassistant-guanaco
Viewer
• Updated
• 10.4k • 6.59k
• 440
garage-bAInd/Open-Platypus
Viewer
• Updated
• 24.9k • 7.66k
• 415
Viewer
• Updated
• 3.71M • 937k
• 640
Updated
• 222
• 224
Salesforce/xlam-function-calling-60k
Viewer
• Updated
• 60k • 6.54k
• 577
HuggingFaceTB/smollm-corpus
Viewer
• Updated
• 237M • 34.3k
• 441
glaiveai/glaive-function-calling-v2
Viewer
• Updated
• 113k • 8.06k
• 490
mlfoundations/dclm-baseline-1.0-parquet
Viewer
• Updated
• 2.73B • 8.1k
• 33
mlfoundations/dclm-baseline-1.0
Preview
• Updated
• 129k
• 256
ruslanmv/ai-medical-chatbot
Viewer
• Updated
• 257k • 1.3k
• 245
Viewer
• Updated
• 100k • 7.79k
• 265
Viewer
• Updated
• 69.9k • 120k
• 384
xzuyn/manythings-translations-alpaca
Viewer
• Updated
• 6.33M • 31
• 8
Viewer
• Updated
• 21.9M • 1.41k
• 699
Viewer
• Updated
• 1.75M • 128
• 104
mlabonne/open-perfectblend
Viewer
• Updated
• 1.42M • 859
• 66
mlabonne/orca-agentinstruct-1M-v1-cleaned
Viewer
• Updated
• 1.05M • 91
• 66
allenai/tulu-3-sft-mixture
Viewer
• Updated
• 939k • 17k
• 228
NovaSky-AI/Sky-T1_data_17k
Viewer
• Updated
• 16.4k • 269
• 187
Viewer
• Updated
• 552M • 109
• 2
Viewer
• Updated
• 78.1M • 370
• 5
Viewer
• Updated
• 1.13M • 755
• 10
Viewer
• Updated
• 16.2M • 352
• 1
Viewer
• Updated
• 172k • 66
• 2
Viewer
• Updated
• 62.3k • 73
• 2
Viewer
• Updated
• 72.1k • 51
• 1
lianghsun/tw-instruct-500k
Viewer
• Updated
• 500k • 75
• 24