Datasets
updated
shayekh/perplexity__aya_dataset__train
Updated
• 11
Viewer
• Updated
• 540k • 26
• 1
argilla/magpie-ultra-v0.1
Viewer
• Updated
• 50k • 362
• 221
Magpie-Align/Magpie-Qwen2-Pro-1M-v0.1
Viewer
• Updated
• 1M • 124
• 14
HuggingFaceTB/smollm-corpus
Viewer
• Updated
• 237M • 31k
• 440
Viewer
• Updated
• 100k • 6.73k
• 264
BanglaLLM/bangla-alpaca-orca
Viewer
• Updated
• 172k • 33
• 4
AhmadMustafa/Urdu-Instruct-News-Article-Generation
Viewer
• Updated
• 112k • 46
• 4
AhmadMustafa/Urdu-Instruct-News-Headline-Generation
Viewer
• Updated
• 112k • 14
AhmadMustafa/Urdu-Instruct-News-Category-Classification
Viewer
• Updated
• 112k • 20
Viewer
• Updated
• 10k • 266
• 54
akbargherbal/six_millions_instruction_dataset_for_arabic_llm_ft
Viewer
• Updated
• 6.37M • 44
• 1
CohereLabs/aya_collection_language_split
Viewer
• Updated
• 514M • 3.27k
• 114
Viewer
• Updated
• 63k • 151
• 35
Viewer
• Updated
• 21.9M • 1.32k
• 699
convaiinnovations/Nadi_Indic466k_Instruct
Viewer
• Updated
• 466k • 17
• 2
ai4bharat/indic-instruct-data-v0.1
Viewer
• Updated
• 404k • 228
• 25
Viewer
• Updated
• 9.97k • 24
• 2
MarkrAI/KoCommercial-Dataset
Viewer
• Updated
• 175k • 615
• 165