rankings Running on CPU Upgrade 7.06k MTEB Leaderboard 🥇 7.06k Embedding Leaderboard Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots Running 421 Reward Bench Leaderboard 📐 421 Explore RewardBench model rankings and scores
Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots
datasets argilla/distilabel-intel-orca-dpo-pairs Viewer • Updated Aug 7, 2025 • 12.9k • 3.08k • 181 m-a-p/Code-Feedback Viewer • Updated Feb 26, 2024 • 66.4k • 1.23k • 229 argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 3.03k • 160 kyujinpy/orca_math_dpo Viewer • Updated Apr 12, 2024 • 15.3k • 64 • 19
argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 3.03k • 160
rankings Running on CPU Upgrade 7.06k MTEB Leaderboard 🥇 7.06k Embedding Leaderboard Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots Running 421 Reward Bench Leaderboard 📐 421 Explore RewardBench model rankings and scores
Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots
datasets argilla/distilabel-intel-orca-dpo-pairs Viewer • Updated Aug 7, 2025 • 12.9k • 3.08k • 181 m-a-p/Code-Feedback Viewer • Updated Feb 26, 2024 • 66.4k • 1.23k • 229 argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 3.03k • 160 kyujinpy/orca_math_dpo Viewer • Updated Apr 12, 2024 • 15.3k • 64 • 19
argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 3.03k • 160