Running Featured 35 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 35 Who needs 1T parameters? Olympiad proofs with a 4B model
HerrHruby/answerbench_offline_acemath_rl_4b_inst_hard_with_dishsoap_16k_self_refine_step_70 Viewer • Updated 25 days ago • 3.2k • 15
HerrHruby/answerbench_offline_acemath_rl_4b_inst_hard_with_dishsoap_16k_self_refine_step_70 Viewer • Updated 25 days ago • 3.2k • 15
HerrHruby/answerbench_offline_acemath_rl_4b_hard_with_dishsoap_16k_self_verify_step_80 Viewer • Updated 25 days ago • 1.6k • 17
HerrHruby/answerbench_offline_acemath_rl_4b_hard_with_dishsoap_16k_self_verify_step_80 Viewer • Updated 25 days ago • 1.6k • 17
HerrHruby/aime_offline_acemath_rl_4b_inst_hard_with_dishsoap_16k_self_refine_step_70 Viewer • Updated 25 days ago • 240 • 17
HerrHruby/aime_offline_acemath_rl_4b_inst_hard_with_dishsoap_16k_self_refine_step_70 Viewer • Updated 25 days ago • 240 • 17
HerrHruby/aime_offline_acemath_rl_4b_hard_with_dishsoap_16k_self_verify_step_80 Viewer • Updated 25 days ago • 240 • 16
HerrHruby/aime_offline_acemath_rl_4b_hard_with_dishsoap_16k_self_verify_step_80 Viewer • Updated 25 days ago • 240 • 16
HerrHruby/offline_acemath_rl_4b_inst_hard_with_dishsoap_16k_self_refine_step_70 4B • Updated 26 days ago • 16
HerrHruby/offline_acemath_rl_4b_inst_hard_with_dishsoap_16k_self_refine_step_70 4B • Updated 26 days ago • 16
HerrHruby/offline_acemath_rl_4b_hard_with_dishsoap_16k_self_verify_step_80 4B • Updated 26 days ago • 20
HerrHruby/offline_acemath_rl_4b_hard_with_dishsoap_16k_self_verify_step_80 4B • Updated 26 days ago • 20