ilgee hong's picture

ilgee hong

ilgee

·

ilgeehong

AI & ML interests

None yet

Recent Activity

updated a model 7 days ago

ilgee/GRPO-HS3-Qwen3-4B-Instruct-2507

published a model 7 days ago

ilgee/GRPO-HS3-Qwen3-4B-Instruct-2507

upvoted a paper about 1 month ago

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

View all activity

Organizations

None yet

Papers 1

arxiv:2305.18379

models 41

ilgee/GRPO-HS3-Qwen3-4B-Instruct-2507

4B • Updated 7 days ago • 18

ilgee/Binary-Think-RM-3B

3B • Updated Nov 2, 2025 • 5 • 1

ilgee/Multiclass-Think-RM-8B

8B • Updated Nov 2, 2025 • 4

ilgee/Binary-Think-RM-8B

8B • Updated Nov 2, 2025 • 2

ilgee/gemma-3-text-4b-it

4B • Updated Aug 4, 2025

ilgee/gemma-3-4b-it-planner-math-epoch1-lr5e-6

Image-Text-to-Text • 4B • Updated Jun 27, 2025

ilgee/gemma-3-4b-it-planner-math-epoch5-lr5e-6

Image-Text-to-Text • 4B • Updated Jun 26, 2025 • 1

ilgee/gemma-2-9b-it-planner-math-epoch1-lr5e-6

Text Generation • 9B • Updated Jun 26, 2025 • 1

ilgee/gemma-3-4b-it-planner-math-epoch5-lr1e-5

Image-Text-to-Text • 4B • Updated Jun 26, 2025 • 1

ilgee/gemma-3-4b-it-planner-math-epoch3-lr1e-5

Image-Text-to-Text • 4B • Updated Jun 26, 2025

datasets 50

ilgee/deepseek-grm-hs2-binary

Viewer • Updated Aug 10, 2025 • 5.98k • 6

ilgee/nothink-hs2-naive-reasoning-multiclass

Viewer • Updated May 8, 2025 • 3.94k • 5

ilgee/generated-nothink-hs2-naive-reasoning-multiclass

Viewer • Updated May 8, 2025 • 3.94k • 9

ilgee/hs2-naive-reasoning-multiclass-max-no-sys

Viewer • Updated Apr 29, 2025 • 3.94k • 25

ilgee/hs2-naive-reasoning-multiclass-min-no-sys

Viewer • Updated Apr 29, 2025 • 3.94k • 12

ilgee/hs2-naive-reasoning-binary-min-no-sys

Viewer • Updated Apr 29, 2025 • 6.01k • 6

ilgee/hs2-naive-reasoning-binary-max-no-sys

Viewer • Updated Apr 29, 2025 • 6.01k • 15

ilgee/hs2-Llama-3.1-8B-Instruct-naive-reasoning-binary-max

Viewer • Updated Apr 28, 2025 • 5.47k • 7

ilgee/generated-nothink-hs2-naive-reasoning-binary

Viewer • Updated Apr 26, 2025 • 6.01k • 6

ilgee/nothink-hs2-naive-reasoning-binary

Viewer • Updated Apr 22, 2025 • 6.01k • 7

View 50 datasets