1 14 6

Jingming Zhuo

JingmingZ

AI & ML interests

Large Language Models

Recent Activity

updated a dataset about 1 hour ago

rl-rag/drtulu_v2_nemotron_science_v1_mcq_0205

published a dataset about 1 hour ago

rl-rag/drtulu_v2_nemotron_science_v1_mcq_0205

updated a dataset about 6 hours ago

rl-rag/drtulu_v2_arena_expert

View all activity

Organizations

updated a dataset about 1 hour ago

rl-rag/drtulu_v2_nemotron_science_v1_mcq_0205

Updated about 1 hour ago

published a dataset about 1 hour ago

rl-rag/drtulu_v2_nemotron_science_v1_mcq_0205

Updated about 1 hour ago

updated a dataset about 6 hours ago

rl-rag/drtulu_v2_arena_expert

Updated about 3 hours ago

published a dataset about 6 hours ago

rl-rag/drtulu_v2_arena_expert

Updated about 3 hours ago

upvoted a paper 2 days ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published 6 days ago • 58

updated a dataset 3 days ago

rl-rag/drtulu_v2_personalfinance

Viewer • Updated 3 days ago • 18.8k • 16

published a dataset 3 days ago

rl-rag/drtulu_v2_personalfinance

Viewer • Updated 3 days ago • 18.8k • 16

updated a dataset 3 days ago

rl-rag/drtulu_v2_nemotron_web_search_mcqa

Viewer • Updated 3 days ago • 1.95k • 15

published a dataset 4 days ago

rl-rag/drtulu_v2_nemotron_web_search_mcqa

Viewer • Updated 3 days ago • 1.95k • 15

authored a paper 3 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 61

upvoted a paper 3 months ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 61

upvoted a collection 3 months ago

DR Tulu

Collection

Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 5 items • Updated Nov 25, 2025 • 33

liked a dataset 3 months ago

rl-research/dr-tulu-sft-data

Viewer • Updated Nov 25, 2025 • 13.1k • 250 • 26

upvoted a paper 4 months ago

Alignment through Meta-Weighted Online Sampling: Bridging the Gap between Data Generation and Preference Optimization

Paper • 2509.23371 • Published Sep 27, 2025 • 6

updated a dataset 4 months ago

rl-rag/hle_rlvr_no_prompt

Viewer • Updated Sep 28, 2025 • 500 • 5

published a dataset 4 months ago

rl-rag/hle_rlvr_no_prompt

Viewer • Updated Sep 28, 2025 • 500 • 5

upvoted a paper 5 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8, 2025 • 80

updated a dataset 5 months ago

rl-rag/verified_miro_trajectories

Viewer • Updated Aug 31, 2025 • 9.88k • 5

published a dataset 5 months ago

rl-rag/verified_miro_trajectories

Viewer • Updated Aug 31, 2025 • 9.88k • 5

updated a dataset 5 months ago

rl-rag/bc_synthetic_v_2

Viewer • Updated Aug 30, 2025 • 3.99k • 3

Jingming Zhuo

AI & ML interests

Recent Activity

Organizations

JingmingZ's activity