XIA TIAN
Summer66
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Rethinking the Trust Region in LLM Reinforcement Learning
upvoted
a
paper
9 months ago
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
upvoted
a
paper
10 months ago
Understanding R1-Zero-Like Training: A Critical Perspective
Organizations
None yet