floyed shen
floyed
AI & ML interests
None yet
Recent Activity
commented on
a paper
4 days ago
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training upvoted a paper 8 days ago
dLLM: Simple Diffusion Language Modeling upvoted a paper 10 days ago
Endless Terminals: Scaling RL Environments for Terminal Agents