Liangyu Wang
ly4096
ยท
AI & ML interests
Efficient reinforcement learning (RL) for LLMs reasoning
Distributed training and inference of LLMs
Efficient algorithm and infrastructure design for LLMs
Recent Activity
submitted
a paper
1 day ago
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers
upvoted
a
paper
3 days ago
FlashDP: Private Training Large Language Models with Efficient DP-SGD