Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
22
33
32
Rui Yang
PRO
Ray2333
Follow
exoplanet's profile picture
Diluner's profile picture
varuy322's profile picture
15 followers
·
9 following
https://yangrui2015.github.io
YangRui2015
AI & ML interests
Deep Reinforcement Learning
Recent Activity
upvoted
a
paper
2 days ago
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
View all activity
Organizations
Ray2333
's datasets
1
Sort: Recently updated
Ray2333/RiC_harmless_helpful
Viewer
•
Updated
Jul 12, 2024
•
291k
•
12