Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
11398.0
TFLOPS
10
9
17
Sumuk Shashidhar
PRO
sumuks
Follow
nateraw's profile picture
Fishtiks's profile picture
LeroyDyer's profile picture
35 followers
·
55 following
https://sumuk.org
sumukx
sumukshashidhar
sumuks
AI & ML interests
Evaluations, Reasoning, Long Term Planning
Recent Activity
updated
a dataset
about 11 hours ago
sumuks/helpsteer3
updated
a dataset
5 days ago
sumuks/openai-coval-dpo
published
a dataset
5 days ago
sumuks/openai-coval-dpo
View all activity
Organizations
sumuks
's datasets
29
Sort: Recently updated
sumuks/helpsteer3
Viewer
•
Updated
about 11 hours ago
•
37.9k
•
251
sumuks/openai-coval-dpo
Viewer
•
Updated
5 days ago
•
5.58k
•
76
sumuks/preference-atlas-rewards
Viewer
•
Updated
22 days ago
•
5.03k
•
33
sumuks/preference-atlas
Viewer
•
Updated
22 days ago
•
329k
•
103
•
1
sumuks/reward-bench-2
Viewer
•
Updated
22 days ago
•
1.87k
•
48
sumuks/helpsteer3-easy
Viewer
•
Updated
29 days ago
•
7.93k
•
51
sumuks/helpsteer-pairwise-grading
Viewer
•
Updated
Feb 12
•
51.8k
•
22
sumuks/rupo-eval-logs-helpsteer3-1
Viewer
•
Updated
Feb 10
•
1.43k
•
48
sumuks/helpsteer3-rupo
Viewer
•
Updated
Feb 10
•
38.2k
•
56
sumuks/persuasiveness_detection
Viewer
•
Updated
Feb 6
•
3.94k
•
10
sumuks/rupo-eval-humanlike-dpo-dataset-lbhr-2
Preview
•
Updated
Feb 6
•
20
sumuks/rupo-eval-humanlike-dpo-dataset-lrhb-2
Preview
•
Updated
Feb 6
•
19
sumuks/rupo-eval-humanlike-dpo-dataset-lrhb-1
Viewer
•
Updated
Feb 5
•
1k
•
13
sumuks/rupo-eval-humanlike-dpo-dataset-lbhr-1
Viewer
•
Updated
Feb 5
•
1k
•
13
sumuks/rupo-eval-humanlike-dpo-dataset-lrhb
Viewer
•
Updated
Feb 5
•
3
•
36
sumuks/rupo-eval-humanlike-dpo-dataset-lbhr
Viewer
•
Updated
Feb 5
•
142
•
212
sumuks/rupo-eval-logs-litbench-1
Preview
•
Updated
Feb 5
•
162
sumuks/rupo-eval-logs-lmarena-1
Viewer
•
Updated
Feb 5
•
775
•
58
sumuks/rupo-eval-logs-lmarena
Viewer
•
Updated
Feb 5
•
1.03k
•
44
sumuks/rupo-eval-logs-litbench
Viewer
•
Updated
Feb 5
•
75
•
15
sumuks/rupo-eval-logs
Viewer
•
Updated
Feb 5
•
100
•
11
sumuks/rupo-eval-logs-test
Viewer
•
Updated
Feb 5
•
1
•
10
sumuks/combined-synthetic-task
Viewer
•
Updated
Feb 5
•
10.6k
•
9
sumuks/persuasion
Viewer
•
Updated
Feb 1
•
14.1k
•
8
sumuks/humanlike-dpo-dataset
Viewer
•
Updated
Jan 29
•
10.9k
•
13
sumuks/tempora-summaries
Viewer
•
Updated
Jan 24
•
5.29k
•
16
sumuks/lmarena
Viewer
•
Updated
Jan 15
•
81.5k
•
31
sumuks/arena-expert
Viewer
•
Updated
Dec 20, 2025
•
4.45k
•
15
sumuks/LitBench-Verified
Viewer
•
Updated
Dec 13, 2025
•
20k
•
25