ChatBench Datasets and Simulators (same prompt + fine-tuning set-up) from the ChatBench paper.
AI & ML interests
None defined yet.
Recent Activity
Papers
DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems
Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions
spaces
26
pinned
Running
14
MageBench Leaderboard
🥇
This is a leaderboard for magebench
Running
on
Zero
2
TRELLIS.2
🏢
High-fidelity 3D Generation from images
Running
on
Zero
5
VITRA
🦾
Generate 3D hand motion predictions from images
Running
47
Phi 4 Mini
🌍
Demos for Phi-4-mini-instruct model
Running
37
ThoughtsOrganizer
🔥
Transform your spoken thoughts into organized insights
Running
on
Zero
Featured
4.78k
TRELLIS
🏢
Scalable and Versatile 3D Generation from images
models
430
microsoft/TRELLIS.2-4B
Image-to-3D
•
Updated
•
1
microsoft/MediPhi-MedCode
Text Generation
•
4B
•
Updated
•
112
•
3
microsoft/MediPhi-Guidelines
Text Generation
•
4B
•
Updated
•
91
•
4
microsoft/MediPhi-MedWiki
Text Generation
•
4B
•
Updated
•
111
•
2
microsoft/MediPhi-Clinical
Text Generation
•
4B
•
Updated
•
8.26k
•
12
microsoft/MediPhi-PubMed
Text Generation
•
4B
•
Updated
•
678
•
8
microsoft/MediPhi
Text Generation
•
4B
•
Updated
•
1.45k
•
12
microsoft/MediPhi-Instruct
Text Generation
•
4B
•
Updated
•
2.74k
•
55
microsoft/MAI-DS-R1-FP8
Text Generation
•
671B
•
Updated
•
1.02k
•
24
microsoft/MAI-DS-R1
Text Generation
•
671B
•
Updated
•
273
•
291
datasets
79
microsoft/SYNUR
Preview
•
Updated
•
70
•
5
microsoft/mediflow
Viewer
•
Updated
•
1.84M
•
340
•
50
microsoft/SIMORD
Updated
•
105
•
5
microsoft/WebTailBench
Preview
•
Updated
•
297
•
14
microsoft/SWE-Sharp-Bench
Viewer
•
Updated
•
150
•
282
microsoft/sigmacollab
Updated
•
72
•
1
microsoft/PatientSafetyBench
Viewer
•
Updated
•
466
•
130
•
2
microsoft/claimify-dataset
Viewer
•
Updated
•
6.49k
•
60
•
5
microsoft/LiveDRBench
Viewer
•
Updated
•
110
•
141
•
6
microsoft/CoSAlign-Train
Viewer
•
Updated
•
125k
•
105
•
2