Model Card for LinalgZero-GSPO
Information and code used to train this model is available on Github.
This model is a fine-tuned version of atomwalk12/LinalgZero-SFT on the atomwalk12/linalgzero-grpo dataset using the GSPO algorithm. It has been trained using ART.
- Downloads last month
- 25
Model tree for atomwalk12/LinAlgZero-GRPO
Base model
atomwalk12/LinalgZero-SFT