UWNSL/Qwen2.5-7B-deepscaler_4k_step_32
UWNSL/Qwen2.5-7B-deepscaler_4k_step_64
8B • Updated
• 4
UWNSL/Qwen2.5-7B-deepscaler_4k_step_96
UWNSL/Qwen2.5-7B-deepscaler_4k_step_128
UWNSL/Qwen2.5-7B-deepscaler_4k_step_160
8B • Updated
• 1
UWNSL/Qwen2.5-7B-deepscaler_4k_step_192
8B • Updated
• 3
UWNSL/Qwen2.5-7B-deepscaler_4k_step_224
8B • Updated
• 1
UWNSL/Qwen2.5-7B-deepscaler_4k_step_256
UWNSL/DeepSeek-R1-Distill-Llama-8B-SafeChain
Text Generation
• 8B • Updated
• 6
UWNSL/DeepSeek-R1-Distill-Qwen-7B-SafeChain
Text Generation
• 8B • Updated
• 6
UWNSL/Llama3.1-3B-Instruct_Mix-Long
Text Generation
• 3B • Updated
• 95
UWNSL/Qwen2.5-3B-Instruct_Mix-Long
Text Generation
• 3B • Updated
• 52
UWNSL/Llama3.1-3B-Instruct_Mix-Large
3B • Updated
• 37
UWNSL/Qwen2.5-3B-Instruct_Mix-Large
3B • Updated
• 53
UWNSL/Llama3.3_70B_Instruct_Long_CoT_lora
UWNSL/Llama3.3_70B_Instruct_Short_CoT_lora
Updated
• 1
• 1
UWNSL/Qwen2.5-32B-Instruct_Short_CoT_lora
Updated
UWNSL/Qwen2.5-32B-Instruct_Long_CoT_lora
Updated
UWNSL/Qwen2.5-0.5B-Instruct_Long_CoT
Text Generation
• 0.5B • Updated
• 2
UWNSL/Qwen2.5-0.5B-Instruct_Short_CoT
Text Generation
• 0.5B • Updated
• 1
UWNSL/Qwen2.5-1.5B-Instruct_Long_CoT
Text Generation
• 2B • Updated
• 37
UWNSL/Qwen2.5-1.5B-Instruct_Short_CoT
Text Generation
• 2B • Updated
• 34
UWNSL/Llama-3.2-1B-Instruct_Long_CoT
Updated
UWNSL/Llama-3.2-1B-Instruct_Short_CoT
Updated
UWNSL/Llama-3.2-3B-Instruct_Long_CoT
Text Generation
• 3B • Updated
• 40
UWNSL/Llama-3.2-3B-Instruct_Short_CoT
Text Generation
• 3B • Updated
• 47
UWNSL/Llama-3.1-8B-Instruct_Long_CoT
Text Generation
• 8B • Updated
UWNSL/Llama-3.1-8B-Instruct_Short_CoT
Text Generation
• 8B • Updated
• 3
UWNSL/Qwen2.5-14B-Instruct_Short_CoT_lora
Updated
UWNSL/Qwen2.5-14B-Instruct_Long_CoT_lora