Diffutron: A Masked Diffusion Language Model for Turkish Language Paper • 2603.20466 • Published 11 days ago • 3
DiffutronLM Collection A masked diffusion language model for Turkish language • 5 items • Updated 7 days ago
Diffutron: A Masked Diffusion Language Model for Turkish Language Paper • 2603.20466 • Published 11 days ago • 3
Diffutron: A Masked Diffusion Language Model for Turkish Language Paper • 2603.20466 • Published 11 days ago • 3
DiffutronLM Collection A masked diffusion language model for Turkish language • 5 items • Updated 7 days ago
Superpositional Gradient Descent: Harnessing Quantum Principles for Model Training Paper • 2511.01918 • Published Nov 1, 2025 • 12
view post Post 2721 I've just distilled Llama-3.2-3B-Instruct with deepseek-ai/DeepSeek-R1 on ServiceNow-AI/R1-Distill-SFT dataset. 🐋🦙Here is the model: suayptalha/DeepSeek-R1-Distill-Llama-3B See translation 👍 3 3 + Reply
view post Post 3875 My last Falcon3-7B merge model, suayptalha/Falcon3-Jessi-v0.4-7B-Slerp, is currently ranked #1 on the open-llm-leaderboard/open_llm_leaderboard among all models with up to 14B parameters.My Qwen2.5-7B merge model, suayptalha/HomerCreativeAnvita-Mix-Qw7B, is also ranked #7, placing two of my models in the top 10! See translation 3 replies · 👍 3 3 + Reply