Collection of models and dataset related to MixtureVitae, open and fully reproducible pretraining dataset built from permissive sources
LAION eV
non-profit
AI & ML interests
open multi-modal foundation models and datasets for their creation; scaling laws, model evaluation; fully local, sovereign model deployment, personalized assistants and open local agentic systems
Recent Activity
View all activity
Organization Card
models and datasets related to openthoughts 4 experiments
-
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen3-1.7B_32k
2B • Updated • 7 -
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen2.5-1.5B_32k
Text Generation • 2B • Updated • 6 -
laion/openthoughts-3-QwQ-32b-annotated-16k_qwen2.5-1.5B_16k
Text Generation • 2B • Updated • 7 -
laion/openthoughts-4-code-qwen3-32b-annotated-7k_qwen3-1.7B_10k
Text Generation • 2B • Updated • 7
Collection of models and dataset related to MixtureVitae, open and fully reproducible pretraining dataset built from permissive sources
models and datasets related to openthoughts 4 experiments
-
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen3-1.7B_32k
2B • Updated • 7 -
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen2.5-1.5B_32k
Text Generation • 2B • Updated • 6 -
laion/openthoughts-3-QwQ-32b-annotated-16k_qwen2.5-1.5B_16k
Text Generation • 2B • Updated • 7 -
laion/openthoughts-4-code-qwen3-32b-annotated-7k_qwen3-1.7B_10k
Text Generation • 2B • Updated • 7
models 493
laion/100k_epochs3__Qwen3-8B
Text Generation • 308k • Updated • 228
laion/rl__24GPU_shaped__stackexchange-overflow-sandboxes-skywork-response__exp_tas_optimal_comb__40-0
Updated
laion/rl_r2egym-full_terminus-structured
Text Generation • 8B • Updated • 382
laion/rl_mixed-struct-step37_terminus-structured
Text Generation • 8B • Updated • 372
laion/rl_r2egym-nl2bash-swesmith-pymethods2test_terminus-structured
Text Generation • 8B • Updated • 377
laion/rl__24GPU_shaped__exp_rpt_pymethods2test-large__GLM-4_7-swesmith-san-30
8B • Updated • 15
laion/rl__24GPU_shaped__exp_rpt_pymethods2test-large__exp_tas_optimal_comb-85
8B • Updated • 20
laion/100k_epochs4__Qwen3-8B
Text Generation • 308k • Updated • 231
laion/100k_baseline__Qwen3-8B
Text Generation • 308k • Updated • 249
laion/rl__24GPU_shaped__exp_rpt_pymethods2test-large__exp_tas_optimal_comb
Updated
datasets 191
laion/eurospeech-enhanced-dacvae
Viewer • Updated • 2.84M • 758
laion/gemini-2.5-pro-tts-voice-profiles
Updated • 31 • 1
laion/clustered-reference-voices
Viewer • Updated • 3k • 8
laion/reference-voices-enhanced
Viewer • Updated • 2.05k • 16
laion/ai-voices-deduplicated
Viewer • Updated • 2k • 11
laion/emolia-3k-speaker-clusters
Viewer • Updated • 63k • 61
laion/OpenThoughts-4-Unique-Combined
Viewer • Updated • 254k • 16 • 1
laion/OpenThoughts4-GLM4.7-20k-partial
Viewer • Updated • 152k • 4
laion/majestrino-1.00-16xk5-sae-features
Viewer • Updated • 12.5M • 58
laion/majestrino-data
Viewer • Updated • 8.22M • 7.54k