QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 12 days ago • 100
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30 • 119
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion Paper • 2503.04222 • Published Mar 6 • 15
BlockPruner: Fine-grained Pruning for Large Language Models Paper • 2406.10594 • Published Jun 15, 2024 • 1
FuseLLM Collection ICLR'2024: Knowledge Fusion of Large Language Models • 2 items • Updated Aug 16, 2024 • 3
view article Article FuseChat-3.0: Preference Optimization for Implicit Model Fusion Dec 18, 2024 • 5
FuseChat 3.0 Collection Preference Optimization for Implicit Model Fusion • 14 items • Updated Mar 7 • 14
Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published Dec 4, 2024 • 12