WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models Paper • 2510.22276 • Published Oct 25, 2025 • 3
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens Paper • 2509.14882 • Published Sep 18, 2025 • 1
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements Paper • 2506.08762 • Published Jun 10, 2025
llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length Paper • 2504.15544 • Published Apr 22, 2025