SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens Paper • 2508.05305 • Published Aug 7 • 46
plina2polina/Llama-3.1-Nemotron-Nano-8B-v1-Nemotron-Post-Training-Tokenized Viewer • Updated Apr 26 • 781k • 3
plina2polina/Llama-3.1-Nemotron-Nano-8B-v1-Nemotron-Post-Training-Tokenized Viewer • Updated Apr 26 • 781k • 3
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation Paper • 2503.13358 • Published Mar 17 • 95
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Paper • 2503.03601 • Published Mar 5 • 232
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published Feb 20 • 174
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published Feb 20 • 174