Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs Paper • 2507.07186 • Published Jul 9, 2025 • 2
DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation Paper • 2503.01622 • Published Mar 3, 2025
Trust Me, I'm Wrong: High-Certainty Hallucinations in LLMs Paper • 2502.12964 • Published Feb 18, 2025 • 3
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models Paper • 2501.06751 • Published Jan 12, 2025 • 32
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models Paper • 2501.06751 • Published Jan 12, 2025 • 32
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations Paper • 2410.02707 • Published Oct 3, 2024 • 47
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations Paper • 2410.02707 • Published Oct 3, 2024 • 47
Editing Implicit Assumptions in Text-to-Image Diffusion Models Paper • 2303.08084 • Published Mar 14, 2023 • 2
Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias Paper • 2308.00225 • Published Aug 1, 2023
SinFusion: Training Diffusion Models on a Single Image or Video Paper • 2211.11743 • Published Nov 21, 2022
Generating Benchmarks for Factuality Evaluation of Language Models Paper • 2307.06908 • Published Jul 13, 2023 • 8