What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models Paper • 2601.06165 • Published 9 days ago • 15
What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models Paper • 2601.06165 • Published 9 days ago • 15
What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models Paper • 2601.06165 • Published 9 days ago • 15
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published Oct 28, 2025 • 18
Distribution-Level Feature Distancing for Machine Unlearning: Towards a Better Trade-off Between Model Utility and Forgetting Paper • 2409.14747 • Published Sep 23, 2024
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs Paper • 2601.01836 • Published 11 days ago • 7
COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs Paper • 2601.01836 • Published 11 days ago • 7
AIM-Intelligence/COMPASS_Qwen2.5-7B-Instruct_LoRA Text Generation • 8B • Updated 10 days ago • 19 • 2
AIM-Intelligence/COMPASS-Policy-Alignment-Testbed-Dataset Viewer • Updated 10 days ago • 5.92k • 161 • 10
AIM-Intelligence/COMPASS_Qwen2.5-7B-Instruct_LoRA Text Generation • 8B • Updated 10 days ago • 19 • 2
COMPASS Collection A Framework for Evaluating Organization-Specific Policy Alignment in LLMs • 5 items • Updated 10 days ago • 4