FINAL-Bench/Metacognitive
Viewer
•
Updated
•
100
•
17
World's First Functional Metacognition Benchmark. "Not how much AI knows — but whether it knows what it doesn't know, and can fix it."