R1-V Towards the Aha Moment of Vision-Language Models MMInstruction/Clevr_CoGenT_TrainA_R1 Viewer • Updated Feb 13 • 37.8k • 225 • 48 MMInstruction/SuperClevr_Val Viewer • Updated Feb 18 • 5k • 140 • 1 MMInstruction/Clevr_CoGenT_ValA Viewer • Updated Feb 3 • 5k • 298 • 1 MMInstruction/Clevr_CoGenT_ValB Viewer • Updated Feb 3 • 5k • 37 • 2
R1-V Towards the Aha Moment of Vision-Language Models MMInstruction/Clevr_CoGenT_TrainA_R1 Viewer • Updated Feb 13 • 37.8k • 225 • 48 MMInstruction/SuperClevr_Val Viewer • Updated Feb 18 • 5k • 140 • 1 MMInstruction/Clevr_CoGenT_ValA Viewer • Updated Feb 3 • 5k • 298 • 1 MMInstruction/Clevr_CoGenT_ValB Viewer • Updated Feb 3 • 5k • 37 • 2