VLR-CVC/DocVQA-2026
Viewer
• Updated
• 73 • 6.82k • 59
Multimodal AI, Document Understanding, Reading Systems.
ComicsPAP: understanding comic strips by picking the correct panel
One missing piece in Vision and Language: A Survey on Comics Understanding