Vision, Language and Reading

non-profit

Multimodal AI, Document Understanding, Reading Systems.

mserrao updated a dataset about 8 hours ago

Llabres updated a dataset 3 days ago

Tomiock updated a dataset 19 days ago

ComicsPAP: understanding comic strips by picking the correct panel

One missing piece in Vision and Language: A Survey on Comics Understanding

VLR-CVC 's datasets 2