Publications

(2022). Egocentric Video-Language Pretraining. In NeurIPS.

Cite Preprint PDF Supplementary Material Code Video

(2021). MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions. In CVPR.

Cite Preprint PDF Supplementary Material Code Video

(2021). VLG-Net: Video-Language Graph Matching Network for Video Grounding. In ICCVW.

Cite Preprint PDF Supplementary Material Code Video

(2019). Temporal localization of moments in video collections with natural language. In ArXiv.

Cite Preprint PDF Supplementary Material Code Video