Mattia Soldan

Mattia Soldan

PhD Student - Electrical and Computer Engineering

King Abdullah University of Science and Technology (KAUST)


Mattia Soldan is a Ph.D. student at King Abdullah University of Science and Technology (KAUST). Under the supervision of Bernard Ghanem, Mattia is part of the Image and Video Understanding Lab (IVUL). Mattia received his MSc degree in Telecommunication Engineering and his BSc degree in Information Engineering from the University of Padova. His research interests include Computer Vision and Natural Language Processing. Mattia aims at leveraging Deep Learning techniques to solve relevant multidisciplinary problems as Natural Language Video Grounding. See the list of publications for a glimpse at his work.

  • Artificial Intelligence
  • Computer Vision
  • Natural Language Processing
  • Information Retrieval
  • PhD in Electrical and Computer Engineering

    King Abdullah University of Science and Technology, Thuwal (Saudi Arabia)

  • MSc in Telecommunication Engineering, 2017

    University of Padova, Padova (Italy)

  • BSc in Information Engineering, 2015

    University of Padova, Padova (Italy)



[2022-09-14] EgoVLP accepted at NeurIPS. (preprint, code).
[2022-08-17] Completed another Ph.D. milestone by succesfully defending my Ph.D. proposal and earning the title of Ph.D. Candidate.
[2022-07-29] I gave a talk about my research at the Machine Learning and Computer Vision Group lead by Professor Dima Damen at the University of Bristol.
[2022-07-11] Started my internship at Samsung AI - Cambridge (website).
[2022-06-30] EgoVLP code release (GitHub).
[2022-06-21] EgoVLP won 1st place in Multi-Instance Retrieval @ EPIC-Kitchens Challenge 2022, hosted by CVPR 2022.
[2022-06-20] EgoVLP won 1st place in OSCC, 2nd place in NLQ & 3rd place in PNR @ Ego4D Challenge 2022, hosted by CVPR 2022.
[2022-06-19] Attended my first (in-person) CVPR paper where I presented MAD.
[2022-06-03] Published a new preprint: “EgoVLP: Egocentric Video-Language Pretraining” (preprint).
[2022-03-15] I gave a talk about my research at the Rising Stars in AI Symposium @ KAUST. See my talk here.
[2022-03-02] “MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions” accepted in CVPR22 (preprint).
[2022-02-23] Updated version of “Finding Moments in Video Collections Using Natural Language” is on ArXiv (preprint).




Expand [2021-12-01] MAD is on ArXiv.
[2021-10-17] Awarded the Best Paper Award for VLG-Net work in ICCV 2021 at the CVEU workshopt.
[2021-08-17] VLG-Net acccepted to the ICCV 2021 Workshop on AI for Creative Video Editing and Understanding
[2021-05-20] I received the Outstanding Reviewer Award from CVPR.
[2021-01-04] Collaboration paper accepted in the proceeding of the *American Institute of Aeronautics and Astronautics AIAA2021.


Expand [2020-11-19] VLG-Net is on ArXiv.
[2020-10-22] My team won the first place at the Entertainment track of the Neom AI Challenge in Riyad. [Project page]
[2020-05-20] Succesfully completed my PhD qualifying exams.


Expand [2019-08-04] Seq2Seq RNN is on Arxiv.
[2019-08-04] Started Ph.D. at KAUST.
[2019-07-30] STAL is on ArXiv.
[2019-07-21] Attendend DeepLearn, International Summer School on Deep Learning in Warsaw (Poland).
[2019-04-04] Concluded my Research Internship.


Expand [2018-08-26] Started my research internship at KAUST.
[2018-07-31] Concluded my job at Telebit.
[2018-02-04] Started job at Telebit as Telecommunication Engineer.
[2018-01-31] Accepted as Research Intern with the VSRP program at KAUST in the IVUL group.


Expand [2017-12-02] I received my Master degree in Telecommunication Engineering from the University of Padova (Italy).
[2017-04-16] Partecipated in a Robotic Hackathon at Technical University of Munich (Germany).


Expand [2015-02-23] I received my Bachelor degree in Information Engineering from the University of Padova (Italy).

Professional Experience

Research Intern
Jul 2022 – Present Cambridge, United Kingdom
  • Vision and Language research for Users Future Interactions.
Research Intern
Aug 2018 – Apr 2019 Thuwal, Saudi Arabia
  • Develop of novel state-of-the-art Deep Learning architectures to address challenging Computer Vision problems.
Telecommunication Engineer
Feb 2018 – Jul 2018 Treviso (Italy)
  • Public tender proposals redaction: economic evaluation of prospect projects by analisys project’s technical aspects.
  • Supported operational sectors of mobile and fixed networks.