Mattia Soldan

PhD Candidate - Electrical and Computer Engineering

King Abdullah University of Science and Technology (KAUST)

Biography

Mattia Soldan is a Ph.D. candidate at King Abdullah University of Science and Technology (KAUST). Under the supervision of Bernard Ghanem, Mattia is part of the Image and Video Understanding Lab (IVUL). Mattia received his MSc degree in Telecommunication Engineering and his BSc degree in Information Engineering from the University of Padova. His research interests include Computer Vision and Natural Language Processing. Mattia aims at leveraging Deep Learning techniques to solve relevant multidisciplinary problems as Natural Language Video Grounding. See the list of publications for a glimpse at his work.

Interests

Artificial Intelligence
Computer Vision
Natural Language Processing
Information Retrieval

Education

PhD in Electrical and Computer Engineering
King Abdullah University of Science and Technology, Thuwal (Saudi Arabia)
MSc in Telecommunication Engineering, 2017
University of Padova, Padova (Italy)
BSc in Information Engineering, 2015
University of Padova, Padova (Italy)

Featured Publications

Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration

This study investigates whether Compressed-Language Models (CLMs), i.e. language models operating on raw byte streams from Compressed …

Juan C. Pérez, Alejandro Pardo, Mattia Soldan, Hani Itani, Juan Leon-Alcazar, Bernard Ghanem

Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration

Towards Automated Movie Trailer Generation

Movie trailers are an essential tool for promoting films and attracting audiences. However the process of creating trailers can be …

Dawit Mureja Argaw, Mattia Soldan, Alejandro Pardo, Chen Zhao, Fabian Caba Heilbron, Joon Son Chung, Bernard Ghanem

Towards Automated Movie Trailer Generation

Boundary-denoising for video activity localization

Video activity localization aims at understanding the semantic content in long untrimmed videos and retrieving actions of interest. The …

Mengmeng Xu, Mattia Soldan, Jialin Gao, Shuming Liu, Juan-Manuel Pérez-Rúa, Bernard Ghanem

Boundary-denoising for video activity localization

Localizing Moments in Long Video via Multimodal Guidance

The recent introduction of the large-scale, long-form MAD and Ego4D datasets has enabled researchers to investigate the performance of …

Wayner Barrios, Mattia Soldan, Alberto Mario Ceballos-Arroyo, Fabian Caba Heilbron, Bernard Ghanem

Localizing Moments in Long Video via Multimodal Guidance

Egocentric Video-Language Pretraining

Video-Language Pretraining (VLP), aiming to learn transferable representation to advance a wide range of video-text downstream tasks, …

Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

The recent and increasing interest in video-language research has driven the development of large-scale datasets that enable …

Mattia Soldan, Alejandro Pardo, Juan Leon-Alcazar, Fabian Caba Heilbron, Chen Zhao, Silvio Giancola, Bernard Ghanem

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

VLG-Net: Video-Language Graph Matching Network for Video Grounding

Grounding language queries in videos aims at identifying the time interval (or moment) semantically relevant to a language query. The …

Mattia Soldan, Mengmeng Xu, Sisi Qu, Jesper Tegner, Bernard Ghanem

VLG-Net: Video-Language Graph Matching Network for Video Grounding

Finding Moments in Video Collections Using Natural Language

In this paper, we introduce the task of retrieving relevant video moments from a large corpus of untrimmed, unsegmented videos given a …

Victor Escorcia, Mattia Soldan, Josef Sivic, Bernard Ghanem, Bryan Russell

Finding Moments in Video Collections Using Natural Language

Seq2Seq RNN based Gait Anomaly Detection from Smartphone Acquired Multimodal Motion Data

Smartphones and wearable devices are fast growing technologies that, in conjunction with advances in wireless sensor hardware, are …

Riccardo Bonetto, Mattia Soldan, Alberto Lanaro, Simone Milani, Michele Rossi

Seq2Seq RNN based Gait Anomaly Detection from Smartphone Acquired Multimodal Motion Data

Projects

OpenTAD: An Open-Source Toolbox for Temporal Action Detection

OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch developed for fostering reproducible open research.

AI-Sports: Taking E-Sports To The Next Level

1st position at the NEOM AI Challenge

Fountain Codes Based Distributed Storage Algorithms for Wireless Sensor Networks

Investigation of encoding strategies for Wireless Sensor Network data to prolong information persistence in the context of battery-powered devices.

Large eddy simulation with flamelet progress variable approach combined with artificial neural network acceleration

Cross-department collaboration for Deep Learning application to Flames simulations.

Lorenzo Angelilli, Pietro Paolo Ciottoli, Riccardo Malpica Galassi, Francisco E. Hernandez Perez, Mattia Soldan, Zhen Lu, Mauro Valorani, Hong G. Im

Large eddy simulation with flamelet progress variable approach combined with artificial neural network acceleration

Professional Experience

Research Intern

Adobe Research

May 2023 – Sep 2023 San Francisco, United Stated

Development of effivient vision and language models for video processing.

Research Intern

Samsung AI

Jul 2022 – Dec 2022 Cambridge, United Kingdom

Development of cutting edge deep learning models for users future interactions.

Research Intern

King Abdullah University of Science and Technology

Aug 2018 – Apr 2019 Thuwal, Saudi Arabia

Develop of novel state-of-the-art Deep Learning architectures to address challenging Computer Vision problems.

Telecommunication Engineer

Telebit srl

Feb 2018 – Jul 2018 Treviso (Italy)

Public tender proposals redaction: economic evaluation of prospect projects by analisys project’s technical aspects.
Supported operational sectors of mobile and fixed networks.

Contact

mattia.soldan@kaust.edu.sa
+966 55 065 9396 / +39 333 58 06 646
4700 KAUST, Al Khawarizmi Building (Bldg 1 - Floor 2 - Seaside), Office #2106-WS08, Thuwal, Saudi Arabia