OpenTAD: An Open-Source Toolbox for Temporal Action Detection

  • Support SoTA TAD methods with modular design. We decompose the TAD pipeline into different components, and implement them in a modular way. This design makes it easy to implement new methods and reproduce existing methods.
  • Support multiple TAD datasets. We support 9 TAD datasets, including ActivityNet-1.3, THUMOS-14, HACS, Ego4D-MQ, EPIC-Kitchens-100, FineAction, Multi-THUMOS, Charades, and EPIC-Sounds Detection datasets.
  • Support feature-based training and end-to-end training. The feature-based training can easily be extended to end-to-end training with raw video input, and the video backbone can be easily replaced.
  • Release various pre-extracted features. We release the feature extraction code, as well as many pre-extracted features on each dataset.
Mattia Soldan
Mattia Soldan
PhD Candidate - Electrical and Computer Engineering

My research interests are settled at the intersection between Computer Vision and Natural Language Processing.