Abstract
This paper presents a novel deep architecture for weakly-supervised temporal action localization that not only generates segment-level action responses but also propagates segment-level responses to the neighborhood in a form of graph Laplacian regularization. Specifically, our approach consists of two sub-modules; a class activation module to estimate the action score map over time through the action classifiers, and a graph regularization module to refine the estimated action score map by solving a quadratic programming problem with the predicted segment-level semantic affinities. Since these two modules are integrated with fully differentiable layers, the proposed networks can be jointly trained in an end-to-end manner. Experimental results on Thumos14 and ActivityNet1.2 demonstrate that the proposed method provides outstanding performances in weakly-supervised temporal action localization.
| Original language | English |
|---|---|
| Title of host publication | 2019 IEEE International Conference on Image Processing, ICIP 2019 - Proceedings |
| Publisher | IEEE Computer Society |
| Pages | 3701-3705 |
| Number of pages | 5 |
| ISBN (Electronic) | 9781538662496 |
| DOIs | |
| State | Published - Sep 2019 |
| Event | 26th IEEE International Conference on Image Processing, ICIP 2019 - Taipei, Taiwan, Province of China Duration: 22 Sep 2019 → 25 Sep 2019 |
Publication series
| Name | Proceedings - International Conference on Image Processing, ICIP |
|---|---|
| Volume | 2019-September |
| ISSN (Print) | 1522-4880 |
Conference
| Conference | 26th IEEE International Conference on Image Processing, ICIP 2019 |
|---|---|
| Country/Territory | Taiwan, Province of China |
| City | Taipei |
| Period | 22/09/19 → 25/09/19 |
Bibliographical note
Publisher Copyright:© 2019 IEEE.
Keywords
- graph Laplacian regularization
- semantic affinity
- weakly-supervised temporal action localization