Anchor-Free Action Proposal Network with Uncertainty Estimation
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.author | Pehlivan, Selen | en_US |
dc.contributor.author | Laaksonen, Jorma | en_US |
dc.contributor.department | Department of Computer Science | en |
dc.contributor.groupauthor | Lecturer Laaksonen Jorma group | en |
dc.contributor.groupauthor | Computer Science Lecturers | en |
dc.contributor.groupauthor | Computer Science - Visual Computing (VisualComputing) - Research area | en |
dc.contributor.groupauthor | Computer Science - Human-Computer Interaction and Design (HCID) - Research area | en |
dc.contributor.groupauthor | Computer Science - Artificial Intelligence and Machine Learning (AIML) - Research area | en |
dc.date.accessioned | 2024-01-04T08:47:35Z | |
dc.date.available | 2024-01-04T08:47:35Z | |
dc.date.issued | 2023 | en_US |
dc.description | Funding Information: This work has been funded by the Academy of Finland project numbers 329268 and 345791. The computational resources have been provided by the Aalto University’s Aalto Science-IT project and the CSC–IT Center for Science. Publisher Copyright: © 2023 IEEE. | |
dc.description.abstract | Proposal generation is a fundamental yet challenging task for two-stage temporal action detection pipelines. The task aims at predicting starting and ending boundaries of segments in realistic video sequences and action recognition methods cannot be directly applied to such videos due to their untrimmed nature. Most state-of-the-art models rely on temporal convolutional neural networks with pre-defined anchor segments. By eliminating anchors, we propose a lighter end-to-end trainable Anchor-Free Multiscale Transformer-based Generator (AMTG) model using local clues via video snippets. To improve effectiveness for temporal evaluation, we apply multiscale Transformer encoders to sequences with a bi-directional mask extension that simultaneously predicts boundary distances with uncertainties and various snippet-based local scores. Later, our model integrates local predictions to generate proposal candidates using the proposed scoring function. Experiments on the THUMOS14 and ActivityNet-1.3 benchmarks demonstrate the effectiveness of AMTG for the temporal proposal generation task. | en |
dc.description.version | Peer reviewed | en |
dc.format.extent | 6 | |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.citation | Pehlivan, S & Laaksonen, J 2023, Anchor-Free Action Proposal Network with Uncertainty Estimation. in Proceedings - 2023 IEEE International Conference on Multimedia and Expo, ICME 2023. Proceedings - IEEE International Conference on Multimedia and Expo, vol. 2023-July, IEEE, pp. 1853-1858, IEEE International Conference on Multimedia and Expo, Brisbane, Australia, 10/07/2023. https://doi.org/10.1109/ICME55011.2023.00318 | en |
dc.identifier.doi | 10.1109/ICME55011.2023.00318 | en_US |
dc.identifier.isbn | 978-1-6654-6891-6 | |
dc.identifier.issn | 1945-7871 | |
dc.identifier.issn | 1945-788X | |
dc.identifier.other | PURE UUID: 2fe8a99f-28e0-4013-9f92-d55e38491ff8 | en_US |
dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/2fe8a99f-28e0-4013-9f92-d55e38491ff8 | en_US |
dc.identifier.other | PURE LINK: http://www.scopus.com/inward/record.url?scp=85171171618&partnerID=8YFLogxK | |
dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/130890725/Anchor-Free_Action_Proposal_Network_with_Uncertainty_Estimation.pdf | en_US |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/125381 | |
dc.identifier.urn | URN:NBN:fi:aalto-202401041070 | |
dc.language.iso | en | en |
dc.relation.ispartof | IEEE International Conference on Multimedia and Expo | en |
dc.relation.ispartofseries | Proceedings - 2023 IEEE International Conference on Multimedia and Expo, ICME 2023 | en |
dc.relation.ispartofseries | pp. 1853-1858 | en |
dc.relation.ispartofseries | Proceedings - IEEE International Conference on Multimedia and Expo ; Volume 2023-July | en |
dc.rights | openAccess | en |
dc.subject.keyword | anchor-free | en_US |
dc.subject.keyword | multiscale transformer network | en_US |
dc.subject.keyword | temporal action proposals | en_US |
dc.subject.keyword | two-stage detectors | en_US |
dc.title | Anchor-Free Action Proposal Network with Uncertainty Estimation | en |
dc.type | A4 Artikkeli konferenssijulkaisussa | fi |
dc.type.version | acceptedVersion |