Cascaded Split-and-Aggregate Learning with Feature Recombination for Pedestrian Attribute Recognition
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.author | Yang, Yang | en_US |
dc.contributor.author | Tan, Zichang | en_US |
dc.contributor.author | Tiwari, Prayag | en_US |
dc.contributor.author | Pandey, Hari Mohan | en_US |
dc.contributor.author | Wan, Jun | en_US |
dc.contributor.author | Lei, Zhen | en_US |
dc.contributor.author | Guo, Guodong | en_US |
dc.contributor.author | Li, Stan Z. | en_US |
dc.contributor.department | Department of Computer Science | en |
dc.date.accessioned | 2021-08-04T06:41:38Z | |
dc.date.available | 2021-08-04T06:41:38Z | |
dc.date.embargo | info:eu-repo/date/embargoEnd/2022-07-18 | en_US |
dc.date.issued | 2021-10 | en_US |
dc.description | | openaire: EC/H2020/732894/EU//INTERVENE | |
dc.description.abstract | Multi-label pedestrian attribute recognition in surveillance is inherently a challenging task due to poor imaging quality, large pose variations, and so on. In this paper, we improve its performance from the following two aspects: 1) We propose a cascaded Split-and-Aggregate Learning (SAL) to capture both the individuality and commonality for all attributes, with one at feature map level and the other at the feature vector level. For the former, we split the features of each attribute by using a designed attribute-specific attention module (ASAM). For the later, the split features for each attribute are learned by using constrained losses. In both modules, the split features are aggregated by using several convolutional or fully connected layers. 2) We propose a Feature Recombination (FR) that conducts a random shuffle based on the split features over a batch of samples to synthesize more training samples, which spans the potential samples' variability. To the end, we formulate a unified framework, named CAScaded Split-and-Aggregate Learning with Feature Recombination (CAS-SAL-FR), to learn the above modules jointly and concurrently. Experiments on five popular benchmarks, including RAP, PA-100K, PETA, Market-1501 and Duke attribute datasets, show the proposed CAS-SAL-FR achieves new state-of-the-art performance. | en |
dc.description.version | Peer reviewed | en |
dc.format.extent | 13 | |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.citation | Yang, Y, Tan, Z, Tiwari, P, Pandey, H M, Wan, J, Lei, Z, Guo, G & Li, S Z 2021, ' Cascaded Split-and-Aggregate Learning with Feature Recombination for Pedestrian Attribute Recognition ', International Journal of Computer Vision, vol. 129, no. 10, pp. 2731-2744 . https://doi.org/10.1007/s11263-021-01499-z | en |
dc.identifier.doi | 10.1007/s11263-021-01499-z | en_US |
dc.identifier.issn | 0920-5691 | |
dc.identifier.other | PURE UUID: 6a1920f4-6a99-4899-8191-87649bc9a703 | en_US |
dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/6a1920f4-6a99-4899-8191-87649bc9a703 | en_US |
dc.identifier.other | PURE LINK: http://www.scopus.com/inward/record.url?scp=85110645826&partnerID=8YFLogxK | en_US |
dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/65142198/VISI_D_20_00405R2.pdf | en_US |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/108891 | |
dc.identifier.urn | URN:NBN:fi:aalto-202108048135 | |
dc.language.iso | en | en |
dc.publisher | Springer Netherlands | |
dc.relation | info:eu-repo/grantAgreement/EC/H2020/732894/EU//INTERVENE | en_US |
dc.relation.ispartofseries | INTERNATIONAL JOURNAL OF COMPUTER VISION | en |
dc.rights | openAccess | en |
dc.title | Cascaded Split-and-Aggregate Learning with Feature Recombination for Pedestrian Attribute Recognition | en |
dc.type | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä | fi |