Gandalf: Learning Label-label Correlations in Extreme Multi-label Classification via Label Features

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorKharbanda, Siddhant
dc.contributor.authorGupta, Devaansh
dc.contributor.authorSchultheis, Erik
dc.contributor.authorBanerjee, Atmadeep
dc.contributor.authorHsieh, Cho Jui
dc.contributor.authorBabbar, Rohit
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.organizationUniversity of California, Los Angeles
dc.contributor.organizationDepartment of Computer Science
dc.contributor.organizationAalto University
dc.date.accessioned2024-10-04T09:00:35Z
dc.date.available2024-10-04T09:00:35Z
dc.date.issued2024-08-25
dc.descriptionPublisher Copyright: © 2024 Copyright held by the owner/author(s).
dc.description.abstractExtreme Multi-label Text Classification (XMC) involves learning a classifier that can assign an input with a subset of most relevant labels from millions of label choices. Recent works in this domain have increasingly focused on a symmetric problem setting where both input instances and label features are short-text in nature. Short-text XMC with label features has found numerous applications in areas such as query-to-ad-phrase matching in search ads, title-based product recommendation, prediction of related searches. In this paper, we propose Gandalf, a novel approach which makes use of a label co-occurrence graph to leverage label features as additional data points to supplement the training distribution. By exploiting the characteristics of the short-text XMC problem, it leverages the label features to construct valid training instances, and uses the label graph for generating the corresponding soft-label targets, hence effectively capturing the label-label correlations. Surprisingly, models trained on these new training instances, although being less than half of the original dataset, can outperform models trained on the original dataset, particularly on the PSP@k metric for tail labels. With this insight, we aim to train existing XMC algorithms on both, the original and new training instances, leading to an average 5% relative improvements for 6 state-of-the-art algorithms across 4 benchmark datasets consisting of up to 1.3M labels. Gandalf can be applied in a plug-and-play manner to various methods and thus forwards the state-of-the-art in the domain, without incurring any additional computational overheads. Code has been open-sourced at www.github.com/xmc-aalto/InceptionXML.en
dc.description.versionPeer revieweden
dc.format.extent12
dc.format.mimetypeapplication/pdf
dc.identifier.citationKharbanda, S, Gupta, D, Schultheis, E, Banerjee, A, Hsieh, C J & Babbar, R 2024, Gandalf: Learning Label-label Correlations in Extreme Multi-label Classification via Label Features . in KDD 2024 - Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining . ACM, pp. 1360-1371, ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Barcelona, Spain, 25/08/2024 . https://doi.org/10.1145/3637528.3672063en
dc.identifier.doi10.1145/3637528.3672063
dc.identifier.isbn9798400704901
dc.identifier.otherPURE UUID: 304a2b2c-618f-4f0f-9799-4e3cdd194fe7
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/304a2b2c-618f-4f0f-9799-4e3cdd194fe7
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85203699854&partnerID=8YFLogxK
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/160395651/SCI_Kharbanda_etal_KDD_2024.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/131091
dc.identifier.urnURN:NBN:fi:aalto-202410046627
dc.language.isoenen
dc.relation.ispartofKDD 2024 - Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
dc.relation.ispartofpp. 1360-1371
dc.relation.ispartofACM SIGKDD International Conference on Knowledge Discovery and Data Miningen
dc.rightsopenAccessen
dc.subject.keywordco-occurrence matrix
dc.subject.keywordcorrelation graph
dc.subject.keyworddata augmentation
dc.subject.keywordextreme classifiers
dc.subject.keywordlabel-label correlations
dc.subject.keywordmulti-label classification
dc.titleGandalf: Learning Label-label Correlations in Extreme Multi-label Classification via Label Featuresen
dc.typeA4 Artikkeli konferenssijulkaisussafi
dc.type.versionpublishedVersion

Files