DC Field | Value | Language |
---|---|---|
dc.contributor.author | Han, Sangwoo | ko |
dc.contributor.author | Choi, Eunseong | ko |
dc.contributor.author | Lim, Chan | ko |
dc.contributor.author | Shim, Hyunjung | ko |
dc.contributor.author | Lee, Jongwuk | ko |
dc.date.accessioned | 2023-09-13T02:01:23Z | - |
dc.date.available | 2023-09-13T02:01:23Z | - |
dc.date.created | 2023-09-13 | - |
dc.date.issued | 2022-10 | - |
dc.identifier.citation | 31st ACM International Conference on Information and Knowledge Management, CIKM 2022, pp.3998 - 4002 | - |
dc.identifier.uri | http://hdl.handle.net/10203/312535 | - |
dc.description.abstract | Extreme multi-label classification (XMC) aims at finding multiple relevant labels for a given sample from a huge label set at the industrial scale. The XMC problem inherently poses two challenges: scalability and label sparsity - the number of labels is too large, and labels follow the long-tail distribution. To resolve these problems, we propose a novel Mixup-based augmentation method for long-tail labels, called TailMix. Building upon the partition-based model, TailMix utilizes the context vectors generated from the label attention layer. It first selectively chooses two context vectors using the inverse propensity score of labels and the label proximity graph representing the co-occurrence of labels. Using two context vectors, it augments new samples with the long-tail label to improve the accuracy of long-tail labels. Despite its simplicity, experimental results show that TailMix consistently outperforms other augmentation methods on three benchmark datasets, especially for long-tail labels in terms of two metrics, PSP@k and PSN@k. | - |
dc.language | English | - |
dc.publisher | Association for Computing Machinery | - |
dc.title | Long-tail Mixup for Extreme Multi-label Classification | - |
dc.type | Conference | - |
dc.identifier.scopusid | 2-s2.0-85140835435 | - |
dc.type.rims | CONF | - |
dc.citation.beginningpage | 3998 | - |
dc.citation.endingpage | 4002 | - |
dc.citation.publicationname | 31st ACM International Conference on Information and Knowledge Management, CIKM 2022 | - |
dc.identifier.conferencecountry | US | - |
dc.identifier.conferencelocation | Atlanta | - |
dc.identifier.doi | 10.1145/3511808.3557632 | - |
dc.contributor.localauthor | Shim, Hyunjung | - |
dc.contributor.nonIdAuthor | Han, Sangwoo | - |
dc.contributor.nonIdAuthor | Choi, Eunseong | - |
dc.contributor.nonIdAuthor | Lim, Chan | - |
dc.contributor.nonIdAuthor | Lee, Jongwuk | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.