Doubly Stochastic Neighbor Embedding on Spheres

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorLu, Yaoen_US
dc.contributor.authorCorander, Jukkaen_US
dc.contributor.authorYang, Zhirongen_US
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.groupauthorCentre of Excellence in Computational Inference, COINen
dc.date.accessioned2020-01-02T13:52:08Z
dc.date.available2020-01-02T13:52:08Z
dc.date.issued2019-12-01en_US
dc.description.abstractStochastic Neighbor Embedding (SNE) methods minimize the divergence between the similarity matrix of a high-dimensional data set and its counterpart from a low-dimensional embedding, leading to widely applied tools for data visualization. Despite their popularity, the current SNE methods experience a crowding problem when the data include highly imbalanced similarities. This implies that the data points with higher total similarity tend to get crowded around the display center. To solve this problem, we introduce a fast normalization method and normalize the similarity matrix to be doubly stochastic such that all the data points have equal total similarities. Furthermore, we show empirically and theoretically that the doubly stochasticity constraint often leads to embeddings which are approximately spherical. This suggests replacing a flat space with spheres as the embedding space. The spherical embedding eliminates the discrepancy between the center and the periphery in visualization, which efficiently resolves the crowding problem. We compared the proposed method (DOSNES) with the state-of-the-art SNE method on three real-world datasets and the results clearly indicate that our method is more favorable in terms of visualization quality. DOSNES is freely available at http://yaolubrain.github.io/dosnes/.en
dc.description.versionPeer revieweden
dc.format.extent7
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationLu, Y, Corander, J & Yang, Z 2019, 'Doubly Stochastic Neighbor Embedding on Spheres', Pattern Recognition Letters, vol. 128, pp. 100-106. https://doi.org/10.1016/j.patrec.2019.08.026en
dc.identifier.doi10.1016/j.patrec.2019.08.026en_US
dc.identifier.issn0167-8655
dc.identifier.issn1872-7344
dc.identifier.otherPURE UUID: 08320917-43d8-440d-addf-7c32d0ff543fen_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/08320917-43d8-440d-addf-7c32d0ff543fen_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/39548535/1_s2.0_S0167865518305099_main.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/41905
dc.identifier.urnURN:NBN:fi:aalto-202001021016
dc.language.isoenen
dc.publisherElsevier
dc.relation.ispartofseriesPattern Recognition Lettersen
dc.relation.ispartofseriesVolume 128, pp. 100-106en
dc.rightsopenAccessen
dc.subject.keywordData visualizationen_US
dc.subject.keywordInformation divergenceen_US
dc.subject.keywordNonlinear dimensionality reductionen_US
dc.titleDoubly Stochastic Neighbor Embedding on Spheresen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files