Stochastic cluster embedding

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorYang, Zhirongen_US
dc.contributor.authorChen, Yuweien_US
dc.contributor.authorSedov, Denisen_US
dc.contributor.authorKaski, Samuelen_US
dc.contributor.authorCorander, Jukkaen_US
dc.contributor.departmentDepartment of Computer Scienceen
dc.contributor.groupauthorProbabilistic Machine Learningen
dc.contributor.groupauthorProfessorship Kaski Samuelen
dc.contributor.groupauthorComputer Science Professorsen
dc.contributor.groupauthorComputer Science - Artificial Intelligence and Machine Learning (AIML) - Research areaen
dc.contributor.groupauthorFinnish Center for Artificial Intelligence, FCAIen
dc.contributor.groupauthorHelsinki Institute for Information Technology (HIIT)en
dc.date.accessioned2023-01-18T09:26:47Z
dc.date.available2023-01-18T09:26:47Z
dc.date.issued2023-02en_US
dc.descriptionFunding Information: This work was supported by The Research Council of Norway, Grant Number 287284, ERC Grant Number 742158, the Academy of Finland (Flagship programme: Finnish Center for Artificial Intelligence FCAI), and UKRI Turing AI World-Leading Researcher Fellowship, EP/W002973/1. We acknowledge for using the IDUN computing cluster (Själander et al. ) provided at Norwegian University of Science and Technology.
dc.description.abstractNeighbor embedding (NE) aims to preserve pairwise similarities between data items and has been shown to yield an effective principle for data visualization. However, even the best existing NE methods such as stochastic neighbor embedding (SNE) may leave large-scale patterns hidden, for example clusters, despite strong signals being present in the data. To address this, we propose a new cluster visualization method based on the Neighbor Embedding principle. We first present a family of Neighbor Embedding methods that generalizes SNE by using non-normalized Kullback–Leibler divergence with a scale parameter. In this family, much better cluster visualizations often appear with a parameter value different from the one corresponding to SNE. We also develop an efficient software that employs asynchronous stochastic block coordinate descent to optimize the new family of objective functions. Our experimental results demonstrate that the method consistently and substantially improves the visualization of data clusters compared with the state-of-the-art NE approaches. The code of our method is publicly available at https://github.com/rozyangno/sce.en
dc.description.versionPeer revieweden
dc.format.extent14
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationYang, Z, Chen, Y, Sedov, D, Kaski, S & Corander, J 2023, 'Stochastic cluster embedding', STATISTICS AND COMPUTING, vol. 33, no. 1, 12, pp. 1-14. https://doi.org/10.1007/s11222-022-10186-zen
dc.identifier.doi10.1007/s11222-022-10186-zen_US
dc.identifier.issn0960-3174
dc.identifier.otherPURE UUID: b0f19441-ebb6-4cc4-b468-280227110223en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/b0f19441-ebb6-4cc4-b468-280227110223en_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/97684964/Stochastic_cluster_embedding.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/118940
dc.identifier.urnURN:NBN:fi:aalto-202301181296
dc.language.isoenen
dc.publisherSpringer
dc.relation.fundinginfoThis work was supported by The Research Council of Norway, Grant Number 287284, ERC Grant Number 742158, the Academy of Finland (Flagship programme: Finnish Center for Artificial Intelligence FCAI), and UKRI Turing AI World-Leading Researcher Fellowship, EP/W002973/1. We acknowledge for using the IDUN computing cluster (Själander et al. ) provided at Norwegian University of Science and Technology. This work was supported by The Research Council of Norway, Grant Number 287284, ERC Grant Number 742158, the Academy of Finland (Flagship programme: Finnish Center for Artificial Intelligence FCAI), and UKRI Turing AI World-Leading Researcher Fellowship, EP/W002973/1. We acknowledge for using the IDUN computing cluster (Själander et al. 2019) provided at Norwegian University of Science and Technology.
dc.relation.ispartofseriesSTATISTICS AND COMPUTINGen
dc.relation.ispartofseriesVolume 33, issue 1, pp. 1-14en
dc.rightsopenAccessen
dc.subject.keywordClusteringen_US
dc.subject.keywordInformation divergenceen_US
dc.subject.keywordNeighbor embeddingen_US
dc.subject.keywordNonlinear dimensionality reductionen_US
dc.subject.keywordStochastic optimizationen_US
dc.subject.keywordVisualizationen_US
dc.titleStochastic cluster embeddingen
dc.typeA1 Alkuperäisartikkeli tieteellisessä aikakauslehdessäfi
dc.type.versionpublishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Stochastic_cluster_embedding.pdf
Size:
3.97 MB
Format:
Adobe Portable Document Format