Stochastic cluster embedding
| dc.contributor | Aalto-yliopisto | fi |
| dc.contributor | Aalto University | en |
| dc.contributor.author | Yang, Zhirong | en_US |
| dc.contributor.author | Chen, Yuwei | en_US |
| dc.contributor.author | Sedov, Denis | en_US |
| dc.contributor.author | Kaski, Samuel | en_US |
| dc.contributor.author | Corander, Jukka | en_US |
| dc.contributor.department | Department of Computer Science | en |
| dc.contributor.groupauthor | Probabilistic Machine Learning | en |
| dc.contributor.groupauthor | Professorship Kaski Samuel | en |
| dc.contributor.groupauthor | Computer Science Professors | en |
| dc.contributor.groupauthor | Computer Science - Artificial Intelligence and Machine Learning (AIML) - Research area | en |
| dc.contributor.groupauthor | Finnish Center for Artificial Intelligence, FCAI | en |
| dc.contributor.groupauthor | Helsinki Institute for Information Technology (HIIT) | en |
| dc.date.accessioned | 2023-01-18T09:26:47Z | |
| dc.date.available | 2023-01-18T09:26:47Z | |
| dc.date.issued | 2023-02 | en_US |
| dc.description | Funding Information: This work was supported by The Research Council of Norway, Grant Number 287284, ERC Grant Number 742158, the Academy of Finland (Flagship programme: Finnish Center for Artificial Intelligence FCAI), and UKRI Turing AI World-Leading Researcher Fellowship, EP/W002973/1. We acknowledge for using the IDUN computing cluster (Själander et al. ) provided at Norwegian University of Science and Technology. | |
| dc.description.abstract | Neighbor embedding (NE) aims to preserve pairwise similarities between data items and has been shown to yield an effective principle for data visualization. However, even the best existing NE methods such as stochastic neighbor embedding (SNE) may leave large-scale patterns hidden, for example clusters, despite strong signals being present in the data. To address this, we propose a new cluster visualization method based on the Neighbor Embedding principle. We first present a family of Neighbor Embedding methods that generalizes SNE by using non-normalized Kullback–Leibler divergence with a scale parameter. In this family, much better cluster visualizations often appear with a parameter value different from the one corresponding to SNE. We also develop an efficient software that employs asynchronous stochastic block coordinate descent to optimize the new family of objective functions. Our experimental results demonstrate that the method consistently and substantially improves the visualization of data clusters compared with the state-of-the-art NE approaches. The code of our method is publicly available at https://github.com/rozyangno/sce. | en |
| dc.description.version | Peer reviewed | en |
| dc.format.extent | 14 | |
| dc.format.mimetype | application/pdf | en_US |
| dc.identifier.citation | Yang, Z, Chen, Y, Sedov, D, Kaski, S & Corander, J 2023, 'Stochastic cluster embedding', STATISTICS AND COMPUTING, vol. 33, no. 1, 12, pp. 1-14. https://doi.org/10.1007/s11222-022-10186-z | en |
| dc.identifier.doi | 10.1007/s11222-022-10186-z | en_US |
| dc.identifier.issn | 0960-3174 | |
| dc.identifier.other | PURE UUID: b0f19441-ebb6-4cc4-b468-280227110223 | en_US |
| dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/b0f19441-ebb6-4cc4-b468-280227110223 | en_US |
| dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/97684964/Stochastic_cluster_embedding.pdf | |
| dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/118940 | |
| dc.identifier.urn | URN:NBN:fi:aalto-202301181296 | |
| dc.language.iso | en | en |
| dc.publisher | Springer | |
| dc.relation.fundinginfo | This work was supported by The Research Council of Norway, Grant Number 287284, ERC Grant Number 742158, the Academy of Finland (Flagship programme: Finnish Center for Artificial Intelligence FCAI), and UKRI Turing AI World-Leading Researcher Fellowship, EP/W002973/1. We acknowledge for using the IDUN computing cluster (Själander et al. ) provided at Norwegian University of Science and Technology. This work was supported by The Research Council of Norway, Grant Number 287284, ERC Grant Number 742158, the Academy of Finland (Flagship programme: Finnish Center for Artificial Intelligence FCAI), and UKRI Turing AI World-Leading Researcher Fellowship, EP/W002973/1. We acknowledge for using the IDUN computing cluster (Själander et al. 2019) provided at Norwegian University of Science and Technology. | |
| dc.relation.ispartofseries | STATISTICS AND COMPUTING | en |
| dc.relation.ispartofseries | Volume 33, issue 1, pp. 1-14 | en |
| dc.rights | openAccess | en |
| dc.subject.keyword | Clustering | en_US |
| dc.subject.keyword | Information divergence | en_US |
| dc.subject.keyword | Neighbor embedding | en_US |
| dc.subject.keyword | Nonlinear dimensionality reduction | en_US |
| dc.subject.keyword | Stochastic optimization | en_US |
| dc.subject.keyword | Visualization | en_US |
| dc.title | Stochastic cluster embedding | en |
| dc.type | A1 Alkuperäisartikkeli tieteellisessä aikakauslehdessä | fi |
| dc.type.version | publishedVersion |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Stochastic_cluster_embedding.pdf
- Size:
- 3.97 MB
- Format:
- Adobe Portable Document Format