Perplexity-free t-SNE and twice Student tt-SNE

De Bodt, Cyril;Mulders, Dounia;Verleysen, Michel;Lee, John
(2018) The European Symposium on Artificial Neural Networks — Location: Bruges (25.April.2018)

Files

2018_esann185.pdf
  • Open Access
  • Adobe PDF
  • 2.94 MB

Details

Authors
Abstract
In dimensionality reduction and data visualisation, t-SNE has become a popular method. In this paper, we propose two variants to the Gaussian similarities used to characterise the neighbourhoods around each high-dimensional datum in t-SNE. A first alternative is to use t distributions like already used in the low-dimensional embedding space; a variable degree of freedom accounts for the intrinsic dimensionality of data. The second variant relies on compounds of Gaussian neighbourhoods with growing widths, thereby suppressing the need for the user to adjust a single size or perplexity. In both cases, heavy-tailed distributions thus characterise the neighbourhood relationships in the data space. Experiments show that both variants are competitive with t-SNE, at no extra cost.
Affiliations

Citations

De Bodt, C., Mulders, D., Verleysen, M., & Lee, J. (2018). Perplexity-free t-SNE and twice Student tt-SNE. ESANN 2018 proceedings, 123-128. https://hdl.handle.net/2078.5/219999