Semi-supervised t-SNE with multi-scale neighborhood preservation

Serna-Serna, Walter;De Bodt, Cyril;Andres M. Alvarez-Meza;Lee, John;Alvaro A. Orozco-Gutierrez;et.al.
(2023) Neurocomputing — Vol. 550, n° 1, p. 126496 (2023)

Files

preprint.pdf
  • Open Access
  • Adobe PDF
  • 11.06 MB

Details

Authors
  • Serna-Serna, WalterUniversidad Tecnológica de Pereira
    Author
  • De Bodt, Cyrilorcid-logoUCLouvain
    Author
  • Andres M. Alvarez-MezaUniversidad Nacional de Colombia
    Author
  • Lee, Johnorcid-logoUCLouvain
    Author
  • Author
  • Alvaro A. Orozco-GutierrezUniversidad Tecnológica de Pereira
    Author
Show more
Abstract
Unsupervised dimensionality reduction (DR) aims to preserve input data structure in a low-dimensional (LD) space based on neighborhood information. In contrast, supervised DR intends to improve the learning performance, i.e., classification and regression, in an LD representation. Unfortunately, obtaining the complete label outputs of a data set for real-world applications is hard. Here, we introduce a novel DR framework coupling both available class labels and input feature similarities to extend the well-known t-distributed Stochastic Neighbor Embedding (SNE) for semi-supervised scenarios. Our proposal, termed Semi-Supervised t-SNE (SS.t-SNE), properly fixes the widths of Gaussian neighborhoods to reveal the salient local and global data structures in an LD space. Indeed, our approach is presented as a generalization of unsupervised and supervised versions of t-SNE. SS.t-SNE outperforms other semi-supervised DR methods in data visualization and classification tasks in LD embeddings.
Affiliations

Citations

Serna-Serna, W., De Bodt, C., Andres M. Alvarez-Meza, Lee, J., Verleysen, M., & Alvaro A. Orozco-Gutierrez. (2023). Semi-supervised t-SNE with multi-scale neighborhood preservation. Neurocomputing, 550(1), 126496. https://doi.org/10.1016/j.neucom.2023.126496 (Original work published 2023)