Unsupervised relevance analysis for feature extraction and selection. A distance-based approach for feature relevance

Peluffo Ordoñez, Diego Hernan;Lee, John;Verleysen, Michel;Rodriguez, José L.;Castellanos-Dominguez, German
(2015) 3rd International Conference on Pattern Recognition Applications and Methods (ICPRAM 2014) — Location: Angers (France) (6.March.2014)

Files

Unsupervisedrelevanceanalysisforfeatureextractionandselection.pdf
  • Restricted Access
  • Adobe PDF
  • 80.45 KB

Details

Authors
  • Peluffo Ordoñez, Diego HernanUCLouvain
    Author
  • Lee, Johnorcid-logoUCLouvain
    Author
  • Author
  • Rodriguez, José L.Universidad Autonoma de Manizales, Colombia
    Author
  • Castellanos-Dominguez, GermanUniversidad Nacional de Colombia
    Author
Abstract
The aim of this paper is to propose a new generalized formulation for feature extraction based on distances from a feature relevance point of view. This is done within an unsupervised framework. To do so, it is first outlined the formal concept of feature relevance. Then, a novel feature extraction approach is introduced. Such an approach employs the M-norm as a distance measure. It is demonstrated that under some conditions, this method can readily explain literature methods. As another contribution of this paper, we propose an elegant feature ranking approach for feature selection followed from the spectral analysis of the data variability. Also, we provide a weighted PCA scheme revealing the relationship between feature extraction and feature selection. To assess the behavior of the studied methods within a pattern recognition system, a clustering stage is carried out. Normalized mutual information is used to quantify the quality of resultant clusters. Proposed methods reach comparable results with respect to literature methods.
Affiliations

Citations

Peluffo Ordoñez, D. H., Lee, J., Verleysen, M., Rodriguez, J. L., & Castellanos-Dominguez, G. (2015). Unsupervised relevance analysis for feature extraction and selection. A distance-based approach for feature relevance. 3rd International Conference on Pattern Recognition Applications and Methods (ICPRAM 2014), Angers (France). https://hdl.handle.net/2078.5/253913