Mutual information for feature selection with missing data

Doquire, Gauthier;Verleysen, Michel
(2011) 19th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning — Location: Bruges (Belgium) (27.April.2011)

Files

esann11gd.pdf
  • Open Access
  • Adobe PDF
  • 699.07 KB

Details

Authors
Abstract
Feature selection is an important task for many machine learning applications; moreover missing data are encoutered very often in practice. This paper proposes to adapt a nearest neighbors based mutual information estimator to handle missing data and to use it to achieve feature selection. Results on artificial and real world datasets show that the method is able to select important features without the need for any imputation algorithm. Moreover, experiments also indicate that selecting the features before imputing the data generally increases the precision of the prediction models.
Affiliations

Citations

Doquire, G., & Verleysen, M. (2011). Mutual information for feature selection with missing data. Proceedings of the 19th European Symposium on Artificial Neural networks, Computational Intelligence and Machine learning (ESANN 2011), 263-268. https://hdl.handle.net/2078.5/253911