(2011) 19th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning — Location: Bruges (Belgium) (27.April.2011)
Feature selection is an important task for many machine learning applications; moreover missing data are encoutered very often in practice. This paper proposes to adapt a nearest neighbors based mutual information estimator to handle missing data and to use it to achieve feature selection. Results on artificial and real world datasets show that the method is able to select important features without the need for any imputation algorithm. Moreover, experiments also indicate that selecting the features before imputing the data generally increases the precision of the prediction models.
Doquire, G., & Verleysen, M. (2011). Mutual information for feature selection with missing data. Proceedings of the 19th European Symposium on Artificial Neural networks, Computational Intelligence and Machine learning (ESANN 2011), 263-268. https://hdl.handle.net/2078.5/253911