This paper proposes the use of mutual information for feature selection in multi-label classification, a surprisingly almost not studied problem. A pruned problem transformation method is first applied, transforming the multi-label problem into a single-label one. A greedy feature selection procedure based on multidimensional mutual information is then conducted. Results on three databases clearly demonstrate the interest of the approach which allows one to sharply reduce the dimension of the problem and to enhance the performance of classifiers.
Doquire, G., & Verleysen, M. (2011). Feature Selection for Multi-label Classification Problems. In Joan Cabestany (ed.), Advances in Computational Intelligence (p. p. 9-16). Springer. https://doi.org/10.1007/978-3-642-21501-8_2