Molecular chirality is of fundamental interest in the pharmaceutical field where the identification and separation of enantiomers is of vital importance, since, in a biological medium, enantiomers should be regarded as different chemical compounds because they may show considerable differences in their interactions with other chiral molecules. The mutual information (MI) criterion has been recently used to select relevant variables in spectrometric data [1, 2]. The MI measures the information content of X (explanatory variables) with respect to y (response variable), without making any assumption on the type of model that will be used. This work will show some preliminary results of the application of MI to QSSR (Quantitative Structure-Selectivity Relationship) data. Therefore, from an initial set of molecular descriptors (≈ 1300), the MI criteria will select a much smaller set of variables (maximum 12), that will be then used to model the enantioselectivity of several molecules. Due to the fact that MI selects a rather small number of variables, the interpretability of the resulting model becomes easier.
Affiliations
VUBDepartment of Analytical Chemistry and Pharmaceutical Technology
Caetano, S., Krier, C., Verleysen, M., & Vander Heyden, Y. (2006). Mutual Information for the selection of variables to model enantioselectivity. Proceedings of the 4th International Chemometrics Research Symposium (ICRM 2006). 4th International Chemometrics Research Symposium (ICRM 2006), Veldhoven (the Netherlands). https://hdl.handle.net/2078.5/254067