To improve the precision of machine-learning predictions, we investigate various techniques that combine multiple quality sources for the same property. In particular, focusing on the electronic band gap, we aim at having the lowest error by taking advantage of all available experimental measurements and density-functional theory calculations. We show that learning about the difference between high- and low-quality values, considered a correction, significantly improves the results compared to learning on the sole high-quality experimental data. As a preliminary step, we also introduce an extension of the MODNet model, which consists of using a genetic algorithm for hyperparameter optimization. Thanks to this, MODNet is shown to achieve excellent performance on the Matbench test suite.
De Breuck, P.-P., Heymans, G., & Rignanese, G.-M. (2022). Accurate experimental band gap predictions with multifidelity correction learning. Journal of Materials Informatics, 2(3), 10. https://doi.org/10.20517/jmi.2022.13 (Original work published 2022)