Investigating latent representations and generalization in deep neural networks for tabular data

Couplet, Edouard;Lambert, Pierre;Verleysen, Michel;Lee, John;De Bodt, Cyril
(2024) Neurocomputing — Vol. 597C (2024)

Files

NEUROCOM2024_revised.pdf
  • Open Access
  • Adobe PDF
  • 791.54 KB
  • https://creativecommons.org/licenses/by-nc-nd/4.0/

Details

Authors
Abstract
Recent deep neural network architectures that are tailored to tabular data operate at the feature level and process multiple latent representations simultaneously, typically one per feature. We investigate the impact of varying the dimension and number of such latent representations on model performance and generalization. Our results identify distinct model behaviors during both training and testing phases. To ease analysis of these behaviors, we propose a novel tool for characterizing data complexity and use it to highlight intricate relationships between data complexity, model complexity and model performance. We hypothesize a phenomenon of implicit self-regularization which intensifies with model capacity and sample-to-dimension ratio. While this self-regularization can mitigate over-fitting, it may also lead to reduced performance on training data. Our findings expand the understanding of neural networks applied to tabular data and provide insights that can help practitioners and/or automated methods in designing neural networks architectures that better match the complexity of specific tabular data sets.
Affiliations

Citations

Couplet, E., Lambert, P., Verleysen, M., Lee, J., & De Bodt, C. (2024). Investigating latent representations and generalization in deep neural networks for tabular data. Neurocomputing, 597C. https://doi.org/10.1016/j.neucom.2024.127967 (Original work published 2024)