Investigating latent representations and generalization in deep neural networks for tabular data

Couplet, Edouard; Lambert, Pierre; Verleysen, Michel; Lee, John; De Bodt, Cyril

doi:10.1016/j.neucom.2024.127967

Investigating latent representations and generalization in deep neural networks for tabular data

Couplet, Edouard

;

Lambert, Pierre

;

Verleysen, Michel

;

Lee, John

;

De Bodt, Cyril

(2024) Neurocomputing — Vol. 597C (2024)

Files

NEUROCOM2024_revised.pdf

Open Access
Adobe PDF
791.54 KB

Download

Details

Authors

Couplet, EdouardUCLouvain
Author
Lambert, PierreUCLouvain
Author
Verleysen, MichelUCLouvain
Author
Lee, JohnUCLouvain
Author
De Bodt, CyrilUCLouvain
Author

Abstract

Recent deep neural network architectures that are tailored to tabular data operate at the feature level and process multiple latent representations simultaneously, typically one per feature. We investigate the impact of varying the dimension and number of such latent representations on model performance and generalization. Our results identify distinct model behaviors during both training and testing phases. To ease analysis of these behaviors, we propose a novel tool for characterizing data complexity and use it to highlight intricate relationships between data complexity, model complexity and model performance. We hypothesize a phenomenon of implicit self-regularization which intensifies with model capacity and sample-to-dimension ratio. While this self-regularization can mitigate over-fitting, it may also lead to reduced performance on training data. Our findings expand the understanding of neural networks applied to tabular data and provide insights that can help practitioners and/or automated methods in designing neural networks architectures that better match the complexity of specific tabular data sets.

Affiliations

UCLouvainSST/ICTM/ELEN - Pôle en ingénierie électrique

Citations

APA
Chicago
FWB

Couplet, E., Lambert, P., Verleysen, M., Lee, J., & De Bodt, C. (2024). Investigating latent representations and generalization in deep neural networks for tabular data. Neurocomputing, 597C. https://doi.org/10.1016/j.neucom.2024.127967 (Original work published 2024)