1. TabKANet: Tabular Data Modeling with Kolmogorov-Arnold Network and Transformer
- Author
-
Gao, Weihao, Gong, Zheng, Deng, Zhuo, Rong, Fuju, Chen, Chucheng, and Ma, Lan
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
Tabular data is the most common type of data in real-life scenarios. In this study, we propose the TabKANet model for tabular data modeling, which targets the bottlenecks in learning from numerical content. We constructed a Kolmogorov-Arnold Network (KAN) based Numerical Embedding Module and unified numerical and categorical features encoding within a Transformer architecture. TabKANet has demonstrated stable and significantly superior performance compared to Neural Networks (NNs) across multiple public datasets in binary classification, multi-class classification, and regression tasks. Its performance is comparable to or surpasses that of Gradient Boosted Decision Tree models (GBDTs). Our code is publicly available on GitHub: https://github.com/AI-thpremed/TabKANet., Comment: 13 pages,5 figures
- Published
- 2024