TabDPT: Scaling Tabular Foundation Models

Ma, Junwei; Thomas, Valentin; Hosseinzadeh, Rasa; Kamkari, Hamidreza; Labach, Alex; Cresswell, Jesse C.; Golestan, Keyvan; Yu, Guangwei; Volkovs, Maksims; Caterini, Anthony L.

Computer Science > Machine Learning

arXiv:2410.18164 (cs)

[Submitted on 23 Oct 2024]

Title:TabDPT: Scaling Tabular Foundation Models

Authors:Junwei Ma, Valentin Thomas, Rasa Hosseinzadeh, Hamidreza Kamkari, Alex Labach, Jesse C. Cresswell, Keyvan Golestan, Guangwei Yu, Maksims Volkovs, Anthony L. Caterini

View PDF HTML (experimental)

Abstract:The challenges faced by neural networks on tabular data are well-documented and have hampered the progress of tabular foundation models. Techniques leveraging in-context learning (ICL) have shown promise here, allowing for dynamic adaptation to unseen data. ICL can provide predictions for entirely new datasets without further training or hyperparameter tuning, therefore providing very fast inference when encountering a novel task. However, scaling ICL for tabular data remains an issue: approaches based on large language models cannot efficiently process numeric tables, and tabular-specific techniques have not been able to effectively harness the power of real data to improve performance and generalization. We are able to overcome these challenges by training tabular-specific ICL-based architectures on real data with self-supervised learning and retrieval, combining the best of both worlds. Our resulting model -- the Tabular Discriminative Pre-trained Transformer (TabDPT) -- achieves state-of-the-art performance on the CC18 (classification) and CTR23 (regression) benchmarks with no task-specific fine-tuning, demonstrating the adapatability and speed of ICL once the model is pre-trained. TabDPT also demonstrates strong scaling as both model size and amount of available data increase, pointing towards future improvements simply through the curation of larger tabular pre-training datasets and training larger models.

Comments:	Minimal TabDPT interface to provide predictions on new datasets available at the following link: this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2410.18164 [cs.LG]
	(or arXiv:2410.18164v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.18164

Submission history

From: Anthony Caterini [view email]
[v1] Wed, 23 Oct 2024 18:00:00 UTC (388 KB)

Computer Science > Machine Learning

Title:TabDPT: Scaling Tabular Foundation Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:TabDPT: Scaling Tabular Foundation Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators