English(EN)Introducing TabFM: A zero-shot foundation model for tabular data
Google推出TabFM用于表格数据;新研究探讨模型局限性
作者PulseAugur 编辑部·[8 个来源]·
Google Research推出了TabFM,一个用于表格数据的零样本基础模型,该模型与BigQuery ML集成,以简化分类和回归任务。与需要大量手动调优的传统方法不同,TabFM使用上下文学习,在单次传递中从未见过的数据表中生成预测。同时,几篇研究论文探讨了表格基础模型的能力和局限性,其中一篇论文强调了关于规则约束数据的推理存在形式化障碍,另一篇论文则引入了一个基准来评估模型在标准独立同分布(IID)数据集之外的通用性。
AI
arXiv:2606.29091v1 Announce Type: cross Abstract: Tabular foundation models cannot reason about data produced by running systems without access to the rules that govern them. We make this statement falsifiable. The \emph{Operational Turing Test} (OTT) constructs pairs of legal an…
arXiv:2606.30258v1 Announce Type: cross Abstract: Tabular foundation models have advanced deep learning for tabular data by delivering strong default performance across many small and medium tasks. Yet in niche domains, where data is scarce, high-dimensional, and shifted from the…
arXiv cs.AI
TIER_1English(EN)·Lennart Purucker, Andrej Tschalzev, Nick Erickson, Gioia Blayer, David Holzm\"uller, Alan Arazi, Alexander Pfefferle, Mustafa Tajjar, Ga\"el Varoquaux, Frank Hutter·
arXiv:2606.30410v1 Announce Type: cross Abstract: Foundation models for predictive machine learning on tabular data have recently gained significant traction in academia and industry. Research communities across disciplines are increasingly evaluating tabular foundation models on…
arXiv cs.LG
TIER_1English(EN)·Zeynep T\"urkmen, K\"ur\c{s}at Kaya, Alexander Pfefferle, Frank Hutter·
arXiv:2606.29241v1 Announce Type: new Abstract: Data-generating priors are a central component of tabular foundation models because they define the task distribution used during pretraining. However, priors are rarely evaluated as independent components, making it difficult to un…
Foundation models for predictive machine learning on tabular data have recently gained significant traction in academia and industry. Research communities across disciplines are increasingly evaluating tabular foundation models on diverse datasets and tasks. However, these task- …
Tabular foundation models have advanced deep learning for tabular data by delivering strong default performance across many small and medium tasks. Yet in niche domains, where data is scarce, high-dimensional, and shifted from the pretraining distribution, they may still fail to …
Tabular foundation models show varying performance across different data conditions, with traditional methods still outperforming newer approaches on complex, large-scale datasets.