TabPFN-2.5: Next Generation Tabular Foundation Model Scales to 20× More Data Cells
By
onasta
Lightly browned and well buttered. A solid pick from the rack.
Summary
TabPFN-2.5 is introduced as the next generation tabular foundation model that scales to 20× more data cells than its predecessor TabPFNv2. The model substantially outperforms tuned tree-based models on industry standard benchmarks with up to 50,000 data points and 2,000 features, and matches the accuracy of AutoGluon 1.4, a complex four-hour tuned ensemble. The TabPFN series has significantly impacted tabular AI with dozens of methods building on it and hundreds of applications across different use cases.
Key quotes
· 3 pulledThe first tabular foundation model, TabPFN, and its successor TabPFNv2 have impacted tabular AI substantially, with dozens of methods building on it and hundreds of applications across different use cases.
This report introduces TabPFN-2.5, the next generation of our tabular foundation model, scaling to 20× data cells compared to TabPFNv2.
On industry standard benchmarks with up to 50,000 data points and 2,000 features, TabPFN-2.5 substantially outperforms tuned tree-based models and matches the accuracy of AutoGluon 1.4, a complex four-hour tuned ensemble.
You might also wanna read
Decoding AI's Internal Language: How Sparse Autoencoders Help Interpret Neural Activations
This article discusses how AI models like Claude process language through numerical activations, similar to neural activity in the human bra
Nested Learning: A New Machine Learning Paradigm for Continual Learning Inspired by Human Neuroplasticity
The article introduces "Nested Learning," a new machine learning paradigm for continual learning that addresses the challenge of models acqu
research.google·5mo agoBinary Retrieval-Augmented Reward Method Reduces Language Model Hallucinations Without Performance Loss
Researchers propose a novel binary retrieval-augmented reward (RAR) method using online reinforcement learning to reduce hallucinations in l
Research Shows LLMs Develop Cognitive Degradation from Social Media Training Data
This research paper introduces the concept of 'LLM Brain Rot' - a phenomenon where large language models (LLMs) experience cognitive degrada
Lumina-DiMOO: Open-Source Multimodal AI Model Using Discrete Diffusion for Cross-Modal Generation
Lumina-DiMOO is an open-source foundational model that uses discrete diffusion modeling for multimodal generation and understanding across v
Efficient Training of Diffusion Models with Token Routing (TREAD)
The article discusses TREAD, a novel method for improving the training efficiency and generative performance of diffusion models, which are
