Advances in Automated Machine Learning and Data-Driven Discovery

The field of machine learning and data-driven discovery is rapidly evolving, with a focus on automation, interpretability, and efficiency. Recent developments have seen the introduction of novel methods for automated feature engineering, hyperparameter optimization, and time-series forecasting. These advancements have the potential to significantly improve the performance and usability of machine learning models, enabling practitioners to focus on higher-level tasks. Notably, the integration of evolutionary algorithms, generative models, and large language models has emerged as a potent approach for algorithmic discovery and automated feature engineering.

Some noteworthy papers in this area include: The paper on Tabularis Formatus presents a neuro-symbolic approach to generating conditional formatting suggestions for tables, outperforming current systems by 15.6-26.5%. The paper on Data-Driven Discovery of Interpreable Kalman Filter Variants demonstrates the potential of combining evolutionary algorithms and generative models for interpretable, data-driven synthesis of simple computational modules. The paper on ELATE introduces an evolutionary language model for automated time-series engineering, improving forecasting accuracy by an average of 8.4% across various domains.

Sources

Tabularis Formatus: Predictive Formatting for Tables

Data-Driven Discovery of Interpretable Kalman Filter Variants through Large Language Models and Genetic Programming

Machine Learning-Based Manufacturing Cost Prediction from 2D Engineering Drawings via Geometric Features

Dynamic Design of Machine Learning Pipelines via Metalearning

How Usable is Automated Feature Engineering for Tabular Data?

ELATE: Evolutionary Language model for Automated Time-series Engineering

AFABench: A Generic Framework for Benchmarking Active Feature Acquisition

Successive Halving with Learning Curve Prediction via Latent Kronecker Gaussian Processes

Built with on top of