hibayesian

Awesome-AutoML-Papers

A curated list of automated machine learning papers, articles, tutorials, slides and projects
Under Apache License 2.0
By hibayesian

automl neural-architecture-search hyperparameter-optimization automated-feature-engineering

Awesome-AutoML-Papers

Awesome-AutoML-Papers is a curated list of automated machine learning papers, articles, tutorials, slides and projects. Star this repository, and then you can keep abreast of the latest developments of this booming research field. Thanks to all the people who made contributions to this project. Join us and you are welcome to be a contributor.

What is AutoML?

Automated Machine Learning (AutoML) provides methods and processes to make Machine Learning available for non-Machine Learning experts, to improve efficiency of Machine Learning and to accelerate research on Machine Learning.


Machine Learning (ML) has achieved considerable successes in recent years and an ever-growing number of disciplines rely on it. However, this success crucially relies on human machine learning experts to perform the following tasks:
+ Preprocess the data,
+ Select appropriate features,
+ Select an appropriate model family,
+ Optimize model hyperparameters,
+ Postprocess machine learning models,
+ Critically analyze the results obtained.


As the complexity of these tasks is often beyond non-ML-experts, the rapid growth of machine learning applications has created a demand for off-the-shelf machine learning methods that can be used easily and without expert knowledge. We call the resulting research area that targets progressive automation of machine learning AutoML. As a new sub-area in machine learning, AutoML has got more attention not only in machine learning but also in computer vision, natural language processing and graph computing.


There are no formal definition of AutoML. From the descriptions of most papers,the basic procedure of AutoML can be shown as the following.

AutoML approaches are already mature enough to rival and sometimes even outperform human machine learning experts. Put simply, AutoML can lead to improved performance while saving substantial amounts of time and money, as machine learning experts are both hard to find and expensive. As a result, commercial interest in AutoML has grown dramatically in recent years, and several major tech companies and start-up companies are now developing their own AutoML systems. An overview comparison of some of them can be summarized to the following table.


| Company | AutoFE | HPO | NAS |
| :-----------: | :--------: | :--------: | :--------: |
| 4paradigm | √ | √ | × |
| Alibaba | × | √ | × |
| Baidu | × | × | √ |
| Determined AI | × | √ | √ |
| Google | √ | √ | √ |
| DataCanvas | √ | √ | √ |
| H2O.ai | √ | √ | × |
| Microsoft | × | √ | √ |
| MLJAR | √ | √ | √ |
| RapidMiner | √ | √ | × |
| Tencent | × | √ | × |


Awesome-AutoML-Papers includes very up-to-date overviews of the bread-and-butter techniques we need in AutoML:
+ Automated Data Clean (Auto Clean)
+ Automated Feature Engineering (Auto FE)
+ Hyperparameter Optimization (HPO)
+ Meta-Learning
+ Neural Architecture Search (NAS)

Table of Contents


Papers
Surveys

Automated Feature Engineering

Architecture Search

Frameworks

Hyperparameter Optimization

Miscellaneous

Tutorials
Bayesian Optimization

Meta Learning

Blog

| Type | Blog Title | Link |
| :--------: | :--------: | :--------: |
| HPO | Bayesian Optimization for Hyperparameter Tuning | Link |
| Meta-Learning | Learning to learn | Link |
| Meta-Learning | Why Meta-learning is Crucial for Further Advances of Artificial Intelligence? | Link |


Books

| Year of Publication | Type | Book Title | Authors | Publisher | Link |
| :--------: | :--------: | :--------: | :--------: | :--------: | :--------: |
| 2009 | Meta-Learning | Metalearning - Applications to Data Mining | Brazdil, P., Giraud Carrier, C., Soares, C., Vilalta, R. | Springer | Download |
| 2019 | HPO, Meta-Learning, NAS | AutoML: Methods, Systems, Challenges | Frank Hutter, Lars Kotthoff, Joaquin Vanschoren | | Download |
| 2021 | Learning | Automated Machine Learning in Action | Qinquan Song, Haifeng Jin, Xia Hu | Manning Publications | Download |


Projects

| Project | Type | Language | License | Link |
| :--------: | :--------: | :--------: | :--------: | :--------: |
| AdaNet | NAS | Python | Apache-2.0 | Github |
| Advisor | HPO | Python | Apache-2.0 | Github |
| AMLA | HPO, NAS | Python | Apache-2.0 | Github |
| ATM | HPO | Python | MIT | Github |
| Auger | HPO | Python | Commercial | Homepage |
| auptimizer | HPO, NAS | Python (support R script) | GPL-3.0 | Github |
| Auto-Keras | NAS | Python | License | Github |
| AutoML Vision | NAS | Python | Commercial | Homepage |
| AutoML Video Intelligence | NAS | Python | Commercial | Homepage |
| AutoML Natural Language | NAS | Python | Commercial | Homepage |
| AutoML Translation | NAS | Python | Commercial | Homepage |
| AutoML Tables | AutoFE, HPO | Python | Commercial | Homepage |
| HyperGBM | HPO| Python | Python | Github |
| HyperKeras | NAS | Python | Python | Github |
| Hypernets | HPO, NAS | Python | Python | Github |
| auto-sklearn | HPO | Python | License | Github |
| auto_ml | HPO | Python | MIT | Github |
| BayesianOptimization | HPO | Python | MIT | Github |
| BayesOpt | HPO | C++ | AGPL-3.0 | Github |
| comet | HPO | Python | Commercial | Homepage |
| DataRobot | HPO | Python | Commercial | Homepage |
| DEvol | NAS | Python | MIT | Github |
| DeepArchitect | NAS | Python | MIT | Github |
| Determined | HPO, NAS | Python | Apache-2.0 | Github |
| Driverless AI | AutoFE | Python | Commercial | Homepage |
| FAR-HO | HPO | Python | MIT | Github |
| H2O AutoML | HPO | Python, R, Java, Scala | Apache-2.0 | Github |
| HpBandSter | HPO | Python | BSD-3-Clause | Github |
| HyperBand | HPO | Python | License | Github |
| Hyperopt | HPO | Python | License | Github |
| Hyperopt-sklearn | HPO | Python | License | Github |
| Hyperparameter Hunter | HPO | Python | MIT | Github |
| Katib | HPO | Python | Apache-2.0 | Github |
| MateLabs | HPO | Python | Commercial | Github |
| Milano | HPO | Python | Apache-2.0 | Github |
| MLJAR | AutoFE, HPO, NAS | Python | MIT | Github |
| mlr3automl | HPO | R | LGPL-3.0 | GitHub |
| nasbot | NAS | Python | MIT | Github |
| neptune | HPO | Python | Commercial | Homepage |
| NNI | HPO, NAS | Python | MIT | Github |
| Oboe | HPO | Python | BSD-3-Clause | Github |
| Optunity | HPO | Python | License | Github |
| R2.ai | HPO | | Commercial | Homepage |
| RBFOpt | HPO | Python | License | Github |
| RoBO | HPO | Python | BSD-3-Clause | Github |
| Scikit-Optimize | HPO | Python | License | Github |
| SigOpt | HPO | Python | Commercial | Homepage |
| SMAC3 | HPO | Python | License | Github |
| TPOT | AutoFE, HPO | Python | LGPL-3.0 | Github |
| TransmogrifAI | HPO | Scala | BSD-3-Clause | Github |
| Tune | HPO | Python | Apache-2.0 | Github |
| Xcessiv | HPO | Python | Apache-2.0 | Github |
| SmartML | HPO | R | GPL-3.0 | Github |
| MLBox | AutoFE, HPO | Python | BSD-3 License | Github |
| AutoAI Watson | AutoFE, HPO | | Commercial | Homepage |


Slides

| Type | Slide Title | Authors | Link |
| :--------: | :--------: | :--------: | :--------: |
| AutoFE | Automated Feature Engineering for Predictive Modeling | Udyan Khurana, etc al. | Download |
| HPO | A Tutorial on Bayesian Optimization for Machine Learning | Ryan P. Adams | Download |
| HPO | Bayesian Optimisation | Gilles Louppe | Download |


Acknowledgement

Special thanks to everyone who contributed to this project.


| Name | Bio |
| :--------: | :--------: |
| Alexander Robles | PhD Student @UNICAMP-Brazil |
| derekflint | |
| endymecy | Senior Researcher @Tencent |
| Eric | |
| Erin LeDell | Chief Machine Learning Scientist @H2O.ai |
| fwcore | |
| Gaurav Mittal | |
| Hernan Ceferino Vazquez | PhD, Data Science Expert @MercadoLibre |
| Kaustubh Damania | |
| Lilian Besson | PhD Student @CentraleSupélec |
| 罗磊 | |
| Marc | |
| Mohamed Maher | |
| Neil Conway | CTO @Determined AI |
| Richard Liaw | PhD Student @UC Berkeley|
| Randy Olson | Lead Data Scientist @LifeEGX |
| Slava Kurilyak | Founder, CEO @Produvia |
| Saket Maheshwary | AI Researcher |
| shaido987 | |
| sophia-wright-blue | |
| tengben0905 | |
| xuehui | @Microsoft |
| Yihui He | Grad Student @CMU |


Contact & Feedback

If you have any suggestions (missing papers, new papers, key researchers or typos), feel free to pull a request. Also you can mail to:
+ Mark Lin ([email protected]).


Licenses

Awesome-AutoML-Papers is available under Apache Licenses 2.0.