A Practical Evaluation of AutoML Tools for Binary, Multiclass, and Multilabel Classification

Authors: Marcelo Aragão, Augusto Afonso, Rafaela Ferraz, Rairon Ferreira, Sávio Leite, Felipe A. P. de Figueiredo, and Samuel B. Mafra.

Abstract:

Selecting the most suitable Automated Machine Learning (AutoML) tool is
pivotal for achieving optimal performance in diverse classification tasks,
including binary, multiclass, and multilabel scenarios. The wide range of
frameworks with distinct features and capabilities complicates this decision,
necessitating systematic evaluation. This study rigorously evaluates sixteen
AutoML tools using twenty-one datasets through feature-based comparisons and
time-constrained experiments, with weighted $F_1$ score and training time as
primary metrics. Both native and label powerset representations were analyzed
for multilabel classification to provide a comprehensive understanding of
framework performance. The results demonstrate critical trade-offs between
accuracy and speed: AutoGluon and AutoKeras performed strongly in binary and
multiclass tasks, while AutoSklearn achieved superior accuracy in multilabel
classification and AutoKeras excelled in training speed. This work emphasizes
the importance of aligning tool selection with problem characteristics by
addressing the interplay between task-specific requirements and computational
constraints. The study’s open-source code and reproducible experimental
protocols ensure its value as a resource for researchers and practitioners.
This comprehensive analysis advances the understanding of AutoML capabilities
and offers actionable insights to guide tool selection, fostering informed
decision-making and future research in the field.

Setup and Execution:

The tests require a need a Linux installation (bare-metal or virtualized).

git clone https://github.com/marcelovca90/auto-ml-evaluation.git
cd auto-ml-evaluation
conda create -n auto-ml-evaluation python=3.8
conda activate auto-ml-evaluation
chmod +x run.sh
./run.sh

Note: if you want to use Label Powerset, make sure to set LABEL_POWERSET = True in common.py.

Name		Name	Last commit message	Last commit date
Latest commit History 125 Commits
fonts		fonts
reports		reports
results		results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
automl_autogluon.py		automl_autogluon.py
automl_autokeras.py		automl_autokeras.py
automl_autopytorch.py		automl_autopytorch.py
automl_autosklearn.py		automl_autosklearn.py
automl_evalml.py		automl_evalml.py
automl_fedot.py		automl_fedot.py
automl_flaml.py		automl_flaml.py
automl_gama.py		automl_gama.py
automl_h2o.py		automl_h2o.py
automl_lightautoml.py		automl_lightautoml.py
automl_lightwood.py		automl_lightwood.py
automl_mljar.py		automl_mljar.py
automl_naive.py		automl_naive.py
automl_pycaret.py		automl_pycaret.py
automl_tpot.py		automl_tpot.py
common.py		common.py
run.sh		run.sh
utils_consolidator.py		utils_consolidator.py
utils_format_json.py		utils_format_json.py
utils_plot_f1_scores.py		utils_plot_f1_scores.py
utils_plot_training_times.py		utils_plot_training_times.py
utils_stats_openml.py		utils_stats_openml.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Practical Evaluation of AutoML Tools for Binary, Multiclass, and Multilabel Classification

Abstract:

Setup and Execution:

About

Releases 1

Packages

Contributors 3

Languages

License

marcelovca90/auto-ml-evaluation

Folders and files

Latest commit

History

Repository files navigation

A Practical Evaluation of AutoML Tools for Binary, Multiclass, and Multilabel Classification

Abstract:

Setup and Execution:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Languages

Packages