Efficient Machine Learning Pipeline Automation Using Tpot And Pycaret
DOI:
https://doi.org/10.63278/mme.vi.1616Keywords:
Auto ML, Data, Preprocessing, Machine Learning, Hyperparameters, Feature selection, Report generation, Data Visualization.Abstract
The paper falls under the domain of Automated Machine Learning (AutoML) and Data Preprocessing. The existing system employs the Tree-based Pipeline Optimization Tool (TPOT), a Python-based AutoML framework that automates tasks such as algorithm selection, hyperparameter tuning, and data preprocessing, primarily for regression tasks. However, its functionality is limited to regression-based optimization. To overcome this limitation, the proposed work integrates PyCaret, a more versatile AutoML library that supports both classification and regression. PyCaret enhances the machine learning pipeline with robust preprocessing features, including feature engineering, error handling, and class imbalance management. It enables users to train multiple models and automatically selects the best-performing one, streamlining the entire workflow and making machine learning more accessible. The system achieves an impressive accuracy of 97.8%. Future work may include extending support to unsupervised and deep learning tasks, integrating cloud-based scalability, and incorporating real-time data processing for broader applicability.
Downloads
How to Cite
Issue
Section
License
Copyright (c) 2025 N. Arikaran, G. Dharanya, M. Kanchana, B. Poojitha, A. Bhuvanesh, S. Kamalesh, Arya Ejoumalai

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their published articles online (e.g., in institutional repositories or on their website, social networks like ResearchGate or Academia), as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).

Except where otherwise noted, the content on this site is licensed under a Creative Commons Attribution 4.0 International License.



According to the