Low-code autoML-augmented data pipeline - a review and experiments
Self archived versionpublished version
MetadataShow full item record
CitationGain, Ulla. Hotti, Virpi. (2021). Low-code autoML-augmented data pipeline - a review and experiments. Journal of physics : Conference series, 1828, 012015. 10.1088/1742-6596/1828/1/012015.
There is a lack of knowledge concerning the low-code autoML (automated machine learning) frameworks that can be used to enrich data for several purposes concerning either data engineering or software engineering. In this paper, 34 autoML frameworks have been reviewed based on the latest commits and augmentation properties of their GitHub content. The PyCaret framework was the result of the review due to requirements concerning adaptability by Google Colaboratory (Colab) and the BI (business intelligence) tool. Finally, the low-code autoML-augmented data pipeline from raw data to dashboards and low-code apps has been drawn based on the experiments concerned classifications of the "Census Income" dataset. The constructed pipeline preferred the same data to be a ground for different reports, dashboards, and applications. However, the constructed low-code autoML-augmented data pipeline contains changeable building blocks such as libraries and visualisations.