Low-code autoML-augmented data pipeline - a review and experiments
Tiedosto(t)
Rinnakkaistallenteen versio
published versionPäivämäärä
2021Tekijä(t)
Yksilöllinen tunniste
10.1088/1742-6596/1828/1/012015Metadata
Näytä kaikki kuvailutiedotLisätietoa
Rinnakkaistallenne
Viittaus
Gain, Ulla. Hotti, Virpi. (2021). Low-code autoML-augmented data pipeline - a review and experiments. Journal of physics : Conference series, 1828, 012015. 10.1088/1742-6596/1828/1/012015.Oikeudet
Tiivistelmä
There is a lack of knowledge concerning the low-code autoML (automated machine learning) frameworks that can be used to enrich data for several purposes concerning either data engineering or software engineering. In this paper, 34 autoML frameworks have been reviewed based on the latest commits and augmentation properties of their GitHub content. The PyCaret framework was the result of the review due to requirements concerning adaptability by Google Colaboratory (Colab) and the BI (business intelligence) tool. Finally, the low-code autoML-augmented data pipeline from raw data to dashboards and low-code apps has been drawn based on the experiments concerned classifications of the "Census Income" dataset. The constructed pipeline preferred the same data to be a ground for different reports, dashboards, and applications. However, the constructed low-code autoML-augmented data pipeline contains changeable building blocks such as libraries and visualisations.