A Comparative Study of Sales Prediction Using Machine Learning Models: Integration of PySpark and Power BI
Conference paper
Sharif, S., Theeng Tamang, M., Nepal, N. and Elmedany, W. 2024. A Comparative Study of Sales Prediction Using Machine Learning Models: Integration of PySpark and Power BI. 2024 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies.
Authors | Sharif, S., Theeng Tamang, M., Nepal, N. and Elmedany, W. |
---|---|
Type | Conference paper |
Abstract | Retail management requires accurate sales forecasting for strategic planning, inventory control, and revenue maximisation. In this research, we are predicting sales using PySpark-built ML models and the Big Mart dataset. We evaluate and demonstrate the prediction skills of numerous machine learning algorithms, concentrating on XGBoost. Big Mart offers item weight, visibility, type, and outlet data. We use these properties as prediction model features. PySpark, a strong distributed computing platform, manages massive datasets for analysis and model training. In massive trials, we test decision trees (DT), XGBoost, linear regressions (LR), and random forests (RF). Training and testing enhance model accuracy. RMSE and R-squared measure our model quality. Our metrics evaluate the model’s data fit and prediction accuracy. XGBoost performed best with an RMSE of 1081 and an R-squared of 0.59. The XGBoost algorithm accurately predicts Big Mart sales. The model performs well because of its ensemble learning and understanding of intricate dataset links. We also used Power BI to present analytical insights, helping decision-makers design sales-estimated business plans. This study employed several machine learning algorithms, XGBoost gave the best performance. This research provides insights into how organisations might use these technologies to improve resource allocation and inventory management decisionmaking. |
Year | 2024 |
Conference | 2024 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies |
Accepted author manuscript | License File Access Level Repository staff only |
Publication process dates | |
Completed | Nov 2024 |
Accepted | 02 Nov 2024 |
Deposited | 20 Dec 2024 |
Copyright holder | © 2024 The Authors |
https://repository.uel.ac.uk/item/8yvy3
5
total views0
total downloads5
views this month0
downloads this month