A Comparative Study of Sales Prediction Using Machine Learning Models: Integration of PySpark and Power BI
Conference paper
Sharif, S., Theeng Tamang, M., Nepal, N. and Elmedany, W. 2024. A Comparative Study of Sales Prediction Using Machine Learning Models: Integration of PySpark and Power BI. 2024 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies. IEEE. https://doi.org/10.1109/3ict64318.2024.10824620
Authors | Sharif, S., Theeng Tamang, M., Nepal, N. and Elmedany, W. |
---|---|
Type | Conference paper |
Abstract | Retail management requires accurate sales forecasting for strategic planning, inventory control, and revenue maximisation. In this research, we are predicting sales using PySpark-built ML models and the Big Mart dataset. We evaluate and demonstrate the prediction skills of numerous machine learning algorithms, concentrating on XGBoost. Big Mart offers item weight, visibility, type, and outlet data. We use these properties as prediction model features. PySpark, a strong distributed computing platform, manages massive datasets for analysis and model training. In massive trials, we test decision trees (DT), XGBoost, linear regressions (LR), and random forests (RF). Training and testing enhance model accuracy. RMSE and R-squared measure our model quality. Our metrics evaluate the model’s data fit and prediction accuracy. XGBoost performed best with an RMSE of 1081 and an R-squared of 0.59. The XGBoost algorithm accurately predicts Big Mart sales. The model performs well because of its ensemble learning and understanding of intricate dataset links. We also used Power BI to present analytical insights, helping decision-makers design sales-estimated business plans. This study employed several machine learning algorithms, XGBoost gave the best performance. This research provides insights into how organisations might use these technologies to improve resource allocation and inventory management decisionmaking. |
Year | 2024 |
Conference | 2024 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies |
Publisher | IEEE |
Accepted author manuscript | License File Access Level Anyone |
Publication dates | |
Online | 13 Jan 2025 |
Publication process dates | |
Completed | Nov 2024 |
Accepted | 02 Nov 2024 |
Deposited | 20 Dec 2024 |
Journal | 2024 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT) |
Journal citation | pp. 137-142 |
ISSN | 2770-7466 |
2770-7458 | |
Book title | 2024 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT) |
ISBN | 979-8-3315-3313-7 |
979-8-3315-3314-4 | |
Digital Object Identifier (DOI) | https://doi.org/10.1109/3ict64318.2024.10824620 |
Web address (URL) of conference proceedings | https://ieeexplore.ieee.org/xpl/conhome/10823647/proceeding |
Copyright holder | © 2024 IEEE |
Additional information | Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
https://repository.uel.ac.uk/item/8yvy3
Restricted files
Accepted author manuscript
144
total views0
total downloads27
views this month0
downloads this month