PSM: Proactive Spill Mitigation in PySpark
Conference paper
Karami, A. 2025. PSM: Proactive Spill Mitigation in PySpark. 12th IEEE International Conference on Data Science and Advanced Analytics. Birmingham, UK 09 - 12 Oct 2025 IEEE.
Authors | Karami, A. |
---|---|
Type | Conference paper |
Abstract | Apache Spark's performance is critically dependent on efficient in-memory computation; however, insufficient executor memory frequently results in costly disk spills, severely degrading performance. Traditional spill mitigation techniques, including static configuration tuning, reactive Adaptive Query Execution (AQE), and internal spill handling mechanisms, exhibit limitations in addressing the dynamic and fine-grained nature of memory pressure in complex and large-scale workloads. This paper introduces Proactive Spill Mitigation (PSM), a novel methodology designed to anticipate and mitigate memory spills in PySpark applications before they occur. PSM leverages real-time task and executor metrics, accessible via the Spark Listener API, as input to a machine learning model trained to predict the likelihood and potential severity of impending spills. Upon detecting a high spill risk, a control mechanism triggers preemptive actions, such as checkpointing intermediate results or dynamically adjusting partition counts, to alleviate memory pressure. Experimental evaluation using the TPC-DS and HiBench Sort benchmarks demonstrates that PSM significantly reduces both execution time and the volume of data spilled to disk across various cluster sizes, outperforming default Spark configurations, statically tuned setups, and configurations with AQE enabled. This work establishes the efficacy of predictive control for enhancing Spark's memory management and improving application performance in dynamic environments. |
Year | 2025 |
Conference | 12th IEEE International Conference on Data Science and Advanced Analytics |
Publisher | IEEE |
Accepted author manuscript | License File Access Level Anyone |
Publication process dates | |
Accepted | 05 Aug 2025 |
Deposited | 21 Aug 2025 |
Journal citation | p. In press |
ISSN | 2766-4112 |
2472-1573 | |
Web address (URL) of conference proceedings | https://ieeexplore.ieee.org/xpl/conhome/1806384/all-proceedings |
Copyright holder | © 2025 IEEE |
Copyright information | Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
https://repository.uel.ac.uk/item/900xv
Restricted files
Accepted author manuscript
78
total views1
total downloads78
views this month1
downloads this month