A reinforcement learning recommender system using bi-clustering and Markov Decision Process
Article
Iftikhar, A., Ghazanfar, M. A., Ayub, M., Alahmari, S. A., Qazi, N. and Wall, J. 2024. A reinforcement learning recommender system using bi-clustering and Markov Decision Process. Expert Systems with Applications. 237 (Art.), p. 121541. https://doi.org/10.1016/j.eswa.2023.121541
Authors | Iftikhar, A., Ghazanfar, M. A., Ayub, M., Alahmari, S. A., Qazi, N. and Wall, J. |
---|---|
Abstract | Collaborative filtering (CF) recommender systems are static in nature and does not adapt well with changing user preferences. User preferences may change after interaction with a system or after buying a product. Conventional CF clustering algorithms only identifies the distribution of patterns and hidden correlations globally. However, the impossibility of discovering local patterns by these algorithms, headed to the popularization of bi-clustering algorithms. Bi-clustering algorithms can analyze all dataset dimensions simultaneously and consequently, discover local patterns that deliver a better understanding of the underlying hidden correlations. In this paper, we modelled the recommendation problem as a sequential decision-making problem using Markov Decision Processes (MDP). To perform state representation for MDP, we first converted user-item votings matrix to a binary matrix. Then we performed bi-clustering on this binary matrix to determine a subset of similar rows and columns. A bi-cluster merging algorithm is designed to merge similar and overlapping bi-clusters. These bi-clusters are then mapped to a squared grid (SG). RL is applied on this SG to determine best policy to give recommendation to users. Start state is determined using Improved Triangle Similarity (ITR similarity measure. Reward function is computed as grid state overlapping in terms of users and items in current and prospective next state. A thorough comparative analysis was conducted, encompassing a diverse array of methodologies, including RL-based, pure Collaborative Filtering (CF), and clustering methods. The results demonstrate that our proposed method outperforms its competitors in terms of precision, recall, and optimal policy learning. |
Journal | Expert Systems with Applications |
Journal citation | 237 (Art.), p. 121541 |
ISSN | 0957-4174 |
Year | 2024 |
Publisher | Elsevier |
Publisher's version | License File Access Level Anyone |
Digital Object Identifier (DOI) | https://doi.org/10.1016/j.eswa.2023.121541 |
Publication dates | |
Mar 2024 | |
Online | 15 Sep 2023 |
Publication process dates | |
Accepted | 08 Sep 2023 |
Deposited | 20 Nov 2023 |
Copyright holder | © 2023, The Authors |
https://repository.uel.ac.uk/item/8wyq4
Download files
Publisher's version
1-s2.0-S0957417423020432-main.pdf | ||
License: CC BY-NC-ND 4.0 | ||
File access level: Anyone |
208
total views87
total downloads20
views this month7
downloads this month