An interactive human centered data science approach towards crime pattern analysis

Article


Qazi, N. and William Wong, B. L. 2019. An interactive human centered data science approach towards crime pattern analysis. Information Processing & Management. 56 (6), p. Art. 102066. https://doi.org/10.1016/j.ipm.2019.102066
AuthorsQazi, N. and William Wong, B. L.
Abstract

The traditional machine learning systems lack a pathway for a human to integrate their domain knowledge into the underlying machine learning algorithms. The utilization of such systems, for domains where decisions can have serious consequences (e.g. medical decision-making and crime analysis), requires the incorporation of human experts' domain knowledge. The challenge, however, is how to effectively incorporate domain expert knowledge with machine learning algorithms to develop effective models for better decision making.

In crime analysis, the key challenge is to identify plausible linkages in unstructured crime reports for the hypothesis formulation. Crime analysts painstakingly perform time-consuming searches of many different structured and unstructured databases to collate these associations without any proper visualization. To tackle these challenges and aiming towards facilitating the crime analysis, in this paper, we examine unstructured crime reports through text mining to extract plausible associations. Specifically, we present associative questioning based searching model to elicit multi-level associations among crime entities. We coupled this model with partition clustering to develop an interactive, human-assisted knowledge discovery and data mining scheme.

The proposed human-centered knowledge discovery and data mining scheme for crime text mining is able to extract plausible associations between crimes, identifying crime pattern, grouping similar crimes, eliciting co-offender network and suspect list based on spatial-temporal and behavioral similarity. These similarities are quantified through calculating Cosine, Jacquard, and Euclidean distances. Additionally, each suspect is also ranked by a similarity score in the plausible suspect list. These associations are then visualized through creating a two-dimensional re-configurable crime cluster space along with a bipartite knowledge graph.

This proposed scheme also inspects the grand challenge of integrating effective human interaction with the machine learning algorithms through a visualization feedback loop. It allows the analyst to feed his/her domain knowledge including choosing of similarity functions for identifying associations, dynamic feature selection for interactive clustering of crimes and assigning weights to each component of the crime pattern to rank suspects for an unsolved crime.
We demonstrate the proposed scheme through a case study using the Anonymized burglary dataset. The scheme is found to facilitate human reasoning and analytic discourse for intelligence analysis.

JournalInformation Processing & Management
Journal citation56 (6), p. Art. 102066
ISSN1873-5371
Year2019
PublisherElsevier
Accepted author manuscript
License
File Access Level
Anyone
Digital Object Identifier (DOI)https://doi.org/10.1016/j.ipm.2019.102066
Publication dates
Online11 Jul 2019
Publication process dates
Deposited08 Sep 2025
Copyright holder© 2019 The Authors
Permalink -

https://repository.uel.ac.uk/item/8q046

Download files


Accepted author manuscript
journalpaperaam.pdf
License: CC BY-NC-ND 4.0
File access level: Anyone

  • 294
    total views
  • 1
    total downloads
  • 294
    views this month
  • 1
    downloads this month

Export as

Related outputs

Vision Transformer Based Image Captioning for the Visually Impaired
Qazi, N., Dewaji, I. and Khan, N. 2025. Vision Transformer Based Image Captioning for the Visually Impaired. 14th International Conference on Human Interaction and Emerging Technologies: Artificial Intelligence & Future Applications, IHIET-FS 2025, June 10-12, 2025, University of East London, London, United Kingdom.. AHFE International. https://doi.org/10.54941/ahfe1005964
Unveiling the Power of Hybrid Balancing Techniques and Ensemble Stacked and Blended Classifiers for Enhanced Churn Prediction
Gaikwad, K., Berardinelli, N. and Qazi, N. 2024. Unveiling the Power of Hybrid Balancing Techniques and Ensemble Stacked and Blended Classifiers for Enhanced Churn Prediction. 16th Asian Conference on Intelligent Information and Database Systems. UAE 15 Apr 2024 - 18 Jun 2025 Springer. https://doi.org/10.1007/978-981-97-5937-8_20
Enhancing Authenticity Verification with Transfer Learning and Ensemble Techniques in Facial Feature-Based Deepfake Detection
Qazi, N. and Ahmed, I. 2024. Enhancing Authenticity Verification with Transfer Learning and Ensemble Techniques in Facial Feature-Based Deepfake Detection. 14th International Conference on Pattern Recognition Systems (ICPRS). London 15 - 18 Jul 2024 IEEE. https://doi.org/10.1109/ICPRS62101.2024.10677831
A reinforcement learning recommender system using bi-clustering and Markov Decision Process
Iftikhar, A., Ghazanfar, M. A., Ayub, M., Alahmari, S. A., Qazi, N. and Wall, J. 2024. A reinforcement learning recommender system using bi-clustering and Markov Decision Process. Expert Systems with Applications. 237 (Art.), p. 121541. https://doi.org/10.1016/j.eswa.2023.121541
Shifting the Weight: Applications of AI in Olympic Weightlifting
Bolarinwa, D., Qazi, N. and Ghazanfar, M. 2023. Shifting the Weight: Applications of AI in Olympic Weightlifting. PRDC 2023: 28th IEEE Pacific Rim International Symposium on Dependable Computing. Singapore 24 - 27 Oct 2023 IEEE. https://doi.org/10.1109/PRDC59308.2023.00051
Global impact of COVID-19 on surgeons and team members (GlobalCOST): a cross-sectional study
Jaffry, Z., Raj, S., Sallam, A., Lyman, S., Negida, A., Yiu, C. F. A., Sobti, A., Bua, N., Field, R. E., Abdalla, H., Hammad, R., Qazi, N., Singh, B., Brennan, P. A., Hussein, A., Narvani, A., Jones, A., Imam, M. A. and The OrthoGlobe Collaborative 2022. Global impact of COVID-19 on surgeons and team members (GlobalCOST): a cross-sectional study. BMJ Open. 12 (8), p. e059873. https://doi.org/10.1136/bmjopen-2021-059873
Contextual Visualization of Crime Matching Through Interactive Clustering and Bayesian Theory
Qazi, N. and William Wong, B. L. 2019. Contextual Visualization of Crime Matching Through Interactive Clustering and Bayesian Theory. in: Akhgar, B., Bayerl, P. S. and Leventakis, G. (ed.) Social Media Strategy in Policing: From Cultural Intelligence to Community Policing Springer. pp. 197–215
Associative search through Formal Concept Analysis in Criminal Intelligence Analysis
Qazi, N., William Wong, B. L., Kodagoda, N. and Adderley, R. 2017. Associative search through Formal Concept Analysis in Criminal Intelligence Analysis. IEEE International Conference on Systems, Man, and Cybernetics (SMC 2016) . IEEE. https://doi.org/10.1109/SMC.2016.7844519
Behavioural Tempo-spatial Knowledge Graph for Crime matching through Associate Questioning and Graph Theory
Qazi, N. and William Wong, B. L. 2017. Behavioural Tempo-spatial Knowledge Graph for Crime matching through Associate Questioning and Graph Theory. 2017 European Intelligence and Security Informatics Conference (EISIC). IEEE. https://doi.org/10.1109/EISIC.2017.29
Semantic-Based Image Retrieval Through Combined Classifiers of Deep Neural Network and Wavelet Decomposition of Image Signal
Qazi, N. and William Wong, B. L. 2016. Semantic-Based Image Retrieval Through Combined Classifiers of Deep Neural Network and Wavelet Decomposition of Image Signal. Proceedings of The 9th EUROSIM Congress on Modelling and Simulation, EUROSIM 2016. FinLand Scandinavian Simulation Society and Linköping University Electronic Press. https://doi.org/10.3384/ecp17142473