Predicting Occupational Accident Risk from Textual Data: A Systematic Review of Machine Learning Application
1Department of Industrial and Systems Engineering, Institut Teknologi Sepuluh Nopember, Indonesia
2Department of Safety Engineering, Politeknik Perkapalan Negeri Surabaya, Indonesia
*Author to whom correspondence should be addressed:
E-mail: 7010231004@student.its.ac.id (AFR)
E-mail: 7010231004@student.its.ac.id (AFR)
Received: June 05, 2025 | Revised: December 23, 2025 | Accepted: March 10, 2026 | Published: June 2026
Abstract
Occupational accident risk remains a persistent concern, especially in high-risk industries. However, the optimal use of textual data in safety analysis is still limited. This study aims to systematically review the application of machine learning (ML) and natural language processing (NLP) in predicting occupational accident risk using textual data. A systematic literature review was conducted using the PRISMA framework and PICOS criteria, with Scopus as the primary database. From 1238 initial articles, 21 were selected for in-depth analysis. The results indicate that algorithms such as Random Forest, Support Vector Machine, and Neural Networks are frequently used, achieving up to 94% accuracy when integrated with NLP techniques. This review highlights the growing potential of ML and NLP in developing proactive and data-driven occupational safety strategies. Further research is recommended to address limitations in data quality, model generalization and validation across various sectors
Keywords
machine learning; natural language processing; occupational accident; predicting; risk management
Available Repositories
Share Article
Article Metrics
--
Views
--
Downloads
--
Citations
Export Citation
Full Text









