Eine Plattform für die Wissenschaft: Bauingenieurwesen, Architektur und Urbanistik
Automatically Categorizing Construction Accident Narratives Using the Deep-Learning Model with a Class-Imbalance Treatment Technique
Learning from prior incidents is crucial for improving safety, particularly in the construction industry where fatalities and injuries are frequent. High-precision classification of construction accident narratives is a laborious, time-consuming process that requires substantial domain expertise. However, automatic text classification had fallen short of expectations due to a lack of high-quality data sets, inadequate semantic interpretation, and primitive model architecture. To address these issues, this study developed a state-of-the-art text classification (TC) model to extract construction knowledge and classify construction accident narratives into predefined categories. The architecture of the TC deep-learning model was built based on the pretrained instruction-based omnifarious representations (INSTRUCTOR). A class-imbalance treatment (CIT) technique incorporating focal loss and weighted random sampling was embedded to make the model concentrate on hard samples and minority classes. The retrained and fine-tuned INSTRUCTOR-CIT model achieved an F1 score of 82.22% for the benchmark data set containing 1,000 accident narratives from the Occupational Health and Safety Administration (OSHA). Impressively, on a larger benchmark data set of 4,770 OSHA accident narratives labeled by another official system, the model achieved an F1 score of 94.84%, highlighting its generality. Furthermore, the experimental results demonstrated that our model was superior to existing methods with less preprocessing and higher accuracy. Finally, the contribution to construction project management was discussed to enhance unstructured data management in the construction industry. The findings of this study contribute to effective management practices and assist construction professionals focus on value-added tasks such as decision making and corrective action planning.
Automatically Categorizing Construction Accident Narratives Using the Deep-Learning Model with a Class-Imbalance Treatment Technique
Learning from prior incidents is crucial for improving safety, particularly in the construction industry where fatalities and injuries are frequent. High-precision classification of construction accident narratives is a laborious, time-consuming process that requires substantial domain expertise. However, automatic text classification had fallen short of expectations due to a lack of high-quality data sets, inadequate semantic interpretation, and primitive model architecture. To address these issues, this study developed a state-of-the-art text classification (TC) model to extract construction knowledge and classify construction accident narratives into predefined categories. The architecture of the TC deep-learning model was built based on the pretrained instruction-based omnifarious representations (INSTRUCTOR). A class-imbalance treatment (CIT) technique incorporating focal loss and weighted random sampling was embedded to make the model concentrate on hard samples and minority classes. The retrained and fine-tuned INSTRUCTOR-CIT model achieved an F1 score of 82.22% for the benchmark data set containing 1,000 accident narratives from the Occupational Health and Safety Administration (OSHA). Impressively, on a larger benchmark data set of 4,770 OSHA accident narratives labeled by another official system, the model achieved an F1 score of 94.84%, highlighting its generality. Furthermore, the experimental results demonstrated that our model was superior to existing methods with less preprocessing and higher accuracy. Finally, the contribution to construction project management was discussed to enhance unstructured data management in the construction industry. The findings of this study contribute to effective management practices and assist construction professionals focus on value-added tasks such as decision making and corrective action planning.
Automatically Categorizing Construction Accident Narratives Using the Deep-Learning Model with a Class-Imbalance Treatment Technique
J. Constr. Eng. Manage.
Shuang, Qing (Autor:in) / Liu, Xishan (Autor:in) / Wang, Zhaojing (Autor:in) / Xu, Xinxin (Autor:in)
01.09.2024
Aufsatz (Zeitschrift)
Elektronische Ressource
Englisch
An Ensemble Approach for Classification of Accident Narratives
British Library Conference Proceedings | 2017
|