Eine Plattform für die Wissenschaft: Bauingenieurwesen, Architektur und Urbanistik
Automated Classification of Construction Claim Documents Using Text Mining
Claims are inevitable on construction projects and proper claim management is crucial to avoid their escalation into disputes. Claim preparation relies heavily on documentation and involves the classification of large amounts of information. Nevertheless, this process is typically performed by human experts (contract administrators) and results in substantial time, effort, cost, and human error. In fact, researchers have concluded that the conventional documentation system is inefficient and must be enhanced. In view of the important role of text mining in the construction field, there is a possibility of using text mining in claim preparation for potential improvements. To the authors’ best knowledge, no previous research has developed an automated tool to classify correspondences on construction projects for utilization in claims. Accordingly, this paper aims to improve information organization in claims management by developing a classification tool, using text mining that automatically classifies correspondences on a project as relevant or irrelevant to a claim. Four different classification algorithms, KNN, Naïve Bayes, SVM, and random forest, are trained and tested using various parameters on a training dataset of 213 documents from a construction project to determine the optimized setting for each classifier. Then, the optimized classifiers are applied to a testing dataset of 205 documents from a second construction project to analyze their performance and applicability across projects with different characteristics. The results reveal that the random forest classifier has a high recall (93%) for claim-relevant documents and average accuracy (65%) while SVM presented acceptable recall (80%) with higher accuracy (79%).
Automated Classification of Construction Claim Documents Using Text Mining
Claims are inevitable on construction projects and proper claim management is crucial to avoid their escalation into disputes. Claim preparation relies heavily on documentation and involves the classification of large amounts of information. Nevertheless, this process is typically performed by human experts (contract administrators) and results in substantial time, effort, cost, and human error. In fact, researchers have concluded that the conventional documentation system is inefficient and must be enhanced. In view of the important role of text mining in the construction field, there is a possibility of using text mining in claim preparation for potential improvements. To the authors’ best knowledge, no previous research has developed an automated tool to classify correspondences on construction projects for utilization in claims. Accordingly, this paper aims to improve information organization in claims management by developing a classification tool, using text mining that automatically classifies correspondences on a project as relevant or irrelevant to a claim. Four different classification algorithms, KNN, Naïve Bayes, SVM, and random forest, are trained and tested using various parameters on a training dataset of 213 documents from a construction project to determine the optimized setting for each classifier. Then, the optimized classifiers are applied to a testing dataset of 205 documents from a second construction project to analyze their performance and applicability across projects with different characteristics. The results reveal that the random forest classifier has a high recall (93%) for claim-relevant documents and average accuracy (65%) while SVM presented acceptable recall (80%) with higher accuracy (79%).
Automated Classification of Construction Claim Documents Using Text Mining
Lecture Notes in Civil Engineering
Desjardins, Serge (Herausgeber:in) / Poitras, Gérard J. (Herausgeber:in) / Nik-Bakht, Mazdak (Herausgeber:in) / Malaeb, Zeina (Autor:in) / Momenifar, Samaneh (Autor:in) / Rehman, Tooba (Autor:in) / Biglari, Ava (Autor:in) / Mohammed, Yasser (Autor:in) / Karim, Mohammad Rezaul (Autor:in)
Canadian Society of Civil Engineering Annual Conference ; 2023 ; Moncton, NB, Canada
Proceedings of the Canadian Society for Civil Engineering Annual Conference 2023, Volume 5 ; Kapitel: 23 ; 313-325
18.12.2024
13 pages
Aufsatz/Kapitel (Buch)
Elektronische Ressource
Englisch
Automated Classification of Construction Project Documents
Online Contents | 2002
|Automated Classification of Construction Project Documents
British Library Online Contents | 2002
|Ontology-Based Multilabel Text Classification of Construction Regulatory Documents
British Library Online Contents | 2016
|