A platform for research: civil engineering, architecture and urbanism
Unsupervised Machine Learning for Augmented Data Analytics of Building Codes
Existing automated code checking methods/tools are unable to automatically analyze and represent all types of requirements (e.g., requirements that are too complex or that require human judgement). Recent efforts in the area of augmented data analytics have proposed the use of templates to facilitate the analysis of text. However, most of these efforts have constructed such templates manually, which is labor-intensive. More importantly, it is difficult for manually-developed templates to capture the linguistic variations in building codes. More research is, thus, needed to automate the generation of templates to support the tagging and extraction of information from building codes. To address this need, this paper proposes an unsupervised machine-learning based method to extract sentence templates that describe syntactic and semantic features and patterns from building codes. The proposed method is composed of four main steps: (1) data preprocessing; (2) identifying the different groups of sentence fragments using clustering; (3) identifying the fixed parts and the slots in the templates based on the syntactic and semantic patterns of the sentence fragment groups; and (4) evaluating the extracted templates. The proposed method was implemented and tested on a corpus of text from the International Building Code. An accuracy of 0.76 was achieved.
Unsupervised Machine Learning for Augmented Data Analytics of Building Codes
Existing automated code checking methods/tools are unable to automatically analyze and represent all types of requirements (e.g., requirements that are too complex or that require human judgement). Recent efforts in the area of augmented data analytics have proposed the use of templates to facilitate the analysis of text. However, most of these efforts have constructed such templates manually, which is labor-intensive. More importantly, it is difficult for manually-developed templates to capture the linguistic variations in building codes. More research is, thus, needed to automate the generation of templates to support the tagging and extraction of information from building codes. To address this need, this paper proposes an unsupervised machine-learning based method to extract sentence templates that describe syntactic and semantic features and patterns from building codes. The proposed method is composed of four main steps: (1) data preprocessing; (2) identifying the different groups of sentence fragments using clustering; (3) identifying the fixed parts and the slots in the templates based on the syntactic and semantic patterns of the sentence fragment groups; and (4) evaluating the extracted templates. The proposed method was implemented and tested on a corpus of text from the International Building Code. An accuracy of 0.76 was achieved.
Unsupervised Machine Learning for Augmented Data Analytics of Building Codes
Zhang, Ruichuan (author) / El-Gohary, Nora (author)
ASCE International Conference on Computing in Civil Engineering 2019 ; 2019 ; Atlanta, Georgia
2019-06-13
Conference paper
Electronic Resource
English
Unsupervised Machine Learning for Augmented Data Analytics of Building Codes
British Library Conference Proceedings | 2019
|Analysis and Optimisation of Building Efficiencies through Data Analytics and Machine Learning
BASE | 2022
|Machine learning in concrete strength simulations: Multi-nation data analytics
Online Contents | 2014
|Building Codes: Housing Codes:
NTIS | 1973
Wiley | 2009
|