A platform for research: civil engineering, architecture and urbanism
Data-driven scene parsing method for recognizing construction site objects in the whole image
Abstract Although efforts have been made for automated monitoring of construction sites, comprehensive understanding of a whole image remains to be a difficult task. Conventional vision-based monitoring methods have shortcomings in obtaining semantic information regarding an entire image because these methods are not scalable to the number of recognizable objects and training data. Most methods use a parametric model to recognize objects, involving cumbersome parameter tuning. This study presents the data-driven scene parsing method to recognize various objects in a construction site image. For identifying object information of a query image, the monitoring system retrieves the most relevant images to a query image using nearest neighbors and scale invariant feature transform flow matching and transfers relevant image labels to a query image. This study demonstrated the reasonable system performance in construction site images, recording 81.48% of average pixel-wise recognition rate with a small amount of similar images. The scene parsing method would enrich the raw information of a construction site image, thereby facilitating information use for various management applications.
Highlights This is the first attempt to apply a scene parsing method for construction sites. The method recognizes a varying number of construction objects classes in an image. The nonparametric nature of the method requires less tuning of a monitoring system. A high recognition rate was achieved with a small amount of training image data.
Data-driven scene parsing method for recognizing construction site objects in the whole image
Abstract Although efforts have been made for automated monitoring of construction sites, comprehensive understanding of a whole image remains to be a difficult task. Conventional vision-based monitoring methods have shortcomings in obtaining semantic information regarding an entire image because these methods are not scalable to the number of recognizable objects and training data. Most methods use a parametric model to recognize objects, involving cumbersome parameter tuning. This study presents the data-driven scene parsing method to recognize various objects in a construction site image. For identifying object information of a query image, the monitoring system retrieves the most relevant images to a query image using nearest neighbors and scale invariant feature transform flow matching and transfers relevant image labels to a query image. This study demonstrated the reasonable system performance in construction site images, recording 81.48% of average pixel-wise recognition rate with a small amount of similar images. The scene parsing method would enrich the raw information of a construction site image, thereby facilitating information use for various management applications.
Highlights This is the first attempt to apply a scene parsing method for construction sites. The method recognizes a varying number of construction objects classes in an image. The nonparametric nature of the method requires less tuning of a monitoring system. A high recognition rate was achieved with a small amount of training image data.
Data-driven scene parsing method for recognizing construction site objects in the whole image
Kim, Hongjo (author) / Kim, Kinam (author) / Kim, Hyoungkwan (author)
Automation in Construction ; 71 ; 271-282
2016-08-15
12 pages
Article (Journal)
Electronic Resource
English
Data-driven scene parsing method for recognizing construction site objects in the whole image
Online Contents | 2016
|Data-driven scene parsing method for recognizing construction site objects in the whole image
British Library Online Contents | 2016
|Enhancing action recognition of construction workers using data-driven scene parsing
DOAJ | 2018
|