A review of the progress in machine vision-based crack detection and identification technology for asphalt pavements

Figures (8) Tables (2)

Figure 1.
Classification of asphalt pavement crack detection techniques.
Figure 2.
Overview of digital image processing techniques.
Figure 3.
Image acquisition using a CCD camera.
Figure 4.
Different devices for image acquisition.
Figure 5.
Flowchart of image denoising using filtering.
Figure 6.
Development of edge detection technology.
Figure 7.
Evolution history chart of deep learning models.
Figure 8.
Development process flowchart of object detection technology based on deep learning.

Dataset name	Description	Application in literature
Massachusetts dataset	Contains 1,711 road images and 151 building images, used for road and building extraction	Wei et al.^[71] optimized CNN models to extract road categories from aerial images. Alshehhi et al.^[72] implemented a patch-based CNN model to extract roads and building parts from remote sensing images.
TerraSAR-X dataset	Used for road extraction in SAR images, with 20% for testing and 80% for training	Henry et al.^[73] used DeepLabV3+ and Deep Residual U-Net to extract road parts from SAR images.
DeepGlobe dataset	Contains 622 test images, 622 validation images, and 4,971 training images	Xie et al.^[74] applied a new global perception framework based on higher-order spatial information (HsgNet) for road extraction.
Google Earth dataset	Contains 567 test images and 2,213 training images	Cheng et al.^[75] proposed the CasNet deep learning model to detect road categories and extract road centerlines. Shi et al.^[76] implemented GAN models using data augmentation procedures to generate high-resolution segmentation maps.
DigitalGlobe dataset	Collected by DigitalGlobe satellite, contains 6,226 images	Zhou et al.^[77] introduced the D-LinkNet model for road semantic segmentation in remote sensing images. Doshi^[78] used a ResNet-based ensemble model to extract roads from satellite images.

Table 1.

Common online public datasets.

Key technology	Main contribution	Performance index	Ref.
CNN model optimized based on road structure	Combined fusion and deconvolution layers to obtain structured output, proposed a road structure-based loss function	F1 Score: 66.2%; recall: 72.9%; precision: 60.6%; accuracy: 92.4%	Wei et al.^[71]
A novel architecture based on FCN, called U-shaped FCN	Proposed U-shaped FCN model, data augmentation to improve training efficiency	F1 Score: 89.6%; recall: 86.8%; precision: 92.5%; accuracy: 95.2%	Kestur et al.^[99]
U-Net fully convolutional network	First use of U-Net for detecting concrete cracks, U-Net is superior to DCNN in robustness, effectiveness, and accuracy	F1 Score: 90%; recall: 91%; precision 90%	Zhenqing Liu^[86]
Technology based on improved deep encoder-decoder neural networks	networks Enhanced model, using ELU function and LM method to improve overall output accuracy	F1 Score: 85.7%; recall: 86.1%; precision: 85.4%	Panboonyuen et al.^[100]
CNN-based automatic bridge crack detection model	Using an end-to-end model, parallel use of three Atrous convolutions to reduce computational complexity	F1 Score: 87.7%; precision: 78.1%; accuracy: 98.4%	Xu & Xu^[101]

Table 2.

The development of technology in crack detection.