A review of the progress in machine vision-based crack detection and identification technology for asphalt pavements

Songling Huang; Hao Chen; Lingbo Yan; Xiaoling Zou; Bin Li; Yanqiu Bi; Songling Huang; Hao Chen; Lingbo Yan; Xiaoling Zou; Bin Li; Yanqiu Bi

doi:10.48130/dts-0025-0006

2025 Volume 4

Article Contents

Next Previous

ARTICLE Open Access

A review of the progress in machine vision-based crack detection and identification technology for asphalt pavements

1.
National & Local Joint Engineering Research Center of Transportation Civil Engineering Materials, Chongqing Jiaotong University, Chongqing 400074, China
2.
School of Civil Engineering, Chongqing Jiaotong University, Chongqing 400074, China

More Information

Corresponding author: biyanqiu@chd.edu.cn

Received: 15 October 2024
Revised: 08 January 2025
Accepted: 05 February 2025
Published online: 31 March 2025
Digital Transportation and Safety 2025, 4(1): 65−79 | Cite this article

Abstract

With the aging of transportation infrastructure and the increasing frequency of use, the detection and identification of cracks in asphalt pavements are crucial for ensuring road safety and maintenance efficiency. Traditional manual inspection methods are not only inefficient and limited in accuracy but also susceptible to subjective factors and environmental conditions. In contrast, machine vision-based crack detection technology enhances the efficiency and reliability of detection through automated image acquisition and analysis processes. This article reviews the latest advancements in machine vision-based crack detection technology for asphalt pavements, with a particular focus on the applications of digital image processing and deep learning. Although image processing-based methods perform well in detecting cracks against simple backgrounds, they exhibit poor robustness under complex lighting and background conditions. On the other hand, deep learning-based methods, while effectively handling complex image data, rely on large amounts of annotated data and significant computational resources. Through critical analysis, the article evaluates the strengths and weaknesses of existing technologies and looks forward to future research directions that integrate multiple sensing data and automated data annotation tools, aiming to further advance and innovate road maintenance technology.
- Asphalt pavement crack detection,
- Machine vision,
- Digital image processing,
- Deep learning,
- Road safety maintenance
Rights and permissions
Copyright: © 2025 by the author(s). Published by Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.

References

[1]	Yao Y, Tung STE, Glišić B. 2014. Crack detection and characterization techniques—an overview. Structural Control and Health Monitoring 21:1387−413 doi: 10.1002/stc.1655 CrossRef Google Scholar
[2]	Liu J, Gu J, Luo S. 2022. Research on road crack detection based on machine vision. 2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), October 3−5, 2022, Beijing, China. USA: IEEE. pp. 543−47. doi: 10.1109/IAEAC54830.2022.9929645
[3]	Koch C, Georgieva K, Kasireddy V, Akinci B, Fieguth P. 2015. A review on computer vision based defect detection and condition assessment of concrete and asphalt civil infrastructure. Advanced Engineering Informatics 29:196−210 doi: 10.1016/j.aei.2015.01.008 CrossRef Google Scholar
[4]	Labudzki R, Legutko S, Raos P. 2014. The essence and applications of machine vision. Tehnicki Vjesnik [Technical Gazette] 21:903−9 Google Scholar
[5]	Abdel-Qader I, Abudayyeh O, Kelly ME. 2003. Analysis of edge-detection techniques for crack identification in bridges. Journal of Computing in Civil Engineering 17:255−63 doi: 10.1061/(asce)0887-3801(2003)17:4(255) CrossRef Google Scholar
[6]	Kamaliardakani M, Sun L, Ardakani MK. 2016. Sealed-crack detection algorithm using heuristic thresholding approach. Journal of Computing in Civil Engineering 30:04014110 doi: 10.1061/(asce)cp.1943-5487.0000447 CrossRef Google Scholar
[7]	Li Q, Zou Q, Zhang D, Mao Q. 2011. FoSA: F* Seed-growing Approach for crack-line detection from pavement images. Image and Vision Computing 29:861−72 doi: 10.1016/j.imavis.2011.10.003 CrossRef Google Scholar
[8]	Guo WY, Wang XF, Xia XZ. 2014. Two-dimensional Otsu' s thresholding segmentation method based on grid box filter. Optik 125:5234−40 doi: 10.1016/j.ijleo.2014.05.003 CrossRef Google Scholar
[9]	Kanopoulos N, Vasanthavada N, Baker RL. 1988. Design of an image edge detection filter using the Sobel operator. IEEE Journal of Solid-State Circuits 23:358−67 doi: 10.1109/4.996 CrossRef Google Scholar
[10]	Wang D, Zhou S. 2008. Color image recognition method based on the prewitt operator. 2008 International Conference on Computer Science and Software Engineering, December 12−14, 2008, Wuhan, China. USA: IEEE. pp. 170−73. doi: 10.1109/CSSE.2008.567
[11]	Li ES, Zhu SL, Zhu BS, Zhao Y, Xia CG, et al. 2009. An adaptive edge-detection method based on the canny operator. 2009 International Conference on Environmental Science and Information Application Technology, July 4−5, 2009, Wuhan, China. USA: IEEE. pp. 465−69. doi: 10.1109/ESIAT.2009.49
[12]	Cao W, Liu Q, He Z. 2020. Review of pavement defect detection methods. IEEE Access 8:14531−44 doi: 10.1109/ACCESS.2020.2966881 CrossRef Google Scholar
[13]	Sinha SK, Fieguth PW. 2006. Morphological segmentation and classification of underground pipe images. Machine Vision and Applications 17:21−31 doi: 10.1007/s00138-005-0012-0 CrossRef Google Scholar
[14]	Fujita Y, Hamamoto Y. 2011. A robust automatic crack detection method from noisy concrete surfaces. Machine Vision and Applications 22:245−54 doi: 10.1007/s00138-009-0244-5 CrossRef Google Scholar
[15]	Azouz Z, Honarvar Shakibaei Asli B, Khan M. 2023. Evolution of crack analysis in structures using image processing technique: a review. Electronics 12:3862 doi: 10.3390/electronics12183862 CrossRef Google Scholar
[16]	Zhu J, Zhong J, Ma T, Huang X, Zhang W, et al. 2022. Pavement distress detection using convolutional neural networks with images captured via UAV. Automation in Construction 133:103991 doi: 10.1016/j.autcon.2021.103991 CrossRef Google Scholar
[17]	Ayenu-Prah A, Attoh-Okine N. 2008. Evaluating pavement cracks with bidimensional empirical mode decomposition. EURASIP Journal on Advances in Signal Processing 2008:861701 doi: 10.1155/2008/861701 CrossRef Google Scholar
[18]	Vivekananthan V, Vignesh R, Vasanthaseelan S, Joel E, Kumar KS. 2023. Concrete bridge crack detection by image processing technique by using the improved OTSU method. Materials Today: Proceedings 74:1002−7 doi: 10.1016/j.matpr.2022.11.356 CrossRef Google Scholar
[19]	Li Y, Yang N. 2023. An improved crack identification method for asphalt concrete pavement. Applied Sciences 13:8696 doi: 10.3390/app13158696 CrossRef Google Scholar
[20]	Liu F, Liu J, Wang L. 2022. Asphalt pavement crack detection based on convolutional neural network and infrared thermography. IEEE Transactions on Intelligent Transportation Systems 23:22145−55 doi: 10.1109/TITS.2022.3142393 CrossRef Google Scholar
[21]	Eskandari Torbaghan M, Li W, Metje N, Burrow M, Chapman DN, et al. 2020. Automated detection of cracks in roads using ground penetrating radar. Journal of Applied Geophysics 179:104118 doi: 10.1016/j.jappgeo.2020.104118 CrossRef Google Scholar
[22]	Chapeleau X, Blanc J, Hornych P, Gautier JL, Carroget J. 2014. Use of distributed fiber optic sensors to detect damage in a pavement. In Asphalt Pavements, ed. Kim YR. 1^st Edition. London: CRC Press. doi: 10.1201/b17219-60
[23]	Xu W, Tang ZM, Xu D, Wu GX. 2015. Integrating multi-features fusion and gestalt principles for pavement crack detection. Journal of Computer-Aided Design & Computer Graphics 27(1):147−56 Google Scholar
[24]	Zhang Y, Zhou H. 2012. Automatic pavement cracks detection and classification using radon transform. Journal of Information and Computational Science 9:5241−7 Google Scholar
[25]	Zhang A, Li QJ, Wang KCP, Qiu S. 2013. Matched filtering algorithm for pavement cracking detection. Transportation Research Record: Journal of the Transportation Research Board 2367:30−42 doi: 10.3141/2367-04 CrossRef Google Scholar
[26]	Sun X, Huang J, Liu W, Xu M. 2012. Pavement crack characteristic detection based on sparse representation. EURASIP Journal on Advances in Signal Processing 2012:191 doi: 10.1186/1687-6180-2012-191 CrossRef Google Scholar
[27]	Stentoumis C, Protopapadakis E, Doulamis A, Doulamis N. 2016. A holistic approach for inspection of civil infrastructures based on computer vision techniques. The International Archives of the Photogrammetry, Remote Sensing and Spatial lnformation Sciences, 2016 XXlll ISPRS Congress,12−19 July 2016, Prague, Czech Republic. Volume XL1-B5. pp. 131−38. doi: 10.5194/isprsarchives-xli-b5-131-2016
[28]	Ni T, Zhou R, Gu C, Yang Y. 2020. Measurement of concrete crack feature with Android smartphone APP based on digital image processing techniques. Measurement 150:107093 doi: 10.1016/j.measurement.2019.107093 CrossRef Google Scholar
[29]	Hoang ND, Nguyen QL. 2018. Metaheuristic optimized edge detection for recognition of concrete wall cracks: a comparative study on the performances of Roberts, prewitt, canny, and sobel algorithms. Advances in Civil Engineering 2018:7163580 doi: 10.1155/2018/7163580 CrossRef Google Scholar
[30]	Hu C, He L, Tao J, Wang M, Zhang D. 2022. Asphalt pavement crack detection based on fusion of neighborhood and gradient salient features. Journal of Computer-Aided Design & Computer Graphics 34:245−53 doi: 10.3724/sp.j.1089.2022.18891 CrossRef Google Scholar
[31]	Talab AMA, Huang Z, Xi F, Liu H. 2016. Detection crack in image using Otsu method and multiple filtering in image processing techniques. Optik 127:1030−33 doi: 10.1016/j.ijleo.2015.09.147 CrossRef Google Scholar
[32]	Ouyang A, Luo C, Zhou C. 2011. Surface distresses detection of pavement based on digital image processing. In Computer and Computing Technologies in Agriculture IV. CCTA 2010. IFIP Advances in Information and Communication Technology, 2011. vol 347. Berlin, Heidelberg: Springer Berlin Heidelberg. pp. 368−75. doi: 10.1007/978-3-642-18369-0_42
[33]	Kumare JS, Gupta P, Singh UP, Singh RK. 2019. An efficient contrast enhancement technique based on firefly optimization. In Soft Computing: Theories and Applications. Advances in Intelligent Systems and Computing, eds. Ray K, Sharma T, Rawat S, Saini R, Bandyopadhyay A. vol 742. Singapore: Springer. pp. 181−92. doi: 10.1007/978-981-13-0589-4_17
[34]	Li G, Xu Y, Li J. 2013. Fuzzy contrast enhancement algorithm for road surface image based on adaptively changing index via grey entropy. Information Technology Journal 12:5309−14 doi: 10.3923/itj.2013.5309.5314 CrossRef Google Scholar
[35]	Yao M, Zhao Z, Yao X, Xu B. 2015. Fusing complementary images for pavement cracking measurements. Measurement Science and Technology 26:025005 doi: 10.1088/0957-0233/26/2/025005 CrossRef Google Scholar
[36]	Ashraf A, Sophian A, Shafie AA, Gunawan TS, Ismail NN. 2023. Machine learning-based pavement crack detection, classification, and characterization: a review. Bulletin of Electrical Engineering and Informatics 12:3601−19 doi: 10.11591/eei.v12i6.5345 CrossRef Google Scholar
[37]	Boubenna H, Lee D. 2018. Image-based emotion recognition using evolutionary algorithms. Biologically Inspired Cognitive Architectures 24:70−76 doi: 10.1016/j.bica.2018.04.008 CrossRef Google Scholar
[38]	Zhou D, Shen X, Dong W. 2012. Image zooming using directional cubic convolution interpolation. IET Image Processing 6:627−34 doi: 10.1049/iet-ipr.2011.0534 CrossRef Google Scholar
[39]	Sobel I. 1970. Camera Models and Machine Perception. USA: Stanford University.
[40]	Marr DC, Hildreth EC. 1980. Theory of edge detection. Proceedings of the Royal Society of London. Series B. Biological Sciences 207:187−217 doi: 10.1098/rspb.1980.0020 CrossRef Google Scholar
[41]	Canny J. 1986. A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI- 8:679−98 doi: 10.1109/TPAMI.1986.4767851 CrossRef Google Scholar
[42]	Al-amri SS, Kalyankar NV, Khamitkar SD. 2010. Image segmentation by using edge detection. International Journal on Computer Science and Engineering 2:804 Google Scholar
[43]	Ding L, Goshtasby A. 2001. On the canny edge detector. Pattern Recognition 34:721−25 doi: 10.1016/S0031-3203(00)00023-6 CrossRef Google Scholar
[44]	Xu Q, Varadarajan S, Chakrabarti C, Karam LJ. 2014. A distributed Canny edge detector: algorithm and FPGA implementation. IEEE Transactions on Image Processing 23:2944−60 doi: 10.1109/tip.2014.2311656 CrossRef Google Scholar
[45]	Jing J, Liu S, Liu C, Gao T, Zhang W, et al. 2021. A novel decision mechanism for image edge detection. In Intelligent Computing Theories and Application. ICIC 2021. Lecture Notes in Computer Science, eds. Huang DS, Jo KH, Li J, Gribova V, Bevilacqua V. Cham: Springer. pp. 274−87. doi: 10.1007/978-3-030-84522-3_22
[46]	Zhang W, Zhao Y, Breckon TP, Chen L. 2017. Noise robust image edge detection based upon the automatic anisotropic Gaussian kernels. Pattern Recognition 63:193−205 doi: 10.1016/j.patcog.2016.10.008 CrossRef Google Scholar
[47]	Ma C, Wang W, Zhao C, Di F, Zhu Z. 2009. Pavement cracks detection based on FDWT. 2009 International Conference on Computational Intelligence and Software Engineering, December 11−13, 2009, Wuhan, China. USA: IEEE. pp. 1−4. doi: DOI: 10.1109/CISE.2009.5362561
[48]	Ragnoli A, De Blasiis MR, Di Benedetto A. 2018. Pavement distress detection methods: a review. Infrastructures 3:58 doi: 10.3390/infrastructures3040058 CrossRef Google Scholar
[49]	Min J. 2018. Measurement method of screw thread geometric error based on machine vision. Measurement and Control 51:304−10 doi: 10.1177/0020294018786751 CrossRef Google Scholar
[50]	Wang Y, Zhang JY, Liu JX, Zhang Y, Chen ZP, et al. 2019. Research on crack detection algorithm of the concrete bridge based on image processing. Procedia Computer Science 154:610−16 doi: 10.1016/j.procs.2019.06.096 CrossRef Google Scholar
[51]	Zhao H, Qin G, Wang X. 2010. Improvement of canny algorithm based on pavement edge detection. 2010 3 ^rd International Congress on Image and Signal Processing. October 16−18, 2010, Yantai, China. USA: IEEE. pp. 964−67. doi: 10.1109/CISP.2010.5646923
[52]	Huang M, Liu Y, Yang Y. 2022. Edge detection of ore and rock on the surface of explosion pile based on improved Canny operator. Alexandria Engineering Journal 61:10769−77 doi: 10.1016/j.aej.2022.04.019 CrossRef Google Scholar
[53]	Sridevi M, Mala C. 2012. A survey on monochrome image segmentation methods. Procedia Technology 6:548−55 doi: 10.1016/j.protcy.2012.10.066 CrossRef Google Scholar
[54]	Salari E, Bao G. 2011. Pavement distress detection and severity analysis. Image Processing: Machine Vision Applications IV, San Francisco Airport, 2011, California, USA. SPIE. doi: 10.1117/12.876724
[55]	Huang Y, Tsai YJ. 2011. Dynamic programming and connected component analysis for an enhanced pavement distress segmentation algorithm. Transportation Research Record 2225:89−98 doi: 10.3141/2225-10 CrossRef Google Scholar
[56]	Shao C, Chen Y, Xu F, Wang S. 2019. A kind of pavement crack detection method based on digital image processing. 2019 IEEE 4 ^th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), December 20−22, 2019, Chengdu, China. USA: IEEE. pp. 397−401. doi: 10.1109/IAEAC47372.2019.8997810
[57]	Basavaprasad B, Ravi M. 2014. A comparative study on classification of image segmentation methods with a focus on graph based techniques. International Journal of Research in Engineering and Technology 3:310−15 doi: 10.15623/ijret.2014.0315060 CrossRef Google Scholar
[58]	Zhao F, Chao Y, Liu X, Li L. 2022. A novel crack segmentation method based on morphological-processing network. 2022 15 ^th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), November 5−7, 2022, Beijing, China. USA: IEEE. pp. 1−6. doi: DOI: 10.1109/CISP-BMEI56279.2022.9980022
[59]	Varadharajan S, Jose S, Sharma K, Wander L, Mertz C. 2014. Vision for road inspection. IEEE Winter Conference on Applications of Computer Vision, March 24−26, 2014, Steamboat Springs, CO, USA. USA: IEEE. pp. 115−22. doi: 10.1109/WACV.2014.6836111
[60]	Nguyen A, Gharehbaghi V, Le NT, Sterling L, Chaudhry UI, et al. 2023. ASR crack identification in bridges using deep learning and texture analysis. Structures 50:494−507 doi: 10.1016/j.istruc.2023.02.042 CrossRef Google Scholar
[61]	Park SE, Eem SH, Jeon H. 2020. Concrete crack detection and quantification using deep learning and structured light. Construction and Building Materials 252:119096 doi: 10.1016/j.conbuildmat.2020.119096 CrossRef Google Scholar
[62]	Tran TS, Nguyen SD, Lee HJ, Tran VP. 2023. Advanced crack detection and segmentation on bridge decks using deep learning. Construction and Building Materials 400:132839 doi: 10.1016/j.conbuildmat.2023.132839 CrossRef Google Scholar
[63]	Nguyen SD, Tran TS, Tran VP, Lee HJ, Piran MJ, et al. 2023. Deep learning-based crack detection: a survey. International Journal of Pavement Research and Technology 16:943−67 doi: 10.1007/s42947-022-00172-z CrossRef Google Scholar
[64]	Soukup D, Huber-Mörk R. 2014. Convolutional neural networks for steel surface defect detection from photometric stereo images. In Advances in Visual Computing. ISVC 2014. Lecture Notes in Computer Science, eds. Bebis G, et al. 2014. Cham: Springer. pp. 668−77. doi: 10.1007/978-3-319-14249-4_64
[65]	Cha YJ, Choi W, Büyüköztürk O. 2017. Deep learning-based crack damage detection using convolutional neural networks. Computer-Aided Civil and Infrastructure Engineering 32:361−78 doi: 10.1111/mice.12263 CrossRef Google Scholar
[66]	Maeda H, Sekimoto Y, Seto T, Kashiyama T, Omata H. 2018. Road damage detection using deep neural networks with images captured through a smartphone. arXiv Preprint doi: 10.48550/arXiv.1801.09454 CrossRef Google Scholar
[67]	Yusof NM, Ibrahim A, Noor MM, Tahir NM, Yusof NM, et al. 2019. Deep convolution neural network for crack detection on asphalt pavement. Journal of Physics: Conference Series 1349:012020 doi: 10.1088/1742-6596/1349/1/012020 CrossRef Google Scholar
[68]	Shatnawi N. 2018. Automatic pavement cracks detection using image processing techniques and neural network. International Journal of Advanced Computer Science and Applications 9(9):399−402 doi: 10.14569/ijacsa.2018.090950 CrossRef Google Scholar
[69]	Yusof NAM, Osman MK, Noor MHM, Ibrahim A, Tahir NM, et al. 2018. Crack detection and classification in asphalt pavement images using deep convolution neural network. 2018 8th IEEE International Conference on Control System, Computing and Engineering (ICCSCE), November 23−25, 2018, Penang, Malaysia. USA: IEEE. pp. 227−32. doi: 10.1109/ICCSCE.2018.8685007
[70]	Tsai Y, Jiang C, Wang Z. 2012. Pavement crack detection using high-resolution 3D line laser imaging technology. In 7 ^th RILEM International Conference on Cracking in Pavements, eds. Scarpas A, Kringos N, Al-Qadi IAL. Dordrecht, Netherlands: Springer. pp. 169−78. doi: 10.1007/978-94-007-4566-7_17
[71]	Wei Y, Wang Z, Xu M. 2017. Road structure refined CNN for road extraction in aerial image. IEEE Geoscience and Remote Sensing Letters 14:709−13 doi: 10.1109/LGRS.2017.2672734 CrossRef Google Scholar
[72]	Alshehhi R, Marpu PR, Woon WL, Dalla Mura M. 2017. Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks. ISPRS Journal of Photogrammetry and Remote Sensing 130:139−49 doi: 10.1016/j.isprsjprs.2017.05.002 CrossRef Google Scholar
[73]	Henry C, Azimi SM, Merkle N. 2018. Road segmentation in SAR satellite images with deep fully convolutional neural networks. IEEE Geoscience and Remote Sensing Letters 15:1867−71 doi: 10.1109/LGRS.2018.2864342 CrossRef Google Scholar
[74]	Xie Y, Miao F, Zhou K, Peng J. 2019. HsgNet: a road extraction network based on global perception of high-order spatial information. ISPRS International Journal of Geo-Information 8:571 doi: 10.3390/ijgi8120571 CrossRef Google Scholar
[75]	Cheng G, Wang Y, Xu S, Wang H, Xiang S, et al. 2017. Automatic road detection and centerline extraction via cascaded end-to-end convolutional neural network. IEEE Transactions on Geoscience and Remote Sensing 55:3322−37 doi: 10.1109/TGRS.2017.2669341 CrossRef Google Scholar
[76]	Shi Q, Liu X, Li X. 2017. Road detection from remote sensing images by generative adversarial networks. IEEE Access 6:25486−94 doi: 10.1109/ACCESS.2017.2773142 CrossRef Google Scholar
[77]	Zhou L, Zhang C, Wu M. 2018. D-LinkNet: LinkNet with pretrained encoder and dilated convolution for high resolution satellite imagery road extraction. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), June 18−22, 2018, Salt Lake City, UT, USA. USA: IEEE. pp. 192−1924. doi: 10.1109/CVPRW.2018.00034
[78]	Doshi J. 2018. Residual inception skip network for binary segmentation. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), June 18−22, 2018, Salt Lake City, UT, USA. USA: IEEE. pp. 206−2063. doi: 10.1109/CVPRW.2018.00037
[79]	Naik SK, Murthy CA. 2003. Hue-preserving color image enhancement without gamut problem. IEEE Transactions on Image Processing 12:1591−98 doi: 10.1109/TIP.2003.819231 CrossRef Google Scholar
[80]	Pitas I, Kiniklis P. 1996. Multichannel techniques in color image enhancement and modeling. IEEE Transactions on Image Processing 5:168−71 doi: 10.1109/83.481684 CrossRef Google Scholar
[81]	Buzuloiu V, Ciuc M, Rangayyan R, Vertan C. 2001. Adaptive-neighborhood histogram equalization of color images. Journal of Electronic Imaging 10:445 doi: 10.1117/1.1353200 CrossRef Google Scholar
[82]	Trahanias PE, Venetsanopoulos AN. 2002. Color image enhancement through 3-D histogram equalization. Proceedings 11 ^th IAPR International Conference on Pattern Recognition. Vol. III. Conference C: Image, Speech and Signal Analysis, August 30 − September 1, 1992, The Hague, Netherlands. USA: IEEE. pp. 545−48. doi: 10.1109/ICPR.1992.202045
[83]	Yang CC, Rodrı́guez JJ. 1997. Efficient luminance and saturation processing techniques for color images. Journal of Visual Communication and Image Representation 8:263−77 doi: 10.1006/jvci.1997.0342 CrossRef Google Scholar
[84]	Rodr'Iguez JJ, Yang CC. 1999. Saturation clipping in the LHS and YIQ color spaces. Proceedings of SPIE - The International Society for Optical Engineering 2658
[85]	Dung CV, Anh LD. 2019. Autonomous concrete crack detection using deep fully convolutional neural network. Automation in Construction 99:52−58 doi: 10.1016/j.autcon.2018.11.028 CrossRef Google Scholar
[86]	Liu Z, Cao Y, Wang Y, Wang W. 2019. Computer vision-based concrete crack detection using U-Net fully convolutional networks. Automation in Construction 104:129−39 doi: 10.1016/j.autcon.2019.04.005 CrossRef Google Scholar
[87]	Wang X, Hu Z. 2017. Grid-based pavement crack analysis using deep learning. 2017 4th International Conference on Transportation Information and Safety (ICTIS), August 8−10, 2017, Banff, AB, Canada. US: IEEE. pp. 917−24. doi: 10.1109/ICTIS.2017.8047878
[88]	Ahmed TU, Shahadat Hossain M, Alam MJ, Andersson K. 2019. An integrated CNN-RNN framework to assess road crack. 2019 22 ⁿd International Conference on Computer and Information Technology (ICCIT), December 18−20, 2019, Dhaka, Bangladesh. USA: IEEE. pp. 1−6. doi: 10.1109/ICCIT48885.2019.9038607
[89]	Xu G, Xu G. 2023. Using a CNN to solve the problem of asphalt pavement crack detection. Proceedings of the 2023 15 ^th International Conference on Machine Learning and Computing. February 17−20, 2023, Zhuhai, China. New York, USA: ACM. pp. 290−97. doi: 10.1145/3587716.3587764
[90]	Yang N, Li Y, Ma R. 2022. An efficient method for detecting asphalt pavement cracks and sealed cracks based on a deep data-driven model. Applied Sciences 12:10089 doi: 10.3390/app121910089 CrossRef Google Scholar
[91]	Han Z, Chen H, Liu Y, Li Y, Du Y, et al. 2021. Vision-based crack detection of asphalt pavement using deep convolutional neural network. Iranian Journal of Science and Technology, Transactions of Civil Engineering 45:2047−55 doi: 10.1007/s40996-021-00668-x CrossRef Google Scholar
[92]	Jiang J, Li P, Wang J, Chen H, Zhang T. 2024. Asphalt pavement crack detection based on infrared thermography and deep learning. International Journal of Pavement Engineering 25:2295906 doi: 10.1080/10298436.2023.2295906 CrossRef Google Scholar
[93]	Yang Z, Ni C, Li L, Luo W, Qin Y. 2022. Three-stage pavement crack localization and segmentation algorithm based on digital image processing and deep learning techniques. Sensors 22:8459 doi: 10.3390/s22218459 CrossRef Google Scholar
[94]	He K, Zhang X, Ren S, Sun J. 2016. Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 27−30, 2016, Las Vegas, NV, USA. USA: IEEE. pp. 770−78. doi: 10.1109/CVPR.2016.90
[95]	Howard AG, Zhu M, Chen B, Kalenichenko D, Wang W, et al. 2017. MobileNets: efficient convolutional neural networks for mobile vision applications. arXiv Preprint doi: 10.48550/arXiv.1704.04861 CrossRef Google Scholar
[96]	Zhang X, Zhou X, Lin M, Sun J. 2018. ShuffleNet: an extremely efficient convolutional neural network for mobile devices. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, June 18−23, 2018, Salt Lake City, UT, USA. USA: IEEE. pp. 6848−56. doi: 10.1109/CVPR.2018.00716
[97]	Shin HC, Roth HR, Gao M, Lu L, Xu Z, et al. 2016. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Transactions on Medical Imaging 35:1285−98 doi: 10.1109/TMI.2016.2528162 CrossRef Google Scholar
[98]	Shorten C, Khoshgoftaar TM. 2019. A survey on image data augmentation for deep learning. Journal of Big Data 6:60 doi: 10.1186/s40537-019-0197-0 CrossRef Google Scholar
[99]	Kestur R, Farooq S, Abdal R, Mehraj E, Narasipura O, et al. 2018. UFCN: a fully convolutional neural network for road extraction in RGB imagery acquired by remote sensing from an unmanned aerial vehicle. Journal of Applied Remote Sensing 12(1):016020 doi: 10.1117/1.jrs.12.016020 CrossRef Google Scholar
[100]	Panboonyuen T, Vateekul P, Jitkajornwanich K, Lawawirojwong S. 2017. An enhanced deep convolutional encoder-decoder network for road segmentation on aerial imagery. In Recent Advances in Information and Communication Technology 2017. IC2IT 2017. Advances in Intelligent Systems and Computing, eds. Meesad P, Sodsee S, Unger H. Cham: Springe. pp. 191−201. doi: 10.1007/978-3-319-60663-7_18
[101]	Xu H, Su X, Wang Y, Cai H, Cui K, et al. 2019. Automatic bridge crack detection using a convolutional neural network. Applied Sciences 9:2867 doi: 10.3390/app9142867 CrossRef Google Scholar
[102]	Yu Z, Shen Y, Sun Z, Chen J, Gang W. 2022. Cracklab: a high-precision and efficient concrete crack segmentation and quantification network. Developments in the Built Environment 12:100088 doi: 10.1016/j.dibe.2022.100088 CrossRef Google Scholar
[103]	Fuentes R, Pauly L, Peel H, Luo S, Hogg D. 2017. Deeper networks for pavement crack detection. Proceedings of the 34 ^th International Symposium on Automation and Robotics in Construction (ISARC), Taipei, Taiwan. The International Association for Automation and Robotics in Construction. pp. 479−85. doi: 10.22260/ISARC2017/0066
[104]	Deng L, Hinton G, Kingsbury B. 2013. New types of deep neural network learning for speech recognition and related applications: an overview. 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, May 26−31, 2013, Vancouver, BC, Canada. USA: IEEE. pp. 8599−603. doi: 10.1109/ICASSP.2013.6639344
[105]	Li J, He Z, Li D, Zheng A. 2022. Research on water seepage detection technology of tunnel asphalt pavement based on deep learning and digital image processing. Scientific Reports 12:11519 doi: 10.1038/s41598-022-15828-w CrossRef Google Scholar
[106]	Lei H, Cheng J, Xu Q. 2019. Cement pavement surface crack detection based on image processing. Mechanical Engineering Science 1(1):46−51 doi: 10.33142/me.v1i1.661 CrossRef Google Scholar
[107]	Nishikawa T, Yoshida J, Sugiyama T, Fujino Y. 2012. Concrete crack detection by multiple sequential image filtering. Computer-Aided Civil and Infrastructure Engineering 27:29−47 doi: 10.1111/j.1467-8667.2011.00716.x CrossRef Google Scholar
[108]	Kim IH, Jeon H, Baek SC, Hong WH, Jung HJ. 2018. Application of crack identification techniques for an aging concrete bridge inspection using an unmanned aerial vehicle. Sensors 18:1881 doi: 10.3390/s18061881 CrossRef Google Scholar
[109]	Zhang K, Cheng HD, Zhang B. 2018. Unified approach to pavement crack and sealed crack detection using preclassification based on transfer learning. Journal of Computing in Civil Engineering 32:04018001 doi: 10.1061/(asce)cp.1943-5487.0000736 CrossRef Google Scholar
[110]	Shim S, Kim J, Cho GC, Lee SW. 2020. Multiscale and adversarial learning-based semi-supervised semantic segmentation approach for crack detection in concrete structures. IEEE Access 8:170939−50 doi: 10.1109/ACCESS.2020.3022786 CrossRef Google Scholar
[111]	Li G, Wan J, He S, Liu Q, Ma B. 2020. Semi-supervised semantic segmentation using adversarial learning for pavement crack detection. IEEE Access 8:51446−59 doi: 10.1109/ACCESS.2020.2980086 CrossRef Google Scholar

About this article

Cite this article

Huang S, Chen H, Yan L, Zou X, Li B, et al. 2025. A review of the progress in machine vision-based crack detection and identification technology for asphalt pavements. Digital Transportation and Safety 4(1): 65−79 doi: 10.48130/dts-0025-0006

Huang S, Chen H, Yan L, Zou X, Li B, et al. 2025. A review of the progress in machine vision-based crack detection and identification technology for asphalt pavements. Digital Transportation and Safety 4(1): 65−79 doi: 10.48130/dts-0025-0006

Figures(8) / Tables(2)

Download PDF

Article Metrics

Article views(5700) PDF downloads(1697)

Other Articles By Authors

on this site
on Google Scholar

HTML

Asphalt pavement crack detection technology classification

Technical assessment and comparison

Performance comparison

Asphalt pavement crack detection involves two technologies: digital image processing and machine learning. Digital image processing technology, which began in the 1960s, detects cracks through steps such as image acquisition, preprocessing, segmentation, feature extraction, and crack identification. Its preprocessing can eliminate interfering factors, segmentation methods are diverse, and feature extraction often uses edge detection operators, which can improve detection efficiency and accuracy but are affected by the complexity of road images. Machine learning technology collects data using a variety of devices, including smartphones, cameras, laser cameras, etc. It goes through data preprocessing, crack classification, object detection, segmentation, and model evaluation. Crack classification often uses CNN, object detection employs object detection algorithms, and evaluation has clear indicators. Both technologies have their characteristics; digital image processing technology has a longer history, while machine learning technology demonstrates the advantages of deep learning but face different challenges in practical applications.

Technologies based on digital image processing mainly rely on steps such as image preprocessing, feature extraction, and crack identification. The advantage of this method is its low computational cost, no need for a large number of sample training, and it can quickly classify cracks automatically. For example, morphological filters^[13] and improved Canny algorithms^[51,52] can effectively remove noise and preserve crack edges. In addition, constructing directional templates^[49] and image smoothing preprocessing algorithms can better highlight the directionality of crack linear features. Machine learning-based technologies identify cracks by training models, can handle more complex road conditions, and their recognition accuracy will gradually improve with the increase of data volume^[65]. However, the computational cost of crack recognition will also increase accordingly. For example, Convolutional Neural Networks (CNN) can automatically learn crack features from images, achieving high-precision crack identification^[87]. Machine learning methods usually have an advantage in accuracy because they can improve recognition rates by learning from a large number of sample data.

In response to the high computational costs associated with machine learning-based crack detection, researchers have adopted various strategies to optimize and reduce these costs. By reducing the number of network layers or the number of neurons per layer^[94], the number of model parameters can be significantly decreased, thereby reducing the computational load. For instance, networks such as MobileNets^[95] or ShuffleNets^[96] are specifically designed to minimize computational requirements while maintaining high accuracy. Pre-trained models on large datasets can also be fine-tuned for crack detection tasks, avoiding the need to train models from scratch and greatly reducing computational demands^[97]. Additionally, data diversity can be increased through rotation, scaling, and cropping, reducing reliance on large amounts of real data^[98]. Optimizers such as Adam and RMSprop^[90] can accelerate model convergence and reduce the number of training iterations. Through these methods, it is possible to effectively reduce computational costs while maintaining the accuracy of crack detection, making machine learning-based crack detection methods more practical and efficient.

In summary, machine learning-based methods may be more effective in dealing with complex and variable road conditions but require sufficient training data and computational resources. Digital image processing methods have the advantage of computational efficiency and are suitable for rapid deployment and real-time detection. In practical applications, the appropriate technology can be selected according to specific needs and conditions. As shown in Table 2, the technical developments in crack detection are presented.

Table 2. The development of technology in crack detection.

Key technology	Main contribution	Performance index	Ref.
CNN model optimized based on road structure	Combined fusion and deconvolution layers to obtain structured output, proposed a road structure-based loss function	F1 Score: 66.2%; recall: 72.9%; precision: 60.6%; accuracy: 92.4%	Wei et al.^[71]
A novel architecture based on FCN, called U-shaped FCN	Proposed U-shaped FCN model, data augmentation to improve training efficiency	F1 Score: 89.6%; recall: 86.8%; precision: 92.5%; accuracy: 95.2%	Kestur et al.^[99]
U-Net fully convolutional network	First use of U-Net for detecting concrete cracks, U-Net is superior to DCNN in robustness, effectiveness, and accuracy	F1 Score: 90%; recall: 91%; precision 90%	Zhenqing Liu^[86]
Technology based on improved deep encoder-decoder neural networks	networks Enhanced model, using ELU function and LM method to improve overall output accuracy	F1 Score: 85.7%; recall: 86.1%; precision: 85.4%	Panboonyuen et al.^[100]
CNN-based automatic bridge crack detection model	Using an end-to-end model, parallel use of three Atrous convolutions to reduce computational complexity	F1 Score: 87.7%; precision: 78.1%; accuracy: 98.4%	Xu & Xu^[101]

To comprehensively evaluate and compare the performance of different models and training strategies, and to ensure the robustness and credibility of the research results, researchers typically employ a suite of assessment metrics that include standard deviation. This approach enhances the credibility of the research findings. In the study by Yu et al.^[102], a deep learning model named Cracklab was proposed based on the Deeplabv3+ framework. During the training process, to enhance the model's recognition of complex background images, the authors conducted an in-depth experimental comparison of three loss functions—cross-entropy loss (CEL), focal loss (FL), and Dice loss (DL). By introducing standard deviation as a supplementary metric alongside multiple experiments of different training strategies, and in conjunction with the experimental outcomes, the authors ultimately selected focal loss (FL) as the loss function for the model, due to its superior performance in terms of stability and convergence. This choice helps to improve the model's performance in crack detection tasks, especially when dealing with complex backgrounds.

Application case analysis

Practical application

In the field of crack identification, technologies based on digital image processing and deep learning have achieved significant success in practical applications. Pauly et al.^[103] designed a deeper neural network architecture specifically for road crack detection. The dataset used included 500 RGB road images^[104] each with a resolution of 3264 x 2448 pixels. Each image was divided into 99 x 99 pixel image patches, which were labeled as cracked or non-cracked by multiple annotators. Subset 1 contained 20,000 cracked and non-cracked image patches as the training set and 200,000 image patches as the test set. Subset 2's training set included 40,000 image patches, and the test set included 60,000 image patches, with the test set images coming from a different location than the training set to introduce environmental variations. Deep Convolutional Neural Networks (CNNs) were used, employing categorical cross-entropy loss function and Stochastic Gradient Descent (SGD) algorithm, with 80 and 40 epochs of training for experiments 1 and 2, respectively. The experimental results indicated that increasing the network depth could improve the accuracy and recall rate of crack detection. However, when the training and testing datasets were from different locations, the network's performance declined, indicating that location variance is a significant hurdle for implementing a universal automatic crack detection system.

Liu et al.^[86] were the first to adopt U-Net for detecting concrete cracks. The study utilized 84 images with a resolution of 512 × 512, 57 of which were used for model training and 27 for testing. These images encompassed various conditions, such as illumination, background interference, and crack width, with manual labeling of crack locations. The study compared the performance of U-Net and Cha's CNN under different conditions, including images under ideal conditions, images with significant background interference, images with thin cracks, and images under low-light conditions. The Focal loss function was used to address class imbalance issues. The Adam algorithm was employed with an initial learning rate of η = 0.0001, an exponential decay rate for the first moment estimate of β1 = 0.9, and for the second moment estimate of β2 = 0.999. Each training randomly selected two images as a mini-batch. A 3-fold cross-validation was used, with each fold utilizing 38 images for training and 19 for validation. After 80 epochs of training, the precision, recall, and F1 scores on the validation set stabilized at approximately 0.90, 0.91, and 0.90, respectively. U-Net outperformed Cha's CNN on the test set, especially under conditions of background interference, thin cracks, and low light, demonstrating better robustness and accuracy. Despite this, there is still much room for improvement in engineering applications, including algorithm improvements to accommodate more input sizes, hyperparameter tuning, and maintaining a larger dataset to train more robust models.

Li et al.^[105] focused on the water leakage detection technology for tunnel asphalt pavements, proposing a computation method for water leakage area images based on deep learning and digital image processing. Their research results showed that the Efficient Net model achieved recognition accuracies of 99.85% and 97.53% on the training and validation sets, respectively, which is a 2.76% improvement over traditional methods. This finding not only validates the effectiveness of deep learning in water leakage detection but also provides new technical support for road maintenance. Overall, these studies demonstrate the significant advantages of deep learning in pavement crack detection, especially in enhancing detection accuracy and handling complex images. However, despite their excellent performance in laboratory environments, these methods still face some challenges in practical applications.

Limitations and challenges
Crack detection is quite challenging due to its irregular shapes and lack of fixed sizes, making it difficult to effectively identify through preset methods. This article reviews two major areas of crack detection and identification: technology based on digital image processing and technology based on machine learning. Although both types of methods perform well in specific contexts, they still face significant challenges in accurately detecting cracks. Crack classification is mentioned less in the text, yet it is crucial for revealing the nature, cause, and severity of cracks. More research is urgently needed to develop crack classification methods so that systems can recognize crack types and optimize maintenance strategies.

Digital image processing technology has given satisfactory performance on custom datasets constructed by researchers. However, these methods depend on lighting conditions, image resolution, and the level of noise present in the images^[35,106]. Moreover, asphalt pavements have different textures when subjected to external disturbances, and even if they are made of the same material, they may not have the same texture. Therefore, when new images with different textures, brightness, resolution, or noise levels are inputted, they may not yield such good results. Additionally, the accuracy of transverse crack detection is not enough compared to longitudinal measurements. This difference in directional measurement could be an issue when establishing a relationship between the width and longitude of cracks. Thus, the practical applicability of using image processing-based methods remains unclear^[107,108].

On the other hand, machine learning methods also present some limitations to researchers. An increase in processing time has been observed in many methods. Many methods require manual setting of model parameters, which limits the full automation of crack detection methods. To avoid model overfitting, it is necessary to train models with large datasets. These methods require extensive labeling of data images. In practical scenarios, the selection of labels is also limited, so obtaining labels can be a daunting task^[90]. Different algorithms may be needed to accurately detect cracks due to varying surface conditions. Moreover, crack detection is performed offline, hence the real-time detection performance is poor. Therefore, there is a need to improve the performance of algorithms and detection accuracy. In addition, deep learning methods can be applied to unsupervised tasks, using small datasets that do not require extensive data labeling, thereby reducing time and costs^[59].

The presence of noise, shadows, blemishes, and other disturbances in images is a common problem that researchers face when using image processing and machine learning methods^[23,47,86]. Therefore, more research is needed to develop methods that can remove noise and other irregularities in images^[109].

Scope of technical application
Crack detection technology does indeed exhibit significant differences in performance under various environmental conditions, primarily influenced by environmental factors, crack characteristics, and the type of technology employed. In environments with good lighting conditions (outdoors, well-lit indoors), high contrast and clear images aid traditional image processing techniques in accurately identifying crack edges and textures. In environments with poor lighting conditions (tunnels, nighttime, shady areas), deep learning models that integrate image enhancement and segmentation algorithms^[86,93] Can be trained to adapt to different lighting conditions, enabling the detection of cracks that would otherwise be difficult to discern. In complex background environments (rough surfaces, intricate textures), advanced image processing algorithms are typically employed^[58,89], such as techniques based on local binary patterns or filters, which can highlight crack features and effectively detect cracks even on surfaces with complex or uneven textures.

Additionally, different types of cracks pose different requirements for detection technology, mainly because each type of crack has unique morphology, causes, and severity levels, necessitating targeted detection methods and techniques for accurate identification and assessment. Transverse cracks are relatively easier to detect, while longitudinal cracks pose a greater challenge. Wang & Hu^[87] used CNN for detecting cracks and non-cracks. They classified three types of cracks, achieving high precision for transverse cracks, medium precision for longitudinal cracks at 0.97, and fatigue cracks at 0.90. Crack detection technology needs to be optimized for different types of cracks^[88]. Surface cracks, which are typically shallow and distributed in a linear or reticular pattern, require higher image resolution and precision in image processing algorithms. Deep cracks, which may originate on the surface and extend inward, necessitate more penetrating detection techniques, such as ultrasonic or electromagnetic detection. The selection and application of crack detection technology should consider the type, characteristics, environmental conditions, and detection objectives of the cracks. By employing appropriate detection methods and equipment, cracks can be effectively identified and assessed, ensuring the integrity and safety of structures. As technology advances, future crack detection techniques are likely to become more intelligent, automated, and capable of adapting to a wider and more complex range of detection needs.

Future development directions

Technological innovation

Future crack detection technology needs to be integrated with road maintenance strategies, where detailed crack data obtained through crack detection technology will be directly used in the formulation of road maintenance strategies. This data includes parameters such as the location, length, width, and depth of cracks, providing a scientific basis for maintenance decisions. For instance, deep learning-based crack detection systems can automatically measure these parameters, providing data support for maintenance teams to choose the most appropriate repair materials and methods. The integration of crack detection technology with road maintenance strategies helps optimize the allocation of maintenance resources. By analyzing the distribution and severity of cracks, it is possible to determine which areas need priority maintenance, allocate maintenance personnel, equipment, and materials rationally, and improve maintenance efficiency.

The future development trend of crack detection technology will focus on the in-depth application of multimodal data fusion, which integrates different types and sources of data to enhance the accuracy, efficiency, and reliability of crack detection. For instance, by combining image and thermal data, we can leverage the complementary advantages of visible light images and infrared thermographic data to more accurately identify cracks^[92]. In difficult situations where surfaces are covered with dirt or paint, thermographic technology can reveal temperature differences caused by cracks, while visible light images provide rich detailed information. Jiang et al.^[92] adopted a method combining infrared thermography technology with deep learning, proposing a model named GSkYOLOv5. This method processes infrared images, effectively reducing the impact of environmental interferences such as shadows and reflections. The experimental results showed an improvement of 4.7% in detection accuracy and 1.3% in recall rate. Moreover, the integration of drone and ground detection technology, by combining cameras mounted on drones and ground detection equipment^[16,17,68], allows for crack detection from different angles and distances. Drones provide a broad perspective and access to hard-to-reach areas, while ground detection offers close-up detailed inspection, and their combination achieves comprehensive and meticulous monitoring of cracks.

In response to the rising cost of data acquisition, semi-supervised learning, and unsupervised learning methods will become new hotspots in research. In the field of crack detection, models can first be trained using a small number of professionally labeled crack images and then incorporate unlabeled image data. In this way, the model can enhance the accuracy of crack identification by learning common features of images. Semi-supervised learning enhances model performance by combining a small amount of labeled data with a large amount of unlabeled data for model training, leveraging the inherent structure of unlabeled data^[110,111]. Unsupervised learning, on the other hand, does not rely on labeled data and directly mines patterns and structures from unlabeled data, widely applied to tasks such as clustering and association rule learning^[37,63]. In crack detection, it can be used to recognize patterns of cracks in images, achieve automatic clustering of crack images, or reveal potential associations between cracks and other image features.

This approach of integrating multiple data and technologies not only enhances the capabilities of crack detection but also brings significant benefits to fields such as materials science and pavement engineering. It promotes technological innovation and knowledge integration, providing solid technical support for a sustainable future.

The potential for interdisciplinary collaboration
Through close collaboration with materials science, we can delve into how the microstructure, mechanical properties, and chemical composition of materials affect the initiation and propagation of cracks. This understanding allows us to tailor detection algorithms based on the characteristics of the materials, significantly enhancing the accuracy and efficiency of crack detection. Furthermore, studying the mechanisms of crack formation during material aging and damage not only helps us assess the overall health of materials through crack detection but also enables the development of predictive models to forecast the trajectory of crack growth and the remaining service life of the materials.

In the field of pavement monitoring and maintenance, applying crack detection technology to real pavement monitoring allows for the real-time assessment of pavement conditions and guides necessary maintenance and repair work, thereby improving the efficiency and effectiveness of pavement maintenance and effectively extending the service life of the pavement. Additionally, studying the behavior of cracks in pavements under different environmental conditions, such as temperature changes, humidity, and traffic loads, enables us to develop crack detection technologies that adapt to various environmental conditions. Utilizing this data feedback, we can also optimize pavement structural design, enhance the overall performance of the pavement, and reduce the occurrence of cracks.

This interdisciplinary collaboration not only enables crack detection technology to better adapt to diverse application scenarios, improving detection accuracy and efficiency, but also allows the fields of materials science and pavement engineering to significantly benefit from advancements in crack detection technology, achieving smarter, more economical, and more environmentally friendly solutions. Moreover, this collaboration promotes technological innovation and knowledge integration in related fields, providing strong support for sustainable development in the future.

Dataset name	Description	Application in literature
Massachusetts dataset	Contains 1,711 road images and 151 building images, used for road and building extraction	Wei et al.^[71] optimized CNN models to extract road categories from aerial images. Alshehhi et al.^[72] implemented a patch-based CNN model to extract roads and building parts from remote sensing images.
TerraSAR-X dataset	Used for road extraction in SAR images, with 20% for testing and 80% for training	Henry et al.^[73] used DeepLabV3+ and Deep Residual U-Net to extract road parts from SAR images.
DeepGlobe dataset	Contains 622 test images, 622 validation images, and 4,971 training images	Xie et al.^[74] applied a new global perception framework based on higher-order spatial information (HsgNet) for road extraction.
Google Earth dataset	Contains 567 test images and 2,213 training images	Cheng et al.^[75] proposed the CasNet deep learning model to detect road categories and extract road centerlines. Shi et al.^[76] implemented GAN models using data augmentation procedures to generate high-resolution segmentation maps.
DigitalGlobe dataset	Collected by DigitalGlobe satellite, contains 6,226 images	Zhou et al.^[77] introduced the D-LinkNet model for road semantic segmentation in remote sensing images. Doshi^[78] used a ResNet-based ensemble model to extract roads from satellite images.

{{lists.name}}

A review of the progress in machine vision-based crack detection and identification technology for asphalt pavements

Abstract