Technical Papers
Feb 21, 2023

Detection of Asphalt Pavement Cracks Based on Vision Transformer Improved YOLO V5

Publication: Journal of Transportation Engineering, Part B: Pavements
Volume 149, Issue 2

Abstract

Automatic and rapid detection of pavement cracks is one of the important tasks for the highway department. This study proposed an improved model of you only look once (YOLO) V5 integrated with the vision transformer (ViT) that can calculate the attention weights of image regions and form a new feature map with weights. The ViT module was added to the neck of YOLO V5 to improve the speed and accuracy of the model. 1944 asphalt pavement images were collected for testing. The test results showed that the proposed model obtained high accuracy and speed for longitudinal, transverse, and fatigue cracks and was capable of real-time detection. The ViT-improved YOLO V5m obtained 0.872 in mAP(0.5), and the detection time for a single image is 11.9 ms. The study also investigated the pavement crack detection performance of the models in a rainfall environment. All models did not have satisfying detection capabilities in rainfall conditions, but the developed YOLO V5 had better performance.

Get full access to this article

View all available purchase options and get full access to this article.

Data Availability Statements

All data that support the findings of this study are available from the corresponding author upon reasonable request.

Acknowledgments

This work was sponsored by the projects found by the National Natural Science Foundation of China (NSFC) under Grant Nos. 51978163 and 52208439 and the Jiangsu Nature Science Foundation under Grant BK20200468.

References

Aziz, L., M. S. B. Salam, U. U. Sheikh, and S. Ayub. 2020. “Exploring deep learning-based architecture, strategies, applications and current trends in generic object detection: A comprehensive review.” IEEE Access 8 (Sep): 170461–170495. https://doi.org/10.1109/ACCESS.2020.3021508.
Behzadian, A., et al. 2022. “The 1st data science for pavements challenge.” Preprint, submitted June 10, 2022. https://arxiv.org/abs/2206.04874.
Bello, I., B. Zoph, A. Vaswani, J. Shlens, and Q. V. Le. 2019. “Attention augmented convolutional networks.” In Proc., IEEE/CVF Int. Conf. on Computer Vision, 3286–3295. Los Alamitos, CA: IEEE Computer Society.
Bochkovskiy, A., C.-Y. Wang, and H.-Y. Liao. 2020. “Yolov4: Optimal speed and accuracy of object detection.” Preprint, submitted April 23, 2020. https://arxiv.org/abs/2004.10934.
Dosovitskiy, A., et al. 2020. “An image is worth 16×16 words: Transformers for image recognition at scale.” Preprint, submitted October 22, 2020. https://arxiv.org/abs/2010.11929.
Du, Y. C., N. Pan, Z. H. Xu, F. W. Deng, Y. Shen, and H. Kang. 2021. “Pavement distress detection and classification based on YOLO network.” Int. J. Pavement Eng. 22 (13): 1659–1672. https://doi.org/10.1080/10298436.2020.1714047.
Feng, X. R., L. Y. Xiao, W. Li, L. L. Pei, Z. Y. Sun, Z. D. Ma, H. Shen, and H. Y. Ju. 2020. “Pavement crack detection and segmentation method based on improved deep learning fusion model.” Math. Probl. Eng. 2020 (Dec): 1–22. https://doi.org/10.1155/2020/8515213.
Girshick, R. 2015. “Fast R-CNN.” In Proc., IEEE Int. Conf. on Computer Vision, 1440–1448. Los Alamitos, CA: IEEE Computer Society.
Girshick, R., J. Donahue, T. Darrell, and J. Malik. 2014. “Rich feature hierarchies for accurate object detection and semantic segmentation.” In Proc., 27th IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 580–587. Los Alamitos, CA: IEEE Computer Society.
He, K. M., X. Y. Zhang, S. Q. Ren, and J. Sun. 2014. “Spatial pyramid pooling in deep convolutional networks for visual recognition.” In Proc., 13th European Conf. on Computer Vision (ECCV), 346–361. Cham, Switzerland: Springer.
Hu, G. X., B. L. Hu, Z. Yang, L. Huang, and P. Li. 2021. “Pavement crack detection method based on deep learning models.” Wireless Commun. Mobile Comput. 2021 (May): 1–3. https://doi.org/10.1155/2021/5573590.
Huyan, J., W. Li, S. Tighe, J. Z. Zhai, Z. C. Xu, and Y. Chen. 2019. “Detection of sealed and unsealed cracks with complex backgrounds using I (A) deep convolutional neural network.” Autom. Constr. 107 (Nov): 102946. https://doi.org/10.1016/j.autcon.2019.102946.
Ibragimov, E., H. J. Lee, J. J. Lee, and N. Kim. 2022. “Automated pavement distress detection using region based convolutional neural networks.” Int. J. Pavement Eng. 23 (6): 1981–1992. https://doi.org/10.1080/10298436.2020.1833204.
Jeong, D. J. 2020. “Road damage detection using YOLO with smartphone images.” In Proc., 8th IEEE Int. Conf. on Big Data (Big Data), 5559–5562. Los Alamitos, CA: IEEE Computer Society.
Jeong, J. H., J. S. Lim, Y. C. Suh, and J. H. Nam. 2014. “Development of performance criteria for Korean pavement warranty specification.” J. Perform. Constr. Facil. 28 (2): 402–411. https://doi.org/10.1061/(ASCE)CF.1943-5509.0000405.
Kalfarisi, R., Z. Y. Wu, and K. Soh. 2020. “Crack detection and segmentation using deep learning with 3D reality mesh model for quantitative assessment and integrated visualization.” J. Comput. Civ. Eng. 34 (3): 04020010. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000890.
Kang, L. W., C. W. Lin, and Y. H. Fu. 2012. “Automatic single-image-based rain streaks removal via image decomposition.” IEEE Trans. Image Process. 21 (4): 1742–1755. https://doi.org/10.1109/TIP.2011.2179057.
Li, S. W., X. Y. Gu, X. R. Xu, D. W. Xu, T. J. Zhang, Z. Liu, and Q. Dong. 2021. “Detection of concealed cracks from ground penetrating radar images based on deep learning algorithm.” Constr. Build. Mater. 273 (Mar): 121949. https://doi.org/10.1016/j.conbuildmat.2020.121949.
Liu, C. Q., J. Li, J. Gao, Z. Q. Gao, and Z. J. Chen. 2021. “Combination of pixel-wise and region-based deep learning for pavement inspection and segmentation.” Int. J. Pavement Eng. 23 (9): 3011–3023. https://doi.org/10.1080/10298436.2021.1877704.
Liu, S., L. Qi, H. F. Qin, J. P. Shi, and J. Y. Jia. 2018. “Path aggregation network for instance segmentation.” In Proc., 31st IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 8759–8768. Los Alamitos, CA: IEEE Computer Society.
Liu, W., D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, and A. C. Berg. 2016. “Ssd: Single shot multibox detector.” In Proc., European Conf. on Computer Vision, 21–37. Cham, Switzerland: Springer.
Ma, D., H. Y. Fang, B. H. Xue, F. M. Wang, M. A. Msekh, and C. L. Chan. 2020. “Intelligent detection model based on a fully convolutional neural network for pavement cracks.” CMES-Comput. Model. Eng. Sci. 123 (3): 1267–1291. https://doi.org/10.32604/cmes.2020.09122.
Mandal, V., L. Uong, and Y. Adu-Gyamfi. 2018. “Automated road crack detection using deep convolutional neural networks.” In Proc., IEEE Int. Conf. on Big Data (Big Data), 5212–5215. Los Alamitos, CA: IEEE Computer Society.
Nie, M., and K. Wang. 2018. “Pavement distress detection based on transfer learning.” In Proc., 5th Int. Conf. on Systems and Informatics (ICSAI), 435–439. Piscataway, NJ: IEEE.
Ren, S. Q., K. M. He, R. Girshick, and J. Sun. 2015. “Faster R-CNN: Towards real-time object detection with region proposal networks.” In Proc., 29th Annual Conf. on Neural Information Processing Systems (NIPS). La Jolla, CA: Neural Information Processing Systems.
Ronneberger, O., P. Fischer, and T. Brox. 2015. “U-net: Convolutional networks for biomedical image segmentation.” In Proc., Int. Conf. on Medical Image Computing and Computer-Assisted Intervention, 234–241. Cham, Switzerland: Springer.
Song, L., and X. C. Wang. 2021. “Faster region convolutional neural network for automated pavement distress detection.” Road Mater. Pavement Des. 22 (1): 23–41. https://doi.org/10.1080/14680629.2019.1614969.
Tran, T. S., V. P. Tran, H. J. Lee, J. M. Flores, and V. P. Le. 2020. “A two-step sequential automated crack detection and severity classification process for asphalt pavements.” Int. J. Pavement Eng. 23 (6): 2019–2033. https://doi.org/10.1155/2021/5573590.
Ultralytics. 2022. “YOLOV5.” Accessed February 13, 2022. https://github.com/ultralytics/yolov5.
Vaswani, A., N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin. 2017. “Attention is all you need.” In Proc., Advances in Neural Information Processing Systems, 5998–6008. La Jolla, CA: Neural Information Processing Systems.
Wang, C.-Y., H.-Y. M. Liao, Y.-H. Wu, P.-Y. Chen, J.-W. Hsieh, and I.-H. Yeh. 2020. “CSPNet: A new backbone that can enhance learning capability of CNN.” In Proc., IEEE/CVF Conf. on Computer Vision and Pattern Recognition Workshops, 390–391. Los Alamitos, CA: IEEE Computer Society.
Wang, T., K. Gopalakrishnan, O. Smadi, and A. K. Somani. 2018a. “Automated shape-based pavement crack detection approach.” Transport 33 (3): 598–608. https://doi.org/10.3846/transport.2018.1559.
Wang, W. Z., B. Wu, S. X. Yang, and Z. X. Wang. 2018b. “Road damage detection and classification with faster R-CNN.” In Proc., IEEE Int. Conf. on Big Data (Big Data), 5220–5223. Los Alamitos, CA: IEEE Computer Society.
Yang, X., J. C. Guan, L. Ding, Z. P. You, V. C. S. Lee, M. R. M. Hasan, and X. Y. Cheng. 2021. “Research and applications of artificial neural network in pavement engineering: A state-of-the-art review.” J. Traffic Transp. Eng. 8 (6): 1000–1021. https://doi.org/10.1016/j.jtte.2021.03.005.
Zhang, H., and V. M. Patel. 2018. “Density-aware single image de-raining using a multi-stream dense network.” In Proc., 31st IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 695–704. Los Alamitos, CA: IEEE Computer Society.
Zhao, H., J. Jia, and V. Koltun. 2020. “Exploring self-attention for image recognition.” In Proc., IEEE/CVF Conf. on Computer Vision and Pattern Recognition, 10076–10085. Los Alamitos, CA: IEEE Computer Society.

Information & Authors

Information

Published In

Go to Journal of Transportation Engineering, Part B: Pavements
Journal of Transportation Engineering, Part B: Pavements
Volume 149Issue 2June 2023

History

Received: Feb 23, 2022
Accepted: Dec 26, 2022
Published online: Feb 21, 2023
Published in print: Jun 1, 2023
Discussion open until: Jul 21, 2023

Permissions

Request permissions for this article.

Authors

Affiliations

Sike Wang
Master’s Candidate, School of Transportation, Southeast Univ., Nanjing 211189, China.
Xueqin Chen, Ph.D.
Associate Professor, Dept. of Civil Engineering, Nanjing Univ. of Science and Technology, Nanjing, Jiangsu 210094, China.
Professor, School of Transportation, Southeast Univ., Southeast University Rd. 2#, Jiangning District, Nanjing, Jiangsu 211189 China (corresponding author). ORCID: https://orcid.org/0000-0001-7461-9226. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

Cited by

  • CrackYOLO: Rural Pavement Distress Detection Model with Complex Scenarios, Electronics, 10.3390/electronics13020312, 13, 2, (312), (2024).
  • Automatic Detection of Pavement Marking Defects in Road Inspection Images Using Deep Learning, Journal of Performance of Constructed Facilities, 10.1061/JPCFEV.CFENG-4619, 38, 2, (2024).
  • CrackTinyNet: A novel deep learning model specifically designed for superior performance in tiny road surface crack detection, IET Intelligent Transport Systems, 10.1049/itr2.12497, (2024).
  • An end-to-end computer vision system based on deep learning for pavement distress detection and quantification, Construction and Building Materials, 10.1016/j.conbuildmat.2024.135036, 416, (135036), (2024).
  • Ground Penetrating Radar Image Recognition for Earth Dam Disease Based on You Only Look Once v5s Algorithm, Water, 10.3390/w15193506, 15, 19, (3506), (2023).
  • Deep transformer networks for precise pothole segmentation tasks, Proceedings of the 16th International Conference on PErvasive Technologies Related to Assistive Environments, 10.1145/3594806.3596560, (596-602), (2023).
  • CCE-YOLOv5s: An Improved YOLOv5 Model for UAV Small Target Detection, 2023 IEEE 5th International Conference on Civil Aviation Safety and Information Technology (ICCASIT), 10.1109/ICCASIT58768.2023.10351744, (824-829), (2023).

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Article
$35.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share