Multiple-Channel Audio Construction Equipment Dataset Preparation for Sound Detection and Localization to Prevent Collision Hazards
Publication: Construction Research Congress 2024
ABSTRACT
Construction workplaces often face unforeseen struck-by equipment hazards, leading to severe injuries and fatalities for workers. Detecting and localizing equipment sounds using multi-channel audio data has drawn interest in research. However, collecting such data for developing sound detection and localization machine learning models is challenging. Physical recordings on site required for deep learning are often infeasible due to the lack of proper sound attribute labels from heterogeneous construction sounds. This paper introduces a novel method for synthesizing overlapping and non-overlapping sound datasets in a three-dimensional space, utilizing Pyroomacoustics. The approach uses single sound data with attributes like start time, end time, azimuth, and elevation as microphone input to generate multi-channel audio output. The study successfully simulates 5,025 distinct scenario audios for both datasets, utilizing seven single-sound audiotapes. The generated large dataset can train neural network models capable of localizing equipment collision hazards in construction sites.
Get full access to this article
View all available purchase options and get full access to this chapter.
REFERENCES
Adavanne, S., Politis, A., Nikunen, J., and Virtanen, T. (2018). Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks. https://doi.org/10.1109/JSTSP.2018.2885636.
Bang, S., Hong, Y., and Kim, H. (2021). Proactive proximity monitoring with instance segmentation and unmanned aerial vehicle-acquired video-frame prediction. Computer-Aided Civil and Infrastructure Engineering, 36(6), 800–816. https://doi.org/10.1111/mice.12672.
Cantzos, D. (2008). Statistical enhancement methods for immersive audio environments and compressed audio [University of Southern California PP - United States -- California]. In ProQuest Dissertations and Theses. http://libproxy.clemson.edu/login?url=https://www.proquest.com/dissertations-theses/statistical-enhancement-methods-immersive-audio/docview/304468450/se-2?accountid=6167.
Chae, S., and Yoshida, T. (2010). Application of RFID technology to prevention of collision accident with heavy equipment. Automation in Construction, 19(3), 368–374. https://doi.org/10.1016/j.autcon.2009.12.008.
Chen, L., Yu, M., Su, D., and Yu, D. (2019). Multi-band PIT and Model Integration for Improved Multi-channel Speech Separation. ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 705–709. https://doi.org/10.1109/ICASSP.2019.8682470.
Cheng, C. F., Rashidi, A., Davenport, M. A., and Anderson, D. V. (2017). Activity analysis of construction equipment using audio signals and support vector machines. Automation in Construction, 81, 240–253. https://doi.org/10.1016/j.autcon.2017.06.005.
Dobie, R. A. (2005). Estimating Noise-Induced Permanent Threshold Shift from Audiometric Shape: The ISO-1999 Model. Ear and Hearing, 26(6), 630–635. https://doi.org/10.1097/01.aud.0000188120.14321.76.
Draxler, C., and Jänsch, K. (2004). SpeechRecorder -A universal platform independent multi-channel audio recording software. Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004, 559–562.
Vinnik, E., Itskov, P. M., and Balaban, E. (2011). Individual Differences in Sound-in-Noise Perception Are Related to the Strength of Short-Latency Neural Responses to Noise. PLOS ONE, 6(2), 1–8. https://doi.org/10.1371/journal.pone.0017266.
Elelu, K., Le, T., and Le, C. (2022). Augmented Hearing of Auditory Safety Cues for Construction Workers: A Systematic Literature Review. Sensors, 22(23). https://doi.org/10.3390/s22239135.
Elelu, K., Le, T., and Le, C. (2023). Collision Hazard Detection for Construction Worker Safety Using Audio Surveillance. Journal of Construction Engineering and Management, 149(1). https://doi.org/10.1061/JCEMD4.COENG-12561.
Gemmeke, J. F., Ellis, D. P. W., Freedman, D., Jansen, A., Lawrence, W., Moore, R. C., Plakal, M., and Ritter, M. (2017). Audio Set: An ontology and human-labeled dataset for audio events. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 776–780. https://doi.org/10.1109/ICASSP.2017.7952261.
Hinze, J. W., and Teizer, J. (2011). Visibility-related fatalities related to construction equipment. Safety Science, 49(5), 709–718. https://doi.org/10.1016/j.ssci.2011.01.007.
Jiang, H., Murdock, C., and Ithapu, V. K. (2022). Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022-June, 10534–10542. https://doi.org/10.1109/CVPR52688.2022.01029.
Lee, J., and Yang, K. (2022). Mobile Device-Based Struck-By Hazard Recognition in Construction Using a High-Frequency Sound. Sensors, 22(9), 3482. https://doi.org/10.3390/s22093482.
Marks, E. D., and Teizer, J. (2013). Method for testing proximity detection and alert technology for safe construction equipment operation. Construction Management and Economics, 31(6), 636–646. https://doi.org/10.1080/01446193.2013.783705.
Padilla-Ortiz, A. L., Machuca-Tzili, F. A., and Ibarra-Zarate, D. (2023). Smartphones, a tool for noise monitoring and noise mapping: an overview. International Journal of Environmental Science and Technology, 20(3), 3521–3536. https://doi.org/10.1007/s13762-022-04240-6.
Scheibler, R., Bezzam, E., and Dokmanic, I. (2018). Pyroomacoustics: A Python Package for Audio Room Simulation and Array Processing Algorithms. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 351–355. https://doi.org/10.1109/ICASSP.2018.8461310.
Sherif, N., Sundelius, N., and Eriksson, L. C. (2022). ROOM MAPPING FOR TUNING OF HIGH FIDELITY SOUND SYSTEMS Examiner: Mikael Ekström.
Simson, W. A. (2022). Physics-Informed Deep Learning for Advanced Medical Ultrasound. https://mediatum.ub.tum.de/doc/1634543/document.pdf.
Tan, T.-H., Lin, Y.-T., Chang, Y.-L., and Alkhaleefah, M. (2021). Sound Source Localization Using a Convolutional Neural Network and Regression Model. Sensors, 21(23). https://doi.org/10.3390/s21238031.
Teizer, J., Allread, B. S., Fullerton, C. E., and Hinze, J. (2010). Autonomous pro-active real-time construction worker and equipment operator proximity safety alert system. Automation in Construction, 19(5), 630–640. https://doi.org/10.1016/j.autcon.2010.02.009.
Virone, G., Istrate, D., Vacher, M., Noury, N., Serignat, J. F., and Demongeot, J. (2003). First steps in data fusion between a multichannel audio acquisition and an information system for home healthcare. Proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (IEEE Cat. No.03CH37439), 1364–1367. https://doi.org/10.1109/IEMBS.2003.1279557.
Wang, Z.-Q., Le Roux, J., and Hershey, J. R. (2018). Multi-Channel Deep Clustering: Discriminative Spectral and Spatial Embeddings for Speaker-Independent Speech Separation. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1–5. https://doi.org/10.1109/ICASSP.2018.8461639.
Zhou, Y., and Wan, H. (2022). Joint Measurement of Multi-channel Sound Event Detection and Localization Using Deep Neural Network. Journal of Physics: Conference Series, 2216(1), 12101. https://doi.org/10.1088/1742-6596/2216/1/012101.
Information & Authors
Information
Published In
History
Published online: Mar 18, 2024
ASCE Technical Topics:
- Artificial intelligence and machine learning
- Business management
- Computer programming
- Computing in civil engineering
- Construction engineering
- Construction equipment
- Construction management
- Construction sites
- Disaster risk management
- Employment
- Engineering fundamentals
- Equipment and machinery
- Infrastructure
- Labor
- Neural networks
- Occupational safety
- Personnel management
- Practice and Profession
- Public administration
- Public health and safety
- Risk management
- Safety
- Traffic engineering
- Traffic management
- Traffic safety
- Transportation engineering
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.