Chapter
Mar 18, 2024

Multiple-Channel Audio Construction Equipment Dataset Preparation for Sound Detection and Localization to Prevent Collision Hazards

Publication: Construction Research Congress 2024

ABSTRACT

Construction workplaces often face unforeseen struck-by equipment hazards, leading to severe injuries and fatalities for workers. Detecting and localizing equipment sounds using multi-channel audio data has drawn interest in research. However, collecting such data for developing sound detection and localization machine learning models is challenging. Physical recordings on site required for deep learning are often infeasible due to the lack of proper sound attribute labels from heterogeneous construction sounds. This paper introduces a novel method for synthesizing overlapping and non-overlapping sound datasets in a three-dimensional space, utilizing Pyroomacoustics. The approach uses single sound data with attributes like start time, end time, azimuth, and elevation as microphone input to generate multi-channel audio output. The study successfully simulates 5,025 distinct scenario audios for both datasets, utilizing seven single-sound audiotapes. The generated large dataset can train neural network models capable of localizing equipment collision hazards in construction sites.

Get full access to this article

View all available purchase options and get full access to this chapter.

REFERENCES

Adavanne, S., Politis, A., Nikunen, J., and Virtanen, T. (2018). Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks. https://doi.org/10.1109/JSTSP.2018.2885636.
Bang, S., Hong, Y., and Kim, H. (2021). Proactive proximity monitoring with instance segmentation and unmanned aerial vehicle-acquired video-frame prediction. Computer-Aided Civil and Infrastructure Engineering, 36(6), 800–816. https://doi.org/10.1111/mice.12672.
Cantzos, D. (2008). Statistical enhancement methods for immersive audio environments and compressed audio [University of Southern California PP - United States -- California]. In ProQuest Dissertations and Theses. http://libproxy.clemson.edu/login?url=https://www.proquest.com/dissertations-theses/statistical-enhancement-methods-immersive-audio/docview/304468450/se-2?accountid=6167.
Chae, S., and Yoshida, T. (2010). Application of RFID technology to prevention of collision accident with heavy equipment. Automation in Construction, 19(3), 368–374. https://doi.org/10.1016/j.autcon.2009.12.008.
Chen, L., Yu, M., Su, D., and Yu, D. (2019). Multi-band PIT and Model Integration for Improved Multi-channel Speech Separation. ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 705–709. https://doi.org/10.1109/ICASSP.2019.8682470.
Cheng, C. F., Rashidi, A., Davenport, M. A., and Anderson, D. V. (2017). Activity analysis of construction equipment using audio signals and support vector machines. Automation in Construction, 81, 240–253. https://doi.org/10.1016/j.autcon.2017.06.005.
Dobie, R. A. (2005). Estimating Noise-Induced Permanent Threshold Shift from Audiometric Shape: The ISO-1999 Model. Ear and Hearing, 26(6), 630–635. https://doi.org/10.1097/01.aud.0000188120.14321.76.
Draxler, C., and Jänsch, K. (2004). SpeechRecorder -A universal platform independent multi-channel audio recording software. Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004, 559–562.
Vinnik, E., Itskov, P. M., and Balaban, E. (2011). Individual Differences in Sound-in-Noise Perception Are Related to the Strength of Short-Latency Neural Responses to Noise. PLOS ONE, 6(2), 1–8. https://doi.org/10.1371/journal.pone.0017266.
Elelu, K., Le, T., and Le, C. (2022). Augmented Hearing of Auditory Safety Cues for Construction Workers: A Systematic Literature Review. Sensors, 22(23). https://doi.org/10.3390/s22239135.
Elelu, K., Le, T., and Le, C. (2023). Collision Hazard Detection for Construction Worker Safety Using Audio Surveillance. Journal of Construction Engineering and Management, 149(1). https://doi.org/10.1061/JCEMD4.COENG-12561.
Gemmeke, J. F., Ellis, D. P. W., Freedman, D., Jansen, A., Lawrence, W., Moore, R. C., Plakal, M., and Ritter, M. (2017). Audio Set: An ontology and human-labeled dataset for audio events. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 776–780. https://doi.org/10.1109/ICASSP.2017.7952261.
Hinze, J. W., and Teizer, J. (2011). Visibility-related fatalities related to construction equipment. Safety Science, 49(5), 709–718. https://doi.org/10.1016/j.ssci.2011.01.007.
Jiang, H., Murdock, C., and Ithapu, V. K. (2022). Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022-June, 10534–10542. https://doi.org/10.1109/CVPR52688.2022.01029.
Lee, J., and Yang, K. (2022). Mobile Device-Based Struck-By Hazard Recognition in Construction Using a High-Frequency Sound. Sensors, 22(9), 3482. https://doi.org/10.3390/s22093482.
Marks, E. D., and Teizer, J. (2013). Method for testing proximity detection and alert technology for safe construction equipment operation. Construction Management and Economics, 31(6), 636–646. https://doi.org/10.1080/01446193.2013.783705.
Padilla-Ortiz, A. L., Machuca-Tzili, F. A., and Ibarra-Zarate, D. (2023). Smartphones, a tool for noise monitoring and noise mapping: an overview. International Journal of Environmental Science and Technology, 20(3), 3521–3536. https://doi.org/10.1007/s13762-022-04240-6.
Scheibler, R., Bezzam, E., and Dokmanic, I. (2018). Pyroomacoustics: A Python Package for Audio Room Simulation and Array Processing Algorithms. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 351–355. https://doi.org/10.1109/ICASSP.2018.8461310.
Sherif, N., Sundelius, N., and Eriksson, L. C. (2022). ROOM MAPPING FOR TUNING OF HIGH FIDELITY SOUND SYSTEMS Examiner: Mikael Ekström.
Simson, W. A. (2022). Physics-Informed Deep Learning for Advanced Medical Ultrasound. https://mediatum.ub.tum.de/doc/1634543/document.pdf.
Tan, T.-H., Lin, Y.-T., Chang, Y.-L., and Alkhaleefah, M. (2021). Sound Source Localization Using a Convolutional Neural Network and Regression Model. Sensors, 21(23). https://doi.org/10.3390/s21238031.
Teizer, J., Allread, B. S., Fullerton, C. E., and Hinze, J. (2010). Autonomous pro-active real-time construction worker and equipment operator proximity safety alert system. Automation in Construction, 19(5), 630–640. https://doi.org/10.1016/j.autcon.2010.02.009.
Virone, G., Istrate, D., Vacher, M., Noury, N., Serignat, J. F., and Demongeot, J. (2003). First steps in data fusion between a multichannel audio acquisition and an information system for home healthcare. Proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (IEEE Cat. No.03CH37439), 1364–1367. https://doi.org/10.1109/IEMBS.2003.1279557.
Wang, Z.-Q., Le Roux, J., and Hershey, J. R. (2018). Multi-Channel Deep Clustering: Discriminative Spectral and Spatial Embeddings for Speaker-Independent Speech Separation. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1–5. https://doi.org/10.1109/ICASSP.2018.8461639.
Zhou, Y., and Wan, H. (2022). Joint Measurement of Multi-channel Sound Event Detection and Localization Using Deep Neural Network. Journal of Physics: Conference Series, 2216(1), 12101. https://doi.org/10.1088/1742-6596/2216/1/012101.

Information & Authors

Information

Published In

Go to Construction Research Congress 2024
Construction Research Congress 2024
Pages: 487 - 496

History

Published online: Mar 18, 2024

Permissions

Request permissions for this article.

ASCE Technical Topics:

Authors

Affiliations

Kehinde Elelu [email protected]
1Glenn Civil Engineering Dept., Clemson Univ. ORCID: https://orcid.org/0000-0001-8185-1314. Email: [email protected]
Tuyen Le, A.M.ASCE [email protected]
2Assitance Professor, Glenn Civil Engineering Dept., Clemson Univ. Email: [email protected]
Chau Le, A.M.ASCE [email protected]
3Assitant Professor, Dept. of Civil, Construction, and Environmental Engineering, North Dakota State Univ. Email: [email protected]

Metrics & Citations

Metrics

Citations

Download citation

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Paper
$35.00
Add to cart
Buy E-book
$190.00
Add to cart

Get Access

Access content

Please select your options to get access

Log in/Register Log in via your institution (Shibboleth)
ASCE Members: Please log in to see member pricing

Purchase

Save for later Information on ASCE Library Cards
ASCE Library Cards let you download journal articles, proceedings papers, and available book chapters across the entire ASCE Library platform. ASCE Library Cards remain active for 24 months or until all downloads are used. Note: This content will be debited as one download at time of checkout.

Terms of Use: ASCE Library Cards are for individual, personal use only. Reselling, republishing, or forwarding the materials to libraries or reading rooms is prohibited.
ASCE Library Card (5 downloads)
$105.00
Add to cart
ASCE Library Card (20 downloads)
$280.00
Add to cart
Buy Single Paper
$35.00
Add to cart
Buy E-book
$190.00
Add to cart

Media

Figures

Other

Tables

Share

Share

Copy the content Link

Share with email

Email a colleague

Share