Potential of Vision-Based 6-DoF Pose Estimation for Cuboid-Shape Objects from Construction Jobsites
Publication: Computing in Civil Engineering 2023
ABSTRACT
Technological advancement transforms construction jobsites into more intelligent systems. Among the technologies, robotics relies on environmental perception, such as the motion and dynamics of interacting objects; a digital twin expands its capabilities by collecting real-time data of the physical twin, including spatial and physical properties. Given booming attention and efforts in such technologies, there lacks a non-invasive approach to collect jobsite objects’ 3D location and orientation, which is a required step for physically based modeling. As an initial effort, this paper proposes a vision-based approach to estimate the 6-DoF object pose of construction jobsite objects from a single image while leveraging deep learning. Tests are performed on a brick and a concrete block of cuboid shape. The evaluation against ground truth data, collected by an RGB-D camera, presents a certain potential for utilizing a non-invasive perception approach to collect jobsite objects’ advanced kinematic data for extended capabilities of intelligent systems.
Get full access to this article
View all available purchase options and get full access to this chapter.
REFERENCES
“BOP: Benchmark for 6D Object Pose Estimation.” n.d. Accessed March 25, 2023. https://bop.felk.cvut.cz/challenges/.
Cao, Y., and T. Li. 2020. “Review of antiswing control of shipboard cranes.” IEEE/CAA Journal of Automatica Sinica, 7 (2): 346–354. https://doi.org/10.1109/JAS.2020.1003024.
Cignoni, P., M. Callieri, M. Corsini, M. Dellepiane, F. Ganovelli, and G. Ranzuglia. 2008. MeshLab: an Open-Source Mesh Processing Tool.
Dimitrov, A., and M. Golparvar-Fard. 2014. “Vision-based material recognition for automated monitoring of construction progress and generating building information modeling from unordered site image collections.” Advanced Engineering Informatics, 28 (1): 37–49. https://doi.org/10.1016/j.aei.2013.11.002.
Fan, Z., Y. Zhu, Y. He, Q. Sun, H. Liu, and J. He. 2023. “Deep Learning on Monocular Object Pose Detection and Tracking: A Comprehensive Overview.” ACM Comput. Surv., 55 (4): 1–40. https://doi.org/10.1145/3524496.
Garrido-Jurado, S., R. Muñoz-Salinas, F. J. Madrid-Cuevas, and M. J. Marín-Jiménez. 2014. “Automatic generation and detection of highly reliable fiducial markers under occlusion.” Pattern Recognition, 47 (6): 2280–2292. https://doi.org/10.1016/j.patcog.2014.01.005.
Ham, N., and S.-H. Lee. 2018. “Empirical Study on Structural Safety Diagnosis of Large-Scale Civil Infrastructure Using Laser Scanning and BIM.” Sustainability, 10 (11): 4024. Multidisciplinary Digital Publishing Institute. https://doi.org/10.3390/su10114024.
Ham, S.-H., M.-I. Roh, H. Lee, and S. Ha. 2015. “Multibody dynamic analysis of a heavy load suspended by a floating crane with constraint-based wire rope.” Ocean Engineering, 109: 145–160. https://doi.org/10.1016/j.oceaneng.2015.08.050.
Hodan, T., P. Haluza, S. Obdrzalek, J. Matas, M. Lourakis, and X. Zabulis. 2017. “T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-Less Objects.” 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), 880–888. Santa Rosa, CA, USA: IEEE.
Kim, C., H. Son, and C. Kim. 2013. “Automated construction progress measurement using a 4D building information model and 3D data.” Automation in Construction, 31: 75–82. https://doi.org/10.1016/j.autcon.2012.11.041.
Li, Z., G. Wang, and X. Ji. 2019. “CDPN: Coordinates-Based Disentangled Pose Network for Real-Time RGB-Based 6-DoF Object Pose Estimation.” 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 7677–7686. Seoul, Korea (South): IEEE.
Pagès, J., X. Armangué, J. Salvi, J. Freixenet, and J. Martí. 2001. “A Computer Vision System for Autonomous Forklift Vehicles in Industrial Environments.” Proc. of the 9th Mediterranean Conference on Control and Automation MEDS, 1–6.
Park, K., T. Patten, and M. Vincze. 2019. “Pix2Pose: Pixel-Wise Coordinate Regression of Objects for 6D Pose Estimation.” 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 7667–7676. Seoul, Korea (South): IEEE.
Soltani, M. M., Z. Zhu, and A. Hammad. 2018. “Framework for Location Data Fusion and Pose Estimation of Excavators Using Stereo Vision.” J. Comput. Civ. Eng., 32 (6): 04018045. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000783.
Zhu, Z., X. Ren, and Z. Chen. 2016. “Visual Tracking of Construction Jobsite Workforce and Equipment with Particle Filtering.” J. Comput. Civ. Eng., 30 (6): 04016023. https://doi.org/10.1061/(ASCE)CP.1943-5487.0000573.
Information & Authors
Information
Published In
History
Published online: Jan 25, 2024
ASCE Technical Topics:
- Analysis (by type)
- Automation and robotics
- Concrete
- Concrete blocks
- Construction engineering
- Construction management
- Continuum mechanics
- Data collection
- Dynamic properties
- Dynamics (solid mechanics)
- Engineering fundamentals
- Engineering materials (by type)
- Engineering mechanics
- Materials engineering
- Methodology (by type)
- Models (by type)
- Motion (dynamics)
- Research methods (by type)
- Solid mechanics
- Spatial analysis
- Spatial data
- Structural behavior
- Structural engineering
- Systems engineering
- Three-dimensional models
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.