An Online Housing Reputation Assessment Framework Based on Text Mining and Visualization Technologies
Publication: Journal of Construction Engineering and Management
Volume 150, Issue 8
Abstract
Learning from online comments is essential for enhancing understanding and improving online housing reputation (OHR). However, two significant issues require attention. First, analyzing online comments for reputation information extraction is a labor-intensive and time-consuming task. Second, most existing online housing information platforms lack effective visual aids, merely presenting the average comment ratings or listing comment texts without secondary interpretation. To address these challenges, this study proposes an OHR assessment framework based on text mining and visualization technologies. This study first evaluates the performance of eight sentiment analysis models for analyzing housing comments, and the attention-based BiLSTM model achieved the highest accuracy (83.57%). Additionally, a housing attribute ontology is constructed to reveal eight critical attributes influencing OHR. Finally, a reputation visualization scheme is designed to comprehensively present OHR. A case study for analyzing online comments from three construction enterprises reveals the advantages and feasibility of the proposed framework for assessing OHR. This study contributes to the body of knowledge by establishing the connection between housing comments and OHR, greatly advancing the research in the construction domain’s reputation management. Furthermore, OHR analysis can facilitate decision making optimization for both consumers and managers, which has theoretical and practical significance for the healthy and sustainable development of the online housing market.
Practical Applications
Developing a sound housing reputation assessment method is crucial for the development of the online housing market. Emerging information technologies like natural language processing and deep learning appear promising in the comment-based reputation analysis. This study proposes an online housing reputation assessment framework that combines text mining and visualization technologies to automatically analyze housing comments and generate online housing reputations. The findings of this study establish the connection between housing comments and reputation and reveal eight key attributes influencing online housing reputation, contributing to research on reputation management in the construction field. The potential practical applications of this study are elucidated from the perspectives of consumers, construction enterprises, governments, and the construction industry. For consumers, reputation assessment can optimize the decision making process and promote public supervision to force construction enterprises to improve the quality of their housing products. Construction enterprises can timely improve their product strategies based on reputation information to increase revenue. Governments can strengthen the supervision of low-reputation enterprises and low quality housing products, thereby enhancing government credibility. The proposed framework can encourage technological innovation and digital transformation in the construction industry and even can be extended to other industries for online reputation assessment.
Get full access to this article
View all available purchase options and get full access to this article.
Data Availability Statement
Some or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request.
Acknowledgments
This research is supported by the National Key R&D Program of China (2022YFC3801700) and the National Natural Science Foundation of China (Grant Nos. 72271106 and U21A20151).
References
Ali, F., E. K. Kim, and Y.-G. Kim. 2015. “Type-2 fuzzy ontology-based opinion mining and information extraction: A proposal to automate the hotel reservation system.” Appl. Intell. 42 (Apr): 481–500. https://doi.org/10.1007/s10489-014-0609-y.
Bahdanau, D., K. Cho, and Y. Bengio. 2014. “Neural machine translation by jointly learning to align and translate.” Preprint, submitted September 1, 2014. https://arxiv.org/abs/1409.0473.
Bakos, Y., and C. Dellarocas. 2011. “Cooperation without enforcement? A comparative analysis of litigation and online reputation as quality assurance mechanisms.” Manage. Sci. 57 (11): 1944–1962. https://doi.org/10.1287/mnsc.1110.1390.
Blei, D. M., A. Y. Ng, and M. I. Jordan. 2003. “Latent dirichlet allocation.” J. Mach. Learn. Res. 3: 993–1022.
Boumhidi, A., and E. H. Nfaoui. 2021. “Leveraging Lexicon-based and sentiment analysis techniques for online reputation generation.” Int. J. Intell. Eng. Syst. 14 (6): 274. https://doi.org/10.22266/ijies2021.1231.25.
Cabral, L. 2012. “Reputation on the internet.” In The Oxford handbook of the digital economy, 343–354. New York: Oxford University Press.
Chang, T., S. Chi, and S.-B. Im. 2022. “Understanding user experience and satisfaction with urban infrastructure through text mining of civil complaint data.” J. Constr. Eng. Manage. 148 (8): 04022061. https://doi.org/10.1061/(ASCE)CO.1943-7862.0002308.
Chen, R., Y. Zheng, W. Xu, M. Liu, and J. Wang. 2018. “Secondhand seller reputation in online markets: A text analytics framework.” Decis. Support Syst. 108 (Apr): 96–106. https://doi.org/10.1016/j.dss.2018.02.008.
Chevalier, J. A., and D. Mayzlin. 2006. “The effect of word of mouth on sales: Online book reviews.” J. Mark. Res. 43 (3): 345–354. https://doi.org/10.1509/jmkr.43.3.345.
Fan, Y., J. Ju, and M. Xiao. 2016. “Reputation premium and reputation management: Evidence from the largest e-commerce platform in China.” Int. J. Ind. Organ. 46 (May): 63–76. https://doi.org/10.1016/j.ijindorg.2016.01.004.
Fombrun, C. J. 1996. Reputation: Realizing value from the corporate image. Boston, MA: Harvard Business School Press.
Galbreath, J., and P. Shum. 2012. “Do customer satisfaction and reputation mediate the CSR–FP link? Evidence from Australia.” Aust. J. Manage. 37 (2): 211–229. https://doi.org/10.1177/0312896211432941.
Ghose, A., and P. G. Ipeirotis. 2010. “Estimating the helpfulness and economic impact of product reviews: Mining text and reviewer characteristics.” IEEE Trans. Knowl. Data Eng. 23 (10): 1498–1512. https://doi.org/10.1109/TKDE.2010.188.
Goh, Y. M., and C. Ubeynarayana. 2017. “Construction accident narrative classification: An evaluation of text mining techniques.” Accid. Anal. Prev. 108 (Nov): 122–130. https://doi.org/10.1016/j.aap.2017.08.026.
Guo, Y., F. Wang, C. Xing, and X. Lu. 2022. “Mining multi-brand characteristics from online reviews for competitive analysis: A brand joint model using latent Dirichlet allocation.” Electron. Commerce Res. Appl. 53 (May): 101141. https://doi.org/10.1016/j.elerap.2022.101141.
Heng, Y., Z. Gao, Y. Jiang, and X. Chen. 2018. “Exploring hidden factors behind online food shopping from Amazon reviews: A topic mining approach.” J. Retailing Consum. Serv. 42 (May): 161–168. https://doi.org/10.1016/j.jretconser.2018.02.006.
Hochreiter, S., and J. Schmidhuber. 1997. “Long short-term memory.” Neural Comput. 9 (8): 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735.
Hu, M., and B. Liu. 2004. “Mining and summarizing customer reviews.” In Proc., 10th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, 168–177. New York: Association for Computing Machinery. https://doi.org/10.1145/1014052.1014073.
Huang, B., and K. M. Carley. 2019. “Syntax-aware aspect level sentiment classification with graph attention networks.” Preprint, submitted September 5, 2019. https://arxiv.org/abs/1909.02606.
Jallan, Y., and B. Ashuri. 2020. “Text mining of the securities and exchange commission financial filings of publicly traded construction firms using deep learning to identify and assess risk.” J. Constr. Eng. Manage. 146 (12): 04020137. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001932.
Jiang, H., P. Lin, and M. Qiang. 2016. “Public-opinion sentiment analysis for large hydro projects.” J. Constr. Eng. Manage. 142 (2): 05015013. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001039.
Koufakou, A., J. Gosselin, and D. Guo. 2016. “Using data mining to extract knowledge from student evaluation comments in undergraduate courses.” In Proc., 2016 Int. Joint Conf. on Neural Networks (IJCNN), 3138–3142. New York: IEEE. https://doi.org/10.1109/IJCNN.2016.7727599.
Lazhar, F. 2018. “Mining hidden opinions from objective sentences.” Int. J. Data Min. Model. Manage. 10 (2): 113–126. https://doi.org/10.1504/IJDMMM.2018.092534.
Lee, C.-T., and L.-M. Sun. 2019. “Early warning mechanism of agricultural network public opinion based on text mining.” Rev. Facultad Agronomia Universidad Zulia 36 (2): 359–369. https://doi.org/10.1007/978-981-16-5857-0_108.
Li, H., L. Lv, J. Zuo, L. Su, L. Wang, and C. Yuan. 2020. “Dynamic reputation incentive mechanism for urban water environment treatment PPP projects.” J. Constr. Eng. Manage. 146 (8): 04020088. https://doi.org/10.1061/(ASCE)CO.1943-7862.0001879.
Lin, Z., D. Li, B. Janamanchi, and W. Huang. 2006. “Reputation distribution and consumer-to-consumer online auction market structure: An exploratory study.” Decis. Support Syst. 41 (2): 435–448. https://doi.org/10.1016/j.dss.2004.07.006.
Luca, M. 2016. Reviews, reputation, and revenue: The case of Yelp.com (March 15, 2016). Boston, MA: Harvard Business School.
Mao, Y., and W. Wu. 2011. “Fuzzy real option evaluation of real estate project based on risk analysis.” Syst. Eng. Procedia 1 (Jan): 228–235. https://doi.org/10.1016/j.sepro.2011.08.036.
Mcauliffe, J., and D. Blei. 2007. “Supervised topic models.” In Advances in neural information processing systems, 20. Cambridge, MA: MIT Press. https://doi.org/10.48550/arXiv.1003.0783.
Mehra, P. 2023. “Unexpected surprise: Emotion analysis and aspect based sentiment analysis (ABSA) of user generated comments to study behavioral intentions of tourists.” Tourism Manage. Perspect. 45 (Jan): 101063. https://doi.org/10.1016/j.tmp.2022.101063.
Mitra, S., and M. Jenamani. 2021. “Helpfulness of online consumer reviews: A multi-perspective approach.” Inf. Process. Manage. 58 (3): 102538. https://doi.org/10.1016/j.ipm.2021.102538.
Mohammad, S. M., S. Kiritchenko, and X. Zhu. 2013. “NRC-Canada: Building the state-of-the-art in sentiment analysis of tweets.” Preprint, submitted August 28, 2013. https://arxiv.org/abs/1308.6242.
Moreno, A., and C. Terwiesch. 2014. “Doing business with strangers: Reputation in online service marketplaces.” Inf. Syst. Res. 25 (4): 865–886. https://doi.org/10.1287/isre.2014.0549.
Mullen, T., and N. Collier. 2004. “Sentiment analysis using support vector machines with diverse information sources.” In Proc., 2004 Conf. on Empirical Methods in Natural Language Processing, edited by D. Lin and D. Wu, 412–418. Barcelona, Spain: Association for Computational Linguistics.
Mulliner, E., and M. Algrnas. 2018. “Preferences for housing attributes in Saudi Arabia: A comparison between consumers’ and property practitioners’ views.” Cities 83 (Dec): 152–164. https://doi.org/10.1016/j.cities.2018.06.018.
Oza, K. S., and P. G. Naik. 2016. “Prediction of online lectures popularity: A text mining approach.” Procedia Comput. Sci. 92 (Jan): 468–474. https://doi.org/10.1016/j.procs.2016.07.369.
Pan, M. C., C. Y. Kuo, C. T. Pan, and W. Tu. 2013. “Antecedent of purchase intention: Online seller reputation, product category and surcharge.” Internet Res. 23 (4): 507–522. https://doi.org/10.1108/IntR-09-2012-0175.
Park, D.-H., and J. Lee. 2008. “eWOM overload and its effect on consumer behavioral intention depending on consumer involvement.” Electron. Commerce Res. Appl. 7 (4): 386–398. https://doi.org/10.1016/j.elerap.2007.11.004.
Park, N., and K. M. Lee. 2007. “Effects of online news forum on corporate reputation.” Public Relat. Rev. 33 (3): 346–348. https://doi.org/10.1016/j.pubrev.2007.05.018.
Qiu, G., B. Liu, J. Bu, and C. Chen. 2011. “Opinion word expansion and target extraction through double propagation.” Comput. Ling. 37 (1): 9–27. https://doi.org/10.1162/coli_a_00034.
Rahab, H., A. Zitouni, and M. Djoudi. 2021. “SANA: Sentiment analysis on newspapers comments in Algeria.” J. King Saud Univ. Comput. Inf. Sci. 33 (7): 899–907. https://doi.org/10.1016/j.jksuci.2019.04.012.
Rao, D., and D. Ravichandran. 2009. “Semi-supervised polarity lexicon induction.” In Proc., 12th Conf. of the European Chapter of the ACL (EACL 2009), 675–682. New York: Association for Computational Linguistics. https://doi.org/10.5555/1609067.1609142.
Ren, X., Y. Li, and M. Guo. 2023. “Dynamically identifying and evaluating key barriers to promoting prefabricated buildings: Text mining approach.” J. Constr. Eng. Manage. 149 (9): 04023075. https://doi.org/10.1061/JCEMD4.COENG-13285.
Resnick, P., and R. Zeckhauser. 2002. “Trust among strangers in Internet transactions: Empirical analysis of eBay’s reputation system.” In The economics of the internet and e-commerce, 127–157. Leeds, UK: Emerald Group Publishing.
Roberts, P. W., and G. R. Dowling. 2002. “Corporate reputation and sustained superior financial performance.” Strategic Manage. J. 23 (12): 1077–1093. https://doi.org/10.1002/smj.274.
Rumelhart, D. E., G. E. Hinton, and R. J. Williams. 1986. “Learning representations by back-propagating errors.” Nature 323 (6088): 533–536. https://doi.org/10.1038/323533a0.
Salminen, M., and A. Ainamo. 2015. “Online news and corporate reputation: A neurophysiological investigation.” J. Media Psychol.: Theories, Methods, Appl. 27 (3): 118–133. https://doi.org/10.1027/1864-1105/a000149.
Sen, S., and D. Lerman. 2007. “Why are you telling me this? An examination into negative consumer reviews on the web.” J. Interact. Mark. 21 (4): 76–94. https://doi.org/10.1002/dir.20090.
Sievert, C., and K. Shirley. 2014. “LDAvis: A method for visualizing and interpreting topics.” In Proc., Workshop on Interactive Language Learning, Visualization, and Interfaces, edited by J. Chuang, S. Green, M. Hearst, J. Heer, and P. Koehn, 63–70. New York: Association for Computational Linguistics. https://doi.org/10.3115/v1/W14-3110.
Sun, Y., R. Huang, D. Chen, and H. Li. 2008. “Fuzzy set-based risk evaluation model for real estate projects.” Supplement, Tsinghua Sci. Technol. 13 (S1): 158–164. https://doi.org/10.1016/S1007-0214(08)70143-3.
Wang, X., and J. Li. 2023. “The online reputation governance mechanism of the internet e-commerce platform enterprises: Logical approach and realization path.” Contemp. Econ. Manage. 45 (1): 18–28. https://doi.org/10.13253/j.cnki.ddjjgl.2023.01.003.
Wang, Y., M. Huang, X. Zhu, and L. Zhao. 2016. “Attention-based LSTM for aspect-level sentiment classification.” In Proc., 2016 Conf. on Empirical Methods in Natural Language Processing, edited by J. Su, K. Duh, and X. Carreras, 606–615. Austin, TX: Association for Computational Linguistics. https://doi.org/10.18653/v1/D16-1058.
Wu, D. D., L. Zheng, and D. L. Olson. 2014. “A decision support approach for online stock forum sentiment analysis.” IEEE Trans. Syst. Man Cybern.: Syst. 44 (8): 1077–1087. https://doi.org/10.1109/TSMC.2013.2295353.
Xiao, C., B. Zhong, and H. Luo. 2021. “Research on public participation in project quality supervision mode under reputation mechanism.” J. Eng. Manage. 35 (6): 102–106. https://doi.org/10.13991/j.cnki.jem.2021.06.018.
Xu, G., Y. Meng, X. Qiu, Z. Yu, and X. Wu. 2019. “Sentiment analysis of comment texts based on BiLSTM.” IEEE Access 7 (Apr): 51522–51532. https://doi.org/10.1109/ACCESS.2019.2909919.
Xu, H., and Y. Lv. 2022. “Mining and application of tourism online review text based on natural language processing and text classification technology.” Wireless Commun. Mobile Comput. 2022 (May): 1–13. https://doi.org/10.1155/2022/9905114.
Xue, J., G. Q. Shen, Y. Li, S. Han, and X. Chu. 2021. “Dynamic analysis on public concerns in Hong Kong-Zhuhai-Macao Bridge: Integrated topic and sentiment modeling approach.” J. Constr. Eng. Manage. 147 (6): 04021049. https://doi.org/10.1061/(ASCE)CO.1943-7862.0002066.
Yan, Z., X. Jing, and W. Pedrycz. 2017. “Fusing and mining opinions for reputation generation.” Inf. Fusion 36 (Jul): 172–184. https://doi.org/10.1016/j.inffus.2016.11.011.
Yang, J., X. Zou, W. Zhang, and H. Han. 2021. “Microblog sentiment analysis via embedding social contexts into an attentive LSTM.” Eng. Appl. Artif. Intell. 97 (Jan): 104048. https://doi.org/10.1016/j.engappai.2020.104048.
Zhong, B., X. Pan, P. E. Love, J. Sun, and C. Tao. 2020. “Hazard analysis: A deep learning and text mining framework for accident prevention.” Adv. Eng. Inf. 46 (Oct): 101152. https://doi.org/10.1016/j.aei.2020.101152.
Information & Authors
Information
Published In
Copyright
© 2024 American Society of Civil Engineers.
History
Received: Oct 21, 2023
Accepted: Jan 29, 2024
Published online: May 16, 2024
Published in print: Aug 1, 2024
Discussion open until: Oct 16, 2024
ASCE Technical Topics:
- Case studies
- Computer networks
- Computer vision and image processing
- Computing in civil engineering
- Construction engineering
- Construction industry
- Construction management
- Engineering fundamentals
- Feasibility studies
- Housing
- Infrastructure
- Internet
- Methodology (by type)
- Model accuracy
- Models (by type)
- Research methods (by type)
- Residential construction
- Urban and regional development
Authors
Metrics & Citations
Metrics
Citations
Download citation
If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.