Analyzing sentiments in hotel booking reviews with Natural Language Processing (NLP) Open Access

Authors

  • Shakeel Basheer 1 ORCID Sheezan Farooq 2
Published online: 4 Oct 2025

Abstract

Purpose - This study aims to analyze customer sentiments in hotel reviews to gain insights into guest experiences, service quality, and overall satisfaction. The goal is to demonstrate how sentiment analysis using natural language processing (NLP) can support decision-making, service enhancement, and strategic management in the tourism and hospitality industry.
Methodology/Design/Approach - The research employs two main approaches for sentiment classification—lexicon-based (VADER) and machine learning-based (BERT). Textual hotel reviews are processed and categorized into positive, negative, and neutral sentiments. The performance of both methods is compared to evaluate accuracy and effectiveness in interpreting customer feedback.
Findings - The results reveal that key sentiment drivers include service quality, cleanliness, and value for money. The BERT model outperforms the lexicon-based VADER method in classification accuracy, demonstrating its superior ability to understand contextual nuances in customer reviews. The study confirms that advanced NLP models can provide deeper and more reliable insights for reputation management and marketing strategies.
Originality/Value - This paper contributes to the growing field of artificial intelligence applications in tourism by showcasing how NLP-based sentiment analysis can transform qualitative feedback into actionable intelligence. It highlights the potential of AI in improving customer experience analytics and suggests future research directions in multilingual sentiment analysis and real-time monitoring for dynamic decision support, benefiting hotels, travel agencies, and policymakers.

References

Alaei, A. R., Becken, S., & Stantic, B. (2020). Sentiment analysis in tourism: Capitalizing on big data. Journal of Travel Research, 59(2), 175–191. https://doi.org/10.1177/0047287518824157

Arrieta, A. B., Díaz-Rodríguez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., … Herrera, F. (2020). Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information Fusion, 58, 82–115. https://doi.org/10.1016/j.inffus.2019.12.012

Basheer, S., Hassan, V., Farooq, S., Ashraf, F., & Reshi, M. (2023). Evaluating e-tourism through bibliometrics: Materials and emerging research trends. Journal of Tourismology, 9(2), 135–146. https://doi.org/10.26650/jot.2023.9.2.1380311

Basheer, S., Tramboo, I., Farooq, S., & Behboodi, Z. (2024). Redefining industry dynamics. In [Book title unavailable] (pp. 176–193). https://doi.org/10.4018/979-8-3693-2248-2.ch008

Cheng, M., & Edwards, D. (2019). Social media in tourism: A visual analytic approach. Current Issues in Tourism, 22(17), 2052–2071. https://doi.org/10.1080/13683500.2017.1369943

Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. Proceedings of NAACL-HLT 2019, 4171–4186. https://doi.org/10.18653/v1/N19-1423

Farooq, S., Farooq, B., Basheer, S., & Walia, S. (2023). Balancing environmental sustainability and privacy: Ethical dilemmas in AI-enabled smart cities. In [Book title unavailable] (pp. 263–286). https://doi.org/10.4018/979-8-3693-0892-9.ch013

García-Pablos, A., Cuadros, M., & Rigau, G. (2018). W2VLDA: Almost unsupervised system for aspect-based sentiment analysis. Expert Systems with Applications, 91, 127–137. https://doi.org/10.1016/j.eswa.2017.08.026

Guo, Y., Barnes, S. J., & Jia, Q. (2017). Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent Dirichlet allocation. Tourism Management, 59, 467–483. https://doi.org/10.1016/j.tourman.2016.09.009

Hassan, V., & Basheer, S. (2024). Unveiling the entrepreneurial mindset: Behavioural factors and green intentions among university tourism students. In Sustainable tourism (Part A, pp. 23–36). Emerald Publishing Limited. https://doi.org/10.1108/978-1-83797-979-020241002

Hassan, V. I., Basheer, S., & Fayad, S. A. (2024). Trash to treasure: Unveiling the hidden rewards of recycling in the hotel industry. In Sustainable disposal methods of food wastes in hospitality operations (pp. 258–268). IGI Global. https://doi.org/10.4018/979-8-3693-2181-2.ch017

Joshi, A., Bhattacharyya, P., & Carman, M. J. (2017). Automatic sarcasm detection: A survey. ACM Computing Surveys, 50(5), 1–22. https://doi.org/10.1145/3124420

Khan, F. H., Qamar, U., & Bashir, S. (2019). A multiclass, multilingual sentiment analysis model for Arabic, Roman Urdu, and English. Journal of Intelligent & Fuzzy Systems, 36(5), 4861–4871. https://doi.org/10.3233/JIFS-18198

Liang, S., Schuckert, M., Law, R., & Chen, C.-C. (2020). The importance of marketer-generated content to peer-to-peer property rental platforms: Evidence from Airbnb. International Journal of Hospitality Management, 84, 102329. https://doi.org/10.1016/j.ijhm.2019.102329

Liu, B. (2020). Sentiment analysis: Mining opinions, sentiments, and emotions (2nd ed.). Cambridge University Press.

Medhat, W., Hassan, A., & Korashy, H. (2014). Sentiment analysis algorithms and applications: A survey. Ain Shams Engineering Journal, 5(4), 1093–1113. https://doi.org/10.1016/j.asej.2014.04.011

Mohammad Malik, Y., Walia, S., Erkol Bayram, G., Valeri, M., & Basheer, S. (2024). Community attachment, tourist contact, and resident attitudes toward tourism development of Kashmir Valley. Tourism, 72(3). https://doi.org/10.37741/t.72.3.9

Öğüt, H., & Taş, B. K. O. (2012). The influence of internet customer reviews on online sales and prices in the hotel industry. Service Industries Journal, 32(2), 197–214. https://doi.org/10.1080/02642069.2010.529436

Rana, T. A., & Cheah, Y.-N. (2016). Aspect extraction in sentiment analysis: Comparative analysis and survey. Artificial Intelligence Review, 46(4), 459–483. https://doi.org/10.1007/s10462-016-9472-z

Sun, C., Qiu, X., Xu, Y., & Huang, X. (2019). How to fine-tune BERT for text classification? In Chinese computational linguistics (Vol. 11856, pp. 194–206). https://doi.org/10.1007/978-3-030-32381-3_16

Xiang, Z., Schwartz, Z., Gerdes, J. H., & Uysal, M. (2015). What can big data and text analytics tell us about hotel guest experience and satisfaction? International Journal of Hospitality Management, 44, 120–130. https://doi.org/10.1016/j.ijhm.2014.10.013

Permission is granted subject to the terms of the License under which the work was published. Permission will be required if your reuse is not covered by the terms of the License.

How to Cite

Basheer, S., & Farooq, S. (2025). Analyzing sentiments in hotel booking reviews with Natural Language Processing (NLP). Journal of Rural Tourism , 3(1), 30-39. https://doi.org/10.70310/jrt.2025.03010671

Downloads

Download data is not yet available.

Additional Information

  • Sheezan Farooq
    Research scholar at Department of computer science ,  Islamic University Science and Technology Kashmir, India

    Research scholar at Department of computer science ,  Islamic University Science and Technology Kashmir