Application of Classification and Word Embedding Techniques to Evaluate Tourists’ Hotel-revisit Intention

Evripides Christodoulou, Andreas Gregoriades, Maria Pampaka, Herodotos Herodotou

2021

Abstract

Revisit intention is a key indicator for future business performance in the hospitality industry. This work focuses on the identification of patterns from user-generated data explaining the reasons why tourist may revisit a hotel they stayed at during their holidays and aims to identify differences among two classes of hotels (4-5 star and 2-3 star). The method utilises data from TripAdvisor retrieved using a scrapper application. Topic modelling is initially performed to identify the main themes discussed in each tourist review. Subsequently, reviews are labelled depending on whether they mention the intention of their author to revisit the hotel in the future using an ontology of revisit-intention generated using Word2Vec word embedding. The identified topics from the labelled reviews are utilised to train an Extreme Gradient Boosting model (XGBoost) to predict revisit intention, which is then used to identify topic-patterns in reviews that relate to revisit intention. The learned model achieved satisfactory performance and was used to identify the most influential topics related to revisit intention using an explainable machine learning technique to illustrate visually the rules embedded in the learned XGBoost model. The method is applied on reviews from tourists that visited Cyprus between 2009-2019. Results highlight that staff professionalism (e.g., politeness, smile) is critical for both classes of hotels; however, its effect is smaller on 2-3 start hotels where cleanliness has greater influence on revisiting.

Download


Paper Citation


in Harvard Style

Christodoulou E., Gregoriades A., Pampaka M. and Herodotou H. (2021). Application of Classification and Word Embedding Techniques to Evaluate Tourists’ Hotel-revisit Intention. In Proceedings of the 23rd International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-758-509-8, pages 216-223. DOI: 10.5220/0010453502160223


in Bibtex Style

@conference{iceis21,
author={Evripides Christodoulou and Andreas Gregoriades and Maria Pampaka and Herodotos Herodotou},
title={Application of Classification and Word Embedding Techniques to Evaluate Tourists’ Hotel-revisit Intention},
booktitle={Proceedings of the 23rd International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2021},
pages={216-223},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010453502160223},
isbn={978-989-758-509-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 23rd International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - Application of Classification and Word Embedding Techniques to Evaluate Tourists’ Hotel-revisit Intention
SN - 978-989-758-509-8
AU - Christodoulou E.
AU - Gregoriades A.
AU - Pampaka M.
AU - Herodotou H.
PY - 2021
SP - 216
EP - 223
DO - 10.5220/0010453502160223