Tourism Analysis Using User-Generated Content: A Case Study of Foreign Tourists Visiting Japan on TripAdvisor

Suguru Tsujioka, Kojiro Watanabe, Akihiro Tsukamoto


In recent years, online travel service platforms such as TripAdvisor have been actively used by tourists. These services include user-generated content, which is vast and difficult to interpret manually. Several previous studies used user-generated content (e.g., social networking services and TripAdvisor) for tourism analysis. Most of these studies did not perform a systematic text analysis. In this study, we propose a method of analyzing this content to understand the characteristics of sightseeing attractions. Specifically, we analyzed the reviews of foreign tourists who visited Japanese sightseeing attractions. The review data were collected from TripAdvisor. First, a correspondence analysis was conducted to understand the similarities between sightseeing attractions. Next, a co-occurrence network analysis was conducted to derive the theme clusters for understanding the characteristics of sightseeing attractions based on the words in the review. Finally, individual analyses were conducted based on the description of the derived themes at each sightseeing attraction. The results of the analyses demonstrate that the proposed method is effective for comprehending the characteristics of each sightseeing attraction. The proposed method is useful when using user-generated content for tourism analysis.


user-generated content, TripAdvisor, correspondence analysis, co-occurrence network

Full Text:


References (n.d.). Retrieved from

Garrett, J. (2006). KWIC and dirty? Human cognition and the claims of full-text searching. Journal of Electronic Publishing, 9(1).

Higuchi, K. (2001). KHcoder. Retrieved from

Miguéns, J., Baggio, R., & Costa, C. (2008). Social media and tourism destinations: TripAdvisor case study. Advances in Tourism Research, 26(28), 1-6.

O’Connor, P. (2010). Managing a hotel’s image on TripAdvisor. Journal of Hospitality Marketing & Management, 19(7), 754-772.

TripAdvisor LCC. (2017). TripAdvisor. Retrieved from

Tsujioka, S. (2016). Town characteristics estimation using geotagged Twitter data-A case study in the Tokyo area. In Proceedings of International Conference on Civil, Architectural, and Environmental Engineering (pp. 143-147).

Tsujioka, S., Kondo, A., & Watanabe, K. (2016). Estimation of residence information of Twitter users based on their posted messages: Data for tourism development. International Journal of Research in Chemical, Metallurgical, and Civil Engineering, 3(1), 180-183.

Tsunekawa, K. (2019, August 5). The top 30 sightseeing attractions in Japan as voted by international travelers. Retrieved from



  • There are currently no refbacks.

Copyright (c) 2020 Tourism and Sustainable Development Review

License URL:

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.