Tourism Analysis Using User-Generated Content: A Case Study of Foreign Tourists Visiting Japan on TripAdvisor

Suguru Tsujioka, Kojiro Watanabe, Akihiro Tsukamoto


In recent years, online travel service platforms such as TripAdvisor have been actively used by tourists. These services include user-generated content, which is vast and difficult to interpret manually. Several previous studies used user-generated content (e.g., social networking services and TripAdvisor) for tourism analysis. Most of these studies did not perform a systematic text analysis. In this study, we propose a method of analyzing this content to understand the characteristics of sightseeing attractions. Specifically, we analyzed the reviews of foreign tourists who visited Japanese sightseeing attractions. The review data were collected from TripAdvisor. First, a correspondence analysis was conducted to understand the similarities between sightseeing attractions. Next, a co-occurrence network analysis was conducted to derive the theme clusters for understanding the characteristics of sightseeing attractions based on the words in the review. Finally, individual analyses were conducted based on the description of the derived themes at each sightseeing attraction. The results of the analyses demonstrate that the proposed method is effective for comprehending the characteristics of each sightseeing attraction. The proposed method is useful when using user-generated content for tourism analysis.


user-generated content, TripAdvisor, correspondence analysis, co-occurrence network

Full Text:


References (n.d.). Retrieved from

Garrett, J. (2006). KWIC and dirty? Human cognition and the claims of full-text searching. Journal of Electronic Publishing, 9(1).

Higuchi, K. (2001). KHcoder. Retrieved from

Miguéns, J., Baggio, R., & Costa, C. (2008). Social media and tourism destinations: TripAdvisor case study. Advances in Tourism Research, 26(28), 1-6.

O’Connor, P. (2010). Managing a hotel’s image on TripAdvisor. Journal of Hospitality Marketing & Management, 19(7), 754-772.

TripAdvisor LCC. (2017). TripAdvisor. Retrieved from

Tsujioka, S. (2016). Town characteristics estimation using geotagged Twitter data-A case study in the Tokyo area. In Proceedings of International Conference on Civil, Architectural, and Environmental Engineering (pp. 143-147).

Tsujioka, S., Kondo, A., & Watanabe, K. (2016). Estimation of residence information of Twitter users based on their posted messages: Data for tourism development. International Journal of Research in Chemical, Metallurgical, and Civil Engineering, 3(1), 180-183.

Tsunekawa, K. (2019, August 5). The top 30 sightseeing attractions in Japan as voted by international travelers. Retrieved from



  • There are currently no refbacks.

Copyright (c) 2020 Suguru Tsujioka, Kojiro Watanabe, Akihiro Tsukamoto

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.


Program Studi Di luar Kampus Utama (PSDKU) Universitas Padjadjaran  & Research Synergy Press

Tourism and Sustainable Development Review (TSDR)

Mailing Address: 
Program Studi Di luar Kampus Utama (PSDKU) Universitas Padjadjaran
Dsn. Sukamanah Ds. Cintaratu  
Kec. Parigi 
Kab. Pangandaran (46393) - Indonesia. 

Mailing Address: 
Research Synergy Press
Jalan Nyaman no 31 
Komplek Sinergi Antapani 
Bandung 40291 - Indonesia.


The Tourism and Sustainable Development Review (TSDR) is indexed by:
Panduan Google Scholar – LPPM Universitas Stikubank SemarangMengenal Crossref (Cross Reference)Current Indexing | Jurnal Teknologi Informasi dan Pendidikan
 DOAJ (@DOAJplus) / Twitter

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.