Abstract
Topic modelling (TM) is a significant natural language processing (NLP) task and is becoming more popular, especially, in the context of literature synthesis and analysis. Despite the growing volume of studies on the use of and versatility of TM, the knowledge of TM development, especially from the perspective of bibliometrics analysis is limited. To this end, this study evaluated TM research using two techniques namely, bibliometrics analysis and TM itself to provide the current status and the pathway for future studies in the TM field. For this purpose, this study used 16,941 documents collected from Scopus database from 2004 to 2023. Results indicate that the publications on TM have increased over the years, however, the citation impact has declined. Furthermore, the scientific production on TM is concentrated in two countries namely, China and the USA. Our findings showed there are several applications of TM that are understudied, for example, TM for image segmentation and classification. This paper highlighted the future research directions, most importantly, calls for increased multidisciplinary research approaches to fully deploy TM algorithms optimally and thus, increase usage in non-computer science subject areas.
Original language | English |
---|---|
Article number | 74 |
Number of pages | 41 |
Journal | Artificial Intelligence Review |
Volume | 58 |
Issue number | 3 |
DOIs | |
Publication status | Published - 6 Jan 2025 |
Bibliographical note
Publisher Copyright:© The Author(s) 2024.
Keywords
- Artificial intelligence
- Natural language processing
- Topic modelling
- Aspect extraction
- Topic discovery