Abstract

Buzzword analysis is one of the important research contents of natural language processing, and the research results can provide technical support for public opinion analysis. The purpose of extracting media buzzwords is to analyze the rules and changes of language change within a range. The traditional word feature-based buzzword extraction had some problems, such as low accuracy and low coverage, and this paper proposes a media buzzword analysis based on the combination of phrase vector and topic model, the core idea is to integrate the semantic similarity features, and use visualization technology to more intuitively show the overall language change rules. Visualization analyses uses a large number of corpus statistics, calculate the distance between words, and then convert into similarity, through word similarity calculation to show the distribution relationship between different words, and finally quantitative perspective to analyze. Our model is better than the traditional system, and the research results can provide corpus and model support for subsequent research directions.

Details

Title
Media Buzzword Analysis Integrated with Phrase Vectors and Topic Model
Author
Zhu, Dengyun 1 ; Gai, Hailong 2 ; Yu, Hongzhi 3 ; Jing, Rong 2 ; Wan, Fucheng 3 

 Key Laboratory of Linguistic and Cultural Computing Ministry of Education, Northwest Minzu University, Lanzhou, Gansu 730030, China; Key Laboratory of China's Ethnic Languages and Intelligent Processing of Gansu Province, Northwest Minzu University, Lanzhou, Gansu, China 
 Key Laboratory of Linguistic and Cultural Computing Ministry of Education, Northwest Minzu University, Lanzhou, Gansu 730030, China 
 Key Laboratory of China's Ethnic Languages and Intelligent Processing of Gansu Province, Northwest Minzu University, Lanzhou, Gansu, China 
Pages
745-751
Publication year
2024
Publication date
2024
Publisher
Engineering and Scientific Research Groups
e-ISSN
11125209
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3074172033
Copyright
© 2024. This work is published under https://creativecommons.org/licenses/by/4.0/legalcode (the“License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.