Content area

Abstract

In this research, we address the problem of stopword detection in Classical Chinese Poetry, an area that has not been explored previously. Stopword detection is crucial in text mining tasks, as identifying and removing stopwords is essential for improving the performance of various natural language processing models. Inspired by the TF-IDF method, we propose a novel approach that utilizes external knowledge to reconstruct the Term Weight matrix. Our key finding is that incorporating external knowledge significantly refines the granularity of the term weight, thereby improving the effectiveness of stopword detection. Based on these findings, we conclude that external knowledge can enhance the ability of text representation, especially for the short texts in Classical Chinese Poetry.

Details

1009240
Business indexing term
Identifier / keyword
Title
Detection of Stopwords in Classical Chinese Poetry
Author
Volume
16
Issue
2
Publication year
2025
Publication date
2025
Publisher
Science and Information (SAI) Organization Limited
Place of publication
West Yorkshire
Country of publication
United Kingdom
ISSN
2158107X
e-ISSN
21565570
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
ProQuest document ID
3180200484
Document URL
https://www.proquest.com/scholarly-journals/detection-stopwords-classical-chinese-poetry/docview/3180200484/se-2?accountid=208611
Copyright
© 2025. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-03-25
Database
ProQuest One Academic