Bert-Based Text Keyword Extraction

Abstract

With the explosive growth of network information, in order to obtain the information faster and more accurately, this paper proposes a text keyword extraction method based on Bert. Firstly, the key sentence set is extracted from the background material by Bert model as the information supplement to the text. Then, based on the extended text, TF-IDF, text rank and LDA are combined to extract keywords. The experimental results on real science and technology academic paper data sets show that the performance of the fusion multi type feature combination algorithm is better than that of the traditional single algorithm; and the F value of the algorithm is increased by 1.5% by extracting key sentences from background materials, which further improves the effect of key word extraction.

Details

Title

Bert-Based Text Keyword Extraction

Author

Qian, Yili¹; Jia, Chaochao¹; Liu, Yimei¹

¹ Shanxi University, Taiyuan 030006, China

Publication year

2021

Publication date

Aug 2021

Publisher

IOP Publishing

ISSN

17426588

e-ISSN

17426596

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1088/1742-6596/1992/4/042077

ProQuest document ID

2566504509

© 2021. This work is published under http://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Bert-Based Text Keyword Extraction

Jump to:

Abstract

Details

Suggested sources