Plný text

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstrakt

The BioBERT Named Entity Recognition (NER) model is a high-performance model designed to identify both known and unknown entities. It surpasses previous NER models utilized by text-mining tools, such as tmTool and ezTag, in effectively discovering novel entities. In previous studies, the Biomedical Entity Recognition and Multi-Type Normalization Tool (BERN) employed this model to identify words that represent specific names, discern the type of the word, and implement it on a web page to offer NER service. However, we aimed to offer a web service that includes Relation Extraction (RE), a task determining the relation between entity pairs within a sentence. First, just like BERN, we fine-tuned the BioBERT NER model within the biomedical domain to recognize new entities. We identified two categories: diseases and genes/proteins. Additionally, we fine-tuned the BioBERT RE model to determine the presence or absence of a relation between the identified gene–disease entity pairs. The NER and RE results are displayed on a web page using the Django web framework. NER results are presented in distinct colors, and RE results are visualized as graphs in NetworkX and Cytoscape, allowing users to interact with the graphs.

Detaily

Název
Web Interface of NER and RE with BERT for Biomedical Text Mining
Autor
Yeon-Ji Park 1 ; Min-a, Lee 1   VIAFID ORCID Logo  ; Geun-Je Yang 1 ; Park, Soo Jun 2 ; Chae-Bong Sohn 1   VIAFID ORCID Logo 

 Department of Electronics and Communications Engineering, Kwangwoon University, Seoul 01897, Republic of Korea; [email protected] (Y.-J.P.); 
 Digital Biomedical Research Division, Electronics and Telecommunications Research Institute, Daejeon 34129, Republic of Korea 
První strana
5163
Rok vydání
2023
Datum vydání
2023
Vydavatel
MDPI AG
e-ISSN
20763417
Typ zdroje
Vědecký časopis
Jazyk publikace
English
ID dokumentu ProQuest
2806475128
Copyright
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.