Content area

Abstract

Being able to correctly model semantic relatedness between texts, and consequently the concepts represented by these texts, has become an important part of many intelligent information retrieval and knowledge processing systems. The need for such systems is especially evident within the biomedical domain, where the sheer amount of scientific publishing contributes to an information overflow. In this paper we present a novel method to approximate semantic relatedness in domain-focused settings. The approach is an extension to a well-known ESA (Explicit Semantic Analysis) method. Our extension successfully leverages the semantics of a domain-specific document corpus. We present the evaluation of the proposed method on a set of reference datasets, that are a de facto reference standard for the task of approximating biomedical semantic relatedness. The proposed method is evaluated in comparison with other state-of-the-art methods, as well as the baselines established with the original ESA method. The results of the experiments suggest that the proposed method combines the semantics of a general and domain-specific corpora to provide significant improvements over the original method.

Details

Title
DomESA: a novel approach for extending domain-oriented lexical relatedness calculations with domain-specific semantics
Author
Rybiński, Maciej 1 ; Aldana Montes, José Francisco 1 

 Departamento de Lenguajes y Ciencias de la Computación, Universidad de Málaga, Malaga, Spain 
Pages
315-331
Publication year
2017
Publication date
Dec 2017
Publisher
Springer Nature B.V.
ISSN
09259902
e-ISSN
15737675
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
1960334516
Copyright
Journal of Intelligent Information Systems is a copyright of Springer, 2017.