Content area

Abstract

A challenging problem for biomedical text retrieval is the difficulty to identify the multiple name variants for biomedical entities such as genes and proteins. Generally, a biomedical entity can be referred to by full name, abbreviation, and alias. Therefore, the traditional text retrieval that uses keywords does not usually perform well in biomedical domain. We propose a novel concept-based text retrieval method that uses the concepts instead of keywords to construct the query. In particular, a novel query expansion algorithm is developed to convert a single name to a concept that contains multiple name variants. In addition, we propose a new method to extract more related terms from relevance feedback by merging multiple term ranking lists of terms. Extensive experiments are conducted on 2004 and 2005 TREC Genomics datasets to evaluate the performance. We reveal the factors that impact the performance of retrieval and build up a new framework for biomedical text retrieval.

Details

Title
Concept-based biomedical text retrieval
Author
Zhong, Ming
Year
2007
Publisher
ProQuest Dissertations & Theses
ISBN
978-0-494-29634-9
Source type
Dissertation or Thesis
Language of publication
English
ProQuest document ID
304776229
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.