Full text

Turn on search term navigation

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Text mining (TM) is a semi-automatized, multi-step process, able to turn unstructured into structured data. TM relevance has increased upon machine learning (ML) and deep learning (DL) algorithms’ application in its various steps. When applied to biomedical literature, text mining is named biomedical text mining and its specificity lies in both the type of analyzed documents and the language and concepts retrieved. The array of documents that can be used ranges from scientific literature to patents or clinical data, and the biomedical concepts often include, despite not being limited to genes, proteins, drugs, and diseases. This review aims to gather the leading tools for biomedical TM, summarily describing and systematizing them. We also surveyed several resources to compile the most valuable ones for each category.

Details

Title
The Treasury Chest of Text Mining: Piling Available Resources for Powerful Biomedical Text Mining
Author
Rosário-Ferreira, Nícia 1   VIAFID ORCID Logo  ; Marques-Pereira, Catarina 2   VIAFID ORCID Logo  ; Pires, Manuel 3   VIAFID ORCID Logo  ; Ramalhão, Daniel 3   VIAFID ORCID Logo  ; Pereira, Nádia 4   VIAFID ORCID Logo  ; Guimarães, Victor 5   VIAFID ORCID Logo  ; Vítor Santos Costa 5   VIAFID ORCID Logo  ; Irina Sousa Moreira 6   VIAFID ORCID Logo 

 CQC-Coimbra Chemistry Center, Chemistry Department, Faculty of Science and Technology, University of Coimbra, 3004-535 Coimbra, Portugal; CIBB, University of Coimbra, 3000-456 Coimbra, Portugal ; [email protected] (C.M.-P.); [email protected] (M.P.); [email protected] (D.R.); [email protected] (N.P.) 
 CIBB, University of Coimbra, 3000-456 Coimbra, Portugal ; [email protected] (C.M.-P.); [email protected] (M.P.); [email protected] (D.R.); [email protected] (N.P.); IIIs-Institute for Interdisciplinary Research, University of Coimbra, 3000-456 Coimbra, Portugal 
 CIBB, University of Coimbra, 3000-456 Coimbra, Portugal ; [email protected] (C.M.-P.); [email protected] (M.P.); [email protected] (D.R.); [email protected] (N.P.); Department of Sciences, University of Porto, 4169-007 Porto, Portugal; [email protected] 
 CIBB, University of Coimbra, 3000-456 Coimbra, Portugal ; [email protected] (C.M.-P.); [email protected] (M.P.); [email protected] (D.R.); [email protected] (N.P.) 
 Department of Sciences, University of Porto, 4169-007 Porto, Portugal; [email protected]; INESC-TEC-Centre of Advanced Computing Systems, 4169-007 Porto, Portugal 
 Department of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal; CNC-Center for Neuroscience and Cell Biology, CIBB-Center for Innovative Biomedicine and Biotechnology, University of Coimbra, 3004-535 Coimbra, Portugal 
First page
60
Publication year
2021
Publication date
2021
Publisher
MDPI AG
ISSN
26736411
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2656345469
Copyright
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.