Content area

Abstract

The Part-Of-Speech tagging is widely used in the natural language process. There are many statistical approaches in this area. The most popular one is Hidden Markov Model. In this paper, an alternative approach, linear-chain Conditional Random Fields, is introduced. The Conditional Random Fields is a factor graph approach that can naturally incorporate arbitrary, non-independent features of the input without conditional independence among the features or distributional assumptions of inputs. This paper applied the Conditional Random Fields for the car review word Part-Of-Speech tagging and then the feature extraction, which can be used as an input to an opinion mining system. To reduce the computational time, we also proposed applying the Limited-memory BFGS algorithm to train the Conditional Random Fields. Furthermore, this paper evaluated the Conditional Random Fields and the classical graph approach using the car review dataset to demonstrate that the Conditional Random Fields have a more robust result with a smaller training dataset.

Details

10000008
Title
A conditional random field framework for language process in product review mining
Author
Ming, Yue 1 ; Liu, Xiyuan 2   VIAFID ORCID Logo  ; Shen, Gang 3 ; Gao, Di 4 ; Wang, Yu 5 

 Syngenta Seeds, LLC, Basel, Switzerland 
 Louisiana Tech University, Department of Mathematics and Statistics, Ruston, USA (GRID:grid.259237.8) (ISNI:0000000121506076) 
 North Dakota State University, Department of Statistics, Fargo, USA (GRID:grid.261055.5) (ISNI:0000 0001 2293 4611) 
 Sam Houston State University, Department of Mathematics and Statistics, Huntsville, USA (GRID:grid.263046.5) (ISNI:0000 0001 2291 1903) 
 Texas A&M University, Department of Statistics, College Station, USA (GRID:grid.264756.4) (ISNI:0000 0004 4687 2082) 
Publication title
Volume
82
Issue
1
Pages
803-817
Publication year
2023
Publication date
Jan 2023
Publisher
Springer Nature B.V.
Place of publication
Dordrecht
Country of publication
Netherlands
ISSN
13807501
e-ISSN
15737721
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2022-06-10
Milestone dates
2022-05-30 (Registration); 2021-08-01 (Received); 2022-05-30 (Accepted); 2022-01-25 (Rev-Recd)
Publication history
 
 
   First posting date
10 Jun 2022
ProQuest document ID
2758755435
Document URL
https://www.proquest.com/scholarly-journals/conditional-random-field-framework-language/docview/2758755435/se-2?accountid=208611
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2023-11-30
Database
ProQuest One Academic