Content area

Abstract

Amidst the proliferation of social media and online platforms, sentiment analysis stands out as a pivotal tool in Natural Language Processing (NLP), facilitating the categorization of public opinions. The overarching goal of this study is to apply sentiment analysis techniques to assess public perceptions of water supply quality and provide decision-support maps for infrastructure planning. The primary research gap addressed in this study concerns the efficacious integration of spatial statistics methods with sentiment analysis for the purpose of generating zoning maps. This integration, offers a novel approach for understanding public perceptions and sentiments within specific geographical contexts. Sub-objectives of the study include aspects such as the development of a robust meta ensemble learning framework, the utilization of crowdsourced geographic information for sentiment analysis, and the evaluation of text mining techniques specific to water supply concerns. Our approach utilizes comments from subscribers of the Water Organization portal. The meta ensemble learning framework comprises six different combinations, including boosting, bagging, and voting solutions, drawing from various base estimators such as K-Nearest Neighbors (KNN), Decision Tree (DT), Support Vector Machine (SVM), Random Forest (RF), and Artificial Neural Network (ANN), alongside boosting techniques like AdaBoost and XGBoost. Results indicate that aggregating vectors from text feature extraction techniques such as Bag of Words (BoW), N-gram, and TF-IDF yielded optimal pattern recognition. AdaBoost emerged as the most effective model, as determined by metrics like Accuracy, F1-score, and AUC. Unreviewed subscriber comments were fed into the final model to predict unfavorable remarks, subsequently visualized on georeferenced maps. Geostatistical methods within Geographic Information Systems (GIS) were employed, including spatial kernel density, spatial join, natural breaks classification, and hotspot analysis using Getis-Ord Gi* statistics. The approach produced maps illustrating areas with a high density of negative remarks, identifying problematic urban blocks and continuous hotspot areas. Overall, our method demonstrates promising efficiency in assessing water supply situations and informing development planning.

Details

Title
Meta ensemble learning in geospatial sentiment analysis and community survey mapping: a water supply case study
Author
Vahidnia, Mohammad H. 1   VIAFID ORCID Logo 

 Shahid Beheshti University, Center for Remote Sensing and GIS Research, Faculty of Earth Sciences, Tehran, Iran (GRID:grid.412502.0) (ISNI:0000 0001 0686 4748) 
Publication title
Volume
17
Issue
4
Pages
3233-3252
Publication year
2024
Publication date
Aug 2024
Publisher
Springer Nature B.V.
Place of publication
Dordrecht
Country of publication
Netherlands
Publication subject
ISSN
18650473
e-ISSN
18650481
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2024-06-03
Milestone dates
2024-05-26 (Registration); 2024-01-31 (Received); 2024-05-26 (Accepted)
Publication history
 
 
   First posting date
03 Jun 2024
ProQuest document ID
3106868878
Document URL
https://www.proquest.com/scholarly-journals/meta-ensemble-learning-geospatial-sentiment/docview/3106868878/se-2?accountid=208611
Copyright
© The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
Last updated
2024-10-03
Database
ProQuest One Academic