Content area

Abstract

Big data analysis is the process of gathering, managing and analyzing a large volume of data to determine patterns and other valuable information. Agricultural data can be a significant area of big data applications. The big data analysis for agricultural data can comprise the various data from both internal systems and outside sources like weather data, soil data, and crop data. Though big data analysis has led to advances in different industries, it has not yet been extensively used in agriculture. Several machine learning techniques are developed to cluster the data for the prediction of crop yield. However, it has low accuracy and low quality of the clustering. To improve clustering accuracy with less complexity, a Proximity Likelihood Maximization Data Clustering (PLMDC) technique is developed for both sparse and densely distributed agricultural big data to enhance the accuracy of crop yield prediction for farmers. In this process, unnecessary data is cleansed from the sparse and dense based agricultural data using a logical linear regression model. After that, the presented clustering method is executed depending on the similarity and weight-based Manhattan distance. The genetic algorithm (GA) is applied with a good fitness function to select the features from the clustered data. Finally, the decision support system is computed by the A-FP growth algorithm to predict the crop yields according to their selected features such as weather features and crop features. The results of the proposed PLMDC technique are better in case of clustering accuracy of both spare and densely distributed data with minimum time and space complexity. Based on the results observations, the PLMDC technique is more efficient than the existing methods.

Details

Company / organization
Title
Improved data clustering methods and integrated A-FP algorithm for crop yield prediction
Author
Vani, P. Suvitha 1 ; Rathi, S. 2 

 Sri Shakthi Institute of Engineering & Technology, Department of Computer Science and Engineering, Coimbatore, India (GRID:grid.252262.3) (ISNI:0000 0001 0613 6919) 
 Government College of Technology, Department of Computer Science and Engineering, Coimbatore, India (GRID:grid.252262.3) (ISNI:0000 0001 0613 6919) 
Publication title
Volume
41
Issue
1
Pages
117-131
Publication year
2023
Publication date
Jun 2023
Publisher
Springer Nature B.V.
Place of publication
New York
Country of publication
Netherlands
ISSN
09268782
e-ISSN
15737578
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2021-07-15
Milestone dates
2021-06-29 (Registration); 2021-06-29 (Accepted)
Publication history
 
 
   First posting date
15 Jul 2021
ProQuest document ID
3255421025
Document URL
https://www.proquest.com/scholarly-journals/improved-data-clustering-methods-integrated-fp/docview/3255421025/se-2?accountid=208611
Copyright
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2021.
Last updated
2025-09-29
Database
ProQuest One Academic