Algorithm selection using edge ML and case-based

Abstract

In practical data mining, a wide range of classification algorithms is employed for prediction tasks. However, selecting the best algorithm poses a challenging task for machine learning practitioners and experts, primarily due to the inherent variability in the characteristics of classification problems, referred to as datasets, and the unpredictable performance of these algorithms. Dataset characteristics are quantified in terms of meta-features, while classifier performance is evaluated using various performance metrics. The assessment of classifiers through empirical methods across multiple classification datasets, while considering multiple performance metrics, presents a computationally expensive and time-consuming obstacle in the pursuit of selecting the optimal algorithm. Furthermore, the scarcity of sufficient training data, denoted by dimensions representing the number of datasets and the feature space described by meta-feature perspectives, adds further complexity to the process of algorithm selection using classical machine learning methods. This research paper presents an integrated framework called eML-CBR that combines edge edge-ML and case-based reasoning methodologies to accurately address the algorithm selection problem. It adapts a multi-level, multi-view case-based reasoning methodology, considering data from diverse feature dimensions and the algorithms from multiple performance aspects, that distributes computations to both cloud edges and centralized nodes. On the edge, the first-level reasoning employs machine learning methods to recommend a family of classification algorithms, while at the second level, it recommends a list of the top-k algorithms within that family. This list is further refined by an algorithm conflict resolver module. The eML-CBR framework offers a suite of contributions, including integrated algorithm selection, multi-view meta-feature extraction, innovative performance criteria, improved algorithm recommendation, data scarcity mitigation through incremental learning, and an open-source CBR module, reshaping research paradigms. The CBR module, trained on 100 datasets and tested with 52 datasets using 9 decision tree algorithms, achieved an accuracy of 94% for correct classifier recommendations within the top k=3 algorithms, making it highly suitable for practical classification applications.

Details

Title

Algorithm selection using edge ML and case-based reasoning

Author

Ali, Rahman¹; Zada, Muhammad Sadiq Hassan²; Khatak, Asad Masood³; Hussain, Jamil⁴

¹ University of Peshawar, Quaid-e-Azam College of Commerce, Peshawar, Pakistan (GRID:grid.266976.a) (ISNI:0000 0001 1882 0101)
² University of Derby, Derby, United Kingdom (GRID:grid.57686.3a) (ISNI:0000 0001 2232 4004)
³ Zayed University, College of Technological Innovation, Abu Dhabi, UAE (GRID:grid.444464.2) (ISNI:0000 0001 0650 0848)
⁴ Sejong University, Department of Data Science, Seoul, South Korea (GRID:grid.263333.4) (ISNI:0000 0001 0727 6358)

Pages

162

Publication year

2023

Publication date

Dec 2023

Publisher

Springer Nature B.V.

e-ISSN

2192113X

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1186/s13677-023-00542-3

ProQuest document ID

2892160067

© The Author(s) 2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Algorithm selection using edge ML and case-based reasoning

Jump to:

Abstract

Details

Full text options

Suggested sources