Content area

Abstract

Data-driven forecasting is becoming increasingly central to modern energy management, yet nonspecialists without a background in artificial intelligence (AI) face significant barriers to entry. While Python is the dominant machine learning language, R remains a practical and accessible tool for users with expertise in statistics, engineering, or domain-specific analysis. To inform tool selection, we first provide an evidence-based comparison of R with major alternatives before reviewing 49 peer-reviewed articles published between 2020 and 2025 in Science Citation Index Expanded (SCIE)-level journals that utilized R for energy forecasting tasks, including electricity (regional and site-level), solar, wind, thermal energy, and natural gas. Despite such growth, the field still lacks a systematic, cross-domain synthesis that clarifies which R-based methods prevail, how accessible workflows are implemented, and where methodological gaps remain; this motivated our use of text mining. Text mining techniques were employed to categorize the literature according to forecasting objectives, modeling methods, application domains, and tool usage patterns. The results indicate that tree-based ensemble learning models—e.g., random forests, gradient boosting, and hybrid variants—are employed most frequently, particularly for solar and short-term load forecasting. Notably, few studies incorporated automated model selection or explainable AI; however, there is a growing shift toward interpretable and beginner-friendly workflows. This review offers a practical reference for nonexperts seeking to apply R in energy forecasting contexts, emphasizing accessible modeling strategies and reproducible practices. We also curate example R scripts, workflow templates, and a study-level link catalog to support replication. The findings of this review support the broader democratization of energy analytics by identifying trends and methodologies suitable for users without advanced AI training. Finally, we synthesize domain-specific evidence and outline the text-mining pipeline, present visual keyword profiles and comparative performance tables that surface prevailing strategies and unmet needs, and conclude with practical guidance and targeted directions for future research.

Details

1009240
Business indexing term
Title
Beginner-Friendly Review of Research on R-Based Energy Forecasting: Insights from Text Mining
Author
Kim Minjoong 1 ; Kim Hyeonwoo 2 ; Moon Jihoon 3   VIAFID ORCID Logo 

 Department of ICT Convergence, Soonchunhyang University, Asan 31538, Republic of Korea; [email protected] 
 Department of Computer Science and Engineering, Soonchunhyang University, Asan 31538, Republic of Korea 
 Department of Data Science, Duksung Women’s University, Seoul 01369, Republic of Korea 
Publication title
Volume
14
Issue
17
First page
3513
Number of pages
60
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
20799292
Source type
Scholarly Journal
Language of publication
English
Document type
Literature Review
Publication history
 
 
Online publication date
2025-09-02
Milestone dates
2025-07-14 (Received); 2025-08-26 (Accepted)
Publication history
 
 
   First posting date
02 Sep 2025
ProQuest document ID
3249684852
Document URL
https://www.proquest.com/scholarly-journals/beginner-friendly-review-research-on-r-based/docview/3249684852/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-09-15
Database
ProQuest One Academic