Content area

Abstract

This study aims to analyze online job postings using machine learning-based, semantic approaches and to identify the expertise roles and competencies required for big data professions. The methodology of this study employs latent Dirichlet allocation (LDA), a probabilistic topic modeling technique, to reveal hidden semantic structures within a corpus of big data job postings. As a result of our analysis, we have identified seven expertise roles, six proficiency areas, and 32 competencies (knowledge, skills, and abilities) necessary for big data professions. These positions include “developer”, “engineer”, “architect”, “analyst”, “manager”, “administrator”, and “consultant”. The six essential proficiency areas for big data are “big data knowledge”, “developer skills”, “big data analytics”, “cloud services”, “soft skills”, and “technical background”. Furthermore, the top five skills emerged as “big data processing”, “big data tools”, “communication skills”, “remote development”, and “big data architecture”. The findings of our study indicated that the competencies required for big data careers cover a broad spectrum, including technical, analytical, developer, and soft skills. Our findings provide a competency map for big data professions, detailing the roles and skills required. It is anticipated that the findings will assist big data professionals in assessing and enhancing their competencies, businesses in meeting their big data labor force needs, and academies in customizing their big data training programs to meet industry requirements.

Details

1009240
Title
Future-Ready Skills Across Big Data Ecosystems: Insights from Machine Learning-Driven Human Resource Analytics
Author
Gurcan Fatih 1   VIAFID ORCID Logo  ; Gudek Beyza 1   VIAFID ORCID Logo  ; Menekse Dalveren Gonca Gokce 2   VIAFID ORCID Logo  ; Derawi Mohammad 3 

 Department of Management Information Systems, Faculty of Economics and Administrative Sciences, Karadeniz Technical University, Trabzon 61080, Turkey 
 Department of Computer Engineering, Izmir Bakircay University, Izmir 35665, Turkey 
 Department of Electronic Systems, Faculty of Information Technology and Electrical Engineering, Norwegian University of Science and Technology, 7034 Gjøvik, Norway 
Publication title
Volume
15
Issue
11
First page
5841
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
20763417
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-05-22
Milestone dates
2025-04-19 (Received); 2025-05-21 (Accepted)
Publication history
 
 
   First posting date
22 May 2025
ProQuest document ID
3217720926
Document URL
https://www.proquest.com/scholarly-journals/future-ready-skills-across-big-data-ecosystems/docview/3217720926/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-07-23
Database
ProQuest One Academic