Full Text

Turn on search term navigation

Introduction

In Brazil, the hospital bed regulation process of the Brazilian Health System (SUS) plays a fundamental role in the management and distribution of care for patients requiring hospitalization [1, 2]. However, although the National Regulation Policy was instituted more than 15 years ago by Ordinance No. 1,559 of August 2008 [3] and consolidated by Ordinance No. 02 of September 2017 (Brazil, 2017) [4], many regions have difficulties in ensuring correct regulatory conduct.

In this context, in addition to organizational issues, the regulation system in Brazil still faces recurring problems such as the precariousness of hospital infrastructure, overcrowding in health units, an insufficient number of beds, difficulties in integration and communication between the entities involved in the regulatory process, greater transparency in processes and allocation of resources, in addition to not having efficient systems to help the regulation process [1, 5]. In Brazil, due to a non-mandatory recommendation from the Ministry of Health (MoH), the Regulation System—SISREG—is still used in many Brazilian states. This system was created in 2001 and is made available by the Brazilian Health System Informatics Department (DATASUS) [6]. Currently, this system is considered obsolete and inadequate, especially due to the lack of interoperability with the SUS technological ecosystem itself and the lack of transparency [7]. This is a legacy health information system which, although it is still used, is no longer able to play an effective role in the National Policy for the Regulation of Assistance in Access to Health Services in Brazil.

Until April 2020, the center for regulating access to health services in the state of Rio Grande do Norte did not have a platform for regulating hospital beds to systematically organize regulatory conduct within the scope of the SUS in the state. The regulatory flow control measures used were based on spreadsheets, e-mails and telephone communication, and messaging systems [8, 9].

Faced with the serious public health crisis caused by the COVID-19 pandemic, the government of the state of Rio Grande do Norte has set up technical-scientific cooperation between researchers in the field of digital health and the managers and formulators of public health policies at the State Secretariat of Public Health of RN (SESAP/RN). The aim of this technical-scientific cooperation was to formulate and implement a digital health solution that would make it possible to control and monitor the entire process of regulating hospital beds in all the state’s public hospitals online, on time and transparently, totaling 24 public hospitals, with more than 900 beds available.

Based on this technical-scientific cooperation, the RegulaRN Platform for COVID-19 was developed and implemented throughout the state of Rio Grande do Norte, whose initial objective was to monitor and control access to hospital beds in wards and intensive care units (ICUs) for the disease during the pandemic [2, 8, 9]. The state of Rio Grande Norte, which is located in the northeastern region of Brazil, currently has an area of 52,797 km² and a population of approximately 3.5 million inhabitants.

After the implementation of the RegulaRN Platform, it became necessary to expand the digital health solution to the other regulatory specialties. The system is currently responsible for regulating access to beds, vascular surgery, outpatient care, exams, and consultations. In this way, the RegulaRN Platform has become a unique digital health solution for the management of health regulation services in the state of Rio Grande do Norte, an important aspect because it has centralized and integrated, through international interoperability standards, the Health Data Network (RDS) with all the other technologies in the state’s public health ecosystem that are necessary for the process of regulating access to health services.

The health regulation process needs to be carried out in a rigorous, agile and transparent manner, as the incorrect conduct of a regulatory process in public health has intrinsic impacts on waiting times for access to hospital beds, as well as on hospitalization times, which can have negative impacts on the availability of hospital beds and increase the potential for existing problems [5, 10]. In this way, the inefficiency and ineffectiveness of this process can aggravate public health crisis situations, such as the COVID-19 pandemic, as it requires more rational use of health resources [8, 11–16]. Therefore, due to its complexity and the pressures that exist in all segments of the regulatory process, investment in intelligent computer systems can maximize the correct direction and assertive decision-making in healthcare systems [17–22].

Intelligent computer models have demonstrated significant potential in healthcare systems by reducing uncertainties and ambiguities in complex decision-making processes. For example, prior studies in similar healthcare contexts have shown that machine learning models can enhance hospital management by optimizing resource allocation and reducing patient waiting times [23–30]. This study aims to build on these findings to demonstrate the effectiveness of AI-based models specifically in bed regulation in Rio Grande do Norte.

In this context, the aim of this work is to analyze data from the RegulaRN Leitos Gerais Platform and use it to train and validate different machine learning models. Subsequently, to choose the most significant classification model capable of predicting the outcome of patients regulated by the RegulaRN Leitos Gerais platform with greater accuracy, precision, recall, specificity, F1 Score, and ROC-AUC. Furthermore, discuss the main impacts and potential of a digital health solution on the decision-making process of regulatory professionals.

Materials and methods

The methodological bias of this paper consists in two main steps: exploratory data analysis and applying the data to computer models. In the evaluation process, the data was extracted, evaluated, characterized, pre-processed, and correlated. For the application stage, concerning the computational models, four phases were taken into account: 1) definition of evaluation metrics; 2) data balancing and division into training and validation groups; 3) selecting the models for classification, and 4) hyperparameters to choose the best performing model; in line with Barreto et al [2].

Extraction, evaluation, characterization and pre-processing

This study used the database from the RegulaRN Leitos Gerais platform, a system adopted to manage the regulation flow of SUS beds in the state of Rio Grande do Norte. The database covers the period of October 2021 to January 2024, with 47.056 regulations in the two-state centers (Metropolitan and West). From this total, 1,868 regulations were removed because they were linked to newborn regulations, and these have different clinical assessment protocols when compared to adult and pediatric patients. The initial analysis therefore included 45,188 regulation requests. A more detailed descriptive analysis of the data is presented in the results section.

The initial data extraction included 24 features: a) date of request; b) occupancy type; c) case type; d) unified prioritization score (EUP); e) Sequential Organ Failure Assessment (SOFA) scale; f) type of hospital bed requested; g) admission date; h) type of input bed; i) discharge date; j) discharge bed type; k) national health card number; l) gender; m) patient’s municipality; n) patient’s federative unit; o) pregnant woman (yes or no); p) gestational period; q) age; r) regulator identification; s) outcome; t) requesting health unit; u) municipality of the requesting health unit; v) providing health unit; w) municipality of the providing health unit and x) ICD.

Thus, the features that were not associated with the patient’s clinical condition and do not show any impact in the final result, or that relate to the locality record, such as: date of request, national health card number, patient’s federative unit, patient’s municipality, regulator identification, requesting health unit, municipality of the requesting health unit, and municipality of the providing health unit. In addition, features with only one possible record or insufficient information were also removed: type of occupation, type of case, pregnant woman (yes or no), and gestational age.

Consequently, only 12 characteristics were selected, namely: EUP score, SOFA scale, type of hospital bed requested, admission date, admission bed type, discharge date, discharge bed type, gender, age, outcome, providing health unit, and ICD. Using the entry date and exit date features, it was possible to create the patient’s hospitalization time feature. As a result, 11 features were used in the classification process. Table 1 shows the description of all the data types extracted from RegulaRN Leitos Gerais.

[Figure omitted. See PDF.]

After extracting the data, we evaluated the values contained in all the features and in order to guarantee the integrity of the analysis, the lines with blank data or inconclusive information were removed. In addition, the target column “outcome” contained six different values, namely: by discharge, by death, for other reasons, by stay, by delivery procedure, by transfer, etc. As these last four outcomes do not properly indicate a positive or negative closure of the regulation, as well as having a lower number of recurrences, around 7.151 regulations were removed. This maintains a binary classification (by discharge—positive, or by death—negative) for the computer models. Finally, 38.023 effective regulations were selected for application in the artificial intelligence models. Fig 1 shows the design used to process and select the data. Furthermore, in order to enable the reproducibility of this experiment, the final database used is available on the zenodo platform (https://zenodo.org/records/11387710).

[Figure omitted. See PDF.]

Correlation between dataset features

The first task was to perform a pairwise correlation of the features. The objective is to identify features with greater or lesser correspondence with others. As many of these are categorical data, the phik correlation model was implemented in this analysis. Phik is abble to consistently correlate variables from several backgrounds, being categorical, ordinals and intervals a like, turning into a refinement of Pearson [31] hypothesis test.

Definition of evaluation metrics

The overall aim of the study is to classify hospital bed regulation data to predict a patient’s positive or negative outcome. Furthermore, it is important to investigate the models’ performance in situations where predictions are wrong, either due to a high number of false positives or false negatives. Thus, it is necessary to include not only accuracy, but also precision, recall, specificity, F1-Score, and ROC-AUC in a similar way to those found in the works of Iwendi et al [32], Aljameel et al [33] and Endo et al [34].

The accuracy consists in the set of data with correct predictions (true positive and true negative) divided by the sum of all predictions made by the model (true positive, true negative, false positive, false negative) (Eq 1):(1)

Precision consists of dividing the true positive rate by the sum of the true positive and false positive rates (Eq 2).(2)

Recall involves the rate of true positives divided by the rate of true positives plus false negatives (Eq 3).(3)

Specificity refers to the prediction of true negatives divided by the sum of true negatives and false positives (Eq 4).(4)

The F1-score is the harmonic mean between the precision and recall. The formula involves the product of precision and recall divided by the sum of these metrics, multiplied by 2 (Eq 5).(5)

ROC-AUC can be obtained by recall divided by the complementary value of specificity (Eq 6).(6)

Data balancing and splitting into training and validation data

The RegulaRN Leitos Gerais Platform database refers to real-world bed regulation data, in this sense, there is an unbalanced distribution of data when classified by outcome, 82.6% are discharges and 17.4% deaths. The use of an unbalanced database biases the machine learning classifiers, making the algorithms able to identify patterns from the predominant class much better than patterns from the minority class. To mitigate this problem, one of the most common techniques is SMOTE (Synthetic Minority Over-sampling), which works by increasing the number of data points in the minority class [35]. The SMOTE algorithm first identifies the minority class, then in the feature vector space identifies the k nearest neighbors of that class (k is usually equal to 5). Finally, a new instance of the minority class is generated by randomly selecting values in the vector space between an instance of the minority class and the nearest neighbors identified. This process is repeated until the database is completely balanced.

In addition, as for the division of training and validation data, the same segmentation was used as in other studies applying machine learning techniques that use a large volume of data [36–38]. Therefore, 80% of the data was directed to training and the others 20% for validation.

Definition of models for data classification

The selection of classification models was based on their proven ability to handle large volumes of imbalanced healthcare data [37, 39–41]. Decision tree was selected because it is one of the classic models that handles high volumes of data well and has wide application in problems in the health area [42]. Random Forest, on the other hand, was selected for its ability to manage complex decision trees and its resistance to overfitting, particularly in high-dimensional datasets [43]. Gradient Boosting and Adaboost due to their adaptability and efficient ability to capture non-linear relationships [44, 45]. XGBoost, for example, has been shown to perform well in healthcare settings due to its gradient boosting framework, which effectively handles missing data and provides robust performance on tabular datasets [46] and Multi-Layer Perceptron (MLP) has an architecture capable of modeling non-linear relationships and performing gradient learning, adjusting weights efficiently for larger volumes of data [47]. For the MLP models, two different paths were taken, the Stochastic Gradient Descent (SGD) was selected due to its performance and Adam because of its consistency in treating gradient explosion and fading problems [48]. These models were chosen for their complementarity in addressing the specific challenges of bed regulation data in this study.

Hyperparameters to define the best model

After the model selection, it was necessary to define the best combination of hyperparameters to enhance the evaluation metrics of each model. Thus, this section presents which hyperparameters were adopted and which methods were elaborated in the training and validation steps. It is worth mentioning that all computational model development in this research used python’s sckit-learn library [49].

For each selected model, hyperparameters were set aiming to boost the performance metrics. In this regard, the following hyperparameters were selected for Decision Tree: criterion, which measures the quality of node splitting; max depth of tree, which determines the maximum depth of the tree; min samples leaf, which represents the minimum number of samples needed in a leaf; and max features, which considers the maximum number of features analyzed to perform a split. For the Random Forest and Gradient Boosting models, the criterion, max depth of the tree and max features were also used, including the number estimators, which considers the number of trees in the forest. In the Adaboost model, the parameters number estimators, learning rate and algorithm were chosen. The learning rate refers to the learning weight at each iteration, while the algorithm relates to how the model can speed up the convergence of the classifier with the least possible error. For XGBoost: learning rate, number estimators, max depth and colsample by tree. This last hyperparameter is associated with the randomly selected fraction of resources that will be used to train each tree. Finally, for the MLP Adam and MLP SGD models, the following were used: hidden layer size, which represents the number of layers in the model; activation, which represents the model’s activation function and batch size, which represents the size of the minibatches that will be used to help the optimizers.

The grid GridSearchCV functionality, which allows all possible combinations of hyperparameters to be iterated, was applied during the training to find which parameters showed better results in the evaluated metrics [50, 51]. A proportional division of the training and test data was also carried out randomly using the cross validation attribute with a value of 10-folds in the GridSearchCV functionality, as a way of enhancing the model’s learning. In addition, the models were trained five times, similar to that developed by Ahsan et al [52], in order to determine the best set of hyperparameters more accurately. The details of the hyperparameters used and the respective values chosen for each model are shown in Table 2.

[Figure omitted. See PDF.]

Results

General data analysis

Considering the data profile from RegulaRN Leitos Gerais, between October 2021 and January 2024, it was possible to identify that most hospitalizations involve male adults, young people and children, in hospital beds and with lower EUP score and SOFA scale. The details of the values extracted from the database are presented in Table 3, classifying each of the characteristics based on their respective outcome.

[Figure omitted. See PDF.]

In addition, the database contains outcomes by each provider hospital (41), to address which health units had the highest number of requests and their respective outcomes, given that his feature showed a high correlation with several other dataset variables. Each hospital has a different treatment specialty, and thus some receive requests of greater complexity and mortality than others, culminating in different proportions of discharges and deaths. Finally, the data includes around 2055 different diseases classified by the International Classification of Diseases (ICD-10), which were also examined for recurrence.

As for the statistical profile, the average age is 53.38 years, with a standard deviation of 26.82 years and a median of 59 years. The average hospitalization time was 12.96 days, with a standard deviation of 17.67 days and a median of 7 days. The mean EUP score was 3.15 and the median was 3. The mean SOFA scale was 1.2 and the median 1.

Regarding the ICD, Table 4 shows the ten most recurrent ICDs, followed by the municipalities with the highest incidence and the hospitals that treat the most. The state capital, the city of Natal, has the highest number of inhabitants and has the highest incidence of ICDs 6 and 10. In contrast, Mossoró, the second largest municipality in terms of inhabitants, has a higher incidence in four of the ten. The noteworthy point is that there is no significant number of requests for these diseases among Parnamirim, São Gonçalo do Amarante, and Macaíba municipalities, which are the 3rd, 4th and 5th most populous municipalities.

[Figure omitted. See PDF.]

SOFA and EUP are two tools used to evaluate the hospital’s bed priority for each patient, considering that EUP revolves around SOFA, The Charlson Comorbidity Index (CCI) and the Clinical Frailty Scale (CFS). According to the data, it is possible to identify that EUP, has a more normalized aggregation to the outcomes classification. Meanwhile, SOFA = 1, was responsible for characterizing 78% of the data, a similar percentage is presented in the sum of requests with EUP 2 (46.2%), 3 (19.1%), and 4 (14.3%). In other words, while SOFA indicates that 78% of referrals had the same degree of priority, EUP structures the same percentage into three different categories. Given the health sector’s peculiarities, the EUP is an indicator that minimizes the generalization of different clinical conditions.

Another important point to evaluate is the ICD that most frequently resulted in death. Naturally, each ICD has its own intrinsic lethality level, meaning that some diseases kill more than others. However, it is necessary to analyze the frequency of certain occurrences and whether the incidence is local and already expected by public health institutions. Hence, with the data in hand, public health authorities can evaluate and orchestrate future intervention proposals. As shown in Table 5, Unspecified Pneumonia (J18.9) was the disease with the highest frequency (see Table 4) and resulted in the most deaths. Around 24.5% of the patients classified with this disease died, resulting in 15.9% of the total number of deaths. However, Unspecified Septicemia is one of the most lethal diseases and is responsible for the death of 50.5% of patients diagnosed with this disease.

[Figure omitted. See PDF.]

Regarding the data correlation, shown in Fig 2, Phik’s correlation revealed that the features that relate most closely to the outcome are the output bed type, requested bed type, entry bed type, SOFA scale, ICD, age, EUP score, and provider unit. Length of stay and gender did not present any relevant correlation for this topic.

[Figure omitted. See PDF.]

Machine learning model results

Table 2 shows the selection of hyperparameters that indicated the best results for the selected models. The Decision Tree model obtained the best criterion results when the entropy node division strategy was selected, max depth of the tree with a value of 50, min samples leaf with a value of 1 and max feature, square root. Random Forest obtained the best results with entropy (criterion), 50 (max depth of tree), 400 (number estimators) and sqrt (max features). For Gradient Boosting, squared error (criterion), 10 (max depth of tree), 50 (number estimators) and sqrt (max features). In the Adabost model, the best results were: 1.0 (learning rate), 400 (number estimators), and samme.r (algorithm). In XGBoost: 0.1 (learning rate), 200 (number estimators), 50 (max depth) and 1.0 (colsample by tree). The MLP models used the same hyperparameters in SGD and ADAM (hidden layer sizes, activation, batch size) which resulted in the same values: 70, relu and 32.

As for the results obtained by the selected metrics, XGBoost scored highest in accuracy (87.77%) and recall (87.77%). On the other hand, the Random Forest model (87.85%) was the most accurate, i.e. being the model that best classifies the positive outcome. As for the F1-Score value, the Gradient Boosting model had the highest value (87.56%). As for specificity, a parameter that assesses the classification performance of the negative outcome, it can be seen that the multilayer perceptron models outperform the others. The highest score was obtained by the SGD (82.94%). Table 6 presents the performance metrics for each machine learning model, including accuracy, precision, recall, F1-Score, and specificity. Notably, XGBoost outperformed the other models in accuracy and recall, making it a robust choice for predicting patient outcomes in bed regulation. However, the high specificity observed in the MLP models indicates that these models may be more suitable when the goal is to minimize false positives, particularly in critical care cases. For a better comparison of the performance of the models used, Fig 3 presents the values of each metric per computational model.

[Figure omitted. See PDF.]

Based on the results of the models, we performed a chi-square statistical validation to analyze whether the behavior of the models has statistical significance. For this, a contingency table was created with the distribution of real and predicted values of all models and for all cases a p value < 0.01 was obtained.

As for the features that were important for training the models, the most relevant features for classifying the Decision Tree were bed type, age, provider health unit and icd. The non-relevant elements were requested bed type, entry bed type and SOFA. In the Random Forest model, output bed type, age, EUP, provider health unit and icd scored the highest, while sex and SOFA were the least relevant characteristics. For the Gradient Boosting classifier output bed type, age, EUP and provider health unit were the most relevant, while sex, requested bed type, entry bed type were the lowest scorers. Adaboost considered the best characteristics to be length of stay, provider health unit, EUP and age, while the least relevant were sex, requested bed type and entry bed type. XGBoost considered output bed type, EUP and entry bed type as the most important characteristics and sex, SOFA and requested bed type as the least important. For the models that used MLP, Adam considered output bed type, requested bed type, provider health unit and age to be the most relevant, while SGD considered output bed type, provider health unit, age and ICD to be the most significant. The least important features were entry bed type and sex for Adam; and sex and requested bed type for SGD. Figs 4 and 5 show the important features of the machine learning models.

[Figure omitted. See PDF.]

Compared to Phik’s correlation, except for Adabost, all the other classifiers included output bed type as the most important feature in the classification process, which corroborates Phik’s correlation (output bed being the feature with the highest correlation with the outcome) and the weaker correlation, sex was identified as the least relevant feature for Random Forest, Grandient Boosting, Adaboost, XGBoost and SGD, while length of stay, which is another feature that has been shown to have a low correlation with the outcome, was not identified as worse in any of the classifiers, however, for Adaboost this feature was the most significant.

The ROC curve (receiver operating characteristic curve) helps to visualize the performance of classifiers to select an appropriate operating point or decision threshold [53]. The discriminative capacity is usually quantified by the area under the AUC curve when considering the prediction of a binary event. It relates the variation in the rate of true positives and false positives predicted by the models, with results on a scale of 0 to 1. Although there is no definitive consensus in the literature, most studies using this tool consider an AUC between 0.7 and 0.8 to be good and acceptable, and between 0.8 and 0.9 to be very good [54, 55].

Thus, the Decision Tree (AUC = 0.738), XGBoost (AUC = 0.766) and Random Forest (AUC = 0.785) models performed well, while the Adaboost (AUC = 0.804), Adam (AUC = 0.814) and SGD (AUC = 0.821) models performed better, falling into the very good category. Fig 6 shows the results obtained.

[Figure omitted. See PDF.]

Discussion

The use of artificial intelligence and computational methods to solve and predict problems in the health field has been going on for some years now, and although there is a considerable range of solutions in various segments, from predicting diseases by diagnosing medical images [56–60] to the classification of important markers for the prediction of cardiological [25, 61], and ophthalmological diseases [62, 63] or the analysis of data to predict early-stage cancer [64–66]; as well as robotic mechanisms for surgery, for example [67–70]. There are still some sectors that have not been explored or that have made negligible contributions [57, 71–73].

According to Yu, Beam and Kohane [57], the association of artificial intelligence will be able to contribute even more effectively to clinical practices and health management. In this way, healthcare professionals will be able to reduce the time spent on repetitive tasks in order to explore better treatments and clinical solutions aimed at patient care, something that machines cannot do and which require more humanized treatment. According to Valentim et al [8], the use of digital health solutions based on artificial intelligence are already considered relevant tools by healthcare managers, as they help to make decision-making more timely, effective and based on robust scientific evidence.

In the process of regulating hospital beds, the use of artificial intelligence helps to reduce medical subjectivity in the face of the repetitive process of countless daily regulations, tasks that can often become a tiring activity throughout the day. This certainly contributes to minimizing errors in the indication of hospital beds, especially when it comes to public health, since the daily volume of care is extremely high, as is the case in the state of Rio Grande Norte in Brazil, which has a population of approximately 3.5 million inhabitants. This could result in better resolutions for patients, as well as better equity in access to the resources available in the public health system. All of this will lead to a more timely hospitalization process for patients, and consequently to better performance in terms of hospital bed turnover—better average occupancy time for hospital beds across the entire public health network [2]. In general, the use of machine learning tools can optimize the care process, increasing efficacy, efficiency and effectiveness, which induces better resilience of the health system, especially in times of crisis, as was the case during COVID-19 [8, 74, 75].

At the management level, adopting AI-driven systems for bed regulation could lead to significant improvements in resource allocation, reducing patient wait times and optimizing bed occupancy rates. However, implementing these systems at scale presents challenges, such as ensuring adequate training for health professionals and integrating AI tools with existing hospital infrastructure. Addressing these challenges will be critical for maximizing the potential benefits of AI in the public healthcare system [5, 8].

In this study, machine learning techniques were used in different tree and ensemble models, as well as artificial neural network models on hospital bed regulation data, and the aim was to classify the outcome of patients regardless of their ICD, to help the regulating doctor and reduce subjectivity during the hospital bed regulation process.

As for the results of the computer models, XGBoost showed the best accuracy (87.77%) and recall (87.77%) values, i.e. of all the models used, it classifies the data better in general, regardless of the outcome (discharge or death), as well as, given the positive outcome, the proportion that was correctly classified. As for the accuracy indicator, which identifies which proportion of positive outcomes was correct, Random Forest performed best (87.05%). As for the F1-Score, Gradient Boosting has a better harmonic mean between precision and recall, i.e. it has a better balance in the metrics that assess the positive outcome. Regarding specificity, a metric that assesses the classification of the negative outcome, the neural network models showed the best results when compared to the tree and ensemble models, achieving scores of 82.58% (ADAM) and 82.94% (SGD). For the ROC-AUC, the SGD and ADAM models also performed better, because, as they had a more balanced classification of positive and negative outcomes, the ROC-AUC value was in the range of 82.13% and 81.42%, respectively.

Considering these results, the models used in this experiment are not only able to predict which patients are more likely to be discharged or die, but also allow us to understand which samples are being better classified concerning the outcome and the best type of hospital bed according to the clinical conditions of each patient. And so, the main metric analyzed should not only be accuracy; the other metrics that point to a positive outcome (precision, recall and ROC-AUC) should also be maximized [2]. Furthermore, it also has a positive impact on the pace of work of the regulatory professional, given that in situations of high demand and overload of requests, the assertiveness of the regulatory process can be compromised, and so the models contribute to better regulatory conduct [76, 77].

Conclusion

This study used the regulation database of the RegulaRN Leitos Gerais platform between 2021 and 2024 in machine learning models to predict the outcome of discharge and death in different diseases that require hospitalization. The results of this article show that there is no single model that obtains the best accuracy, precision, recall, F1-Score, specificity, and ROC-AUC metrics. Thus, depending on the objectives of the regulatory professionals, it should be observed which model can provide the best result based on the desired metric, i.e. for example, if the regulator’s objective is to observe the best classifications for the positive outcome, it should use XGBoost and Random Forest; If the objective is to evaluate the best classification for the negative outcome, the multilayer perceptron models should be evaluated.

It should be noted that artificial intelligence computer models enhance the activities carried out in the healthcare and management sectors. Research in this area should therefore be increasingly explored in order to minimize the precariousness and weaknesses that exist in the different health segments. In this way, this research also aims to make a positive contribution to the health system such as the SUS, which aims to guarantee universal and comprehensive access to health with equity.

A significant limitation of this study is the incomplete dataset, particularly the absence of detailed information such as pregnancy status and gestational age. This missing information could introduce bias in model predictions, particularly for patient subgroups with different clinical needs. Future work should focus on improving data collection protocols to ensure that such critical variables are recorded, allowing for more nuanced and accurate model predictions across diverse patient groups. During the evaluation of the database, some gaps were found in the data, which is why it was not included for training the models. However, for some diseases, knowing whether the patient is pregnant or not and the appropriate length of pregnancy are essential. In addition, this study considered the same evaluation of hospital outcomes in different diseases with different morbidity scales. Furthermore, another limitation of this work was the non-inclusion of other models widely used in academic literature, such as k-Nearest Neighbors (kNN) and Support Vector Machines (SVM) [78–80], as they were not included within the initial scope of this research. However, it is considered that for future work the scope of computational models can be expanded and these models included. Furthermore, still addressing future work, creating a new feature that can categorize diseases by morbidity could contribute to a more appropriate classification of the models. Furthermore, trying to identify which treatment protocols were used to treat certain diseases can also be a relevant indicator for classifying models.

Acknowledgments

We would like to thank the Public Health Secretariat of Rio Grande do Norte (SESAP/RN), the Health Technological Innovation Laboratory (LAIS) of the Federal University of Rio Grande do Norte (UFRN), the Advanced Innovation Center (NAVI) of the Federal Institute do Rio Grande do Norte (IFRN), to LyRIDS, ECE-Engineering School and the Department of Informatics and Applied Mathematics for the support necessary for the development of this research.

References

1. 1. Bastos LBR, Barbosa MA, Rosso CFW, Oliveira LMdAC, Ferreira IP, Bastos DAdS, et al. Practices and challenges on coordinating the Brazilian Unified Health System. Revista de Saúde Pública. 2020;54:25. pmid:32074220

* View Article

* PubMed/NCBI

* Google Scholar

2. 2. Barreto TdO, Veras NVR, Cardoso PH, Fernandes FRdS, Medeiros LPdS, Bezerra MV, et al. Artificial intelligence applied to analyzes during the pandemic: COVID-19 beds occupancy in the state of Rio Grande do Norte, Brazil. Frontiers in Artificial Intelligence. 2023;6:1290022. pmid:38145230

* View Article

* PubMed/NCBI

* Google Scholar

3. 3. Brasil. Portaria nº 1.559, de 1º de agosto de 2008. Institui a Política Nacional de Regulação do Sistema Único de Saúde-SUS. Diário Oficial da União. 2008;.

4. 4. Brasil. Portaria nº 2, de 28º de setembro de 2017. Consolidação das normas sobre as políticas nacionais de Saú do Sistema Único de Saúde-SUS. Diário Oficial da União. 2017;.

5. 5. Maldonado RN, Savio RO, Feijó VBER, Aroni P, Rossaneis MA, Haddad MdCFL. Hospital indicators after implementation of bed regulation strategies: an integrative review. Revista Brasileira de Enfermagem. 2021;74:e20200022. pmid:34161538

* View Article

* PubMed/NCBI

* Google Scholar

6. 6. Cordeiro MF. SISREG: uma ferramenta de desafios e avanços para a garantia do direito a saúde. 2015;.

7. 7. Junior JRM, de Souza Junior AA, da Luz AEdJ. The impact of COVID-19 on the municipal Regulation System (SISREG) of Rio de Janeiro (RJ). Research, Society and Development. 2024;13(4):e5613445564–e5613445564.

* View Article

* Google Scholar

8. 8. Valentim RAdM, Lima TS, Cortez LR, Barros DMdS, Silva RDd, Paiva JCd, et al. The relevance a technology ecosystem in the Brazilian National Health Service’s Covid-19 response: the case of Rio Grande do Norte, Brazil. Ciência & Saúde Coletiva. 2021;26:2035–2052.

* View Article

* Google Scholar

9. 9. Medina MVB. Análise da utilização da Escala Quick Sequential Organ Failure Assessment para tomada de decisão na regulação de leitos de UTI. Universidade Federal do Rio Grande do Norte; 2023.

10. 10. Konder M, O’Dwyer G. Regulation of access to hospital beds in emergency care and the development of integrated health services. 2019;.

11. 11. Kim M, Lee JY, Park JS, Kim HA, Hyun M, Suh YS, et al. Lessons from a COVID-19 hospital, Republic of Korea. Bulletin of the World Health Organization. 2020;98(12):842. pmid:33293744

* View Article

* PubMed/NCBI

* Google Scholar

12. 12. Lu Y, Guan Y, Zhong X, Fishe JN, Hogan T. Hospital beds planning and admission control policies for COVID-19 pandemic: A hybrid computer simulation approach. In: 2021 IEEE 17th International Conference on Automation SCience and Engineering (CASE). IEEE; 2021. p. 956–961.

13. 13. Pecoraro F, Luzi D, Clemente F. The efficiency in the ordinary hospital bed management: A comparative analysis in four European countries before the COVID-19 outbreak. Plos one. 2021;16(3):e0248867. pmid:33750956

* View Article

* PubMed/NCBI

* Google Scholar

14. 14. Shi F, Li H, Liu R, Liu Y, Liu X, Wen H, et al. Emergency preparedness and management of mobile cabin hospitals in China during the COVID-19 pandemic. Frontiers in Public Health. 2022;9:763723. pmid:35047472

* View Article

* PubMed/NCBI

* Google Scholar

15. 15. Alavinejad M, Mellado B, Asgary A, Mbada M, Mathaha T, Lieberman B, et al. Management of hospital beds and ventilators in the Gauteng province, South Africa, during the COVID-19 pandemic. PLOS global public health. 2022;2(11):e0001113. pmid:36962677

* View Article

* PubMed/NCBI

* Google Scholar

16. 16. Kuzior A, Kashcha M, Kuzmenko O, Lyeonov S, Brożek P. Public health system economic efficiency and COVID-19 resilience: Frontier DEA analysis. International Journal of Environmental Research and Public Health. 2022;19(22):14727. pmid:36429444

* View Article

* PubMed/NCBI

* Google Scholar

17. 17. Taylor CA, Draney MT, Ku JP, Parker D, Steele BN, Wang K, et al. Predictive medicine: computational techniques in therapeutic decision-making. Computer Aided Surgery: Official Journal of the International Society for Computer Aided Surgery (ISCAS). 1999;4(5):231–247. pmid:10581521

* View Article

* PubMed/NCBI

* Google Scholar

18. 18. Shahid N, Rappon T, Berta W. Applications of artificial neural networks in health care organizational decision-making: A scoping review. PloS one. 2019;14(2):e0212356. pmid:30779785

* View Article

* PubMed/NCBI

* Google Scholar

19. 19. Tian S, Yang W, Le Grange JM, Wang P, Huang W, Ye Z. Smart healthcare: making medical care more intelligent. Global Health Journal. 2019;3(3):62–65.

* View Article

* Google Scholar

20. 20. Panagiotou OA, Högg LH, Hricak H, Khleif SN, Levy MA, Magnus D, et al. Clinical application of computational methods in precision oncology: a review. JAMA oncology. 2020;6(8):1282–1286. pmid:32407443

* View Article

* PubMed/NCBI

* Google Scholar

21. 21. Bian J, Modave F. The rapid growth of intelligent systems in health and health care; 2020.

22. 22. Gupta PK, Ramachandran AT, Keerthi AM, Dave PS, Giridhar S, Kallapur SS, et al. An overview of clinical decision support system (CDSS) as a computational tool and its applications in public health. Applications in ubiquitous computing. 2021; p. 81–117.

* View Article

* Google Scholar

23. 23. Moulaei K, Shanbehzadeh M, Mohammadi-Taghiabad Z, Kazemi-Arpanahi H. Comparing machine learning algorithms for predicting COVID-19 mortality. BMC medical informatics and decision making. 2022;22(1):2. pmid:34983496

* View Article

* PubMed/NCBI

* Google Scholar

24. 24. Albuquerque G, Fernandes F, Barbalho IM, Barros DM, Morais PS, Morais AH, et al. Computational methods applied to syphilis: where are we, and where are we going? Frontiers in Public Health. 2023;11:1201725. pmid:37680278

* View Article

* PubMed/NCBI

* Google Scholar

25. 25. Carvalho DRd, Araújo BGd, Lacerda JMT, Dantas MdCR, Hékis HR, Valentim RAdM. An architecture for online transient detection in electrocardiogram signals on the MP-HA protocol. Revista Brasileira de Engenharia Biomédica. 2012;28:346–354.

* View Article

* Google Scholar

26. 26. Levantesi S, Pizzorusso V. Application of machine learning to mortality modeling and forecasting. Risks. 2019;7(1):26.

* View Article

* Google Scholar

27. 27. Shamout F, Zhu T, Clifton DA. Machine learning for clinical outcome prediction. IEEE reviews in Biomedical Engineering. 2020;14:116–126.

* View Article

* Google Scholar

28. 28. Huang Y, Talwar A, Chatterjee S, Aparasu RR. Application of machine learning in predicting hospital readmissions: a scoping review of the literature. BMC medical research methodology. 2021;21:1–14. pmid:33952192

* View Article

* PubMed/NCBI

* Google Scholar

29. 29. Dixit RR. Risk Assessment for Hospital Readmissions: Insights from Machine Learning Algorithms. Sage Science Review of Applied Machine Learning. 2021;4(2):1–15.

* View Article

* Google Scholar

30. 30. Iwase S, Nakada Ta, Shimada T, Oami T, Shimazui T, Takahashi N, et al. Prediction algorithm for ICU mortality and length of stay using machine learning. Scientific reports. 2022;12(1):12912. pmid:35902633

* View Article

* PubMed/NCBI

* Google Scholar

31. 31. Baak M, Koopman R, Snoek H, Klous S. A new correlation coefficient between categorical, ordinal and interval variables with Pearson characteristics. Computational Statistics & Data Analysis. 2020;152:107043.

* View Article

* Google Scholar

32. 32. Iwendi C, Bashir AK, Peshkar A, Sujatha R, Chatterjee JM, Pasupuleti S, et al. COVID-19 patient health prediction using boosted random forest algorithm. Frontiers in public health. 2020;8:357. pmid:32719767

* View Article

* PubMed/NCBI

* Google Scholar

33. 33. Aljameel SS, Khan IU, Aslam N, Aljabri M, Alsulmi ES. Machine Learning-Based Model to Predict the Disease Severity and Outcome in COVID-19 Patients. Scientific programming. 2021;2021(1):5587188.

* View Article

* Google Scholar

34. 34. Endo PT, Santos GL, de Lima Xavier ME, Nascimento Campos GR, de Lima LC, Silva I, et al. Illusion of truth: analysing and classifying COVID-19 fake news in brazilian portuguese language. Big Data and Cognitive Computing. 2022;6(2):36.

* View Article

* Google Scholar

35. 35. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research. 2002;16:321–357.

* View Article

* Google Scholar

36. 36. Maiga J, Hungilo GG, et al. Comparison of machine learning models in prediction of cardiovascular disease using health record data. In: 2019 international conference on informatics, multimedia, cyber and information system (ICIMCIS). IEEE; 2019. p. 45–48.

37. 37. Hamida S, El Gannour O, Cherradi B, Ouajji H, Raihani A. Optimization of machine learning algorithms hyper-parameters for improving the prediction of patients infected with COVID-19. In: 2020 ieee 2nd international conference on electronics, control, optimization and computer science (icecocs). IEEE; 2020. p. 1–6.

38. 38. Papaiz F, Dourado MET Jr, de Medeiros Valentim RA, Pinto R, de Morais AHF, Arrais JP. Ensemble-imbalance-based classification for amyotrophic lateral sclerosis prognostic prediction: identifying short-survival patients at diagnosis. BMC Medical Informatics and Decision Making. 2024;24(1):80. pmid:38504285

* View Article

* PubMed/NCBI

* Google Scholar

39. 39. Divya KS, Bhargavi P, Jyothi S. Machine learning algorithms in big data analytics. Int J Comput Sci Eng. 2018;6(1):63–70.

* View Article

* Google Scholar

40. 40. Liu J, Wang L, Zhang L, Zhang Z, Zhang S. Predictive analytics for blood glucose concentration: an empirical study using the tree-based ensemble approach. Library Hi Tech. 2020;38(4):835–858.

* View Article

* Google Scholar

41. 41. Rahul K, Banyal RK, Goswami P, Kumar V. Machine learning algorithms for big data analytics. In: Computational Methods and Data Engineering: Proceedings of ICMDE 2020, Volume 1. Springer; 2021. p. 359–367.

42. 42. Charbuty B, Abdulazeez A. Classification based on decision tree algorithm for machine learning. Journal of Applied Science and Technology Trends. 2021;2(01):20–28.

* View Article

* Google Scholar

43. 43. Rigatti SJ. Random forest. Journal of Insurance Medicine. 2017;47(1):31–39. pmid:28836909

* View Article

* PubMed/NCBI

* Google Scholar

44. 44. Fafalios S, Charonyktakis P, Tsamardinos I. Gradient boosting trees. Gnosis Data Analysis PC. 2020;1.

* View Article

* Google Scholar

45. 45. Schapire RE. Empirical inference. Berlin, Heidelberg. 2013; p. 37–52.

* View Article

* Google Scholar

46. 46. Sheng C, Yu H. An optimized prediction algorithm based on XGBoost. In: 2022 International Conference on Networking and Network Applications (NaNA). IEEE; 2022. p. 1–6.

47. 47. Popescu MC, Balas VE, Perescu-Popescu L, Mastorakis N. Multilayer perceptron and neural networks. WSEAS Transactions on Circuits and Systems. 2009;8(7):579–588.

* View Article

* Google Scholar

48. 48. Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:14126980. 2014;.

49. 49. Pedregosa F. Scikit-learn: Machine learning in python Fabian. Journal of machine learning research. 2011;12:2825.

* View Article

* Google Scholar

50. 50. Ensor KB, Glynn PW. Stochastic optimization via grid search. Lectures in Applied Mathematics-American Mathematical Society. 1997;33:89–100.

* View Article

* Google Scholar

51. 51. Bergstra J, Bengio Y. Random search for hyper-parameter optimization. Journal of machine learning research. 2012;13(2).

* View Article

* Google Scholar

52. 52. Ahsan MM, E Alam T, Trafalis T, Huebner P. Deep MLP-CNN model using mixed-data to distinguish between COVID-19 and Non-COVID-19 patients. Symmetry. 2020;12(9):1526.

* View Article

* Google Scholar

53. 53. Bradley AP. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern recognition. 1997;30(7):1145–1159.

* View Article

* Google Scholar

54. 54. De Hond AA, Steyerberg EW, Van Calster B. Interpreting area under the receiver operating characteristic curve. The Lancet Digital Health. 2022;4(12):e853–e855. pmid:36270955

* View Article

* PubMed/NCBI

* Google Scholar

55. 55. Nahm FS. Receiver operating characteristic curve: overview and practical use for clinicians. Korean journal of anesthesiology. 2022;75(1):25–36. pmid:35124947

* View Article

* PubMed/NCBI

* Google Scholar

56. 56. Mohammad-Rahimi H, Nadimi M, Ghalyanchi-Langeroudi A, Taheri M, Ghafouri-Fard S. Application of machine learning in diagnosis of COVID-19 through X-ray and CT images: a scoping review. Frontiers in cardiovascular medicine. 2021;8:638011. pmid:33842563

* View Article

* PubMed/NCBI

* Google Scholar

57. 57. Yu KH, Beam AL, Kohane IS. Artificial intelligence in healthcare. Nature biomedical engineering. 2018;2(10):719–731. pmid:31015651

* View Article

* PubMed/NCBI

* Google Scholar

58. 58. Davenport T, Kalakota R. The potential for artificial intelligence in healthcare. Future healthcare journal. 2019;6(2):94–98. pmid:31363513

* View Article

* PubMed/NCBI

* Google Scholar

59. 59. Owoyemi A, Owoyemi J, Osiyemi A, Boyd A. Artificial Intelligence for Healthcare in Africa. Frontiers in digital health 2: 6; 2020. pmid:34713019

* View Article

* PubMed/NCBI

* Google Scholar

60. 60. Yang CC. Explainable artificial intelligence for predictive modeling in healthcare. Journal of healthcare informatics research. 2022;6(2):228–239. pmid:35194568

* View Article

* PubMed/NCBI

* Google Scholar

61. 61. Cuocolo R, Perillo T, De Rosa E, Ugga L, Petretta M. Current applications of big data and machine learning in cardiology. Journal of geriatric cardiology: JGC. 2019;16(8):601. pmid:31555327

* View Article

* PubMed/NCBI

* Google Scholar

62. 62. Barros DM, Moura JC, Freire CR, Taleb AC, Valentim RA, Morais PS. Machine learning applied to retinal image processing for glaucoma detection: review and perspective. Biomedical engineering online. 2020;19:1–21. pmid:32293466

* View Article

* PubMed/NCBI

* Google Scholar

63. 63. Srivastava O, Tennant M, Grewal P, Rubin U, Seamone M. Artificial intelligence and machine learning in ophthalmology: A review. Indian Journal of Ophthalmology. 2023;71(1):11–17. pmid:36588202

* View Article

* PubMed/NCBI

* Google Scholar

64. 64. Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Computational and structural biotechnology journal. 2015;13:8–17. pmid:25750696

* View Article

* PubMed/NCBI

* Google Scholar

65. 65. Firmino M, Angelo G, Morais H, Dantas MR, Valentim R. Computer-aided detection (CADe) and diagnosis (CADx) system for lung cancer with likelihood of malignancy. Biomedical engineering online. 2016;15:1–17. pmid:26759159

* View Article

* PubMed/NCBI

* Google Scholar

66. 66. Galvao-Lima L, Morais H, Valentim R, Barreto E. miRNAs as biomarkers for early cancer detection and their application in the development of new diagnostic tools. Biomedical engineering online. 2021;20:1–21. pmid:33593374

* View Article

* PubMed/NCBI

* Google Scholar

67. 67. Zhao B, Waterman R, Urman R, Gabriel RA. A machine learning approach to predicting case duration for robot-assisted surgery. Journal of Medical Systems. 2019;43:1–32. pmid:30612192

* View Article

* PubMed/NCBI

* Google Scholar

68. 68. Panesar S, Cagle Y, Chander D, Morey J, Fernandez-Miranda J, Kliot M. Artificial intelligence and the future of surgical robotics. Annals of surgery. 2019;270(2):223–226. pmid:30907754

* View Article

* PubMed/NCBI

* Google Scholar

69. 69. Zhou XY, Guo Y, Shen M, Yang GZ. Application of artificial intelligence in surgery. Frontiers of medicine. 2020;14:417–430. pmid:32705406

* View Article

* PubMed/NCBI

* Google Scholar

70. 70. Moglia A, Georgiou K, Georgiou E, Satava RM, Cuschieri A. A systematic review on artificial intelligence in robot-assisted surgery. International Journal of Surgery. 2021;95:106151. pmid:34695601

* View Article

* PubMed/NCBI

* Google Scholar

71. 71. Fernandes YYMP, Araújo GTd, Araújo BGd, Dantas MdCR, Carvalho DRd, Valentim RAdM. ILITIA: telehealth architecture for high-risk gestation classification. Research on Biomedical Engineering. 2017;33(3):237–246.

* View Article

* Google Scholar

72. 72. Reddy S. Explainability and artificial intelligence in medicine. The Lancet Digital Health. 2022;4(4):e214–e215. pmid:35337639

* View Article

* PubMed/NCBI

* Google Scholar

73. 73. Schwalbe N, Wahl B. Artificial intelligence and the future of global health. The Lancet. 2020;395(10236):1579–1586. pmid:32416782

* View Article

* PubMed/NCBI

* Google Scholar

74. 74. Ammar W, Kdouh O, Hammoud R, Hamadeh R, Harb H, Ammar Z, et al. Health system resilience: Lebanon and the Syrian refugee crisis. Journal of global health. 2016;6(2). pmid:28154758

* View Article

* PubMed/NCBI

* Google Scholar

75. 75. Massuda A, Hone T, Leles FAG, De Castro MC, Atun R. The Brazilian health system at crossroads: progress, crisis and resilience. BMJ global health. 2018;3(4):e000829. pmid:29997906

* View Article

* PubMed/NCBI

* Google Scholar

76. 76. Muhammad L, Algehyne EA, Usman SS, Ahmad A, Chakraborty C, Mohammed IA. Supervised machine learning models for prediction of COVID-19 infection using epidemiology dataset. SN computer science. 2021;2(1):1–13. pmid:33263111

* View Article

* PubMed/NCBI

* Google Scholar

77. 77. Silva Junior CL, Guabiraba KPdL, Gomes GG, Andrade CLTd, Melo EA. Outpatient regulation in Primary Care in the municipality of Rio de Janeiro, Brazil, based on the local regulatory doctors. Ciência & Saúde Coletiva. 2022;27:2481–2493.

* View Article

* Google Scholar

78. 78. Ali A, Khan Z, Khan DM, Aldahmani S. An Optimal Random Projection k Nearest Neighbours Ensemble via Extended Neighbourhood Rule for Binary Classification. IEEE Access. 2024;.

* View Article

* Google Scholar

79. 79. Ali A, Hamraz M, Gul N, Khan DM, Aldahmani S, Khan Z. A k nearest neighbour ensemble via extended neighbourhood rule and feature subsets. Pattern Recognition. 2023;142:109641.

* View Article

* Google Scholar

80. 80. Vijayarani S, Dhayanand S, Phil M. Kidney disease prediction using SVM and ANN algorithms. International Journal of Computing and Business Research (IJCBR). 2015;6(2):1–12.

* View Article

* Google Scholar

Citation: Barreto TdO, Farias FLdO, Veras NVR, Cardoso PH, Silva GJPC, Pinheiro CdO, et al. (2024) Artificial intelligence applied to bed regulation in Rio Grande do Norte: Data analysis and application of machine learning on the “RegulaRN Leitos Gerais” platform. PLoS ONE 19(12): e0315379. https://doi.org/10.1371/journal.pone.0315379

About the Authors:

Tiago de Oliveira Barreto

Roles: Conceptualization, Data curation, Formal analysis, Methodology, Resources, Writing – original draft, Writing – review & editing

E-mail: [email protected]

Affiliation: Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil

ORICD: https://orcid.org/0000-0002-4399-9518

Fernando Lucas de Oliveira Farias

Roles: Conceptualization, Data curation, Investigation, Writing – original draft, Writing – review & editing

Affiliation: Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil

Nicolas Vinícius Rodrigues Veras

Roles: Data curation, Methodology, Resources, Writing – original draft, Writing – review & editing

Affiliations: Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil, Advanced Nucleus of Technological Innovation (NAVI), Federal Institute of Rio Grande do Norte (IFRN), Natal, Rio Grande do Norte, Brazil

Pablo Holanda Cardoso

Roles: Data curation, Methodology, Resources, Writing – original draft

Gleyson José Pinheiro Caldeira Silva

Roles: Investigation, Methodology, Visualization, Writing – original draft

Affiliation: Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil

Chander de Oliveira Pinheiro

Roles: Funding acquisition, Methodology, Writing – review & editing

Affiliation: Secretary of Public Health of Rio Grande do Norte, Natal, Rio Grande do Norte, Brazil

Maria Valéria Bezerra Medina

Roles: Methodology, Visualization, Writing – review & editing

Affiliation: Secretary of Public Health of Rio Grande do Norte, Natal, Rio Grande do Norte, Brazil

Felipe Ricardo dos Santos Fernandes

Roles: Conceptualization, Investigation, Methodology, Writing – review & editing

Affiliation: Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil

ORICD: https://orcid.org/0000-0003-0805-1796

Ingridy Marina Pierre Barbalho

Roles: Investigation, Methodology, Visualization, Writing – review & editing

Affiliation: Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil

Lyane Ramalho Cortez

Roles: Funding acquisition, Methodology, Validation, Writing – review & editing

Affiliations: Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil, Secretary of Public Health of Rio Grande do Norte, Natal, Rio Grande do Norte, Brazil

João Paulo Queiroz dos Santos

Roles: Methodology, Project administration, Writing – review & editing

Antonio Higor Freire de Morais

Roles: Conceptualization, Methodology, Resources, Writing – review & editing

Gustavo Fontoura de Souza

Roles: Data curation, Methodology, Writing – review & editing

Guilherme Medeiros Machado

Roles: Methodology, Validation, Writing – review & editing

Affiliation: LyRIDS, ECE-Engineering School, Paris, France

Márcia Jacyntha Nunes Rodrigues Lucena

Roles: Investigation, Methodology, Writing – review & editing

Affiliation: Department of Informatics and Applied Mathematics, Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil

Ricardo Alexsandro de Medeiros Valentim

Roles: Conceptualization, Investigation, Methodology, Writing – original draft, Writing – review & editing

Affiliation: Laboratory of Technological Innovation in Health (LAIS), Federal University of Rio Grande do Norte (UFRN), Natal, Rio Grande do Norte, Brazil

ORICD: https://orcid.org/0000-0002-9216-8593

[/RAW_REF_TEXT]

References

1. Bastos LBR, Barbosa MA, Rosso CFW, Oliveira LMdAC, Ferreira IP, Bastos DAdS, et al. Practices and challenges on coordinating the Brazilian Unified Health System. Revista de Saúde Pública. 2020;54:25. pmid:32074220

2. Barreto TdO, Veras NVR, Cardoso PH, Fernandes FRdS, Medeiros LPdS, Bezerra MV, et al. Artificial intelligence applied to analyzes during the pandemic: COVID-19 beds occupancy in the state of Rio Grande do Norte, Brazil. Frontiers in Artificial Intelligence. 2023;6:1290022. pmid:38145230

3. Brasil. Portaria nº 1.559, de 1º de agosto de 2008. Institui a Política Nacional de Regulação do Sistema Único de Saúde-SUS. Diário Oficial da União. 2008;.

4. Brasil. Portaria nº 2, de 28º de setembro de 2017. Consolidação das normas sobre as políticas nacionais de Saú do Sistema Único de Saúde-SUS. Diário Oficial da União. 2017;.

5. Maldonado RN, Savio RO, Feijó VBER, Aroni P, Rossaneis MA, Haddad MdCFL. Hospital indicators after implementation of bed regulation strategies: an integrative review. Revista Brasileira de Enfermagem. 2021;74:e20200022. pmid:34161538

6. Cordeiro MF. SISREG: uma ferramenta de desafios e avanços para a garantia do direito a saúde. 2015;.

7. Junior JRM, de Souza Junior AA, da Luz AEdJ. The impact of COVID-19 on the municipal Regulation System (SISREG) of Rio de Janeiro (RJ). Research, Society and Development. 2024;13(4):e5613445564–e5613445564.

8. Valentim RAdM, Lima TS, Cortez LR, Barros DMdS, Silva RDd, Paiva JCd, et al. The relevance a technology ecosystem in the Brazilian National Health Service’s Covid-19 response: the case of Rio Grande do Norte, Brazil. Ciência & Saúde Coletiva. 2021;26:2035–2052.

9. Medina MVB. Análise da utilização da Escala Quick Sequential Organ Failure Assessment para tomada de decisão na regulação de leitos de UTI. Universidade Federal do Rio Grande do Norte; 2023.

10. Konder M, O’Dwyer G. Regulation of access to hospital beds in emergency care and the development of integrated health services. 2019;.

11. Kim M, Lee JY, Park JS, Kim HA, Hyun M, Suh YS, et al. Lessons from a COVID-19 hospital, Republic of Korea. Bulletin of the World Health Organization. 2020;98(12):842. pmid:33293744

12. Lu Y, Guan Y, Zhong X, Fishe JN, Hogan T. Hospital beds planning and admission control policies for COVID-19 pandemic: A hybrid computer simulation approach. In: 2021 IEEE 17th International Conference on Automation SCience and Engineering (CASE). IEEE; 2021. p. 956–961.

13. Pecoraro F, Luzi D, Clemente F. The efficiency in the ordinary hospital bed management: A comparative analysis in four European countries before the COVID-19 outbreak. Plos one. 2021;16(3):e0248867. pmid:33750956

14. Shi F, Li H, Liu R, Liu Y, Liu X, Wen H, et al. Emergency preparedness and management of mobile cabin hospitals in China during the COVID-19 pandemic. Frontiers in Public Health. 2022;9:763723. pmid:35047472

15. Alavinejad M, Mellado B, Asgary A, Mbada M, Mathaha T, Lieberman B, et al. Management of hospital beds and ventilators in the Gauteng province, South Africa, during the COVID-19 pandemic. PLOS global public health. 2022;2(11):e0001113. pmid:36962677

16. Kuzior A, Kashcha M, Kuzmenko O, Lyeonov S, Brożek P. Public health system economic efficiency and COVID-19 resilience: Frontier DEA analysis. International Journal of Environmental Research and Public Health. 2022;19(22):14727. pmid:36429444

17. Taylor CA, Draney MT, Ku JP, Parker D, Steele BN, Wang K, et al. Predictive medicine: computational techniques in therapeutic decision-making. Computer Aided Surgery: Official Journal of the International Society for Computer Aided Surgery (ISCAS). 1999;4(5):231–247. pmid:10581521

18. Shahid N, Rappon T, Berta W. Applications of artificial neural networks in health care organizational decision-making: A scoping review. PloS one. 2019;14(2):e0212356. pmid:30779785

19. Tian S, Yang W, Le Grange JM, Wang P, Huang W, Ye Z. Smart healthcare: making medical care more intelligent. Global Health Journal. 2019;3(3):62–65.

20. Panagiotou OA, Högg LH, Hricak H, Khleif SN, Levy MA, Magnus D, et al. Clinical application of computational methods in precision oncology: a review. JAMA oncology. 2020;6(8):1282–1286. pmid:32407443

21. Bian J, Modave F. The rapid growth of intelligent systems in health and health care; 2020.

22. Gupta PK, Ramachandran AT, Keerthi AM, Dave PS, Giridhar S, Kallapur SS, et al. An overview of clinical decision support system (CDSS) as a computational tool and its applications in public health. Applications in ubiquitous computing. 2021; p. 81–117.

23. Moulaei K, Shanbehzadeh M, Mohammadi-Taghiabad Z, Kazemi-Arpanahi H. Comparing machine learning algorithms for predicting COVID-19 mortality. BMC medical informatics and decision making. 2022;22(1):2. pmid:34983496

24. Albuquerque G, Fernandes F, Barbalho IM, Barros DM, Morais PS, Morais AH, et al. Computational methods applied to syphilis: where are we, and where are we going? Frontiers in Public Health. 2023;11:1201725. pmid:37680278

25. Carvalho DRd, Araújo BGd, Lacerda JMT, Dantas MdCR, Hékis HR, Valentim RAdM. An architecture for online transient detection in electrocardiogram signals on the MP-HA protocol. Revista Brasileira de Engenharia Biomédica. 2012;28:346–354.

26. Levantesi S, Pizzorusso V. Application of machine learning to mortality modeling and forecasting. Risks. 2019;7(1):26.

27. Shamout F, Zhu T, Clifton DA. Machine learning for clinical outcome prediction. IEEE reviews in Biomedical Engineering. 2020;14:116–126.

28. Huang Y, Talwar A, Chatterjee S, Aparasu RR. Application of machine learning in predicting hospital readmissions: a scoping review of the literature. BMC medical research methodology. 2021;21:1–14. pmid:33952192

29. Dixit RR. Risk Assessment for Hospital Readmissions: Insights from Machine Learning Algorithms. Sage Science Review of Applied Machine Learning. 2021;4(2):1–15.

30. Iwase S, Nakada Ta, Shimada T, Oami T, Shimazui T, Takahashi N, et al. Prediction algorithm for ICU mortality and length of stay using machine learning. Scientific reports. 2022;12(1):12912. pmid:35902633

31. Baak M, Koopman R, Snoek H, Klous S. A new correlation coefficient between categorical, ordinal and interval variables with Pearson characteristics. Computational Statistics & Data Analysis. 2020;152:107043.

32. Iwendi C, Bashir AK, Peshkar A, Sujatha R, Chatterjee JM, Pasupuleti S, et al. COVID-19 patient health prediction using boosted random forest algorithm. Frontiers in public health. 2020;8:357. pmid:32719767

33. Aljameel SS, Khan IU, Aslam N, Aljabri M, Alsulmi ES. Machine Learning-Based Model to Predict the Disease Severity and Outcome in COVID-19 Patients. Scientific programming. 2021;2021(1):5587188.

34. Endo PT, Santos GL, de Lima Xavier ME, Nascimento Campos GR, de Lima LC, Silva I, et al. Illusion of truth: analysing and classifying COVID-19 fake news in brazilian portuguese language. Big Data and Cognitive Computing. 2022;6(2):36.

35. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research. 2002;16:321–357.

36. Maiga J, Hungilo GG, et al. Comparison of machine learning models in prediction of cardiovascular disease using health record data. In: 2019 international conference on informatics, multimedia, cyber and information system (ICIMCIS). IEEE; 2019. p. 45–48.

37. Hamida S, El Gannour O, Cherradi B, Ouajji H, Raihani A. Optimization of machine learning algorithms hyper-parameters for improving the prediction of patients infected with COVID-19. In: 2020 ieee 2nd international conference on electronics, control, optimization and computer science (icecocs). IEEE; 2020. p. 1–6.

38. Papaiz F, Dourado MET Jr, de Medeiros Valentim RA, Pinto R, de Morais AHF, Arrais JP. Ensemble-imbalance-based classification for amyotrophic lateral sclerosis prognostic prediction: identifying short-survival patients at diagnosis. BMC Medical Informatics and Decision Making. 2024;24(1):80. pmid:38504285

39. Divya KS, Bhargavi P, Jyothi S. Machine learning algorithms in big data analytics. Int J Comput Sci Eng. 2018;6(1):63–70.

40. Liu J, Wang L, Zhang L, Zhang Z, Zhang S. Predictive analytics for blood glucose concentration: an empirical study using the tree-based ensemble approach. Library Hi Tech. 2020;38(4):835–858.

41. Rahul K, Banyal RK, Goswami P, Kumar V. Machine learning algorithms for big data analytics. In: Computational Methods and Data Engineering: Proceedings of ICMDE 2020, Volume 1. Springer; 2021. p. 359–367.

42. Charbuty B, Abdulazeez A. Classification based on decision tree algorithm for machine learning. Journal of Applied Science and Technology Trends. 2021;2(01):20–28.

43. Rigatti SJ. Random forest. Journal of Insurance Medicine. 2017;47(1):31–39. pmid:28836909

44. Fafalios S, Charonyktakis P, Tsamardinos I. Gradient boosting trees. Gnosis Data Analysis PC. 2020;1.

45. Schapire RE. Empirical inference. Berlin, Heidelberg. 2013; p. 37–52.

46. Sheng C, Yu H. An optimized prediction algorithm based on XGBoost. In: 2022 International Conference on Networking and Network Applications (NaNA). IEEE; 2022. p. 1–6.

47. Popescu MC, Balas VE, Perescu-Popescu L, Mastorakis N. Multilayer perceptron and neural networks. WSEAS Transactions on Circuits and Systems. 2009;8(7):579–588.

48. Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:14126980. 2014;.

49. Pedregosa F. Scikit-learn: Machine learning in python Fabian. Journal of machine learning research. 2011;12:2825.

50. Ensor KB, Glynn PW. Stochastic optimization via grid search. Lectures in Applied Mathematics-American Mathematical Society. 1997;33:89–100.

51. Bergstra J, Bengio Y. Random search for hyper-parameter optimization. Journal of machine learning research. 2012;13(2).

52. Ahsan MM, E Alam T, Trafalis T, Huebner P. Deep MLP-CNN model using mixed-data to distinguish between COVID-19 and Non-COVID-19 patients. Symmetry. 2020;12(9):1526.

53. Bradley AP. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern recognition. 1997;30(7):1145–1159.

54. De Hond AA, Steyerberg EW, Van Calster B. Interpreting area under the receiver operating characteristic curve. The Lancet Digital Health. 2022;4(12):e853–e855. pmid:36270955

55. Nahm FS. Receiver operating characteristic curve: overview and practical use for clinicians. Korean journal of anesthesiology. 2022;75(1):25–36. pmid:35124947

56. Mohammad-Rahimi H, Nadimi M, Ghalyanchi-Langeroudi A, Taheri M, Ghafouri-Fard S. Application of machine learning in diagnosis of COVID-19 through X-ray and CT images: a scoping review. Frontiers in cardiovascular medicine. 2021;8:638011. pmid:33842563

57. Yu KH, Beam AL, Kohane IS. Artificial intelligence in healthcare. Nature biomedical engineering. 2018;2(10):719–731. pmid:31015651

58. Davenport T, Kalakota R. The potential for artificial intelligence in healthcare. Future healthcare journal. 2019;6(2):94–98. pmid:31363513

59. Owoyemi A, Owoyemi J, Osiyemi A, Boyd A. Artificial Intelligence for Healthcare in Africa. Frontiers in digital health 2: 6; 2020. pmid:34713019

60. Yang CC. Explainable artificial intelligence for predictive modeling in healthcare. Journal of healthcare informatics research. 2022;6(2):228–239. pmid:35194568

61. Cuocolo R, Perillo T, De Rosa E, Ugga L, Petretta M. Current applications of big data and machine learning in cardiology. Journal of geriatric cardiology: JGC. 2019;16(8):601. pmid:31555327

62. Barros DM, Moura JC, Freire CR, Taleb AC, Valentim RA, Morais PS. Machine learning applied to retinal image processing for glaucoma detection: review and perspective. Biomedical engineering online. 2020;19:1–21. pmid:32293466

63. Srivastava O, Tennant M, Grewal P, Rubin U, Seamone M. Artificial intelligence and machine learning in ophthalmology: A review. Indian Journal of Ophthalmology. 2023;71(1):11–17. pmid:36588202

64. Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Computational and structural biotechnology journal. 2015;13:8–17. pmid:25750696

65. Firmino M, Angelo G, Morais H, Dantas MR, Valentim R. Computer-aided detection (CADe) and diagnosis (CADx) system for lung cancer with likelihood of malignancy. Biomedical engineering online. 2016;15:1–17. pmid:26759159

66. Galvao-Lima L, Morais H, Valentim R, Barreto E. miRNAs as biomarkers for early cancer detection and their application in the development of new diagnostic tools. Biomedical engineering online. 2021;20:1–21. pmid:33593374

67. Zhao B, Waterman R, Urman R, Gabriel RA. A machine learning approach to predicting case duration for robot-assisted surgery. Journal of Medical Systems. 2019;43:1–32. pmid:30612192

68. Panesar S, Cagle Y, Chander D, Morey J, Fernandez-Miranda J, Kliot M. Artificial intelligence and the future of surgical robotics. Annals of surgery. 2019;270(2):223–226. pmid:30907754

69. Zhou XY, Guo Y, Shen M, Yang GZ. Application of artificial intelligence in surgery. Frontiers of medicine. 2020;14:417–430. pmid:32705406

70. Moglia A, Georgiou K, Georgiou E, Satava RM, Cuschieri A. A systematic review on artificial intelligence in robot-assisted surgery. International Journal of Surgery. 2021;95:106151. pmid:34695601

71. Fernandes YYMP, Araújo GTd, Araújo BGd, Dantas MdCR, Carvalho DRd, Valentim RAdM. ILITIA: telehealth architecture for high-risk gestation classification. Research on Biomedical Engineering. 2017;33(3):237–246.

72. Reddy S. Explainability and artificial intelligence in medicine. The Lancet Digital Health. 2022;4(4):e214–e215. pmid:35337639

73. Schwalbe N, Wahl B. Artificial intelligence and the future of global health. The Lancet. 2020;395(10236):1579–1586. pmid:32416782

74. Ammar W, Kdouh O, Hammoud R, Hamadeh R, Harb H, Ammar Z, et al. Health system resilience: Lebanon and the Syrian refugee crisis. Journal of global health. 2016;6(2). pmid:28154758

75. Massuda A, Hone T, Leles FAG, De Castro MC, Atun R. The Brazilian health system at crossroads: progress, crisis and resilience. BMJ global health. 2018;3(4):e000829. pmid:29997906

76. Muhammad L, Algehyne EA, Usman SS, Ahmad A, Chakraborty C, Mohammed IA. Supervised machine learning models for prediction of COVID-19 infection using epidemiology dataset. SN computer science. 2021;2(1):1–13. pmid:33263111

77. Silva Junior CL, Guabiraba KPdL, Gomes GG, Andrade CLTd, Melo EA. Outpatient regulation in Primary Care in the municipality of Rio de Janeiro, Brazil, based on the local regulatory doctors. Ciência & Saúde Coletiva. 2022;27:2481–2493.

78. Ali A, Khan Z, Khan DM, Aldahmani S. An Optimal Random Projection k Nearest Neighbours Ensemble via Extended Neighbourhood Rule for Binary Classification. IEEE Access. 2024;.

79. Ali A, Hamraz M, Gul N, Khan DM, Aldahmani S, Khan Z. A k nearest neighbour ensemble via extended neighbourhood rule and feature subsets. Pattern Recognition. 2023;142:109641.

80. Vijayarani S, Dhayanand S, Phil M. Kidney disease prediction using SVM and ANN algorithms. International Journal of Computing and Business Research (IJCBR). 2015;6(2):1–12.

Word count: 10748

Show less

© 2024 Barreto et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

Bed regulation within Brazil’s National Health System (SUS) plays a crucial role in managing care for patients in need of hospitalization. In Rio Grande do Norte, Brazil, the RegulaRN Leitos Gerais platform was the information system developed to register requests for bed regulation for COVID-19 cases. However, the platform was expanded to cover a range of diseases that require hospitalization. This study explored different machine learning models in the RegulaRN database, from October 2021 to January 2024, totaling 47,056 regulations. From the data obtained, 12 features were selected from the 24 available. After that, blank and inconclusive data were removed, as well as the outcomes that had values other than discharge and death, rendering a binary classification. Data was also correlated, balanced, and divided into training and test portions for application in machine learning models. The results showed better accuracy (87.77%) and recall (87.77%) for the XGBoost model, and higher precision (87.85%) and F1-Score (87.56%) for the Random Forest and Gradient Boosting models, respectively. As for Specificity (82.94%) and ROC-AUC (82.13%), the Multilayer Perceptron with SGD optimizer obtained the highest scores. The results evidenced which models could adequately assist medical regulators during the decision-making process for bed regulation, enabling even more effective regulation and, consequently, greater availability of beds and a decrease in waiting time for patients.

Details

Title

Artificial intelligence applied to bed regulation in Rio Grande do Norte: Data analysis and application of machine learning on the “RegulaRN Leitos Gerais” platform

Author

de Oliveira Barreto, Tiago

; Fernando Lucas de Oliveira Farias; Nicolas Vinícius Rodrigues Veras; Cardoso, Pablo Holanda; Gleyson José Pinheiro Caldeira Silva; Chander de Oliveira Pinheiro; Bezerra Medina, Maria Valéria; Felipe Ricardo dos Santos Fernandes

; Ingridy Marina Pierre Barbalho; Lyane Ramalho Cortez; João Paulo Queiroz dos Santos; Antonio Higor Freire de Morais; Gustavo Fontoura de Souza; Guilherme Medeiros Machado; Márcia Jacyntha Nunes Rodrigues Lucena; Ricardo Alexsandro de Medeiros Valentim

First page

e0315379

Section

Research Article

Publication year

2024

Publication date

Dec 2024

Publisher

Public Library of Science

e-ISSN

19326203

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1371/journal.pone.0315379

ProQuest document ID

3150325581

Artificial intelligence applied to bed regulation in Rio Grande do Norte: Data analysis and application of machine learning on the “RegulaRN Leitos Gerais” platform

Jump to:

Full Text

Introduction

Materials and methods

Extraction, evaluation, characterization and pre-processing

Correlation between dataset features

Definition of evaluation metrics

Data balancing and splitting into training and validation data

Definition of models for data classification

Hyperparameters to define the best model

Results

General data analysis

Machine learning model results

Discussion

Conclusion

Acknowledgments

References

Abstract

Details

Suggested sources