A Novel Stacking-Based Deterministic Ensemble Model for Infectious Disease Prediction
Article
Subjects > Engineering
Europe University of Atlantic > Research > Scientific Production
Universidad Internacional do Cuanza > Research > Articles and books
Abierto
Inglés
Infectious Disease Prediction aims to anticipate the aspects of both seasonal epidemics and future pandemics. However, a single model will most likely not capture all the dataset’s patterns and qualities. Ensemble learning combines multiple models to obtain a single prediction that uses the qualities of each model. This study aims to develop a stacked ensemble model to accurately predict the future occurrences of infectious diseases viewed at some point in time as epidemics, namely, dengue, influenza, and tuberculosis. The main objective is to enhance the prediction performance of the proposed model by reducing prediction errors. Autoregressive integrated moving average, exponential smoothing, and neural network autoregression are applied to the disease dataset individually. The gradient boosting model combines the regress values of the above three statistical models to obtain an ensemble model. The results conclude that the forecasting precision of the proposed stacked ensemble model is better than that of the standard gradient boosting model. The ensemble model reduces the prediction errors, root-mean-square error, for the dengue, influenza, and tuberculosis dataset by approximately 30%, 24%, and 25%, respectively
metadata
Mahajan, Asmita and Sharma, Nonita and Aparicio Obregón, Silvia and Alyami, Hashem and Alharbi, Abdullah and Anand, Divya and Sharma, Manish and Goyal, Nitin
mail
UNSPECIFIED, UNSPECIFIED, silvia.aparicio@uneatlantico.es, UNSPECIFIED, UNSPECIFIED, divya.anand@uneatlantico.es, UNSPECIFIED, UNSPECIFIED
(2022)
A Novel Stacking-Based Deterministic Ensemble Model for Infectious Disease Prediction.
Mathematics, 10 (10).
p. 1714.
ISSN 2227-7390
Text
mathematics-10-01714-v2.pdf - Published Version Available under License Creative Commons Attribution. Download (1MB) |
Abstract
Infectious Disease Prediction aims to anticipate the aspects of both seasonal epidemics and future pandemics. However, a single model will most likely not capture all the dataset’s patterns and qualities. Ensemble learning combines multiple models to obtain a single prediction that uses the qualities of each model. This study aims to develop a stacked ensemble model to accurately predict the future occurrences of infectious diseases viewed at some point in time as epidemics, namely, dengue, influenza, and tuberculosis. The main objective is to enhance the prediction performance of the proposed model by reducing prediction errors. Autoregressive integrated moving average, exponential smoothing, and neural network autoregression are applied to the disease dataset individually. The gradient boosting model combines the regress values of the above three statistical models to obtain an ensemble model. The results conclude that the forecasting precision of the proposed stacked ensemble model is better than that of the standard gradient boosting model. The ensemble model reduces the prediction errors, root-mean-square error, for the dengue, influenza, and tuberculosis dataset by approximately 30%, 24%, and 25%, respectively
Item Type: | Article |
---|---|
Uncontrolled Keywords: | autoregressive integrated moving average; epidemiology; exponential smoothing; ensemble; gradient boosting; infectious disease; neural network autoregression; pandemic; stacking |
Subjects: | Subjects > Engineering |
Divisions: | Europe University of Atlantic > Research > Scientific Production Universidad Internacional do Cuanza > Research > Articles and books |
Date Deposited: | 31 May 2022 18:14 |
Last Modified: | 11 Jul 2023 23:30 |
URI: | https://repositorio.unic.co.ao/id/eprint/2118 |
Actions (login required)
View Item |
<a class="ep_document_link" href="/10290/1/Influence%20of%20E-learning%20training%20on%20the%20acquisition%20of%20competences%20in%20basketball%20coaches%20in%20Cantabria.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
The main aim of this study was to analyse the influence of e-learning training on the acquisition of competences in basketball coaches in Cantabria. The current landscape of basketball coach training shows an increasing demand for innovative training models and emerging pedagogies, including e-learning-based methodologies. The study sample consisted of fifty students from these courses, all above 16 years of age (36 males, 14 females). Among them, 16% resided outside the autonomous community of Cantabria, 10% resided more than 50 km from the city of Santander, 36% between 10 and 50 km, 14% less than 10 km, and 24% resided within Santander city. Data were collected through a Google Forms survey distributed by the Cantabrian Basketball Federation to training course students. Participation was voluntary and anonymous. The survey, consisting of 56 questions, was validated by two sports and health doctors and two senior basketball coaches. The collected data were processed and analysed using Microsoft® Excel version 16.74, and the results were expressed in percentages. The analysis revealed that 24.60% of the students trained through the e-learning methodology considered themselves fully qualified as basketball coaches, contrasting with 10.98% of those trained via traditional face-to-face methodology. The results of the study provide insights into important characteristics that can be adjusted and improved within the investigated educational process. Moreover, the study concludes that e-learning training effectively qualifies basketball coaches in Cantabria.
Josep Alemany Iturriaga mail josep.alemany@uneatlantico.es, Álvaro Velarde-Sotres mail alvaro.velarde@uneatlantico.es, Javier Jorge mail , Kamil Giglio mail ,
Alemany Iturriaga
<a href="/14584/1/s41598-024-73664-6.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
The evolution of the COVID-19 pandemic has been associated with variations in clinical presentation and severity. Similarly, prediction scores may suffer changes in their diagnostic accuracy. The aim of this study was to test the 30-day mortality predictive validity of the 4C and SEIMC scores during the sixth wave of the pandemic and to compare them with those of validation studies. This was a longitudinal retrospective observational study. COVID-19 patients who were admitted to the Emergency Department of a Spanish hospital from December 15, 2021, to January 31, 2022, were selected. A side-by-side comparison with the pivotal validation studies was subsequently performed. The main measures were 30-day mortality and the 4C and SEIMC scores. A total of 27,614 patients were considered in the study, including 22,361 from the 4C, 4,627 from the SEIMC and 626 from our hospital. The 30-day mortality rate was significantly lower than that reported in the validation studies. The AUCs were 0.931 (95% CI: 0.90–0.95) for 4C and 0.903 (95% CI: 086–0.93) for SEIMC, which were significantly greater than those obtained in the first wave. Despite the changes that have occurred during the coronavirus disease 2019 (COVID-19) pandemic, with a reduction in lethality, scorecard systems are currently still useful tools for detecting patients with poor disease risk, with better prognostic capacity.
Pedro Ángel de Santos Castro mail , Carlos del Pozo Vegas mail , Leyre Teresa Pinilla Arribas mail , Daniel Zalama Sánchez mail , Ancor Sanz-García mail , Tony Giancarlo Vásquez del Águila mail , Pablo González Izquierdo mail , Sara de Santos Sánchez mail , Cristina Mazas Pérez-Oleaga mail cristina.mazas@uneatlantico.es, Irma Dominguez Azpíroz mail irma.dominguez@unini.edu.mx, Iñaki Elío Pascual mail inaki.elio@uneatlantico.es, Francisco Martín-Rodríguez mail ,
de Santos Castro
<a class="ep_document_link" href="/14915/1/s41598-024-74357-w.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
Diabetes is a persistent health condition led by insufficient use or inappropriate use of insulin in the body. If left undetected, it can lead to further complications involving organ damage such as heart, lungs, and eyes. Timely detection of diabetes helps obtain the right medication, diet, and exercise plan to lead a healthy life. ML approach has been utilized to obtain rapid and reliable diabetes detection, however, existing approaches suffer from the use of limited datasets, lack of generalizability, and lower accuracy. This study proposes a novel feature extraction approach to overcome these limitations by using an ensemble of convolutional neural network (CNN) and long short-term memory (LSTM) models. Multiple datasets are combined to make a larger dataset for experiments and multiple features are utilized for investigating the efficacy of the proposed approach. Features from the extra tree classifier, CNN, and LSTM are also considered for comparison. Experimental results reveal the superb performance of CNN-LSTM-based features with random forest model obtaining a 0.99 accuracy score. This performance is further validated by comparison with existing approaches and k-fold cross-validation which shows the proposed approach provides robust results.
Furqan Rustam mail , Ahmad Sami Al-Shamayleh mail , Rahman Shafique mail , Silvia Aparicio Obregón mail silvia.aparicio@uneatlantico.es, Rubén Calderón Iglesias mail ruben.calderon@uneatlantico.es, J. Pablo Miramontes Gonzalez mail , Imran Ashraf mail ,
Rustam
<a href="/14934/1/s41598-024-77427-1.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
A deep learning approach to optimize remaining useful life prediction for Li-ion batteries
Accurately predicting the remaining useful life (RUL) of lithium-ion (Li-ion) batteries is vital for improving battery performance and safety in applications such as consumer electronics and electric vehicles. While the prediction of RUL for these batteries is a well-established field, the current research refines RUL prediction methodologies by leveraging deep learning techniques, advancing prediction accuracy. This study proposes AccuCell Prodigy, a deep learning model that integrates auto-encoders and long short-term memory (LSTM) layers to enhance RUL prediction accuracy and efficiency. The model’s name reflects its precision (“AccuCell”) and predictive strength (“Prodigy”). The proposed methodology involves preparing a dataset of battery operational features, split using an 80–20 ratio for training and testing. Leveraging 22 variations of current (critical parameter) across three Li-ion cells, AccuCell Prodigy significantly reduces prediction errors, achieving a mean square error of 0.1305%, mean absolute error of 2.484%, and root mean square error of 3.613%, with a high R-squared value of 0.9849. These results highlight its robustness and potential for advancing battery health management.
Mahrukh Iftikhar mail , Muhammad Shoaib mail , Ayesha Altaf mail , Faiza Iqbal mail , Santos Gracia Villar mail santos.gracia@uneatlantico.es, Luis Alonso Dzul López mail luis.dzul@uneatlantico.es, Imran Ashraf mail ,
Iftikhar
<a class="ep_document_link" href="/14949/1/s41598-024-76213-3.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
Nanotechnology has opened new avenues for advanced research in various fields of soft materials. Materials scientists, chemists, physicists, and computational mathematicians have begun to take a keen interest in soft materials due to their potential applications in nanopatterning, membrane separation, drug delivery, nanolithography, advanced storage media, and nanorobotics. The unique properties of soft materials, particularly self-assembly, have made them useful in fields ranging from nanotechnology to biomedicine. The discovery of new morphologies in the diblock copolymer system in curved geometries is a challenging problem for mathematicians and theoretical scientists. Structural frustration under the effects of confinement in the system helps predict new structures. This mathematical study evaluates the effects of confinement and curvature on symmetric diblock copolymer melt using a cell dynamic simulation model. New patterns in lamella morphologies are predicted. The Laplacian involved in the cell dynamic simulation model is approximated by generating a 17-point stencil discretized to a polar grid by the finite difference method. Codes are programmed in FORTRAN to run the simulation, and IBM open DX is used to visualize the results. Comparison of computational results with existing studies validates this study and identifies defects and new patterns.
Muhammad Javed Iqbal mail , Inayatullah Soomro mail , Mirza Abdur Razzaq mail , Erislandy Omar-Martinez mail , Zaily Leticia Velázquez Martínez mail zaily.velazquez@unini.edu.mx, Imran Ashraf mail ,
Iqbal