Prediction β-Thalassemia carriers using complete blood count features
Artículo
Materias > Ingeniería
Universidad Europea del Atlántico > Investigación > Producción Científica
Fundación Universitaria Internacional de Colombia > Investigación > Producción Científica
Universidad Internacional Iberoamericana México > Investigación > Producción Científica
Universidad Internacional Iberoamericana Puerto Rico > Investigación > Producción Científica
Universidad Internacional do Cuanza > Investigación > Artículos y libros
Abierto
Inglés
β-Thalassemia is one of the dangerous causes of the high mortality rate in the Mediterranean countries. Substantial resources are required to save a β-Thalassemia carriers’ life and early detection of thalassemia patients can help appropriate treatment to increase the carrier’s life expectancy. Being a genetic disease, it can not be prevented however the analysis of several indicators in parents’ blood can be used to detect disorders causing Thalassemia. Laboratory tests for Thalassemia are time-consuming and expensive like high-performance liquid chromatography, Complete Blood Count (CBC) with peripheral smear, genetic test, etc. Red blood indices from CBC can be used with machine learning models for the same task. Despite the available approaches for Thalassemia carriers from CBC data, gaps exist between the desired and achieved accuracy. Moreover, the data imbalance problem is studied well which makes the models less generalizable. This study proposes a highly accurate approach for β-Thalassemia detection using red blood indices from CBC augmented by supervised machine learning. In view of the fact that all the features do not carry predictive information regarding the target variable, this study employs a unified framework of two features selection techniques including Principal Component Analysis (PCA) and Singular Vector Decomposition (SVD). The data imbalance between β-Thalassemia carrier and non-carriers is handled by Synthetic Minority Oversampling Technique (SMOTE) and Adaptive Synthetic (ADASYN). Extensive experiments are performed using many state-of-the-art machine learning models and deep learning models. Experimental results indicate the superiority of the proposed approach over existing approaches with an accuracy score of 0.96.
metadata
Rustam, Furqan; Ashraf, Imran; Jabbar, Shehbaz; Tutusaus, Kilian; Mazas Pérez-Oleaga, Cristina; Pascual Barrera, Alina Eugenia y de la Torre Diez, Isabel
mail
SIN ESPECIFICAR, SIN ESPECIFICAR, SIN ESPECIFICAR, kilian.tutusaus@uneatlantico.es, cristina.mazas@uneatlantico.es, alina.pascual@unini.edu.mx, SIN ESPECIFICAR
(2022)
Prediction β-Thalassemia carriers using complete blood count features.
Scientific Reports, 12 (1).
ISSN 2045-2322
Texto
s41598-022-22011-8.pdf Available under License Creative Commons Attribution. Descargar (2MB) |
Resumen
β-Thalassemia is one of the dangerous causes of the high mortality rate in the Mediterranean countries. Substantial resources are required to save a β-Thalassemia carriers’ life and early detection of thalassemia patients can help appropriate treatment to increase the carrier’s life expectancy. Being a genetic disease, it can not be prevented however the analysis of several indicators in parents’ blood can be used to detect disorders causing Thalassemia. Laboratory tests for Thalassemia are time-consuming and expensive like high-performance liquid chromatography, Complete Blood Count (CBC) with peripheral smear, genetic test, etc. Red blood indices from CBC can be used with machine learning models for the same task. Despite the available approaches for Thalassemia carriers from CBC data, gaps exist between the desired and achieved accuracy. Moreover, the data imbalance problem is studied well which makes the models less generalizable. This study proposes a highly accurate approach for β-Thalassemia detection using red blood indices from CBC augmented by supervised machine learning. In view of the fact that all the features do not carry predictive information regarding the target variable, this study employs a unified framework of two features selection techniques including Principal Component Analysis (PCA) and Singular Vector Decomposition (SVD). The data imbalance between β-Thalassemia carrier and non-carriers is handled by Synthetic Minority Oversampling Technique (SMOTE) and Adaptive Synthetic (ADASYN). Extensive experiments are performed using many state-of-the-art machine learning models and deep learning models. Experimental results indicate the superiority of the proposed approach over existing approaches with an accuracy score of 0.96.
Tipo de Documento: | Artículo |
---|---|
Palabras Clave: | Computational biology andbioinformatics; Health care |
Clasificación temática: | Materias > Ingeniería |
Divisiones: | Universidad Europea del Atlántico > Investigación > Producción Científica Fundación Universitaria Internacional de Colombia > Investigación > Producción Científica Universidad Internacional Iberoamericana México > Investigación > Producción Científica Universidad Internacional Iberoamericana Puerto Rico > Investigación > Producción Científica Universidad Internacional do Cuanza > Investigación > Artículos y libros |
Depositado: | 05 Dic 2022 23:30 |
Ultima Modificación: | 17 Jul 2023 23:30 |
URI: | https://repositorio.unic.co.ao/id/eprint/4905 |
Acciones (logins necesarios)
Ver Objeto |
<a href="/10290/1/Influence%20of%20E-learning%20training%20on%20the%20acquisition%20of%20competences%20in%20basketball%20coaches%20in%20Cantabria.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
The main aim of this study was to analyse the influence of e-learning training on the acquisition of competences in basketball coaches in Cantabria. The current landscape of basketball coach training shows an increasing demand for innovative training models and emerging pedagogies, including e-learning-based methodologies. The study sample consisted of fifty students from these courses, all above 16 years of age (36 males, 14 females). Among them, 16% resided outside the autonomous community of Cantabria, 10% resided more than 50 km from the city of Santander, 36% between 10 and 50 km, 14% less than 10 km, and 24% resided within Santander city. Data were collected through a Google Forms survey distributed by the Cantabrian Basketball Federation to training course students. Participation was voluntary and anonymous. The survey, consisting of 56 questions, was validated by two sports and health doctors and two senior basketball coaches. The collected data were processed and analysed using Microsoft® Excel version 16.74, and the results were expressed in percentages. The analysis revealed that 24.60% of the students trained through the e-learning methodology considered themselves fully qualified as basketball coaches, contrasting with 10.98% of those trained via traditional face-to-face methodology. The results of the study provide insights into important characteristics that can be adjusted and improved within the investigated educational process. Moreover, the study concludes that e-learning training effectively qualifies basketball coaches in Cantabria.
Josep Alemany Iturriaga mail josep.alemany@uneatlantico.es, Álvaro Velarde-Sotres mail alvaro.velarde@uneatlantico.es, Javier Jorge mail , Kamil Giglio mail ,
Alemany Iturriaga
<a href="/14584/1/s41598-024-73664-6.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
The evolution of the COVID-19 pandemic has been associated with variations in clinical presentation and severity. Similarly, prediction scores may suffer changes in their diagnostic accuracy. The aim of this study was to test the 30-day mortality predictive validity of the 4C and SEIMC scores during the sixth wave of the pandemic and to compare them with those of validation studies. This was a longitudinal retrospective observational study. COVID-19 patients who were admitted to the Emergency Department of a Spanish hospital from December 15, 2021, to January 31, 2022, were selected. A side-by-side comparison with the pivotal validation studies was subsequently performed. The main measures were 30-day mortality and the 4C and SEIMC scores. A total of 27,614 patients were considered in the study, including 22,361 from the 4C, 4,627 from the SEIMC and 626 from our hospital. The 30-day mortality rate was significantly lower than that reported in the validation studies. The AUCs were 0.931 (95% CI: 0.90–0.95) for 4C and 0.903 (95% CI: 086–0.93) for SEIMC, which were significantly greater than those obtained in the first wave. Despite the changes that have occurred during the coronavirus disease 2019 (COVID-19) pandemic, with a reduction in lethality, scorecard systems are currently still useful tools for detecting patients with poor disease risk, with better prognostic capacity.
Pedro Ángel de Santos Castro mail , Carlos del Pozo Vegas mail , Leyre Teresa Pinilla Arribas mail , Daniel Zalama Sánchez mail , Ancor Sanz-García mail , Tony Giancarlo Vásquez del Águila mail , Pablo González Izquierdo mail , Sara de Santos Sánchez mail , Cristina Mazas Pérez-Oleaga mail cristina.mazas@uneatlantico.es, Irma Dominguez Azpíroz mail irma.dominguez@unini.edu.mx, Iñaki Elío Pascual mail inaki.elio@uneatlantico.es, Francisco Martín-Rodríguez mail ,
de Santos Castro
<a href="/14206/1/mnm_2024_17-3_mnm-17-3-mnm240038_mnm-17-mnm240038.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
Exploring body composition and somatotype profiles among youth professional soccer players
OBJECTIVE: This study aimed to analyze the body composition and somatotype of professional soccer players, investigating variations across categories and playing positions. METHODS: An observational, cross-sectional, and analytical study was conducted with 51 male professional soccer players in the U-19 and U-20 categories. Data about sex, age, height, and weight were collected between March and May 2023. Body composition analysis utilized the ISAK protocol for the restricted profile, while somatotype categorization employed the Heath and Carter formula. Statistical analysis was performed using IBM SPSS Statistics V.26, which involved the application of Mann-Whitney and Kruskal-Wallis tests to discern differences in body composition variables and proportionality based on categories and playing positions. The Dunn test further identified specific positions exhibiting significant differences. RESULTS: The study encompassed 51 players, highlighting meaningful differences in body composition. The average body mass in kg was 75.8 (±6.9) for U-20 players and 70.5 (±6.1) for U-19 players. The somatotype values were 2.6-4.6-2.3 for U-20 players and 2.5-4.3-2.8 for U-19 players, with a predominance of muscle mass in all categories, characterizing them as balanced mesomorphs. CONCLUSIONS: Body composition and somatotype findings underscore distinctions in body mass across categories and playing positions, with notably higher body mass and muscle mass predominance in elevated categories. However, the prevailing skeletal muscle development establishes a significant semblance with the recognized somatotype standard for soccer.
Raynier Zambrano-Villacres mail , Evelyn Frias-Toral mail , Emily Maldonado-Ponce mail , Carlos Poveda-Loor mail , Paola Leal mail , Álvaro Velarde-Sotres mail alvaro.velarde@uneatlantico.es, Alice Leonardi mail , Bruno Trovato mail , Federico Roggio mail , Alessandro Castorina mail , Xu Wenxin mail , Giuseppe Musumeci mail ,
Zambrano-Villacres
<a href="/14482/1/sensors-24-06325.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
Smart Physiotherapy: Advancing Arm-Based Exercise Classification with PoseNet and Ensemble Models
Telephysiotherapy has emerged as a vital solution for delivering remote healthcare, particularly in response to global challenges such as the COVID-19 pandemic. This study seeks to enhance telephysiotherapy by developing a system capable of accurately classifying physiotherapeutic exercises using PoseNet, a state-of-the-art pose estimation model. A dataset was collected from 49 participants (35 males, 14 females) performing seven distinct exercises, with twelve anatomical landmarks then extracted using the Google MediaPipe library. Each landmark was represented by four features, which were used for classification. The core challenge addressed in this research involves ensuring accurate and real-time exercise classification across diverse body morphologies and exercise types. Several tree-based classifiers, including Random Forest, Extra Tree Classifier, XGBoost, LightGBM, and Hist Gradient Boosting, were employed. Furthermore, two novel ensemble models called RandomLightHist Fusion and StackedXLightRF are proposed to enhance classification accuracy. The RandomLightHist Fusion model achieved superior accuracy of 99.6%, demonstrating the system’s robustness and effectiveness. This innovation offers a practical solution for providing real-time feedback in telephysiotherapy, with potential to improve patient outcomes through accurate monitoring and assessment of exercise performance.
Shahzad Hussain mail , Hafeez Ur Rehman Siddiqui mail , Adil Ali Saleem mail , Muhammad Amjad Raza mail , Josep Alemany Iturriaga mail josep.alemany@uneatlantico.es, Álvaro Velarde-Sotres mail alvaro.velarde@uneatlantico.es, Isabel De la Torre Díez mail , Sandra Dudley mail ,
Hussain
<a class="ep_document_link" href="/14281/1/s41598-024-69663-2.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>
en
open
An enhanced approach for predicting air pollution using quantum support vector machine
The essence of quantum machine learning is to optimize problem-solving by executing machine learning algorithms on quantum computers and exploiting potent laws such as superposition and entanglement. Support vector machine (SVM) is widely recognized as one of the most effective classification machine learning techniques currently available. Since, in conventional systems, the SVM kernel technique tends to sluggish down and even fail as datasets become increasingly complex or jumbled. To compare the execution time and accuracy of conventional SVM classification to that of quantum SVM classification, the appropriate quantum features for mapping need to be selected. As the dataset grows complex, the importance of selecting an appropriate feature map that outperforms or performs as well as the classification grows. This paper utilizes conventional SVM to select an optimal feature map and benchmark dataset for predicting air quality. Experimental evidence demonstrates that the precision of quantum SVM surpasses that of classical SVM for air quality assessment. Using quantum labs from IBM’s quantum computer cloud, conventional and quantum computing have been compared. When applied to the same dataset, the conventional SVM achieved an accuracy of 91% and 87% respectively, whereas the quantum SVM demonstrated an accuracy of 97% and 94% respectively for air quality prediction. The study introduces the use of quantum Support Vector Machines (SVM) for predicting air quality. It emphasizes the novel method of choosing the best quantum feature maps. Through the utilization of quantum-enhanced feature mapping, our objective is to exceed the constraints of classical SVM and achieve unparalleled levels of precision and effectiveness. We conduct precise experiments utilizing IBM’s state-of-the-art quantum computer cloud to compare the performance of conventional and quantum SVM algorithms on a shared dataset.
Omer Farooq mail , Maida Shahid mail , Shazia Arshad mail , Ayesha Altaf mail , Faiza Iqbal mail , Yini Airet Miro Vera mail , Miguel Angel Lopez Flores mail , Imran Ashraf mail ,
Farooq