Advancement in medical report generation: current practices, challenges, and future directions

Article Subjects > Engineering Europe University of Atlantic > Research > Scientific Production
Ibero-american International University > Research > Scientific Production
Universidad Internacional do Cuanza > Research > Articles and books
University of La Romana > Research > Scientific Production
Cerrado Inglés The correct analysis of medical images requires the medical knowledge and expertise of radiologists to understand, clarify, and explain complex patterns and diagnose diseases. After analyzing, radiologists write detailed and well-structured reports that contribute to the precise and timely diagnosis of patients. However, manually writing reports is often expensive and time-consuming, and it is difficult for radiologists to analyze medical images, particularly images with multiple views and perceptions. It is challenging to accurately diagnose diseases, and many methods are proposed to help radiologists, both traditional and deep learning-based. Automatic report generation is widely used to tackle this issue as it streamlines the process and lessens the burden of manual labeling of images. This paper introduces a systematic literature review with a focus on analyses and evaluating existing research on medical report generation. This SLR follows a proper protocol for the planning, reviewing, and reporting of the results. This review recognizes that the most commonly used deep learning models are encoder-decoder frameworks (45 articles), which provide an accuracy of around 92–95%. Transformers-based models (20 articles) are the second most established method and achieve an accuracy of around 91%. The remaining articles explored in this SLR are attention mechanisms (10), RNN-LSTM (10), Large language models (LLM-10), and graph-based methods (4) with promising results. However, these methods also face certain limitations such as overfitting, risk of bias, and high data dependency that impact their performance. The review not only highlights the strengths and challenges of these methods but also suggests ways to handle them in the future to increase the accuracy and timely generation of medical reports. The goal of this review is to direct radiologists toward methods that lessen their workload and provide precise medical diagnoses. metadata Rehman, Marwareed and Shafi, Imran and Ahmad, Jamil and Osorio García, Carlos Manuel and Pascual Barrera, Alina Eugenia and Ashraf, Imran mail UNSPECIFIED, UNSPECIFIED, UNSPECIFIED, carlos.osorio@uneatlantico.es, alina.pascual@unini.edu.mx, UNSPECIFIED (2024) Advancement in medical report generation: current practices, challenges, and future directions. Medical & Biological Engineering & Computing. ISSN 0140-0118

Full text not available from this repository.

Abstract

The correct analysis of medical images requires the medical knowledge and expertise of radiologists to understand, clarify, and explain complex patterns and diagnose diseases. After analyzing, radiologists write detailed and well-structured reports that contribute to the precise and timely diagnosis of patients. However, manually writing reports is often expensive and time-consuming, and it is difficult for radiologists to analyze medical images, particularly images with multiple views and perceptions. It is challenging to accurately diagnose diseases, and many methods are proposed to help radiologists, both traditional and deep learning-based. Automatic report generation is widely used to tackle this issue as it streamlines the process and lessens the burden of manual labeling of images. This paper introduces a systematic literature review with a focus on analyses and evaluating existing research on medical report generation. This SLR follows a proper protocol for the planning, reviewing, and reporting of the results. This review recognizes that the most commonly used deep learning models are encoder-decoder frameworks (45 articles), which provide an accuracy of around 92–95%. Transformers-based models (20 articles) are the second most established method and achieve an accuracy of around 91%. The remaining articles explored in this SLR are attention mechanisms (10), RNN-LSTM (10), Large language models (LLM-10), and graph-based methods (4) with promising results. However, these methods also face certain limitations such as overfitting, risk of bias, and high data dependency that impact their performance. The review not only highlights the strengths and challenges of these methods but also suggests ways to handle them in the future to increase the accuracy and timely generation of medical reports. The goal of this review is to direct radiologists toward methods that lessen their workload and provide precise medical diagnoses.

Item Type: Article
Uncontrolled Keywords: Automated medical health services; Deep learning; Image processing; Public health; Report generation
Subjects: Subjects > Engineering
Divisions: Europe University of Atlantic > Research > Scientific Production
Ibero-american International University > Research > Scientific Production
Universidad Internacional do Cuanza > Research > Articles and books
University of La Romana > Research > Scientific Production
Date Deposited: 22 Jan 2025 23:30
Last Modified: 22 Jan 2025 23:30
URI: https://repositorio.unic.co.ao/id/eprint/16269

Actions (login required)

View Item View Item

en

close

Single-cell omics for nutrition research: an emerging opportunity for human-centric investigations

Understanding how dietary compounds affect human health is challenged by their molecular complexity and cell-type–specific effects. Conventional multi-cell type (bulk) analyses obscure cellular heterogeneity, while animal and standard in vitro models often fail to replicate human physiology. Single-cell omics technologies—such as single-cell RNA sequencing, as well as single-cell–resolved proteomic and metabolomic approaches—enable high-resolution investigation of nutrient–cell interactions and reveal mechanisms at a single-cell resolution. When combined with advanced human-derived in vitro systems like organoids and organ-on-chip platforms, they support mechanistic studies in physiologically relevant contexts. This review outlines emerging applications of single-cell omics in nutrition research, emphasizing their potential to uncover cell-specific dietary responses, identify nutrient-sensitive pathways, and capture interindividual variability. It also discusses key challenges—including technical limitations, model selection, and institutional biases—and identifies strategic directions to facilitate broader adoption in the field. Collectively, single-cell omics offer a transformative framework to advance human-centric nutrition research.

Producción Científica

Manuela Cassotta mail manucassotta@gmail.com, Yasmany Armas Diaz mail , Danila Cianciosi mail , Bei Yang mail , Zexiu Qi mail , Ge Chen mail , Santos Gracia Villar mail santos.gracia@uneatlantico.es, Luis Alonso Dzul López mail luis.dzul@uneatlantico.es, Giuseppe Grosso mail , José L. Quiles mail , Jianbo Xiao mail , Maurizio Battino mail maurizio.battino@uneatlantico.es, Francesca Giampieri mail francesca.giampieri@uneatlantico.es,

Cassotta

<a class="ep_document_link" href="/17862/1/sensors-25-06419.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>

en

open

Edge-Based Autonomous Fire and Smoke Detection Using MobileNetV2

Forest fires pose significant threats to ecosystems, human life, and the global climate, necessitating rapid and reliable detection systems. Traditional fire detection approaches, including sensor networks, satellite monitoring, and centralized image analysis, often suffer from delayed response, high false positives, and limited deployment in remote areas. Recent deep learning-based methods offer high classification accuracy but are typically computationally intensive and unsuitable for low-power, real-time edge devices. This study presents an autonomous, edge-based forest fire and smoke detection system using a lightweight MobileNetV2 convolutional neural network. The model is trained on a balanced dataset of fire, smoke, and non-fire images and optimized for deployment on resource-constrained edge devices. The system performs near real-time inference, achieving a test accuracy of 97.98% with an average end-to-end prediction latency of 0.77 s per frame (approximately 1.3 FPS) on the Raspberry Pi 5 edge device. Predictions include the class label, confidence score, and timestamp, all generated locally without reliance on cloud connectivity, thereby enhancing security and robustness against potential cyber threats. Experimental results demonstrate that the proposed solution maintains high predictive performance comparable to state-of-the-art methods while providing efficient, offline operation suitable for real-world environmental monitoring and early wildfire mitigation. This approach enables cost-effective, scalable deployment in remote forest regions, combining accuracy, speed, and autonomous edge processing for timely fire and smoke detection.

Producción Científica

Dilshod Sharobiddinov mail , Hafeez Ur Rehman Siddiqui mail , Adil Ali Saleem mail , Gerardo Méndez Mezquita mail , Debora L. Ramírez-Vargas mail debora.ramirez@unini.edu.mx, Isabel de la Torre Díez mail ,

Sharobiddinov

<a href="/17863/1/v16p4316.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>

en

open

Divulging Patterns: An Analytical Review for Machine Learning Methodologies for Breast Cancer Detection

Breast cancer is a lethal carcinoma impacting a considerable number of women across the globe. While preventive measures are limited, early detection remains the most effective strategy. Accurate classification of breast tumors into benign and malignant categories is important which may help physicians in diagnosing the disease faster. This survey investigates the emerging inclination and approaches in the area of machine learning (ML) for the diagnosis of breast cancer, pointing out the classification techniques based on both segmentation and feature selection. Certain datasets such as the Wisconsin Diagnostic Breast Cancer Dataset (WDBC), Wisconsin Breast Cancer Dataset Original (WBCD), Wisconsin Prognostic Breast Cancer Dataset (WPBC), BreakHis, and others are being evaluated in this study for the demonstration of their influence on the performance of the diagnostic tools and the accuracy of the models such as Support vector machine, Convolutional Neural Networks (CNNs) and ensemble approaches. The main shortcomings or research gaps such as prejudice of datasets, scarcity of generalizability, and interpretation challenges are highlighted. This research emphasizes the importance of the hybrid methodologies, cross-dataset validation, and the engineering of explainable AI to narrow these gaps and enhance the overall clinical acceptance of ML-based detection tools.

Producción Científica

Alveena Saleem mail , Muhammad Umair mail , Muhammad Tahir Naseem mail , Muhammad Zubair mail , Silvia Aparicio Obregón mail silvia.aparicio@uneatlantico.es, Rubén Calderón Iglesias mail ruben.calderon@uneatlantico.es, Shoaib Hassan mail , Imran Ashraf mail ,

Saleem

<a href="/17871/1/ijph-70-1608318.pdf" class="ep_document_link"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>

en

open

Unhealthy Ultra-Processed Food Consumption in Children and Adolescents Living in the Mediterranean Area: The DELICIOUS Project

Objectives: This study addressed the consumption of ultra-processed foods (UPFs) formulated with excess of energy/fats/sugars (hence deemed as unhealthy) and factors associated with it in children and adolescents living in 5 Mediterranean countries participating to the DELICIOUS (UnDErstanding consumer food choices & promotion of healthy and sustainable Mediterranean diet and LIfestyle in Children and adolescents through behavIOUral change actionS) project.Methods: A total of 2011 parents of children and adolescents (6–17 years) participated in a survey exploring their children’s frequency consumption of unhealthy UPFs and demographic, eating, and lifestyle habits.Results: Most children consumed unhealthy UPFs daily: higher intake was associated with being older and with obesity, as well as higher parental education and younger age. Children eating more frequently out of home and with a higher number of meals were also more likely to consume unhealthier UPF. Moreover, more screen time and a lower healthy lifestyle score were associated with higher unhealthy UPF consumption.Conclusion: consumption of unhealthy UPFs seems to be preeminent in children and adolescents living in the Mediterranean area and associated with an overall unhealthy lifestyle.

Producción Científica

Alice Rosi mail , Francesca Giampieri mail francesca.giampieri@uneatlantico.es, Osama Abdelkarim mail , Mohamed Aly mail , Achraf Ammar mail , Evelyn Frias-Toral mail , Juancho Pons mail , Laura Vázquez-Araújo mail , Alessandro Scuderi mail , Nunzia Decembrino mail , Alice Leonardi mail , Fernando Maniega Legarda mail , Lorenzo Monasta mail , Ana Mata mail , Adrián Chacón mail , Pablo Busó mail , Giuseppe Grosso mail ,

Rosi

<a class="ep_document_link" href="/17849/1/1-s2.0-S2590005625001043-main.pdf"><img class="ep_doc_icon" alt="[img]" src="/style/images/fileicons/text.png" border="0"/></a>

en

open

Ultra Wideband radar-based gait analysis for gender classification using artificial intelligence

Gender classification plays a vital role in various applications, particularly in security and healthcare. While several biometric methods such as facial recognition, voice analysis, activity monitoring, and gait recognition are commonly used, their accuracy and reliability often suffer due to challenges like body part occlusion, high computational costs, and recognition errors. This study investigates gender classification using gait data captured by Ultra-Wideband radar, offering a non-intrusive and occlusion-resilient alternative to traditional biometric methods. A dataset comprising 163 participants was collected, and the radar signals underwent preprocessing, including clutter suppression and peak detection, to isolate meaningful gait cycles. Spectral features extracted from these cycles were transformed using a novel integration of Feedforward Artificial Neural Networks and Random Forests , enhancing discriminative power. Among the models evaluated, the Random Forest classifier demonstrated superior performance, achieving 94.68% accuracy and a cross-validation score of 0.93. The study highlights the effectiveness of Ultra-wideband radar and the proposed transformation framework in advancing robust gender classification.

Producción Científica

Adil Ali Saleem mail , Hafeez Ur Rehman Siddiqui mail , Muhammad Amjad Raza mail , Sandra Dudley mail , Julio César Martínez Espinosa mail ulio.martinez@unini.edu.mx, Luis Alonso Dzul López mail luis.dzul@uneatlantico.es, Isabel de la Torre Díez mail ,

Saleem