FIGURE 9. The layout diagram for U-Net architecture.

PAM-UNet: Enhanced Retinal Vessel Segmentation Using a Novel Plenary Attention Mechanism

Article

Full-text available

Jun 2024

Retinal vessel segmentation is critical for diagnosing related diseases in the medical field. However, the complex structure and variable size and shape of retinal vessels make segmentation challenging. To enhance feature extraction capabilities in existing algorithms, we propose PAM-UNet, a U-shaped network architecture incorporating a novel Plenary Attention Mechanism (PAM). In the BottleNeck stage of the network, PAM identifies key channels and embeds positional information, allowing spatial features within significant channels to receive more focus. We also propose a new regularization method, DropBlock_Diagonal, which discards diagonal regions of the feature map to prevent overfitting and enhance vessel feature learning. Within the decoder stage of the network, features from each stage are merged to enhance the segmentation accuracy of the final vessel. Experimental validation on two retinal image datasets, DRIVE and CHASE_DB1, shows that PAM-UNet achieves 97.15%, 83.16%, 98.45%, 83.15%, 98.66% and 97.64%, 85.82%, 98.46%, 82.56%, 98.95% on Acc, Se, Sp, F1, AUC, respectively, outperforming UNet and most other retinal vessel segmentation algorithms.

A Microvascular Segmentation Network Based on Pyramidal Attention Mechanism

Article

Full-text available

Jun 2024
SENSORS-BASEL

The precise segmentation of retinal vasculature is crucial for the early screening of various eye diseases, such as diabetic retinopathy and hypertensive retinopathy. Given the complex and variable overall structure of retinal vessels and their delicate, minute local features, the accurate extraction of fine vessels and edge pixels remains a technical challenge in the current research. To enhance the ability to extract thin vessels, this paper incorporates a pyramid channel attention module into a U-shaped network. This allows for more effective capture of information at different levels and increased attention to vessel-related channels, thereby improving model performance. Simultaneously, to prevent overfitting, this paper optimizes the standard convolutional block in the U-Net with the pre-activated residual discard convolution block, thus improving the model’s generalization ability. The model is evaluated on three benchmark retinal datasets: DRIVE, CHASE_DB1, and STARE. Experimental results demonstrate that, compared to the baseline model, the proposed model achieves improvements in sensitivity (Sen) scores of 7.12%, 9.65%, and 5.36% on these three datasets, respectively, proving its strong ability to extract fine vessels.

Efficient Implementation of AI Algorithms on an FPGA-based System for Enhancing Blood Vessel Segmentation

Preprint

Full-text available

Apr 2024

Machine learning offers the potential to enhance real-time image analysis in surgical operations. This paper presents results from the implementation of machine learning algorithms targeted for an intelligent image processing system comprising a custom CMOS image sensor and field programmable gate array (FPGA). A novel method is presented for efficient image segmentation and minimises energy usage and requires low memory resources, which makes it suitable for implementation. Using two eigenvalues of the enhanced Hessian image, simplified traditional machine learning and deep learning methods are employed to learn the prediction of blood vessels. Quantitative comparisons are provided between different machine learning models based on accuracy, resource utilisation, throughput, and power usage. It is shown how a gradient boosting decision tree with 1000 times fewer parameters can achieve comparable state-of-the-art performance whilst only using a much smaller proportion of the resources and producing a 200 MHz design that operates at 1,779 frames per second at 3.85 W, making it highly suitable for the proposed system. A methodology for implementing the AI algorithms onto FPGA is presented and then used to provide additional results by extending the original work to a 512 × 512 image size along with more detailed analysis.

FRD-Net: a full-resolution dilated convolution network for retinal vessel segmentation

Article

Full-text available

Apr 2024

Accurate and automated retinal vessel segmentation is essential for performing diagnosis and surgical planning of retinal diseases. However, conventional U-shaped networks often suffer from segmentation errors when dealing with fine and low-contrast blood vessels due to the loss of continuous resolution in the encoding stage and the inability to recover the lost information in the decoding stage. To address this issue, this paper introduces an effective full-resolution retinal vessel segmentation network, namely FRD-Net, which consists of two core components: the backbone network and the multi-scale feature fusion module (MFFM). The backbone network achieves horizontal and vertical expansion through the interaction mechanism of multi-resolution dilated convolutions while preserving the complete image resolution. In the backbone network, the effective application of dilated convolutions with varying dilation rates, coupled with the utilization of dilated residual modules for integrating multi-scale feature maps from adjacent stages, facilitates continuous learning of multi-scale features to enhance high-level contextual information. Moreover, MFFM further enhances segmentation by fusing deeper multi-scale features with the original image, facilitating edge detail recovery for accurate vessel segmentation. In tests on multiple classical datasets,compared to state-of-the-art segmentation algorithms, FRD-Net achieves superior performance and generalization with fewer model parameters.

EAMR-Net: A multiscale effective spatial and cross-channel attention network for retinal vessel segmentation

Article

Full-text available

Feb 2024
MATH BIOSCI ENG

Delineation of retinal vessels in fundus images is essential for detecting a range of eye disorders. An automated technique for vessel segmentation can assist clinicians and enhance the efficiency of the diagnostic process. Traditional methods fail to extract multiscale information, discard unnecessary information, and delineate thin vessels. In this paper, a novel residual U-Net architecture that incorporates multi-scale feature learning and effective attention is proposed to delineate the retinal vessels precisely. Since drop block regularization performs better than drop out in preventing overfitting, drop block was used in this study. A multi-scale feature learning module was added instead of a skip connection to learn multi-scale features. A novel effective attention block was proposed and integrated with the decoder block to obtain precise spatial and channel information. Experimental findings indicated that the proposed model exhibited outstanding performance in retinal vessel delineation. The sensitivities achieved for DRIVE, STARE, and CHASE_DB datasets were 0.8293, 0.8151 and 0.8084, respectively.

UC-Stack: a deep learning computer automatic detection system for diabetic retinopathy classification

Article

Full-text available

Feb 2024
PHYS MED BIOL

Object: The existing diagnostic paradigm for diabetic retinopathy (DR) greatly relies on subjective assessments by medical practitioners utilizing optical imaging, introducing susceptibility to individual interpretation. This work presents a novel system for the early detection and grading of DR, providing an automated alternative to the manual examination. Approach: First, we use advanced image preprocessing techniques, specifically contrast-limited adaptive histogram equalization and Gaussian filtering, with the goal of enhancing image quality and module learning capabilities. Second, a deep learning-based automatic detection system is developed. The system consists of a feature segmentation module, a deep learning feature extraction module, and an ensemble classification module. The feature segmentation module accomplishes vascular segmentation, the deep learning feature extraction module realizes the global feature and local feature extraction of retinopathy images, and the ensemble module performs the diagnosis and classification of DR for the extracted features. Lastly, nine performance evaluation metrics are applied to assess the quality of the model’s performance. Main results: Extensive experiments are conducted on four retinal image databases (APTOS 2019, Messidor, DDR, and EyePACS). The proposed method demonstrates promising performance in the binary and multi-classification tasks for DR, evaluated through nine indicators, including AUC and quadratic weighted Kappa score. The system shows the best performance in the comparison of three segmentation methods, two convolutional neural network architecture models, four Swin Transformer structures, and the latest literature methods. Significance: In contrast to existing methods, our system demonstrates superior performance across multiple indicators, enabling accurate screening of DR and providing valuable support to clinicians in the diagnostic process. Our automated approach minimizes the reliance on subjective assessments, contributing to more consistent and reliable DR evaluations.

Study on Limited Data Problem - Semantic Segmentation of Retinal Fundus Images

Article

Jan 2024

Medical images captured through various imaging modalities are key to the diagnosis and treatment of diseases, and these images serve as a major input to AI driven models but due to the scarcity of medical experts and their quality research time, the quantum of data required for AI models is limited. Moreover, medical image analysis is a laborious and error prone task. Early detection of retinal diseases by means of AI based semantic segmentation model has been boon to the diagnostic system. However, performing semantic segmentation with limited data of retinal fundus image is quite challenging. The present study is primarily focused on investigation of effectiveness of data augmentation with gamma corrected images to alleviate the problem of limited annotated data in deep learning models. The most widely accepted U-Net model in medical domain for image segmentation is trained employing retinal fundus image dataset augmented with gamma corrected images on all image channels and then tested against major publicly available datasets. The proposed method has outperformed other complex contemporary methods in terms of sensitivity and has also shown better generalizability across datasets from different institutions.

Level-set based adaptive-active contour segmentation technique with long short-term memory for diabetic retinopathy classification

Article

Full-text available

Dec 2023

Diabetic Retinopathy (DR) is a major type of eye defect that is caused by abnormalities in the blood vessels within the retinal tissue. Early detection by automatic approach using modern methodologies helps prevent consequences like vision loss. So, this research has developed an effective segmentation approach known as Level-set Based Adaptive-active Contour Segmentation (LBACS) to segment the images by improving the boundary conditions and detecting the edges using Level Set Method with Improved Boundary Indicator Function (LSMIBIF) and Adaptive-Active Counter Model (AACM). For evaluating the DR system, the information is collected from the publically available datasets named as Indian Diabetic Retinopathy Image Dataset (IDRiD) and Diabetic Retinopathy Database 1 (DIARETDB 1). Then the collected images are pre-processed using a Gaussian filter, edge detection sharpening, Contrast enhancement, and Luminosity enhancement to eliminate the noises/interferences, and data imbalance that exists in the available dataset. After that, the noise-free data are processed for segmentation by using the Level set-based active contour segmentation technique. Then, the segmented images are given to the feature extraction stage where Gray Level Co-occurrence Matrix (GLCM), Local ternary, and binary patterns are employed to extract the features from the segmented image. Finally, extracted features are given as input to the classification stage where Long Short-Term Memory (LSTM) is utilized to categorize various classes of DR. The result analysis evidently shows that the proposed LBACS-LSTM achieved better results in overall metrics. The accuracy of the proposed LBACS-LSTM for IDRiD and DIARETDB 1 datasets is 99.43% and 97.39%, respectively which is comparably higher than the existing approaches such as Three-dimensional semantic model, Delimiting Segmentation Approach Using Knowledge Learning (DSA-KL), K-Nearest Neighbor (KNN), Computer aided method and Chronological Tunicate Swarm Algorithm with Stacked Auto Encoder (CTSA-SAE).

Multi-Layer Preprocessing and U-Net with Residual Attention Block for Retinal Blood Vessel Segmentation

Article

Full-text available

Nov 2023

Retinal blood vessel segmentation is a valuable tool for clinicians to diagnose conditions such as atherosclerosis, glaucoma, and age-related macular degeneration. This paper presents a new framework for segmenting blood vessels in retinal images. The framework has two stages: a multi-layer preprocessing stage and a subsequent segmentation stage employing a U-Net with a multi-residual attention block. The multi-layer preprocessing stage has three steps. The first step is noise reduction, employing a U-shaped convolutional neural network with matrix factorization (CNN with MF) and detailed U-shaped U-Net (D_U-Net) to minimize image noise, culminating in the selection of the most suitable image based on the PSNR and SSIM values. The second step is dynamic data imputation, utilizing multiple models for the purpose of filling in missing data. The third step is data augmentation through the utilization of a latent diffusion model (LDM) to expand the training dataset size. The second stage of the framework is segmentation, where the U-Nets with a multi-residual attention block are used to segment the retinal images after they have been preprocessed and noise has been removed. The experiments show that the framework is effective at segmenting retinal blood vessels. It achieved Dice scores of 95.32, accuracy of 93.56, precision of 95.68, and recall of 95.45. It also achieved efficient results in removing noise using CNN with matrix factorization (MF) and D-U-NET according to values of PSNR and SSIM for (0.1, 0.25, 0.5, and 0.75) levels of noise. The LDM achieved an inception score of 13.6 and an FID of 46.2 in the augmentation step.

LMBiS-Net: A Lightweight Multipath Bidirectional Skip Connection based CNN for Retinal Blood Vessel Segmentation

Preprint

Full-text available

Sep 2023

Blinding eye diseases are often correlated with altered retinal morphology, which can be clinically identified by segmenting retinal structures in fundus images. However, current methodologies often fall short in accurately segmenting delicate vessels. Although deep learning has shown promise in medical image segmentation, its reliance on repeated convolution and pooling operations can hinder the representation of edge information, ultimately limiting overall segmentation accuracy. In this paper, we propose a lightweight pixel-level CNN named LMBiS-Net for the segmentation of retinal vessels with an exceptionally low number of learnable parameters \textbf{(only 0.172 M)}. The network used multipath feature extraction blocks and incorporates bidirectional skip connections for the information flow between the encoder and decoder. Additionally, we have optimized the efficiency of the model by carefully selecting the number of filters to avoid filter overlap. This optimization significantly reduces training time and enhances computational efficiency. To assess the robustness and generalizability of LMBiS-Net, we performed comprehensive evaluations on various aspects of retinal images. Specifically, the model was subjected to rigorous tests to accurately segment retinal vessels, which play a vital role in ophthalmological diagnosis and treatment. By focusing on the retinal blood vessels, we were able to thoroughly analyze the performance and effectiveness of the LMBiS-Net model. The results of our tests demonstrate that LMBiS-Net is not only robust and generalizable but also capable of maintaining high levels of segmentation accuracy. These characteristics highlight the potential of LMBiS-Net as an efficient tool for high-speed and accurate segmentation of retinal images in various clinical applications.

The layout diagram for U-Net architecture.

Context in source publication

Citations