Automatic brain-tumor diagnosis using cascaded deep convolutional neural networks with symmetric U-Net and asymmetric residual-blocks

Abd-Ellah, Mahmoud Khaled; Awad, Ali Ismail; Khalaf, Ashraf A. M.; Ibraheem, Amira Mofreh

doi:10.1038/s41598-024-59566-7

Download PDF

Article
Open access
Published: 25 April 2024

Automatic brain-tumor diagnosis using cascaded deep convolutional neural networks with symmetric U-Net and asymmetric residual-blocks

Mahmoud Khaled Abd-Ellah¹,
Ali Ismail Awad^2,3^na1,
Ashraf A. M. Khalaf⁴^na1 &
…
Amira Mofreh Ibraheem¹^na1

Scientific Reports volume 14, Article number: 9501 (2024) Cite this article

495 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

The use of various kinds of magnetic resonance imaging (MRI) techniques for examining brain tissue has increased significantly in recent years, and manual investigation of each of the resulting images can be a time-consuming task. This paper presents an automatic brain-tumor diagnosis system that uses a CNN for detection, classification, and segmentation of glioblastomas; the latter stage seeks to segment tumors inside glioma MRI images. The structure of the developed multi-unit system consists of two stages. The first stage is responsible for tumor detection and classification by categorizing brain MRI images into normal, high-grade glioma (glioblastoma), and low-grade glioma. The uniqueness of the proposed network lies in its use of different levels of features, including local and global paths. The second stage is responsible for tumor segmentation, and skip connections and residual units are used during this step. Using 1800 images extracted from the BraTS 2017 dataset, the detection and classification stage was found to achieve a maximum accuracy of 99%. The segmentation stage was then evaluated using the Dice score, specificity, and sensitivity. The results showed that the suggested deep-learning-based system ranks highest among a variety of different strategies reported in the literature.

Segmentation of glioblastomas in early post-operative multi-modal MRI with deep neural networks

Article Open access 02 November 2023

Detection and classification of brain tumor using hybrid deep learning models

Article Open access 27 December 2023

Context aware deep learning for brain tumor segmentation, subtype classification, and survival prediction using radiology images

Article Open access 12 November 2020

Introduction

A brain tumor is an abnormal growth of tissues that appears in the brain and can affect its function. The number of new brain-tumor cases in the United States in 2019 was 23,820, and there were an estimated 17,760 deaths from the condition in that year, according to the American Cancer Society’s Cancer Statistics Center^1,2. Furthermore, the National Brain Tumor Foundation has announced that the number of individuals in advanced nations who die because of a cerebral tumor in recent decades has increased by 300%³. Gliomas are the best-known kind of brain tumor, and these can be divided into high- and low-grade gliomas (HGGs and LGGs); HGGs tend to result in a 2-year life expectancy, whereas people with LGGs can have a life expectancy of several years or more. The most common brain-tumor treatment methods that are applied to reduce tumor growth are surgery, chemotherapy, and radiotherapy⁴.

Magnetic resonance imaging (MRI) is a high-quality imaging technique that can provide substantial information about brain tissue; as such, it has been widely used for automatic tumor diagnosis⁵. MRI can provide highly detailed images of the features of brain tumors and, as a result, can result in additional therapeutic options becoming available to a patient. The technique can also give information about the physiology, metabolism, and hemodynamics of certain tumors. MRI scans are ideal for soft-tissue imaging. Because of the prevalence of brain tumors, a large amount of brain-tumor MRI data is generated; developing an automated brain-tumor diagnostic system with acceptable performance is thus critical⁶. The use of a computer-aided diagnosis (CADx) system is essential for detecting brain tumors quickly and without the need for human interaction. The treatment for a brain tumor will vary according on the type of tumor and its size and location. Brain-tumor classification and segmentation are thus crucial tasks for diagnosing tumors, determining treatment decisions, and increasing the likelihood of recovery^7,8.

Automatic brain-tumor detection using MRI scans can significantly improve diagnosis, therapy, and growth-rate prediction. Automatic tumor diagnosis includes a pipeline of three processes: detection, segmentation, and classification. Detection aims to classify MRI images with respect to the presence or absence of a tumor into abnormal and normal images, respectively⁹. Segmentation aims to recognize the tumor zone and delineate the boundaries of its regions: necrotic tissue, active tumor tissue, and edema (growth near the tumor). Classification aims to categorize MRI images of gliomas into HGGs and LGGs¹⁰.

Segmentation is realizable by identifying regions that appear different when compared to ordinary tissues. While some tumors—for example, meningiomas—are easily segmented, others—such as gliomas—are fundamentally harder to segment. These tumors tend to have edema and extended limb-like structures, are very often diffused, and provide limited image contrast, which makes the division procedure difficult. Their boundaries are frequently hazy and difficult to distinguish from healthy tissues. Moreover, they can appear in any region of the brain and can have a range of different shapes and sizes¹¹. As a result, the problem of brain-tumor diagnosis can be seen as a difficult image-classification task.

In general, machine learning, and particularly deep-learning algorithms, can greatly assist with the detection, segmentation, classification, and registration of brain tumors. Recently, deep-learning techniques have gained greater research attention^12,13. Deep-learning methods can be trained in their convolution layers using either unsupervised or supervised training methods¹⁴. The findings of recent studies have shown that deep-learning approaches outperform traditional learning methods in brain-tumor diagnosis¹⁵, and several systems have been designed for brain-tumor detection, classification, and segmentation based on deep learning.

Because of the high incidence of brain tumors, a vast amount of MRI data has been collected. Gliomas, with their irregular shapes and ambiguous boundaries, are the most challenging tumors to identify. The detection and classification of brain tumors are critical steps that are dependent on a diagnosing physician’s expertise and knowledge. Although various studies have focused on the use of deep-learning approaches to brain-tumor diagnosis, no comprehensive system for automatic tumor detection, classification, and segmentation is currently available, and a complete technique for automatic brain-tumor diagnosis has not yet been published in the literature. Furthermore, accurately integrating tumor segmentation, classification, and detection inside a single system is still an open problem. The presence of the brain-tumor detection and classification phases before tumor segmentation in a single system will result in normal images being excluded from the segmentation phase. This allows for real-time deployment of automatic tumor-diagnosis systems, saving time and computing power that would otherwise be used in attempting to identify tumors in normal images. An intelligent strategy for detecting and classifying brain tumors is essential for supporting clinicians, and it could be a beneficial tool in hospital emergency rooms when examining the MRI scans of patients, since it would allow for speedier diagnosis. For example, such a system could help doctors by examining MRI images before processing to identify whether they are normal or contain an HGG or LGG. Once an image has been automatically identified as abnormal, a physician can easily detect and segment the brain tumor to estimate its size using the segmentation stage.

In this study, the aforementioned challenges have been addressed by developing an automatic CADx system for brain-tumor detection, classification, and segmentation. In the first stage, we introduce a convolutional neural network (CNN) architecture for detection and classification. In the second stage, a CNN architecture for segmentation is introduced. These systems use state-of-the-art CNN architecture and training procedures, including the U-Net, residual units, batch normalization (BN), dropout regularization, the parametric rectified linear unit (PReLU), and skip connections. Additionally, both the contexts and local shapes of tumors are taken into consideration. The proposed technique overcomes the issue of performing pixel arrangement without considering the local dependencies of labels.

The proposed system’s accuracy was assessed using MRI images derived from a brain-tumor segmentation database. In addition to improving the overall structure of the system, the major research contributions can be summarized as follows.

1.
A new and fully automatic brain-tumor diagnosis structure has been built by combining the detection and classification phases prior to segmentation, saving time and computing power. This can support clinicians and be a beneficial tool for real-time deployment in hospital emergency rooms.
2.
A new detection and classification CNN has been constructed by combining two parallel paths, using residual units, and selecting the best values of convolutional-layer filters to help extract more features.
3.
The proposed segmentation model has been designed using two asymmetric parallel paths with two U-Net architectures in series and a residual encoder and decoder for brain-tumor segmentation.
4.
Both local and global features are considered, including local details of the brain, to improve the detection, classification, and segmentation performances.
5.
The use of feature fusion further enhances the extraction of multi-scale features.
6.
The proposed brain-tumor diagnosis system was optimized in several respects, including in terms of the accuracy and speed of diagnosis.
7.
Comparisons were made among the results of different experiments to obtain a model with the highest accuracy and Dice score, resulting in a model for brain-tumor detection, classification, and segmentation that is superior to any previously published.

The remainder of this paper is structured as follows. The next section Related work is dedicated to exploring works related to the detection, classification, and segmentation of brain tumors. The structures of the proposed system are then presented in Materials and Methods. Subsequently, the implementation and performance-evaluation processes are described in Simulations and Evaluation Criteria. A discussion of the outcomes and research findings is then given in Results and Discussion. Finally, the conclusions of the research are presented in Conclusions and Future Work.

Related work

Over recent decades, the diagnosis of brain tumors has gained considerable research interest, and many diagnosis techniques have been introduced. The performance of tumor diagnosis has been improved by applying several different automatic brain-tumor detection techniques. In these approaches, machine learning is crucial for the identification, classification, and segmentation of brain tumors, and several machine-learning algorithms have recently been developed to this end as shown in Table 1.

El-Dahshan et al.¹⁶ used the discrete wavelet transform (DWT) as a feature extractor, principal component analysis (PCA) for feature minimization and selection, and a three-layer artificial neural network (ANN) for the actual tumor detection. Abd-Ellah et al.³ combined morphological filters, the DWT, PCA, and a kernel support vector machine (KSVM) for brain-tumor detection from MRI images. Using a twofold classifier, researchers were able to categorize a picture as benign (noncancerous) or malignant (cancerous)¹⁷. Using the DWT as a feature extractor, Zhang et al.¹⁸ were able to discover brain tumors in MRI images. The features were reduced from 65,536 to 1024 using a three-level decomposition with Haar wavelets. The reduced characteristics were then sent into a back-propagation neural-network classifier. Devasena et al.¹⁹ demonstrated a CADx system for MRI-based tumor diagnosis that employs a hybrid abnormal detection algorithm.

Patil et al.²⁰ applied a probabilistic neural network (PNN) for extracting features to detect brain tumors; they applied the k-nearest neighbors (k-NN) algorithm, an ANN, and an SVM to detect and recognize various types of tumor²¹. Goswami et al.²² also proposed a brain-tumor classification system using MRI images. Noise filtering, edge detection, and histogram equalization are used in the preprocessing stage of their system; then, independent component analysis is applied for feature extraction, and classification is performed using a self-organized map. Deepa and Devi²³ used a combination of feature extraction, segmentation, and tumor classification to diagnose brain tumors. In their approach, optimal texture characteristics are extracted using statistical features, and a radial basis function neural network and a back-propagation neural network are used in the segmentation and classification stages, respectively.

Sarith et al.²⁴ presented a technique for detecting brain tumors from MRI images that uses wavelet entropy-based spiderweb plots for feature extraction and a PNN for tumor identification. Yang et al.²⁵ suggested a technique using 2D-DWT and Haar-wavelet feature extraction for early brain-tumor detection from MRI images with a KSVM as a classifier. Kalbkhani et al.²⁶ also used 2D-DWT and modeled the sub-bands of detail coefficients using a generalized autoregressive conditional heteroscedasticity model. Linear discriminant analysis was used to extract 61,440 features, which were then reduced to 24 using PCA; the k-NN and SVM methods were employed for the actual tumor detection. Mudda et al.’s²⁷ main goal was to detect whether a brain has a tumor or is healthy. Gray-level run-length matrix (GLRLM) texture characteristics were employed to extract the features for efficient brain-tumor diagnosis using neural-network methods. In 2023, Asiri et al.²⁸ employed six machine-learning methods for brain-tumor detection: SVM, neural networks, random forest (RF), CN2 rule induction (CN2), naive Bayes (NB), and decision trees. Achieving 95.3% accuracy, it was found that SVM surpassed other methods.

Abd-Ellah et al.⁵ created a deep-learning technique for detecting and classifying brain tumors from MRI images. As a feature extractor, they employed the AlexNet CNN, and at the classification step, an error-correcting output codes SVM was applied. For feature extraction and selection, Heba et al.²⁹ used the DWT was used with PCA. A seven-layer deep neural network (DNN) was used to classify the retrieved characteristics. Tazin et al.³⁰ employed a CNN and a transfer-learning technique to determine whether or not a brain tumor was present in X-ray images; VGG19, InceptionV3, and MobileNetV2 were used for deep feature extraction. MobileNetV2 was found to be 92% accurate, InceptionV3 was 91% accurate, and VGG19 was 88% accurate. Alsaif et al.³¹ applied multiple CNN models based on data augmentation to identify brain tumors using MRI images. VGG16, VGG19, ResNet-50, ResNet-101, Inception-V3, and DenseNet121 were applied to distinguish between normal samples and those containing a tumor. The best accuracy, found with VGG16, was established as 96%.

Techniques for semi-automatic and automated brain-tumor segmentation can be broadly split into discriminative and generative models³². Brain-tumor segmentation approaches based on generative models require previous knowledge of the shape, size, and appearance of both tumor and normal tissues, which may be obtained in probabilistic picture atlases^33,34, in which the tumor segmentation is modeled as a detection problem with a probabilistic image. The generative framework of tumor segmentation using outlier detection through MRI is represented in Ref.³⁵. Based on a probabilistic technique, Prastawa et al.³⁶ initialized active contours on a brain atlas and iterated until the probability was below a specific threshold. Different methods based on active contours have been presented^37,38. Because aligning a large brain tumor onto a model is a challenging task, some techniques apply registration alongside tumor segmentation, as in Ref.³⁹.

Brain-tumor segmentation methods based on discriminative models use the extraction of image features to classify image voxels as normal or tumor tissues. The performance of a discriminative model depends on the extracted features and classification techniques. Different image features have been considered in brain-tumor-segmentation techniques, including image textures^40,41, local histograms⁴², alignment-based features such as symmetry analysis, region-shape differences, inter-image gradients, tensor eigenvalues⁴³, raw-input pixel values^44,45, and discriminative-learning techniques such as decision forests^42,43 and SVMs^46,47.

Techniques based on deep learning have arisen as an effective alternative to traditional machine learning, as they have impressive capacity to learn discriminative features, outperforming models using pre-defined and hand-crafted features. More recently, deep-learning techniques have achieved success in general image-analysis studies, including object detection⁴⁸, image classification⁴⁹, and semantic segmentation^50,51,52, and this success has led the approach to be applied to brain-tumor segmentation.

Specifically, CNNs were used in the BraTS 2014 challenge as a promising approach to segmenting brain tumors^53,54,55. Additional brain-tumor-segmentation techniques using deep learning were introduced for BraTS 2015, and various deep-learning methods were presented, such as stacked denoising autoencoders, convolutional restricted Boltzmann machines, and CNNs^{56,57,58,59,60}. Deep-learning brain-tumor segmentation methods that build upon CNNs have been found to achieve the best performance. In particular, both 2D-CNNs^{4,53,54,58,59,60,61,62,63} and 3D-CNNs^55,64,65 have been adopted to develop brain-tumor-segmentation techniques.

Wang et al.⁶⁶ presented a modality-pairing learning approach for segmenting brain tumors. Two parallel branches were created to leverage the properties of various modalities, and several layer interconnections were used to capture complicated interactions and extract a wealth of information. To reduce the prediction variation between the branches, they applied a consistency loss. Furthermore, a learning-rate warmup technique was used to deal with the problem of training instability. The authors of Ref.⁶⁷ introduced a cross-modality deep-feature-learning approach for segmenting brain tumors. This system was made up of two procedures: cross-modality feature fusion (CMFF) and cross-modality feature transition (CMFT); they used the BraTS 2017 and 2018 datasets. Remya et al.⁶⁸ demonstrated tumor segmentation based on a fuzzy c-means (FCM) technique and an improved noise-filtering computation. They updated the noise-filtering computation to obtain the right tumor region, and the execution was improved by upgrading the threshold function. Following filtering, segmentation was performed using Otsu’s method and the FCM approach. The Jaccard and Dice coefficients were 0.5304 and 0.6893, respectively, according to the findings.

Most of these methods were trained using small image patches to classify images into various classes. The classified patches are used to label centers for achieving segmentation. However, most of the methods used assume that a voxel label is distinct, and they do not consider spatial consistency and appearance. Havaei et al.⁴ constructed a cascaded two-pathway architecture, and they provided the probabilities of pixel-wise segmentation results acquired from the first CNN architecture as an extra input to the second CNN to take into account the local dependencies of labels. Additionally, the spatial consistency and appearance can be taken into consideration, as in Ref.⁶⁴, and conditional random fields (CRFs) and Markov random fields can be combined with CNN segmentation methods as a post-processing step or created as neural networks according to Refs.^51,52. Myronenko⁶⁹ proposed a semantic segmentation network to segment tumor subregions from 3D MRI images using an encoder–decoder architecture. In this system, the autoencoder branch reconstructs the input image and the decoder regularizes and imposes additional constraints on its layers. This approach won first place in the BraTS 2018 challenge.

Fritscher et al. presented a network architecture based on 3D input patches to three convolutional pathways in the coronal, sagittal, and axial views, which are merged by fully connected layers⁷⁰. Pereira et al.⁶¹ proposed a tumor-segmentation technique that applies intensity normalization in the preprocessing stage. Later, they presented a hierarchical study based on a fully convolutional network (FCN) and histograms using MRI images⁶². Zhao et al.⁷¹ combined CRFs and an FCN to segment brain tumors in three training stages. Dong et al.⁷² used a U-Net approach with comprehensive data augmentation to propose a fully automatic 2D method, and a 3D U-Net network was applied for tumor segmentation, providing shortcut connections between the upsampling and downsampling paths. Various additional approaches for detecting, segmenting, and classifying brain tumors are outlined in Ref.⁷³.

Table 1 Comparison between different classification and segmentation literature.

Full size table

Materials and methods

As outlined in the previous section, deep-learning methods have attracted considerable research interest because they provide the ability to process a huge number of MRI images with great efficiency⁵. The proposed diagnosis system has two stages. The first stage focuses on tumor detection and classification by categorizing images into normal, HGG, and LGG. The second stage converts tumor segmentation into a classification problem. Figure 1 shows a flowchart of the developed system. MRI images are provided to the architecture’s input, and preprocessing is applied. Then, a CNN is used to extract and classify the features of each MRI image to determine if it is normal or abnormal. To achieve accurate results, the parallel architecture of the CNN is first trained, and a second CNN is applied in the second-stage segmentation. The system was evaluated using the Reference Image Database to Evaluate Therapy Response (RIDER) and the BraTS 2017 dataset as standard reference MRI datasets.

Preprocessing stage

Image preprocessing is vital in diagnostic tasks, as it enhances the quality of images and prepares them for precise and efficient diagnosis. The BraTS 2017 database has 3D MRI volumes with various spacings in the three dimensions. In the present work, detection, classification, and segmentation were applied to slices with various image modalities (FLAIR, T2, T1, and T1C). In the preprocessing stage of the proposed approach, each volume is cropped to remove the unwanted background, which saves computational power. Then, each slice is normalized, excluding the ground truth, by subtracting the mean and dividing by the standard deviation. After that, the top and bottom pixel intensity are clipped by one percent. All normal slices are ignored in the segmentation stage. The axes are swapped to represent the modalities in axis 0 and the slice in axis 1. All slices are randomly shuffled.

In the first stage, the slices are resized to $256 \times 256$ due to the variation in MRI slice size. Therefore, in the segmentation stage, patches of $128 \times 128$ size are generated. Patches are randomly shuffled and randomly selected. By rescaling or resizing the images to a standardized format, we can ensure consistent performance of the diagnosis algorithm across different images. Moreover, resizing can also reduce computational complexity. Data augmentation can play a vital role in diagnostic tasks by generating additional training samples through the application of various transformations to an existing dataset. By introducing variations in the training data, data augmentation enhances a model’s resilience and ability to handle a wide range of scenarios. Samples of enhanced images are shown in Fig. 2.

The main CNN architectures

Convolutional layers

The convolutional layers are the fundamental building blocks of the CNN architecture; they provide feature maps from input maps, except for the first convolutional layer, for which the input is taken directly from an input image. The feature computation $M_{s}$ is calculated as in Eq. (1):

$$\begin{aligned} M_{s}= b_{s} + \sum _{r} W_{sr} * X_{r} \end{aligned}$$

(1)

where $W_sr$ denotes the input channel sub-kernel, $X_r$ is the rth input channel, and $b_s$ is a bias term. The convolutional layers can separately learn the biases and weights of the feature map, extending the data-driven, customized, and task-specific dense feature extractors. As a result, a nonlinear activation function is applied.

Rectified linear unit (ReLU) and parametric ReLU (PReLU) layers

ReLU activation functions are frequently used for the hidden layers. When the input is greater than 0, the output y is the same as the input x; otherwise, the output is neglected as in Eq. (2):

$$\begin{aligned} f{y}= \max (x,0) \end{aligned}$$

(2)

PReLU offers a small number of parameters $\alpha$, which increases accuracy⁸⁰ as in Eq. (3):

$$\begin{aligned} f{y}= \max (x,0) + \alpha \min (x,0) \end{aligned}$$

(3)

Max-pooling layers

Max-pooling minimizes the input dimension by down-sampling; the number of learned parameters is thus decreased, reducing cost, improving performance, and overcoming the problem of overfitting. A non-overlapping max filter is employed to subregions, taking $N \times N$ regions and providing a single value⁵.

Residual blocks

Residual blocks have skip connections that enable information to be transmitted both forward and backward directly between layers as in Eqs. (4)–(5):

$$\begin{aligned} y_{i}= & {} h(m_{i})+f(m_{i},W_{i}) \end{aligned}$$

(4)

$$\begin{aligned} m_{i+1}= & {} f(y_{i}) \end{aligned}$$

(5)

where $m_{i}$ is the ith input and $m_{i+1}$ is the ith output, $W_{i}$ is the set of weights, $h(m_{i})=m_{i}$ is known as attaching an identity skip connection, and $m_{i+1}=y_{i}$ when f is an identity⁸¹.

There is a stack of various layers in residual blocks: a convolutional layer, PReLU, and BN, which is used to normalize the input. Further, PReLU is again attached and followed by a convolution layer, and these layers are repeated. The input and the output are summed, generating a direct connection. The residual decoding block includes a convolution layer in the direct route, as shown in the bottom left of Fig. 5.

Regularization and loss function

The cross-entropy loss is applied to test the performance of the classification stage. This is developed as the predicted label probability as in Eq. (6):

$$\begin{aligned} L_{\textrm{ce}}(H, \hat{H})=H\log \left( \dfrac{H}{\hat{H}}\right) +(1-H)\log \left( \dfrac{1-H}{1-\hat{H}}\right) , \end{aligned}$$

(6)

where H is the desired output, $L_{\textrm{ce}}(H, \hat{H})$ is the cross-entropy error, and $\hat{H}$ is the predicted output⁸².

BN is used for regularizing the provided values and eliminating nonlinearities as in Eq. (7):

$$\begin{aligned} \hat{S}= \dfrac{S-\textrm{E}[S]}{\sqrt{\textrm{Var}[S] }} \end{aligned}$$

(7)

where S, $\hat{S}$, $\textrm{Var}[S]$, and E[S] are the input layer, the normalized activations, the unbiased variance estimate, and the expectation value, respectively⁸³. BN allows the CNN to be trained with stable gradients, smoothing the optimization plane, making weight initialization easier, and reaching optimal values more rapidly with more viable activation functions. The weighted loss and regularization function reduces the problem of becoming stuck in local minima and enhances the performance of the model. Another regularization technique is dropout, which randomly ignores selected neurons during training and stops weights from updating⁶².

Softmax layer

Softmax is a widely used layer in CNNs for multi-class classification tasks. It is normally positioned at the end of the network and generates a probability distribution over the classes. The softmax layer normalizes the outputs of the preceding layer to indicate the probability of each class. The softmax function calculates the probability of each class by multiplying the input scores and dividing them by the sum of all exponentiated values as in Eq. (8):

$$\begin{aligned} Y_{j}(x)= \dfrac{\exp {x_{j}}}{\sum _{i}\exp {x_{i}} } \end{aligned}$$

(8)

where $Y_{j}(x)$ is the probability of the input instance in class j, and $x_{j}$ is the jth element of the vector $x_{i}$.

The first stage: the proposed brain glioma detection and classification architecture

In this work, we examined different architectures by applying feature-map concatenation between different numbers of layers when composing CNNs. This operation produced various architecture designs with different computational routes. We now present the best architecture that was found during this exploration process.

Two-pathway with residual-based deep convolutional neural network architecture (TRDCNN)

Detection and classification of glioblastomas are conducted using the proposed TRDCNN. A schematic of the proposed structure is provided in Fig. 3. The input is MRI images; as noted, these are subjected to preprocessing to decrease the calculation complexity and to speed up processing. The proposed structure comprises four paths: the global, local, merged, and output paths. The local and global paths have their own responsibilities in feature extraction; they have one of each of a convolutional (CONV), ReLU, and max-pooling layer, followed by six stages of residual blocks with max-pooling. The first CONV layer uses a small filter of $5 \times 5$ pixels in the local path and a large filter of $12 \times 12$ pixels in the global path. The global and local paths are connected in parallel, and they are then combined into a merged path that includes BN, ReLU, and fully connected layers, followed by a dropout layer. The final path is the output path, which contains a classification layer with a softmax function; this classifies features into normal and glioblastoma (HGG or LGG) images.

The second stage: brain-tumor segmentation architectures

A CNN usually has a huge number of parameters; patch-based training can be applied to train a DCNN with enough samples⁵⁸. As such, the segmentation problem can be handled as a classification problem. Image patches are selected regions that describe a central pixel, and they are labeled with the label of their center pixel. Millions of image patches can be generated to train a CNN. In the testing process, all the extracted patches are classified by the trained network, which then makes up a segmented image. However, CNN techniques segment images slice by slice⁴. Furthermore, the locations of training patches and the number for each class are controlled by changing the patch-selection scheme.

Two parallel cascaded U-Nets with an asymmetric residual (TPCUAR-Net)

Figure 5 shows the TPCUAR-Net architecture. The input to this structure is an MRI image resulting from merging five different MRI modalities (Fig. 4). The noise is reduced and image quality is improved by preprocessing. There are four paths: the structure, upper, lower, merged, and output paths. The upper and lower paths each comprise two series U-Nets with various depths, which are used in feature extraction to elicit the local and global features. These have up- and down-sampling processes, and two different residual blocks called Residual Enc., and Residual Dec., as in Fig. 5. The convolutional layer has a two-step stride in the down-sampling path and a one-step stride in the up-sampling path. The merged path combines the upper and lower paths in parallel form; it comprises concatenation, BN, PReLU, and convolutional layers. The concatenation layer receives a group of inputs with the same shape and combines them, providing one path. The output path produces an image segmented with a softmax function.

Simulations and evaluation criteria

This section describes simulations conducted to evaluate the proposed system using several different metrics. The approach was developed using Jupyter Notebook, and the Keras and TensorFlow toolkits were used to code the suggested architecture. The computer used for this task has a 3.2 GHz Intel Core i7 processor, 24 GB of RAM, and was running Windows 7 64-bit as its operating system.

Imaging data

The images used in this manuscript were extracted from the BraTS 2017 dataset. This provides images from 285 glioma patients, 210 with HGG and 75 with LGG. The data were scanned using different clinical protocols and scanners from different institutions, including Heidelberg University, the University of Alabama, the University of Bern, the National Institutes of Health, the University of Debrecen, and The Center for Biomedical Image Computing and Analytics. Multimodal scans are available for each patient, including T1, T1ce, T2, and FLAIR volumes³².

The data used during BraTS 2014, 2015, and 2016 (from The Cancer Imaging Archive) were discarded, as their ground-truth labels were generated from the highest-ranking methods during BraTS 2012 and 2013. However, expert radiologists were included in BraTS 2017, which lead us to use these data instead of other BraTS datasets. The training and testing datasets are listed in Table 2, and Fig. 6 presents a sample of the MRI images used in the investigation.

Table 2 Training and testing dataset distributions.

Full size table

Training and testing

The proposed model was trained using the image database for 100 epochs using the SGD optimizer and a learning rate of 0.08. The batch size was 16, and the error loss was calculated using the square of the mean. All details of training and testing parameters are listed in Table 3. Typically, a grid or random search is performed in hyperparameter space, followed by training the proposed models for a predetermined number of epochs for each hyperparameter choice. The best hyperparameter is then determined by selecting the value with the highest validation accuracy. The parameters are chosen based on a trial-and-error method.

Table 3 Training parameters.

Full size table

Evaluation metrics

Tumor detection and classification

The metrics used for evaluation were as follows: sensitivity (SV), which indicates the proportion of correctly classified positives; specificity (SP), which is the proportion of correctly classified negatives; and accuracy (AC), which represents the proportion of both true negatives and true positives. These are calculated as in Eqs. (9)–(11), respectively⁵:

$$\begin{aligned} {{\textbf {Sensitivity (SV)}}}= & {} \frac{\sigma }{\sigma + \Phi } \end{aligned}$$

(9)

$$\begin{aligned} {{\textbf {Specificity (SP)}}}= & {} \frac{\eta }{\eta + \Psi } \end{aligned}$$

(10)

$$\begin{aligned} {{\textbf {Accuracy (AC)}}}= & {} \frac{\sigma + \eta }{\sigma + \eta + \Phi + \Psi } \end{aligned}$$

(11)

where $\sigma$ is the number of true positives, $\Phi$ is the number of false negatives, $\eta$ is the number of true negatives, and $\Psi$ is the number of false positives. The intersection over Union (IoU) is a popular metric to measure localization accuracy for binary classification; it is calculated as in Eq. (12)⁸⁴:

$$\begin{aligned} {{\textbf {Intersection over Union (IoU)}}}= \frac{\sigma }{\sigma + \Phi + \Psi } \end{aligned}$$

(12)

Tumor segmentation

The system’s performance on the test set was determined by contrasting the prediction output with the ground truth supplied by knowledgeable radiologists. There are three distinct types into which tumor structure are categorized; the primary reason for this is practical clinical applications. These are: a full tumor (which includes all types), a core tumor (which includes all types save edema, and an augmenting tumor (which includes enhancing). The Dice coefficient, sensitivity, and specificity were computed for each tumor location as in Eqs. (13)–(15):

$$\begin{aligned} {{\textbf {Dice}}}= & {} \dfrac{|O\cap G|}{(|O|+|G|)/2} \end{aligned}$$

(13)

$$\begin{aligned} {{\textbf {Sensitivity}}}= & {} \dfrac{|O\cap G|}{|G|} \end{aligned}$$

(14)

$$\begin{aligned} {{\textbf {Specificity}}}= & {} \dfrac{|O_{0}\cap G_{0}|}{|G_{0}|} \end{aligned}$$

(15)

where O is positive segmented regions, $O_0$ is negative segmented regions, G is the actual ground truth, and $G_0$ is falsely identified regions. The intersection point between O and G is $|O\cap G|$⁴. Another popular performance metric for determining the separation between two point sets is the average Hausdorff distance⁸⁵ as in Eq. (16):

$$\begin{aligned} {\textbf {Hausdorff}}(x,y) = \left( \dfrac{1}{x}\sum _{x\in X}\min _{y\in Y}d(x,y)+\dfrac{1}{y}\sum _{y\in Y}\min _{x\in X}d(x,y)\right) \end{aligned}$$

(16)

Results and discussion

Tumor detection accuracy analysis

Multiple tests were conducted to confirm the suggested network’s performance in achieving the tumor-detection goal. Table 4 shows a comparison with various methods reported in the literature. The empirical results demonstrate that the suggested TRDCNN structure outperforms existing techniques in terms of accuracy, sensitivity, and specificity. Without taking network training time into account, the average testing duration per picture was 0.28 s. Figure 7 shows the improvement TRDCNN achieves over the training and validation images.

Table 4 Comparison of the proposed approach against some methods in the literature.

Full size table

Analysis of tumor-segmentation accuracy

Table 5 presents the performance of TPCUAR-Net in comparison with different investigations found in the literature that used the U-net network. Reproduction tests have been directed to demonstrate the execution of the presented system in fulling the segmentation assignment. A comparison with other investigations in the literature is given in Table 6, and Fig. 9 presents the segmentation findings. Without taking network training time into account, the average testing duration per picture was 0.08 s. Figure 8 presents boxplots for the test dataset. In this figure, our proposed method ranks first among competing results for complete, core, and enhancing tumors, with fewer outliers than the other techniques.

Table 5 Comparison of the proposed TPCUAR-Net against some methods based on U-Net in terms of Dice score.

Full size table

Table 6 Comparison of the proposed TPCUAR-Net against some state-of-the-art methods.

Full size table

Discussion

The detection and classification of brain tumors are important steps that rely on the expertise and knowledge of a physician, and an intelligent method for detecting and classifying brain tumors is vital for assisting clinicians. Gliomas, with their uneven shapes and uncertain boundaries, are the most difficult tumors to diagnose. Image segmentation presents substantial issues in terms of categorization, image processing, object detection, and explanation. For example, whenever an image-classification model is developed, it must be able to work with great precision even when subjected to occlusion, lighting variations, viewing angles, and other factors.

In this study, we designed a deep CNN diagnosis model based on parallel paths. This can provide various benefits over and above those of a standard CNN, including increased model capacity and feature diversity, ensemble learning, multi-resolution analysis, efficient information flow, and regularization capabilities. These advantages can boost the network’s performance, its capacity to learn complex patterns, and its generalization ability. We then incorporated residual blocks into our model to achieve improved performance, providing benefits such as increased model capacity, improved gradient flow, enhanced feature reuse, regularization for reduced overfitting, efficient training, and flexibility in network design. The addition of residual blocks can help to train deep models faster and more efficiently. Skip connections are used to provide faster convergence by giving direct access to lower-level capabilities. This minimizes the number of weight updates necessary for information propagation, resulting in faster training overall. Efficiency in training is especially important when dealing with huge datasets and computationally expensive models. The proposed detection and classification model was found to achieve an accuracy of 98.88%, which is greater than the accuracy of other models reported in the literature. In the segmentation, our architecture achieved a Dice score of 0.91, which is greater than those of other comparable models reported in the literature. Therefore, increasing the depth of U-Net by using cascaded and parallel paths with the application of some useful preprocessing techniques for can improve the segmentation performance.

The processing time is another important factor for evaluating the proposed model. The training time was not considered, because the parameters were kept unchanged after training. The technique used for measuring processing time involved transmitting all of the images into the proposed system, recording the associated calculation time for each stage for each individual image, and computing the average value to reflect the times used by different stages. The average testing duration per image was 0.41 s.

Our innovative complete CADx approach has the potential to play an important role in the early detection and diagnosis of brain tumors; it can be applied as a useful tool in hospital emergency units during the examination of patients with MRI scans because of the greater possible speed of diagnosis. Our method allows doctors to analyze MRI images before processing to determine whether they are normal or contain an HGG or LGG. Once an image is recognized as abnormal, clinicians can detect and segment the brain tumor to determine its size.

Conclusions and future work

Herein, we have described a deep-learning-based technique for detecting, classifying, and segmenting glioblastoma brain tumors using MRI images. The primary goal of this work was to merge the detection, classification, and segmentation processes into a single fully automated system. The first phase includes deep CNN architecture for brain-tumor identification and classification from MRI scans, which classifies the pictures as normal, HGG, or LGG using the TRDCNN architecture. This was assessed using the BraTS 2017 database, with 1350 and 450 images used for training and testing, respectively. The TRDCNN architecture was found to produce encouraging results in terms of accuracy, sensitivity, and specificity, with values of 98.88%, 98.66%, and 99.60%, respectively.

For the second phase, a deep-CNN-based automatic brain-tumor segmentation approach from MRI images was described. Various structures with varying depths were studied and investigated. The second phase was assessed using a database derived from the BraTS 2017 dataset, with 62,000 and 43,400 images used for training and testing, respectively. The TPCUAR-Net was found to produce the best results for the total, core, and enhancing tumor areas, with a maximum Dice score of 0.91 and a testing duration of 0.45 s per image. The suggested method’s superiority stems from various advantages, including its ability to evaluate both local and global aspects to learn both high-level and low-level information at the same time. The use of a fully connected layer, residual blocks, and skip connections could help to solve the vanishing-gradient problem while also speeding up both training and testing. A cascaded network can efficiently train a CNN when the label distribution is imbalanced. In the two parallel networks, the proposed technique integrates both global and local features.

One factor contributing to this is the lack of defined procedures for evaluating CADx systems in a real setting. The image formats used to train the models were those of the AI research area (PNG) rather than those of the radiology field (DICOM, NIfTI), which is significant. Furthermore, the analysis requires authors with clinical backgrounds. As such, the engagement of doctors in the process may benefit the study project’s relevance and the acceptance of its findings. Our planned future work involves implementing complete CADx systems within clinical practice. The existing design will be improved in the future, with a possible expansion to a 3D CNN architecture, enabling 3D brain-tumor diagnosis from MRI and other scans.

Data availability

Data are available from the corresponding author upon request. The BraTS 2017 dataset is publicly available at https://www.med.upenn.edu/sbia/brats2017/data.html or https://www.kaggle.com/datasets/xxc025/unet-datasets?select=BRATS2017.zip.

References

Siegel, R. L., Miller, K. D. & Jemal, A. Cancer statistics, 2019. CA Cancer J. Clin. 69, 7–34. https://doi.org/10.3322/caac.21551 (2019).
Article PubMed Google Scholar
Abd-Ellah, M. K., Khalaf, A. A. M., Awad, A. I. & Hamed, H. F. A. TPUAR-Net: Two parallel u-net with asymmetric residual-based deep convolutional neural network for brain tumor segmentation. In Image Analysis and Recognition (eds Karray, F. et al.) 106–116 (Springer, 2019).
Chapter Google Scholar
Abd-Ellah, M. K., Awad, A. I., Khalaf, A. A. M. & Hamed, H. F. A. Classification of brain tumor MRIs using a kernel support vector machine. In Building Sustainable Health Ecosystems: 6th International Conference on Well-Being in the Information Society, WIS 2016, CCIS, vol. 636, 151–160. https://doi.org/10.1007/978-3-319-44672-1_13 (2016).
Havaei, M. et al. Brain tumor segmentation with deep neural networks. Med. Image Anal. 35, 18–31. https://doi.org/10.1016/j.media.2016.05.004 (2017).
Article PubMed Google Scholar
Abd-Ellah, M. K., Awad, A. I., Khalaf, A. A. M. & Hamed, H. F. A. Two-phase multi-model automatic brain tumour diagnosis system from magnetic resonance images using convolutional neural networks. EURASIP J. Image Video Process. 97, 1–10 (2018).
Google Scholar
Madabhushi, A. & Lee, G. Image analysis and machine learning in digital pathology: Challenges and opportunities. Med. Image Anal. 33, 170–175. https://doi.org/10.1016/j.media.2016.06.037 (2016). (20th anniversary of the Medical Image Analysis journal (MedIA)).
Sultan, H. H., Salem, N. M. & Al-Atabany, W. Multi-classification of brain tumor images using deep neural network. IEEE Access 7, 69215–69225. https://doi.org/10.1109/ACCESS.2019.2919122 (2019).
Article Google Scholar
Ullah, M. S. et al. Brain tumor classification from MRI scans: A framework of hybrid deep learning model with Bayesian optimization and quantum theory-based marine predator algorithm. Front. Oncol.https://doi.org/10.3389/fonc.2024.1335740 (2024).
Article PubMed PubMed Central Google Scholar
Rauf, F. et al. Automated deep bottleneck residual 82-layered architecture with Bayesian optimization for the classification of brain and common maternal fetal ultrasound planes. Front. Med.https://doi.org/10.3389/fmed.2023.1330218 (2023).
Article Google Scholar
Khan, M. A. et al. Deep-Net: Fine-tuned deep neural network multi-features fusion for brain tumor recognition. Comput. Mater. Contin. 76, 3029–3047. https://doi.org/10.32604/cmc.2023.038838 (2023).
Article Google Scholar
Soltaninejad, M. et al. Automated brain tumour detection and segmentation using superpixel-based extremely randomized trees in flair MRI. Int. J. Comput. Assist. Radiol. Surg. 12, 183–203. https://doi.org/10.1007/s11548-016-1483-3 (2017).
Article PubMed Google Scholar
Nazir, K. et al. 3D kronecker convolutional feature pyramid for brain tumor semantic segmentation in MR imaging. Comput. Mater. Contin. 76, 2861–2877. https://doi.org/10.32604/cmc.2023.039181 (2023).
Article Google Scholar
Khan, W. R. et al. A hybrid attention-based residual Unet for semantic segmentation of brain tumor. Comput. Mater. Contin. 76, 647–664. https://doi.org/10.32604/cmc.2023.039188 (2023).
Article Google Scholar
Hinton, G. E., Osindero, S. & Teh, Y.-W. A fast learning algorithm for deep belief nets. Neural Comput. 18, 1527–1554. https://doi.org/10.1162/neco.2006.18.7.1527 (2006).
Article MathSciNet PubMed Google Scholar
Suk, H.-I., Lee, S.-W. & Shen, D. Deep ensemble learning of sparse regression models for brain disease diagnosis. Med. Image Anal. 37, 101–113. https://doi.org/10.1016/j.media.2017.01.008 (2017).
Article PubMed PubMed Central Google Scholar
El-Dahshan, E.-S.A., Hosny, T. & Salem, A.-B.M. Hybrid intelligent techniques for MRI brain images classification. Digit. Signal Process. 20, 433–441 (2010).
Article Google Scholar
Abd-Ellah, M. K., Awad, A. I., Khalaf, A. A. M. & Hamed, H. F. A. Design and implementation of a computer-aided diagnosis system for brain tumor classification. In 2016 28th International Conference on Microelectronics (ICM), 73–76. https://doi.org/10.1109/ICM.2016.7847911 (2016).
Zhang, Y., Dong, Z., Wua, L. & Wanga, S. A hybrid method for MRI brain image classification. Expert Syst. Appl. 38, 10049–10053 (2011).
Article Google Scholar
Lakshmi Devasena, C. & Hemalatha, M. Efficient computer aided diagnosis of abnormal parts detection in magnetic resonance images using hybrid abnormality detection algorithm. Cent. Eur. J. Comput. Sci. 3, 117–128. https://doi.org/10.2478/s13537-013-0107-z (2013).
Article Google Scholar
Patil, S. & Udupi, V. R. A computer aided diagnostic system for classification of braintumors using texture features and probabilistic neural network. Int. J. Comput. Sci. Eng. Inf. Technol. Res. IJCSEITR 3, 61–66 (2013).
Google Scholar
Arakeri, M. P. & Reddy, G. R. M. Computer-aided diagnosis system for tissue characterization of brain tumor on magnetic resonance images. SIViP 9, 409–425. https://doi.org/10.1007/s11760-013-0456-z (2015).
Article Google Scholar
Goswami, S. & Bhaiya, L. K. P. Brain tumor detection using unsupervised learning based neural network. In 2013 International Conference on Communication Systems and Network Technologies 573–577. IEEE (2013).
Deepa, S. N. & Devi, B. Artificial neural networks design for classification of brain tumour. In 2012 International Conference on Computer Communication and Informatics (ICCCI-2012) 1–6 (IEEE, 10–12 Jan. 2012).
Saritha, M., Joseph, K. P. & Mathew, A. T. Classification of MRI brain images using combined wavelet entropy based spider web plots and probabilistic neural network. Pattern Recognit. Lett. 34, 2151–2156 (2013).
Article ADS Google Scholar
Yang, G. et al. Automated classification of brain images using wavelet-energy and biogeography-based optimization. Multimedia Tools Appl. 26, 1–17 (2015).
Google Scholar
Kalbkhani, H., Shayesteh, M. G. & Zali-Vargahan, B. Robust algorithm for brain magnetic resonance image (MRI) classification based on GARCH variances series. Biomed. Signal Process. Control 8, 909–919 (2013).
Article Google Scholar
Mallikarjun Mudda, N. K. & Manjunath, R. Brain tumor classification using enhanced statistical texture features. IETE J. Res. 68, 3695–3706. https://doi.org/10.1080/03772063.2020.1775501 (2022).
Article Google Scholar
Asiri, A. A. et al. Machine learning-based models for magnetic resonance imaging (MRI)-based brain tumor classification. Intell. Autom. Soft Comput. 36, 299–312. https://doi.org/10.32604/iasc.2023.032426 (2023).
Article Google Scholar
Mohsen, H., El-Dahshan, E.-S.A., El-Horbaty, E.-S.M. & Salem, A.-B.M. Classification using deep learning neural networks for brain tumors. Future Comput. Inf. J. 3, 68–71 (2018).
Article Google Scholar
Tazin, T. et al. A robust and novel approach for brain tumor classification using convolutional neural network. Comput. Intell. Neurosci.https://doi.org/10.1155/2021/2392395 (2021).
Article PubMed PubMed Central Google Scholar
Alsaif, H. et al. A novel data augmentation-based brain tumor detection using convolutional neural network. Appl. Sci.https://doi.org/10.3390/app12083773 (2022).
Article Google Scholar
Menze, B. H. et al. The multimodal brain tumor image segmentation benchmark (brats). IEEE Trans. Med. Imaging 34, 1993–2024. https://doi.org/10.1109/TMI.2014.2377694 (2015).
Article PubMed Google Scholar
Gooya, A. et al. GLISTR: Glioma image segmentation and registration. IEEE Trans. Med. Imaging 31, 1941–1954. https://doi.org/10.1109/TMI.2012.2210558 (2012).
Article PubMed PubMed Central Google Scholar
Menze, B. H. et al. A generative model for brain tumor segmentation in multi-modal images. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2010 (eds Jiang, T. et al.) 151–159 (Springer, 2010).
Google Scholar
Prastawa, M., Bullitt, E., Ho, S. & Gerig, G. A brain tumor segmentation framework based on outlier detection. Med. Image Anal. 8, 275–283. https://doi.org/10.1016/j.media.2004.06.007 (2004) (Medical Image Computing and Computer-Assisted Intervention - MICCAI 2003).
Article PubMed Google Scholar
Prastawa, M., Bullitt, E., Ho, S. & Gerig, G. Robust estimation for brain tumor segmentation. In Medical Image Computing and Computer-Assisted Intervention—MICCAI 2003 (eds Ellis, R. E. & Peters, T. M.) 530–537 (Springer, 2003).
Google Scholar
Khotanlou, H., Colliot, O., Atif, J. & Bloch, I. 3D brain tumor segmentation in MRI using fuzzy classification, symmetry analysis and spatially constrained deformable models. Fuzzy Sets Syst. 160, 1457–1473. https://doi.org/10.1016/j.fss.2008.11.016 (2009). Special Issue: Fuzzy Sets in Interdisciplinary Perception and Intelligence.
Popuri, K., Cobzas, D., Murtha, A. & Jägersand, M. 3D variational brain tumor segmentation using Dirichlet priors on a clustered feature set. Int. J. Comput. Assist. Radiol. Surg. 7, 493–506. https://doi.org/10.1007/s11548-011-0649-2 (2012).
Article PubMed Google Scholar
Parisot, S., Duffau, H., Chemouny, S. & Paragios, N. Joint tumor segmentation and dense deformable registration of brain MR images. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2012, 651–658 (Springer (eds Ayache, N. et al.) (2012).
Subbanna, N., Precup, D. & Arbel, T. Iterative multilevel mrf leveraging context and voxel information for brain tumour segmentation in MRI. In 2014 IEEE Conference on Computer Vision and Pattern Recognition, 400–405. https://doi.org/10.1109/CVPR.2014.58 (2014).
Subbanna, N. K., Precup, D., Collins, D. L. & Arbel, T. Hierarchical probabilistic gabor and mrf segmentation of brain tumours in mri volumes. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2013, 751–758 (Springer (eds Mori, K. et al.) (2013).
Goetz, W. C. B. J. S. B. M. H.-P. M.-H. K., M. Extremely randomized trees based brain tumor segmentation. In Proceedings MICCAI BraTS (Brain Tumor Segmentation Challenge), 6–11 (2014).
Kleesiek, B. A. U. G. K. U. B. M. H. F., J. ilastik for multi-modal brain tumor segmentation. In In: Proceedings MICCAI BraTS (Brain Tumor Segmentation Challenge), 12–17 (2014).
Havaei, M., Jodoin, P. & Larochelle, H. Efficient interactive brain tumor segmentation as within-brain knn classification. In 2014 22nd International Conference on Pattern Recognition, 556–561, https://doi.org/10.1109/ICPR.2014.106 (2014).
Hamamci, A., Kucuk, N., Karaman, K., Engin, K. & Unal, G. Tumor-cut: Segmentation of brain tumors on contrast enhanced MR images for radiosurgery applications. IEEE Trans. Med. Imaging 31, 790–804. https://doi.org/10.1109/TMI.2011.2181857 (2012).
Article PubMed Google Scholar
Li, H. & Fan, Y. Label propagation with robust initialization for brain tumor segmentation. In 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI), 1715–1718, https://doi.org/10.1109/ISBI.2012.6235910 (2012).
Ruan, S., Lebonvallet, S., Merabet, A. & Constans, J. Tumor segmentation from a multispectral mri images by using support vector machine classification. In 2007 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 1236–1239, https://doi.org/10.1109/ISBI.2007.357082 (2007).
Girshick, R., Donahue, J., Darrell, T. & Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In 2014 IEEE Conference on Computer Vision and Pattern Recognition, 580–587, https://doi.org/10.1109/CVPR.2014.81 (2014).
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 84–90. https://doi.org/10.1145/3065386 (2017).
Article Google Scholar
Long, J., Shelhamer, E. & Darrell, T. Fully convolutional networks for semantic segmentation. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 3431–3440, https://doi.org/10.1109/CVPR.2015.7298965 (2015).
Zheng, S. et al. Conditional random fields as recurrent neural networks. In 2015 IEEE International Conference on Computer Vision (ICCV), 1529–1537, https://doi.org/10.1109/ICCV.2015.179 (2015).
Liu, Z., Li, X., Luo, P., Loy, C. & Tang, X. Semantic image segmentation via deep parsing network. In 2015 IEEE International Conference on Computer Vision (ICCV), 1377–1385, https://doi.org/10.1109/ICCV.2015.162 (2015).
Zikic, D., Ioannou, Y., Criminisi, A. & Brown, M. Segmentation of brain tumor tissues with convolutional neural networks. In MICCAI workshop on Multimodal Brain Tumor Segmentation Challenge (BRATS) (Springer, 2014).
Havaei, M. et al. Brain tumor segmentation with deep neural networks. In In: Proceedings MICCAI BraTS (Brain Tumor Segmentation Challenge), 1–5 (2014).
Urban, G., Bendszus, M., Hamprecht, F. A. & Kleesiek, J. Multi-modal brain tumor segmentation using deep convolutional neuralnetworks. In In: Proceedings MICCAI BraTS (Brain Tumor Segmentation Challenge), 31–35 (2014).
Vaidhya, K., Thirunavukkarasu, S., Alex, V. & Krishnamurthi, G. Multi-modal brain tumor segmentation using stacked denoising autoencoders. In Crimi, A., Menze, B., Maier, O., Reyes, M. & Handels, H. (eds.) Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 181–194, https://doi.org/10.1007/978-3-319-30858-6_16 (Springer International Publishing, Cham, 2016).
Agn, M., Puonti, O., Rosenschöld, P. M. a., Law, I. & Van Leemput, K. Brain tumor segmentation using a generative model with an rbm prior on tumor shape. In Crimi, A., Menze, B., Maier, O., Reyes, M. & Handels, H. (eds.) Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 168–180, https://doi.org/10.1007/978-3-319-30858-6_15 (Springer International Publishing, Cham, 2016).
Dvořák, P. & Menze, B. Local structure prediction with convolutional neural networks for multimodal brain tumor segmentation. In Menze, B. et al. (eds.) Medical Computer Vision: Algorithms for Big Data, 59–71, https://doi.org/10.1007/978-3-319-42016-5_6 (Springer International Publishing, Cham, 2016).
Havaei, M., Dutil, F., Pal, C., Larochelle, H. & Jodoin, P.-M. A convolutional neural network approach to brain tumor segmentation. In Crimi, A., Menze, B., Maier, O., Reyes, M. & Handels, H. (eds.) Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 195–208, https://doi.org/10.1007/978-3-319-30858-6_17 (Springer International Publishing, Cham, 2016).
Pereira, S., Pinto, A., Alves, V. & Silva, C. A. Deep convolutional neural networks for the segmentation of gliomas in multi-sequence mri. In Crimi, A., Menze, B., Maier, O., Reyes, M. & Handels, H. (eds.) Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 131–143, https://doi.org/10.1007/978-3-319-30858-6_12 (Springer International Publishing, Cham, 2016).
Pereira, S., Pinto, A., Alves, V. & Silva, C. A. Brain tumor segmentation using convolutional neural networks in MRI images. IEEE Trans. Med. Imaging 35, 1240–1251. https://doi.org/10.1109/TMI.2016.2538465 (2016).
Article PubMed Google Scholar
Pereira, S., Oliveira, A., Alves, V. & Silva, C. A. On hierarchical brain tumor segmentation in MRI using fully convolutional neural networks: A preliminary study. In 2017 IEEE 5th Portuguese Meeting on Bioengineering (ENBENG), 1–4, https://doi.org/10.1109/ENBENG.2017.7889452 (2017).
Zhao, X. et al. A deep learning model integrating FCNNs and CRFs for brain tumor segmentation. Med. Image Anal. 43, 98–111. https://doi.org/10.1016/j.media.2017.10.002 (2018).
Article PubMed Google Scholar
Kamnitsas, K. et al. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Med. Image Anal. 36, 61–78. https://doi.org/10.1016/j.media.2016.10.004 (2017).
Article PubMed Google Scholar
Yi, D., Zhou, M., Chen, Z. & Gevaert, O. 3-d convolutional neural networks for glioblastoma segmentation. CoRR (2016). arXiv:1611.04534.
Wang, Y. et al. Modality-pairing learning for brain tumor segmentation. In Crimi, A. & Bakas, S. (eds.) Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 230–240 (Springer International Publishing, Cham, 2021).
Zhang, D. et al. Cross-modality deep feature learning for brain tumor segmentation. Pattern Recogn. 110, 107562. https://doi.org/10.1016/j.patcog.2020.107562 (2021).
Article Google Scholar
Remya, R., Parimala, G. K. & Sundaravadivelu, S. Enhanced dwt filtering technique for brain tumor detection. IETE J. Res. 68, 1532–1541. https://doi.org/10.1080/03772063.2019.1656555 (2022).
Article Google Scholar
Myronenko, A. 3d mri brain tumor segmentation using autoencoder regularization. In Crimi, A. et al. (eds.) Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 311–320 (Springer International Publishing, Cham, 2019).
Fritscher, K. et al. Deep neural networks for fast segmentation of 3d medical images. In Ourselin, S., Joskowicz, L., Sabuncu, M. R., Unal, G. & Wells, W. (eds.) Medical Image Computing and Computer-Assisted Intervention – MICCAI 2016, 158–165, https://doi.org/10.1007/978-3-319-46723-8_19 (Springer International Publishing, Cham, 2016).
Zhao, X. et al. Brain tumor segmentation using a fully convolutional neural network with conditional random fields. In Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries: Second International Workshop, BrainLes 2016, with the Challenges on BRATS, ISLES and mTOP 2016, Held in Conjunction with MICCAI 2016, Athens, Greece, October 17, 2016, Revised Selected Papers, 75–87, https://doi.org/10.1007/978-3-319-55524-9_8 (2016).
Dong, H., Yang, G., Liu, F., Mo, Y. & Guo, Y. Automatic brain tumor detection and segmentation using U-Net based fully convolutional networks. In Valdés Hernández, M. & González-Castro, V. (eds.) Medical Image Understanding and Analysis, 506–517 (Springer International Publishing, Cham, 2017).
Abd-Ellah, M. K., Awad, A. I., Khalaf, A. A. & Hamed, H. F. A review on brain tumor diagnosis from MRI images: Practical implications, key achievements, and lessons learned. Magn. Reson. Imaging 61, 300–318. https://doi.org/10.1016/j.mri.2019.05.028 (2019).
Article PubMed Google Scholar
Pan, Y. et al. Brain tumor grading based on neural networks and convolutional neural networks. In 2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 699–702, https://doi.org/10.1109/EMBC.2015.7318458 (2015).
Ye, F., Pu, J., Wang, J., Li, Y. & Zha, H. Glioma grading based on 3d multimodal convolutional neural network and privileged learning. In 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 759–763, https://doi.org/10.1109/BIBM.2017.8217751 (2017).
Ge, C., Qu, Q., Gu, I. Y. & Store Jakola, A. 3d multi-scale convolutional networks for glioma grading using mr images. In 2018 25th IEEE International Conference on Image Processing (ICIP), 141–145, https://doi.org/10.1109/ICIP.2018.8451682 (2018).
Sultan, H. H., Salem, N. M. & Al-Atabany, W. Multi-classification of brain tumor images using deep neural network. IEEE Access 7, 69215–69225. https://doi.org/10.1109/ACCESS.2019.2919122 (2019).
Article Google Scholar
Anaraki, A. K., Ayati, M. & Kazemi, F. Magnetic resonance imaging-based brain tumor grades classification and grading via convolutional neural networks and genetic algorithms. Biocybern. Biomed. Eng. 39, 63–74. https://doi.org/10.1016/j.bbe.2018.10.004 (2019).
Article Google Scholar
Abd-Ellah, M. K., Awad, A. I., Hamed, H. F. A. & Khalaf, A. A. M. Parallel deep cnn structure for glioma detection and classification via brain mri images. In 2019 31st International Conference on Microelectronics (ICM), 304–307, https://doi.org/10.1109/ICM48031.2019.9021872 (2019).
He, K., Zhang, X., Ren, S. & Sun, J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. Computer Vision and Pattern Recognition (2015). arXiv:1502.01852.
He, K., Zhang, X., Ren, S. & Sun, J. Identity mappings in deep residual networks. In Leibe, B., Matas, J., Sebe, N. & Welling, M. (eds.) Computer Vision – ECCV 2016, 630–645 (Springer International Publishing, Cham, 2016).
Miller, J. W., Goodman, R. & Smyth, P. On loss functions which minimize to conditional expected values and posterior probabilities. IEEE Trans. Inf. Theory 39, 1404–1408. https://doi.org/10.1109/18.243457 (1993).
Article Google Scholar
Ioffe, S. & Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. CoRR (2015). arXiv:1502.03167.
Szabó, S. et al. Classification assessment tool: a program to measure the uncertainty of classification models in terms of class-level metrics. Applied Soft Computing 111468, https://doi.org/10.1016/j.asoc.2024.111468 (2024).
Aydin OU, H. A. K. A. G. I. F. J.-F. D. M. V., Taha AA. On the usage of average hausdorff distance for segmentation performance assessment: hidden error when used for ranking. Eur Radiol Exp.5, 1–7, https://doi.org/10.1186/s41747-020-00200-2 (2021).
Wang, G., Li, W., Ourselin, S. & Vercauteren, T. Automatic brain tumor segmentation using convolutional neural networks with test-time augmentation. In Crimi, A. et al. (eds.) Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries, 61–72, https://doi.org/10.1007/978-3-030-11726-9_6 (Springer International Publishing, Cham, 2019).
Le, H. T. & Pham, H.T.-T. Brain tumour segmentation using U-Net based fully convolutional networks and extremely randomized trees. Vietnam J. Sci. Technol. Eng. 60, 19–25. https://doi.org/10.31276/VJSTE.60(3).19 (2018).
Article Google Scholar

Download references

Funding

Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB).

Author information

These authors contributed equally: Ali Ismail Awad, Ashraf A. M. Khalaf and Amira Mofreh Ibraheem.

Authors and Affiliations

Faculty of Artificial Intelligence, Egyptian Russian University, Cairo, 11829, Egypt
Mahmoud Khaled Abd-Ellah & Amira Mofreh Ibraheem
College of Information Technology, United Arab Emirates University, P.O. Box 15551, Al Ain, United Arab Emirates
Ali Ismail Awad
Faculty of Engineering, Al-Azhar University, P.O. Box 83513, Qena, Egypt
Ali Ismail Awad
Department of Electrical Engineering, Faculty of Engineering, Minia University, Minia, 61519, Egypt
Ashraf A. M. Khalaf

Authors

Mahmoud Khaled Abd-Ellah
View author publications
You can also search for this author in PubMed Google Scholar
Ali Ismail Awad
View author publications
You can also search for this author in PubMed Google Scholar
Ashraf A. M. Khalaf
View author publications
You can also search for this author in PubMed Google Scholar
Amira Mofreh Ibraheem
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.K.A.-E.: Conceptualization, Methodology, Investigation, Project administration, Supervision, Writing—original draft, Writing—review and editing. A.I.A.: Conceptualization, Methodology, Resources, Supervision, Writing—original draft, Writing—review and editing. A.A.M.K.: Conceptualization, Methodology, Supervision, Writing—original draft, Writing—review and editing. A.M.I.: Data curation, Formal analysis, Methodology, Investigation, Resources, Validation, Visualization, Writing—original draft, Writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Mahmoud Khaled Abd-Ellah or Ali Ismail Awad.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Abd-Ellah, M.K., Awad, A.I., Khalaf, A.A.M. et al. Automatic brain-tumor diagnosis using cascaded deep convolutional neural networks with symmetric U-Net and asymmetric residual-blocks. Sci Rep 14, 9501 (2024). https://doi.org/10.1038/s41598-024-59566-7

Download citation

Received: 09 September 2023
Accepted: 12 April 2024
Published: 25 April 2024
DOI: https://doi.org/10.1038/s41598-024-59566-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Segmentation of glioblastomas in early post-operative multi-modal MRI with deep neural networks

Detection and classification of brain tumor using hybrid deep learning models

Context aware deep learning for brain tumor segmentation, subtype classification, and survival prediction using radiology images

Introduction

Related work

Materials and methods

Preprocessing stage

The main CNN architectures

Convolutional layers

Rectified linear unit (ReLU) and parametric ReLU (PReLU) layers

Max-pooling layers

Residual blocks

Regularization and loss function

Softmax layer

The first stage: the proposed brain glioma detection and classification architecture

Two-pathway with residual-based deep convolutional neural network architecture (TRDCNN)

The second stage: brain-tumor segmentation architectures

Two parallel cascaded U-Nets with an asymmetric residual (TPCUAR-Net)

Simulations and evaluation criteria

Imaging data

Training and testing

Evaluation metrics

Tumor detection and classification

Tumor segmentation

Results and discussion

Tumor detection accuracy analysis

Analysis of tumor-segmentation accuracy

Discussion

Conclusions and future work

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links