Identifying autism spectrum disorder from multi-modal data with privacy-preserving

Wang, Haishuai; Jing, Hezi; Yang, Jianjun; Liu, Chao; Hu, Liwei; Tao, Guangyu; Zhao, Ziping; Shen, Ning

doi:10.1038/s44184-023-00050-x

Download PDF

Article
Open access
Published: 02 May 2024

Identifying autism spectrum disorder from multi-modal data with privacy-preserving

Haishuai Wang¹,
Hezi Jing²,
Jianjun Yang³,
Chao Liu⁴,
Liwei Hu⁵,
Guangyu Tao⁶,
Ziping Zhao² &
…
Ning Shen⁷

npj Mental Health Research volume 3, Article number: 15 (2024) Cite this article

699 Accesses
4 Altmetric
Metrics details

Subjects

Abstract

The application of deep learning models to precision medical diagnosis often requires the aggregation of large amounts of medical data to effectively train high-quality models. However, data privacy protection mechanisms make it difficult to perform medical data collection from different medical institutions. In autism spectrum disorder (ASD) diagnosis, automatic diagnosis using multimodal information from heterogeneous data has not yet achieved satisfactory performance. To address the privacy preservation issue as well as to improve ASD diagnosis, we propose a deep learning framework using multimodal feature fusion and hypergraph neural networks for disease prediction in federated learning (FedHNN). By introducing the federated learning strategy, each local model is trained and computed independently in a distributed manner without data sharing, allowing rapid scaling of medical datasets to achieve robust and scalable deep learning predictive models. To further improve the performance with privacy preservation, we improve the hypergraph model for multimodal fusion to make it suitable for autism spectrum disorder (ASD) diagnosis tasks by capturing the complementarity and correlation between modalities through a hypergraph fusion strategy. The results demonstrate that our proposed federated learning-based prediction model is superior to all local models and outperforms other deep learning models. Overall, our proposed FedHNN has good results in the work of using multi-site data to improve the performance of ASD identification.

Genome interpretation in a federated learning context allows the multi-center exome-based risk prediction of Crohn’s disease patients

Article Open access 09 November 2023

Deep representation learning of electronic health records to unlock patient stratification at scale

Article Open access 17 July 2020

Assisting schizophrenia diagnosis using clinical electroencephalography and interpretable graph neural networks: a real-world and cross-site study

Article 25 July 2023

Introduction

Autism spectrum disorder (ASD) is a neurodevelopmental disorder affecting 1 in 44 children¹. Patients with ASD are genetically heterogeneous, present diverse behavioral characteristics and varying degrees of intellectual performances. Thus, clinical diagnosis of ASD remains challenging. Current clinical practice mainly relies on subjective personal characteristic (PC) assessment of the physicians and/or neuroimaging data, e.g., fMRI. PC assessment includes social interaction, language skills, IQ, and stereotypical behaviors, whereas Functional connectivity (FC) extracted from fMRI data reflects the interrelationship and temporal connectivity among different brain regions. While both are useful to some extent for diagnosis, few studies have leveraged both data types in clinical diagnosis.

Deep learning has gain tremendous attention in recent years, and applications of deep learning algorithms (e.g., graph neural networks) for disease clinical diagnosis has become a popular approach in computer-aided diagnosis (CADx) studies² such as Alzheimer’s disease^3,4,5,6 and Autism^7,8. Recently, a number of deep learning algorithms have been applied to ASD diagnosis, especially using MRI data. For example, Y. Kong et al.⁹ considered the connectivity between each pair of region of interest (ROIs), evaluated on T1-weighted MRI images with a deep neural network classifier. Wang et al.¹⁰ used multilayer perceptron and ensemble learning methods with multi-graph features for ASD identification. Since medical data with different modalities typically provides more complementary information, multimodal data integration has also been attempted for disease diagnosis. To learn the complex relationships and information from the multi-modal data, graph structure has been employed in the medical field^11,12. For example, Parisot et al.¹³ constructed a demographic information graph using non-imaging features, and added imaging features for classification. Hypergraphs, as an extension of general graph, are particularly efficient for handling multi-relational and high-order relationships. In hypergraphs, relationships between nodes can be more than just binary, encompassing multi-relational aspects, which enhanced capability of hypergraphs better expresses intricate associations between entities. For instance, Di et al.¹⁴ employed hypergraph learning to identify and classify COVID-19 patients, and Xiao et al.¹⁵ constructed hypergraph-based representations of fMRI data to explore the classification of neurodegenerative diseases. Despite the advancements for disease diagnosis using multi-modal data, privacy regulations could be an obstacle when collecting large scale multi-modal data for model training because the datasets are typically integrated from multiply institutions. The limitations faced in multimodal deep learning for ASD diagnosis highlight the demands for more effective solutions to leverage the benefits of multi-center data but addressing the challenges of data heterogeneity and privacy concerns. Recently, Federated Learning^16,17,18 has been proposed to address the issues of local data management and privacy protection by collaboratively training models with transferring only model parameters without exchanging the data itself, allowing clinical data do not have to be stored centrally. Recent studies indicate notable achievements of federated learning in the fields of medical image diagnosis, disease prediction, and drug development, etc. For instance, the PriMIA¹⁹ framework proposed by Kaissis et al. has successfully realized differential privacy, secure aggregated federated learning, and encrypted inference to protect sensitive medical imaging data without requiring data transmission. Other applications of federated learning in COVID-19^20,21,22 diagnosis also demonstrates its immense potential.

In this study, we propose a deep learning framework for autism prediction using multimodal feature fusion and hypergraph neural networks (HGNN) in Federated Learning (FedHNN). FedHNN uses a hypergraph to fuse functional neuroimaging data with PC data to capture interrelationships in multimodal data. Graph structures are powerfully expressive for modeling relationships between patients²³. In multimodal feature fusion, hypergraphs enable learning multivariate relationships more accurately than ordinary graph structures, facilitating multimodal fusion and expansion²⁴, and building networks of relationships between patients more effectively.

We applied FedHNN for ASD diagnosis, and FedHNN demonstrated superior performance compared to other deep learning models or using single site dataset. Extension experiments of the model evaluation demonstrated that FedHNN can effectively be used for privacy-preserving ASD diagnosis tasks. This study also indicates the high potential of federated learning in achieving large-scale precision medicine.

Results

Building a privacy-preserving multimodal deep learning framework

The experimental data used in this study were obtained from the Autism Imaging Data Exchange (ABIDE I)²⁵ dataset. The ABIDE dataset consists of 17 international acquisition sites that publicly share resting-state functional magnetic resonance imaging (R-fMRI), anatomical and phenotypic datasets from 1112 subjects, with sample variations in each clinical site. To ensure that the deep learning model can be executed on a single site, the ROIs fMRI sequences were downloaded from the preprocessed ABIDE dataset, and a total of 449 subjects (containing 206 ASD subjects and 243 TC subjects) from four largest sites (New York University (NYU), University of California, Los Angeles (UCLA), University of Michigan (UM), and University of Utah School of Medicine (USM)) were selected for this study. The demographic information of each site is summarized in Table 1.

Table 1 Data summary of the dataset used in our study

Full size table

As shown in Fig. 1, we applied a federated learning framework to train data collected from different clinical sites with privacy protection. For data from each clinical site, a local multimodal based HGNN was trained with feature fusion for PC data and fMRI data. Only encrypted model parameters collected from each local model were transmitted to the global model, thus preserving privacy information of the patient at each clinical site. We applied this framework to the 4 ABIDE datasets mentioned above. This framework model was trained on all samples from the 4 sites, alleviating the small sample problem faced by training each center dataset independently, while preserving patient privacy through the federated learning approach. More details of the model framework can be found in Methods.

**Fig. 1: The model framework and workflow.**

Assessment for confounding

We first assessed the differences between the types of data across medical sites and between subjects with different disease states (Fig. 2). The age distribution of each site is different. There is no obvious difference between sites in the three intelligence characteristics FIQ, VIQ, and PIQ, but Autism patients showed apparently lower mean IQ than normal subjects (Fig. 2d). Next, we applied UMAP to project phenotypic data and three brain atlas features separately into a one-dimensional feature space, to assess potential confounding relationships between disease states, site differences, and different data features. We noticed that the three brain atlas features differed between each site (Fig. 2e), probably because different medical sites use scanners from different manufacturers, or the calibration methods and the specified acquisition protocols differ from site to site. For example, NYU site is using a 3 Tesla Allegra scanner and UM is using a 3 Tesla GE Signa scanner. At the NYU site, subjects all completed at least one simulated scan prior to the scan and most participants were asked to open their eyes and relax while a white crosshair was projected on the screen against a black background during the resting-state fMRI scan. whereas at the USM site, subjects did not undergo any simulated scanning procedure prior to the scan and their images were acquired from participants approximately every 2–3 years. Through multimodal feature fusion, data from different modalities might provide complementary information to each other, and the federated learning framework also eliminates data differences from different medical sites to some extent.

**Fig. 2: Overview of brain atlas visualization and data distribution variance.**

FedHNN using multi-site data outperforms the model using single-site data

To assess the performance of FedHNN, we randomly split the samples into training and test sets. Using stratified 5-fold cross-validation, we demonstrated that our model classified the autism vs healthy individuals with 73.52% accuracy. In addition, we compared the performance of federated learning with other non-federated learning strategies on the ASD identification task. Table 2 presents the results of each evaluation metric obtained in the test set for the different learning strategies. Among the four single-site individual models, higher AUCs were obtained in the two site databases with larger data sizes (NYU and UM) than the other sites with AUC of 0.6900 and 0.7037, whose accuracies were (0.7125, 0.7194), precision (0.7689, 0.7272), recall (0.7272, 0.8257), specificity (0.6000, 0.5818), and F1-score (0.7696, 0.7664), respectively. In our case, the proposed FedHNN obtained an AUC of 0.7110, an accuracy of 0.7352, precision of 0.7319, recall of 0.8204, specificity of 0.6028, and F1-score of 0.7598, which outperforms all the single site trained models. Thus, our proposed federated learning model based on preserving data privacy can improve the learning performance of individual sites by extending the amount of training data by federating each site.

Table 2 FedHNN model performance

Full size table

However, the federated learning process requires the individual site to use private data to train the model and encrypt their trained model parameters to transfer to the global model, which will generate some communication loss. Without using the federated learning strategy to protect the data privacy, the All-HNN strategy combined data from all sites and extends the amount of data, and its AUC result for the test set was 0.7776, which was much higher than the federated learning strategy and the single-site independent training strategy. This demonstrates that accurate ASD identification needs to be performed on the basis of sharing a large amount of medical data. The use of federated learning under a privacy-preserving strategy can also effectively collect data information from each site and improve the accuracy of ASD identification.

Benchmark against other deep learning strategies

Based on the privacy-preserving strategy, we compared different kinds of deep learning models to verify that the HGNN-based FedHNN model proposed in this study can reach the best performance. We use other different graph learning models and convolutional networks as different baseline approach based on federated learning strategies and compare them under the same experimental setup: (1) Graph Convolutional Network (GCN)²⁶ (2) Graph Attention Network (GAT)²⁷ (3) Sample and aggreGatE (GraphSAGE)²⁸ (4) Convolutional Neural Network (CNN).

FedHNN based on the HGNN obtained the best AUC result of 0.7110. Table 2 presents the results of each evaluation metric obtained in the test set of different deep learning models based on federated learning strategies. The results illustrated that compared with ordinary graph structures, hypergraphs performed much better at establishing multivariate relationships between modalities, conveying complex higher-order correlations between data, and facilitating the fusion and extension of different modalities. The hypergraph structure can fuse multimodal information into the same graph structure through its flexible hyper-edges, which allows for more efficient construction of relational networks between patients. In addition, the combination of hypergraph structure and hyperedge convolution performed better than other combined models.

Data types and hyperparameters of FedHNN

To evaluate the contribution of multimodal feature fusion to the classification results, we tested model performance using different combinations of input data. By comparing the accuracy results we found that the three fMRI brain atlases used in combination with the phenotypic data obtained the best classification performance. All three fMRI brain atlases played important roles in the ASD identification task compared to the phenotypic data, the fusion with non-imaging data improved the classification performance. The combination of the three brain atlases also yielded better classification results than a single atlas, suggesting that the different brain atlases may provide complementary information for ASD diagnosis (Table 3).

Table 3 Performance of different data type combinations

Full size table

Next, we explored the impact of two important hyperparameters on FedHNN for the ASD identification task: the number of hypergraph neighbor nodes (kneigs) and the update speed of the global model (pace). To determine the optimal number of neighbor nodes, we used the hypergraph structure generated by kneigs = 5 to 25 nearest neighbors (including the center of prime) to input into the model for training and evaluated its synthesis results. The best accuracy was achieved when kneigs = 10 for both NYU and USM sites while kneigs varied from 5 to 25 (Fig. 3c). In addition, the highest average accuracy was achieved with kneigs = 10 (Fig. 3a). Due to the high cost of communication between the local model and the global model, to identify the optimum update rate of the global model, we further investigated the effect of the size of the pace on the accuracy of the ASD identification task. The USM site showed significant superiority when pace = 20, which achieved an accuracy rate of 0.82 (Fig. 3d). Combining the results of all sites, we chose the hypergraph constructed with a neighbor node of 10 as the model’s graph input and the communication between the local and global models was performed every 20 pace to achieve the best performance in federated learning (Fig. 3a, b).

**Fig. 3: The impact of hyperparameters of FedHNN.**

Discussion

It is still a challenging task to automate ASD diagnosis. In this study, we demonstrate the possibility of federated learning to combine data from different sites for automatic/computational ASD diagnosis in a way that protects data privacy across healthcare institutions. Obtaining sufficient data remains a major challenge in CADx research, not only in collecting data, but also requiring collaboration among medical institutions to solve the problem of medical data annotation. In the setup of federated learning, participants retain local data to execute distributed computations rather than transferring data directly to a centralized data warehouse to build machine learning models. Thus, federated learning addresses privacy issues and encourages multi-institutional collaboration.

We show in a proof of concept that the use of a federated deep learning model based on hypergraph fusion of multimodal features has high accuracy in ASD diagnosis tasks, where the multimodal features are a combination of multi-scale brain FC data and PC data obtained from ABIDE’s large heterogeneous dataset. This dataset collected data from 17 sites, and we only chose the data from the four largest sites for this study to ensure that the deep learning model could be executed on a single site. The reason why federated learning outperformed the individual model is because federated learning ensembled and aggregated multiple local individual models with privacy-preserving, and thereby trained on more diverse and larger datasets. In the federated learning framework, multiple individual models were first trained on local data from different sources or devices and then aggregated by some strategies without leaking privacy. This multi-center data distribution increased data diversity and scale compared to training on a single centralized dataset, allowing federated models to capture more comprehensive and representative underlying data, thereby have better generalization and improved performance. Therefore, federated learning outperforms the individual model. However, due to the information loss during the aggregation of local individual models, the model trained on the directly aggregated data from all sites without protecting privacy outperform the federated learning.

The number of participants selected for medical analysis in this study was much fewer than in any other federated learning applications, this may play a role, especially when using the averaging strategies to update model parameters. Moreover, we found that compared with the “All” strategy (training with all data stored centrally), adding the federated learning strategy will reduce the ASD identification accuracy to some extent, this is probably due to the fact that the update strategy used by the current model is not optimal. In the future, combining with more efficient model update strategies may improve the effectiveness of local and global model communication and decrease the loss of intercommunication^16,29.

In conclusion, the FedHNN model proposed in this study can be effectively performed in ASD diagnosis tasks, which enables the collaborative training from multiple sites to solve the data isolation and privacy preserving challenges when training accurate deep learning models. The proposed method could be applied to many applications. For example, the proposed method can facilitate the clinical diagnosis of various diseases such as Depression, Alzheimer, Covid-19, etc. In the field of clinical diagnosis, data isolation and the emphasis on data privacy have emerged as prominent challenges. Traditional centralized data analysis methods may suffer from data security, privacy protection, and data sharing. In this context, our proposed approach potentially introduced an innovative solution to the disease clinical diagnosis. By enabling collaborative training across multiple institutions without sharing the data directly, the proposed method with federated learning ingeniously addresses the concerns associated with sensitive clinical data sharing. As a result, it holds the potential for novel research experiments and commercial opportunities, ultimately contributing to the enhancement of global patient care. Therefore, this study particularly addressed the challenges in the scenarios where data is sensitive to share and privacy regulation limits the development of large artificial intelligence models.

Methods

Federated learning process

During the model training, we set up a central server as the global model to calculate the updated model weight information, and all different medical sites used the same deep learning framework to accomplish the same task. We trained each local model on every single site and updated the model weight information to the global model with a certain frequency. The weights shared by every site were encrypted with attached random noise to protect the data information from being leaked by inverse processing. The global model aggregated the parameters from all local models and updated the processed weights to individual medical sites. In this case, each local model continued to perform internal optimization based on the updated parameter information.

As shown in Fig. 1. Formally, we set S (S = 4 in this study) medical sites for using in federated learning, with N_s as the number of patients in each data site s. At the beginning of each federated training round epoch, each local model was randomly initialized with model parameters ${{\boldsymbol{w}}}_{{\boldsymbol{s}}}^{{\boldsymbol{(}}{\boldsymbol{0}}{\boldsymbol{)}}}$, which is the locally weighted factor. Define R as the number of optimization iterations of the local model, each local site s trained the optimization model parameters ${{\boldsymbol{w}}}_{{\boldsymbol{s}}}^{{\boldsymbol{(}}{\boldsymbol{r}}{\boldsymbol{)}}}$ within r rounds using local data X_s and uploaded the encrypted model parameters ${\widetilde{{\boldsymbol{w}}}}_{{\boldsymbol{s}}}={w}_{s}+{\varepsilon }_{s}$ to the global model with a fixed frequency pace. The global model collected the model parameters which were uploaded by all local sites and calculated them using the averaging strategy³⁰, then deploys the updated weights $\bar{{\boldsymbol{w}}}$, which is shown in Eq. (1) to each local model, and then each local model continued to perform local optimization in the next round r + 1. Repeat the above process until the global model converges and returns.

$$\bar{{\boldsymbol{w}}}=\frac{{\sum }_{s=1}^{S}{\widetilde{{\boldsymbol{w}}}}_{{\boldsymbol{s}}}}{S}$$

(1)

Network architecture

The complete framework of our proposed local model in FedHNN is shown in Fig. 1b. We adopted HGNN which was improved by GCN as a local deep learning framework in each local model. As a representation learning method, HGNN uses the hypergraph structure with a more powerful representation for modeling. We improved the hypergraph model for the ASD recognition task by using a feature fusion strategy based on HGNN using different modal features obtained in the preprocessing step to generate hyperedges.

In each local model, we define the hypergraph ${\mathscr{G}}{\mathscr{=}}{\mathscr{(}}{\mathscr{V}}{\mathscr{,}}{\mathscr{E}}{\mathscr{,}}{\bf{W}})$ which includes a vertex set ${\mathscr{V}}$, a hyperedge set ${\mathscr{E}}$, and each hyperedge was assigned a weight by W, which is a diagonal matrix of edge weights. Different from GCN, which uses the adjacency matrix A to represent the graph structure, HGNN uses the incidence matrix H (size: ${\mathscr{V}}{\mathscr{\times }}{\mathscr{E}}$) to represent the hypergraph structure, where the entries are defined as

$$h\left(v,e\right)=\left\{\begin{array}{c}1,{\rm{if}}\,{v}\in e\\ 0,{\rm{if}}\,{v}\notin e\end{array}\right.$$

(2)

For each site, we generated the multimodal data ${\bf{X}}=[{{\bf{X}}}_{1},{{\bf{X}}}_{2},\ldots ,{{\bf{X}}}_{{\rm{C}}}]\in {{\mathbb{R}}}^{{\rm{n}}\times {{\rm{d}}}_{{in}}}$ after data preprocessing, where C is the number of data modalities and $d_{in}$ is the input dimension of data X to be fed into the model.

As shown in the figure of model framework (Fig. 1), we constructed a hyperedge structure group (hypergraph) for each modality, and then concatenated the hyperedge groups to generate the multi-modality hypergraph. We adopted the following steps to construct each hypergraph for each modality: (1) we represented each sample/patient into an embedding vector, and the embedding vector is the representation of a modality; (2) the Euclidean distance was applied to calculate the distance between representation vectors of every two samples as the similarity of the two samples; (3) during the hypergraph construction, each vertex in the hypergraph represents one sample, and each vertex connects to its kneigs nearest neighbors (defined by the top kneigs similar samples in terms of the distance) to generate each hyperedge $e\in {\mathscr{E}}$. As a result, there are n hyperedges, and each hyperedge connects kneigs vertices in the hypergraph. Compared with the general graph structure, the constructed hypergraph structure has the special ability to describe and mine nonlinear high-order relationships between data samples, which makes it more flexible when dealing with multimodal and heterogeneous data, and it is more convenient for the integration and expansion of multi-modality. Therefore, the constructed hypergraph enables the model to better capture the similarity and correlations among samples from the multi-modality data.

Upon constructing the hypergraph, we used Eq. 2 to obtain the incidence matrix ${{\bf{H}}}_{{\boldsymbol{i}}}\in {{\mathbb{R}}}^{{\rm{n}}\times n}$. The element at the ith row and jth column in the matrix is 1, indicating that vertex v_i and vertex v_j belong to the same hyperedge and thus have a connection. Other elements in the matrix are set to 0, representing no connection between the corresponding vertices. Then the fused ${\bf{H}}{\boldsymbol{=}}\left[{{\bf{H}}}_{1}{\boldsymbol{,}}{{\bf{H}}}_{2}{\boldsymbol{,}}{\boldsymbol{\ldots }}{\boldsymbol{,}}{{\bf{H}}}_{{\rm{C}}}\right]$ can be obtained by concatenating each incidence matrix to execute the hypergraph convolution operation, which can be formulated by

$${\bf{Y}}={{\bf{D}}}_{v}^{-\frac{{\bf{1}}}{{\bf{2}}}}{\bf{HW}}{{\bf{D}}}_{e}^{-{\bf{1}}}{{\bf{H}}}^{{\bf{T}}}{{\bf{D}}}_{v}^{-\frac{{\bf{1}}}{{\bf{2}}}}{\bf{X}}{\boldsymbol{\Theta }}$$

(3)

where D_e and D_v denote the diagonal matrices of e degree and vertex degree, with each edge degree defined as $\delta (e)=\sum _{v\in {\mathscr{V}}}h(v,e)$ and each vertex degree ν defined as $d\left(v\right)=\sum _{e\in {\mathscr{E}}}\omega \left(e\right)h(v,e)$, with the role of D_e and D_v can be simply summarized as the normalized incidence matrix H. ${{\boldsymbol{\Theta }}}^{{d}_{{in}}\times {d}_{{out}}}$ is the trainable parameter, which can extract d_out-dimensional feature from initial X. ${{\bf{Y}}}^{n\times {d}_{{out}}}$ is the output after the convolution operation, which can be used for classification.

The complete hypergraph convolution layer was obtained by the above hypergraph convolution operation plus a nonlinear activation function, which can be formulated as

$${{\bf{X}}}^{\left(l+1\right)}={{\sigma }}\left({{\bf{D}}}_{v}^{-\frac{1}{2}}{\bf{HW}}{{\bf{D}}}_{e}^{-1}{{\bf{H}}}^{{\rm{T}}}{{\bf{D}}}_{v}^{-\frac{1}{2}}{{\bf{X}}}^{\left(l\right)}{{\boldsymbol{\Theta }}}^{\left(l\right)}\right)$$

(4)

where ${{\bf{X}}}^{(l+1)}$ is the output of the lth layer and σ is the RELU function used for nonlinear activation.

In the GCN approach, the graph structure is usually constructed by using single modal features. It is because the ordinary graph structure uses the adjacency matrix as the input for graph learning, which largely limits the number of edges. However, in multimodal feature fusion, its complex heterogeneous relationships make ordinary graph structures often lose a lot of information when they are constructed. The hypergraph model used in this study can perform node-edge-node transformation by taking advantage of the property of having a higher-order correlation between data. The hypergraph structure allows better characterization, enables more accurate modeling of multivariate relationships, and facilitates the fusion and extension of multimodalities, thus building the relationship network between patients more efficiently. Therefore, we used hypergraph to fuse functional neuroimaging data as well as PC data with each other, which could achieve more accurate ASD identification.

Data processing

In this study, the fMRI scan data were obtained from the Configurable Connectome Analysis Pipeline (CPAC)³¹ in the Preprocessed Connectome Project, which includes AAL atlas³², Harvard-Oxford (HO) atlas³³, and Craddock200 (CC200) atlas³⁴. Each of the three atlases defines a different ROI and uses BOLD signal imaging that can indirectly reflect the metabolism of brain activity. The Pearson correlation coefficient (PCC) is usually used to assess the synchronization of two signals: if the synchronization of BOLD signal changes in two brain locations is high, then a strong functional connection exists between the two locations. We visualize the brain regions are defined by the AAL atlas as well as the set of functional brain connections that retain only 1% of edge strength (Fig. 2b, c).

In the local model, we form a symmetric LOFC matrix (Fig. 2a) by making PCC between any two ROI timeseries pairs of the BOLD signals of the brain locations defined by the three atlases and extract its upper triangle as the original feature representation of this atlas.

$${\rm{PCC}}\left({r}_{i},{r}_{j}\right)=\frac{E\left({r}_{i}{r}_{j}\right)-E\left({r}_{i}\right)E\left({r}_{j}\right)}{\sqrt{E\left({{r}_{i}}^{2}\right)-{E}^{2}\left({r}_{i}\right)}\sqrt{E\left({{r}_{i}}^{2}\right)-{E}^{2}\left({r}_{j}\right)}}$$

(5)

where r_i and r_j denote respectively the time series of brain regions i and j, and E(∙) denotes the mathematical expectation.

In addition, the five phenotypic data, including gender, age, FIQ, VIQ and PIQ, are extracted in this study³⁵. The processed phenotypic data are used together with the original feature representation, which are obtained from the mentioned three brain atlases as the feature representation for the next input.

Training strategy and data splitting

We evaluate the model using a stratified 5-fold cross-validation approach. The data from all sites are randomly split into 5 equal parts, using 4 folds for training and the remaining one-fold for testing. The performance metrics of all strategies are reported as the mean of 5 cross-validations. The proposed model FedHNN applies two layers of HGNN and uses dropout to avoid overfitting. In each round of collaborative training, the local model is optimized by using an internal dataset. We applied backpropagation to update the parameters and minimize a cross-entropy loss function with a learning rate of 1e-5. The hypergraph model can perform node-edge-node transform, which can better refine the features using the hypergraph structure²⁴.

Data availability

All datasets used in this study are publicly available at http://preprocessed-connectomes-project.org/abide/.

References

Amaral, D. G., Schumann, C. M. & Nordahl, C. W. Neuroanatomy of autism. Trends Neurosci. 31, 137–145 (2008).
Article CAS PubMed Google Scholar
Lynch, C. J. & Liston, C. New machine-learning technologies for computer-aided diagnosis. Nat. Med. 24, 1304–1305 (2018).
Article CAS PubMed Google Scholar
Bi, X., Zhao, X., Huang, H., Chen, D. & Ma, Y. Functional brain network classification for alzheimer’s disease detection with deep features and extreme learning machine. Cogn. Comput. 12, 513–527 (2020).
Article Google Scholar
Aviles-Rivero, A. I., Runkel, C., Papadakis, N., Kourtzi, Z. & Schönlieb, C.-B. Multi-modal hypergraph diffusion network with dual prior for alzheimer classification. In International Conference on Medical Image Computing and Computer-Assisted Intervention, 717–727 (Springer, 2022).
Zheng, S. et al. Multi-modal graph learning for disease prediction. IEEE Trans. Med. Imaging 41, 2207–2216 (2022).
Article PubMed Google Scholar
Qiu, S. et al. Multimodal deep learning for alzheimer’s disease dementia assessment. Nat. Commun. 13, 3404 (2022).
Article CAS PubMed PubMed Central Google Scholar
Yin, W., Mostafa, S. & Wu, F.-X. Diagnosis of autism spectrum disorder based on functional brain networks with deep learning. J. Comput. Biol. 28, 146–165 (2021).
Article CAS PubMed Google Scholar
Wang, Y. et al. Mage: automatic diagnosis of autism spectrum disorders using multi-atlas graph convolutional networks and ensemble learning. Neurocomputing 469, 346–353 (2022).
Article Google Scholar
Abraham, A. et al. Deriving reproducible biomarkers from multi-site resting-state data: an autism-based example. NeuroImage 147, 736–745 (2017).
Article PubMed Google Scholar
Wang, Y., Wang, J., Wu, F.-X., Hayrat, R. & Liu, J. Aimafe: autism spectrum disorder identification with multi-atlas deep feature representation and ensemble learning. J. Neurosc. Methods 343, 108840 (2020).
Article Google Scholar
Ahmedt-Aristizabal, D., Armin, M. A., Denman, S., Fookes, C. & Petersson, L. Graph-based deep learning for medical diagnosis and analysis: past, present and future. Sensors 21, 4758 (2021).
Article CAS PubMed PubMed Central Google Scholar
Li, M. M., Huang, K. & Zitnik, M. Graph representation learning in biomedicine and healthcare. Nat. Biomed. Eng. 6, 1353–1369 (2022).
Article PubMed PubMed Central Google Scholar
Parisot, S. et al. Disease prediction using graph convolutional networks: application to autism spectrum disorder and alzheimer’s disease. Med. Image Anal. 48, 117–130 (2018).
Article PubMed Google Scholar
Di, D. et al. Hypergraph learning for identification of covid-19 with ct imaging. Med. Image Anal. 68, 101910 (2021).
Article PubMed Google Scholar
Xiao, L. et al. Multi-hypergraph learning-based brain functional connectivity analysis in fmri data. IEEE Trans. Med. Imaging 39, 1746–1758 (2019).
Article PubMed PubMed Central Google Scholar
McMahan, B., Moore, E., Ramage, D., Hampson, S. & y Arcas, B. A. Communication-efficient learning of deep networks from decentralized data. In Artificial intelligence and statistics, 1273–1282 (PMLR, 2017).
Yang, Q., Liu, Y., Chen, T. & Tong, Y. Federated machine learning: concept and applications. ACM Transactions on Intell. Syst. Technol. 10, 1–19 (2019).
Google Scholar
Li, T., Sahu, A. K., Talwalkar, A. & Smith, V. Federated learning: challenges, methods, and future directions. IEEE Signal. Process. Mag. 37, 50–60 (2020).
CAS Google Scholar
Kaissis, G. et al. End-to-end privacy preserving deep learning on multi-institutional medical imaging. Nat. Mach. Intell. 3, 473–484 (2021).
Article Google Scholar
Bai, X. et al. Advancing covid-19 diagnosis with privacy-preserving collaboration in artificial intelligence. Nat. Mach. Intell. 3, 1081–1089 (2021).
Article PubMed PubMed Central Google Scholar
Dayan, I. et al. Federated learning for predicting clinical outcomes in patients with covid-19. Nat. Med. 27, 1735–1743 (2021).
Article CAS PubMed PubMed Central Google Scholar
Dou, Q. et al. Federated deep learning for detecting covid-19 lung abnormalities in ct: a privacy-preserving multinational validation study. NPJ Digital Med. 4, 60 (2021).
Article Google Scholar
Parisot, S. et al. Spectral graph convolutions for population-based disease prediction. In International conference on medical image computing and computer-assisted intervention, 177–185 (Springer, 2017).
Feng, Y., You, H., Zhang, Z., Ji, R. & Gao, Y. Hypergraph neural networks. Proc. AAAI Conf. Artificial Intell. 33, 3558–3565 (2019).
Google Scholar
Di Martino, A. et al. The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol. Psychiatry 19, 659–667 (2014).
Article PubMed Google Scholar
Kipf, T. N. & Welling, M. Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907 (2016).
Velicˇkovic´, P. et al. Graph attention networks. arXiv preprint arXiv:1710.10903 (2017).
Hamilton, W., Ying, Z. & Leskovec, J. Inductive representation learning on large graphs. Adv. Neural Inform. Process. Syst. 30, (2017).
Mohri, M., Sivek, G. & Suresh, A. T. Agnostic federated learning. In International Conference on Machine Learning, 4615–4625 (PMLR, 2019).
Li, X. et al. Multi-site fmri analysis using privacy-preserving federated learning and domain adaptation: Abide results. Med. Image Anal. 65, 101765 (2020).
Article PubMed PubMed Central Google Scholar
Craddock, C. et al. Towards automated analysis of connectomes: the configurable pipeline for the analysis of connectomes (c-pac). Front. Neuroinform. 42, 10–3389 (2013).
Google Scholar
Tzourio-Mazoyer, N. et al. Automated anatomical labeling of activations in spm using a macroscopic anatomical parcellation of the mni mri single-subject brain. Neuroimage 15, 273–289 (2002).
Article CAS PubMed Google Scholar
Desikan, R. S. et al. An automated labeling system for subdividing the human cerebral cortex on mri scans into gyral based regions of interest. Neuroimage 31, 968–980 (2006).
Article PubMed Google Scholar
Craddock, R. C., James, G. A., Holtzheimer, P. E. III, Hu, X. P. & Mayberg, H. S. A whole brain fmri atlas generated via spatially constrained spectral clustering. Hum. Brain Mapp. 33, 1914–1928 (2012).
Article PubMed Google Scholar
Niu, K. et al. Multichannel deep attention neural networks for the classification of autism spectrum disorder using neuroimaging and personal characteristic data. Complexity 2020, (2020).

Download references

Acknowledgements

This work is supported by the Natural Science Foundation of Shandong Province (ZR2021MH227), the National Natural Science Foundation of China (62071330), and Jinan Science and Technology Bureau plan (202019181).

Author information

Authors and Affiliations

College of Computer Science, Zhejiang University, Hangzhou, China
Haishuai Wang
College of Computer Science, Tianjin Normal University, Tianjin, China
Hezi Jing & Ziping Zhao
Department of General Practice, Shandong Provincial Third Hospital, Shandong University, Jinan, China
Jianjun Yang
Department of Orthodontics, Shanghai Ninth People’s Hospital, Shanghai Jiao Tong University, Shanghai, China
Chao Liu
Department of Radiology, Shanghai Children’s Medical Center, Shanghai Jiao Tong University, Shanghai, China
Liwei Hu
Department of Radiology, Shanghai Chest Hospital, Shanghai Jiao Tong University, Shanghai, China
Guangyu Tao
Liangzhu Laboratory, School of Medicine, Zhejiang University, Hangzhou, China
Ning Shen

Authors

Haishuai Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hezi Jing
View author publications
You can also search for this author in PubMed Google Scholar
Jianjun Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Liwei Hu
View author publications
You can also search for this author in PubMed Google Scholar
Guangyu Tao
View author publications
You can also search for this author in PubMed Google Scholar
Ziping Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Ning Shen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.W. conceived this study. J.Y., C.L., L.H. and G.T. analyzed the data and conducted experiments. H.W. and H.J. designed the method, and drafted the manuscript. H.W., Z.Z. and N.S. revised the manuscript and supervised the whole work.

Corresponding authors

Correspondence to Haishuai Wang, Ziping Zhao or Ning Shen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethics

Ethical approval for the study was obtained from Zhejiang University.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, H., Jing, H., Yang, J. et al. Identifying autism spectrum disorder from multi-modal data with privacy-preserving. npj Mental Health Res 3, 15 (2024). https://doi.org/10.1038/s44184-023-00050-x

Download citation

Received: 24 April 2023
Accepted: 20 December 2023
Published: 02 May 2024
DOI: https://doi.org/10.1038/s44184-023-00050-x