January 10.1016/j.bpsc.2022.10.009 A Comparative Machine Learning Study of Connectivity-Based Biomarkers of Schizophrenia Victoria Shevchenko 2 R. Austin Benn 2 Robert Scholz 1 2 5 Wei Wei 2 Carla Pallavicini 2 Ulysse Klatzmann 2 Francesco Alberti 2 Theodore D. Satterthwaite 4 Demian Wassermann 3 Pierre-Louis Bazin 0 Daniel S. Margulies 2 Affiliations: . Full brain picture Analytics , Leiden , The Netherlands . Max Planck School of Cognition , Leipzig , Germany . Université Paris Cité, CNRS, Integrative Neuroscience and Cognition Center , Paris , France . Université Paris-Saclay , Inria, CEA, Paris , France . University of Pennsylvania Perelman School of Medicine , Philadelphia, PA , US . Wilhelm Wundt Institute for Psychology, Leipzig University , Leipzig , Germany 2024 6 2024 1 14

Functional connectivity holds promise as a biomarker of psychiatric disorders. Yet, its high dimensionality, combined with small sample sizes in clinical research, increases the risk of overfitting when the aim is prediction. Recently, low-dimensional representations of the connectome such as macroscale cortical gradients and gradient dispersion have been proposed, with studies noting consistent gradient and dispersion differences in psychiatric conditions. However, it is unknown which of these derived measures has the highest predictive capacity and how they compare to raw connectivity. Our study evaluates which connectome features - functional connectivity, gradients, or gradient dispersion - best identify schizophrenia. Figure 1 summarizes this work.

-

are the ones connecting primary sensory regions. controls, SCZ: patients with schizophrenia.

I. Introduction

Functional connectivity holds promise as a potential biomarker for psychiatric disorders1,2, as evidenced by a robust body of literature that highlights distinct functional profiles between patients with schizophrenia and neurotypical individuals. Prior studies have reported lower connectivity across regions, reduced small-worldness of the resting state networks, and lower functional network segregation3–5. However, the high dimensionality of connectivity data, combined with small sample sizes in clinical research, poses a risk of overfitting when the aim is prediction6–8.

Recently, low-dimensional representations of the connectome such as macroscale cortical gradients9,10 and gradient dispersion11,12 have been proposed. The gradients are derived from functional connectivity matrices through dimensionality reduction with the aim to maximize the cumulative amount of variance explained by the resulting components. The first component (or the principal gradient) reflects the functional hierarchy of the cortex9,10,13, spanning from the primary sensory (unimodal) regions to higher-order (transmodal) regions. It has been demonstrated to be consistent across individuals14.

Dong et al.15 revealed that the principal gradient is contracted in schizophrenia, i.e., the primary sensory regions were closer to the higher-order regions in terms of their functional profile, as indicated by their principal gradient values. This finding indicates lower functional differentiation between uni- and transmodal regions. Based on the gradient framework, more differences in the cortical functional hierarchy between neurotypical individuals and subjects with neurodevelopmental and psychiatric disorders have been reported12,14,16–18.

As an extension of the gradient framework, gradient dispersion further quantifies the density of local connectivity, known as functional modularity. Specifically, higher dispersion would indicate lower functional modularity within the cluster, whereas lower dispersion would mean higher modularity. Thus, gradient dispersion characterizes functional differentiation across the cortical hierarchy. This measure is of interest to us since changes in the functional hierarchy of the cortex are reported to be idiosyncratic to schizophrenia2,15,19,20.

Nevertheless, it is unknown if these connectivity derivatives can discriminate between patients with schizophrenia and neurotypical individuals, or if they outperform raw connectivity. Here, we attempt to identify the features with the largest biomarker potential from a large set of features including functional connectivity, gradients, and gradient dispersion. In addition, we explore the impact of the number of features on the choice of classifier. We also seek to address the question germane to neuroscience and computational psychiatry: when one has a limited number of subjects and a disproportionately rich set of independent variables, how does one justify the choice of features? Additionally, we elaborate on putative functional underpinnings of schizophrenia based on the features with the highest predictive potential.

II. Methods Data & Preprocessing

The present study's sample was derived from three open-source datasets: COBRE21, LA5c study from UCLA Consortium for Neuropsychiatric Phenomics22, and SRPBS multidisorder MRI dataset23. All data were acquired in accordance with the Declaration of Helsinki. The links to the datasets are provided in the data availability statement below.

The initial cohort for our investigation consisted of 996 individuals; subsequently, 40 subjects were excluded from analysis due to substantial motion artifacts (average framewise displacement (FD) > 0.5mm). The analyzed sample comprised 248 patients with schizophrenia (SCZ) and 688 neurotypical controls (NC; see Table 1). The scanning parameters of each dataset are reported in Supplementary Table 1.

Preprocessing of MRI data was done using fMRIPrep 20.2.124 which is based on Nipype 1.5.125 (Supplementary Methods 1). The preprocessed BOLD time series were parcellated with the Schaefer parcellation (1000 parcels, 7 Yeo networks)26. Then, we computed a connectivity matrix (Pearson correlation) for each subject. displacement. literature10,27,28. We thresholded the connectivity matrices by discarding 90% of the lowest correlation values including negative values. We used Procrustes alignment to align the gradients of all subjects29. To avoid introducing dataset-specific bias to the alignment, we used the gradients computed from the group connectivity matrix of the Human Connectome Project (HCP)30 as reference gradients. Figure 2B displays mean variance explained across all subjects for 200 gradients. On average, the principal gradient accounted for ~6% of variance of thresholded connectivity matrices. Collectively, 200 gradients accounted for ~80% of variance. produce a 1000 x 1000 connectivity matrix. Principal component analysis (PCA) was applied to the thresholded matrix to extract 200 gradients. B: Variance explained by 200 gradients, mean across subjects ±1 s.d.

Centroid Dispersion

Gradient dispersion has been quantified before with different approaches11,12,32. These prior investigations, akin to the present study, used dispersion to operationalize functional modularity across the cortex. However, these methods necessitated the identification of centroids of networks for which dispersion was computed relative to the regions encompassed within the network. Thus, dispersion values were only assigned to centroids and categorized as within- or between-network dispersion.

Following this procedure, we computed within- and between-network dispersion for seven Yeo networks31 in the 3-dimensional gradient space. Within-network dispersion was quantified as the sum of squares of Euclidean distances from the centroid of the network to the rest of its regions. For the gradient values belonging to a given network, the centroid was defined as three median values of the 11 first three cortical gradients, as in Bethlehem et al. ; the position of the centroid in the 3D latent gradient space is defined by these three values. Between-network dispersion was defined as the Euclidean distance between the network centroids. Thus, centroid dispersion amounted to 28 values per subject: 7 values for within-network and 21 values for between-network dispersion. Centroid dispersion is schematically illustrated in Figure 4B (right).

Neighborhood dispersion

Centroid dispersion can vary depending on how the networks are delineated (e.g., if a different number of networks is used). Hence, we computed neighborhood dispersion with the aim to circumvent this potential confounding factor. Specifically, we calculated dispersion for individual regions which enabled us to maintain the spatial resolution congruent with the gradients (1000 regions per measure).

To compute neighborhood dispersion for every region, we identified K closest neighboring regions (Figure 3) via the K-Nearest Neighbors (KNN) algorithm. Then, we computed the mean Euclidean distance between the focal region and its designated neighboring regions in the gradient space. The resulting value quantified dispersion for that specific region. Neighborhood dispersion was computed for combinations of gradients spanning from 1 to 200.

Unlike previous investigations where the primary source of variability in dispersion originated from network delineation, our study has its own unique challenge in determining the number of the nearest neighbors (K). To address this issue, we included all dispersion values computed based on a range of nearest neighbors from 10 to 170 with a step size of 40. Basing the calculation of dispersion on differing sets of gradients (each including up to 200 gradients), we derived 1000 x 200 x 5 = 1,000,000 dispersion values, which we then used as input to our analytic workflow (along with flattened connectivity matrices and gradients).

Analytic Workflow

The objective of our workflow was to identify the features with the largest predictive capacity from an extensive array of connectivity-based features. Our dataset included vectorized connectivity matrices (N = 499,500), 200 gradients (N = 200,000; 200 values for each region), neighborhood dispersion (N = 1,000,000) and centroid dispersion (N = 28). The schematic representation of the feature selection procedure is depicted in Figure 4. For each participant, we combined all features into one flat vector. Then, we concatenated the vectors for all 936 participants, resulting in a matrix of size [936 x 1,699,528] (Figure 4B, 1). Next, we applied principal component analysis (PCA) to each type of features separately and retained 20% of variance for each type (effectively compressing the feature dimension), except for the centroid dispersion for which all variance was included (Figure 4B, 2). The aim of the decomposition was to retain the same amount of variance for all feature types and to ensure that for each feature type more than one component is extracted when applying PCA. We retained 149 components in total (Nconn = 7, Ngrad = 72, Ncentroid_disp = 28, Ncortex_disp = 42; Figure 4B, 2.). The decomposed dataset was divided into the train (75% of participants) and holdout (25%) sets, with the ratio of NSCZ/NNC = 0.35 in both.

The purpose of the following steps was to quantify the importance of each component for classification performance. We fitted an L2-regularized logistic regression on the train set (Figure 4B, 3). Logistic regression was determined as the best model to compute component importance since the resulting coefficients are interpretable and the logic behind their computation is well understood. We used permutation feature importance (Figure 4C, 1) as the measure of the contribution of each component and feature to classification performance. Initially conceived for random forests33,34, it allows us to estimate the importance of the features in a classifier-agnostic way using the holdout set. First, the classifier is trained on the train set and the baseline accuracy on the holdout set is obtained. Second, each feature of the holdout set is randomly shuffled Nperm times and at each shuffle the permutation accuracy is computed. Permutation feature importance is the difference between the baseline accuracy and the permutation accuracy at each permutation. This importance metric can be estimated for any classifier; it quantifies the extent to which classification performance deteriorates or improves as every feature is shuffled. The feature with the largest permutation importance contributes the most to classification performance (Figure 4C, 1). We computed mean permutation importance for each component (Nperm = 10,000) and inverse transformed it based on the components’ projection matrices to obtain feature importance in the feature space (Figure 4C, 2). Permutation feature importance enabled the selection of features for further assessment of their utility for classification.

Classifier Analysis

Finally, we assessed the predictive capacity of each feature type in a classifier-agnostic manner so as to prevent the results from being driven by the choice of a specific classifier. To this end, we selected 936 features with the largest permutation feature importance from each type and we trained and tested 13 distinct classifiers on them:

Logistic regression (L2-regularized, LR).

K-Neighbors Classifier (KN).

Naïve Bayes (NB).

Decision Tree Classifier (DT).

Support Vector Machine (SVM).

Ridge Classifier (Ridge).

Random Forest Classifier (RF).

Ada Boost Classifier (AB).

Gradient Boosting Classifier (GB).

Light Gradient Boosting Machine (LGB).

Linear Discriminant Analysis (LDA).

Extra Trees Classifier (ET).

Quadratic Discriminant Analysis (QDA).

To verify that the difference in classification performance between feature types persists for larger numbers of features, we repeated this analysis for a range of 100 to 10,000 features with the largest permutation feature importance from each type.

For each feature count, we identified the best classifier based on its mean cross-validation (CV) accuracy across 10 folds. Next, we tested the best classifier on the holdout set. All data transformations were done within Scikit-Learn pipelines. The multi-classifier assessment was done using Pycaret (https://github.com/pycaret/pycaret). Age, sex, dataset of origin, and framewise displacement (FD) were always included as covariates.

For classifier performance, we report both accuracy and the F1-score. The F1-score, calculated using the Scikit-Learn package in Python, represents the harmonic mean of precision and recall. This metric is particularly useful in cases where class distribution is imbalanced, as it accounts for both false positives and false negatives. baselines, namely:

The CV and test performance across all classifiers and feature subsets was compared to two 1) Test performance on the PCA dataset (all 149 components) of the logistic regression. 2) the dummy classifier which randomly picks the class for each sample. It is frequently used as a baseline in machine learning research35,36.

Note that in this study the accuracy of the dummy classifier consistently remained at 73.7%. In contrast, the F1 score for this classifier is always 0, meaning that it does not differentiate between the two classes. These values constitute our chance reference.

III. Results Permutation Feature Importance & Classification Performance

Upon visual inspection, we observed that the principal components of functional connectivity had the largest permutation feature importance, followed by gradients, centroid dispersion, and neighborhood dispersion (Figure 5A). We sought to verify that permutation feature importance indeed reflects an advantage in classification performance regardless of classifier. To this end, we selected the 936 features (as many as subjects) with the largest permutation feature importance for each type and trained and tested 13 classifiers on them (Figure 5). Connectivity outperformed the other feature types. This conclusion was supported by the Mann-Whitney U test (Figure 5B): when trained on connectivity, the classifiers had significantly higher F1-score than when trained on the 1st (principal) gradient (U = 164, p = 0.003), neighborhood dispersion (U = 169, p = 0.001) and centroid dispersion (U = 156, p = 0.009). The other contrasts did not survive the correction for multiple comparisons (Bonferroni: α = 0.013).

Multi-classifier analyses further emphasized the superior predictive capacity of functional connectivity compared to the other feature types. Here we report CV and test accuracy and F1-score across all classifiers and for the best classifier (Figure 5C). Connectivity edges with the largest permutation feature importance consistently outperformed logistic regression fitted on all feature components. The other feature types performed substantially worse.

In addition, we examined the impact of the number of best features on the choice of the best classifier (fits on all feature types are considered). Figure 5D illustrates the evolution of the best classifier with the increasing number of features as the change in relative density of instances where the classifiers were identified as best. Three main patterns can be noted. Firstly, several classifiers clearly performed better with N_features < 3000: GB, SVM, RF, KN and Ridge. Secondly, ET and LGB had a relatively stable winning rate across all feature subsets. Thirdly, LR and LDA had an increased performance when N_features > 3000. However, LDA rarely outperformed the other classifiers, whereas LR and LGB often emerged as optimal.

The Features with The Largest Permutation Feature Importance

To get a better understanding of the contribution of the most important connectivity features to the diagnosis prediction, we conducted an additional exploratory analysis. For three feature subsets including 500, 1000 and 5000 connectivity edges with the largest permutation feature importance, we computed weighted degree centrality. For each subset, the selected edges were transformed back to a 1000x1000 connectivity matrix and for each region the sum of edges was computed, each edge weighted by its correlation value. Thus, for each region, we sought to quantify its connectivity strength to the other regions given the selected edges.

For each edge subset, we plotted the difference in group degree centrality between SCZ and NC (Figure 6). Degree centrality was overall lower in patients with schizophrenia which is in line with previous accounts of lower overall functional connectivity characteristic of this disease37–42. This finding lends support to the dysconnectivity hypothesis43,44.

Secondly, the spatial pattern illustrated in Figure 6 indicated that the significant differences in degree centrality were concentrated in the primary regions for all edge subsets displayed here. Put differently, the edges with the largest permutation feature importance reflect connectivity strength in the primary regions, indicating that their connectivity profiles contribute the most to classification performance. with the largest permutation feature importance. Inset violinplots display weighted degree centrality averaged across regions for the two groups. SCZ: patients with schizophrenia, NC: neurotypical controls, WDG: weighted degree centrality.

IV. Discussion

Here, we extend the effort to determine the optimal connectivity-based predictors of behavior (R. Kong et al., 2023) to psychopathology by attempting to benchmark connectivity-based features against each other for the prediction of the diagnosis of schizophrenia. To this end, we applied a feature selection workflow based on permutation feature importance to a large dataset comprising connectivity, macroscale cortical gradients and gradient dispersion. Our analysis revealed that, despite growing interest in cortical gradients and gradient dispersion12,15,18,45, functional connectivity holds superior predictive potential for schizophrenia over its low-dimensional derivatives (Figure 5B, C). Additionally, we demonstrated that the connectivity edges connecting the primary sensory regions have the largest permutation feature importance. This result indicates that variations in connectivity strength of these regions encapsulate critical information for distinguishing patients from controls. This finding, however, does not discard the relevance of the gradients for the studies looking specifically into functional hierarchical variations. Regarding our use case, the diminished predictive capacity of the gradients could stem from the conservative matrix thresholding applied prior to dimensionality reduction. Future studies are needed to test this hypothesis.

To gain a deeper understanding of the connectivity patterns associated with schizophrenia, we conducted an exploratory analysis whereby we compared weighted degree centrality for the regions linked by the 500, 1000 and 5000 most important edges. Firstly, degree centrality appeared to be lower in patients with schizophrenia, corroborating previous evidence of overall hypoconnectivity typical for the disorder37–42. Secondly, for the edges with the largest importance, these differences were predominantly concentrated in the sensorimotor, auditory, and visual cortex. These findings appear to contrast with the studies highlighting discrepancies predominantly in higher-order areas, such as the default mode network (DMN) 20,46–49

. However, the differences we observed need to be contextualized as the ones relevant for classification performance: while they may or may not constitute the neurological basis of schizophrenia, they are most informative for the differentiation of the two groups. Viewed from this perspective, our results resonate with the research demonstrating that across individuals the measurements are most reliable in the primary, unimodal areas50,51. In addition, one study reported more accurate surface registration for the primary sensory areas52. It has been postulated that this stability can be attributed to the fact that the primary areas are phylogenetically the most ancient cortical areas and are therefore considered as evolutionary anchors around which most of the cortical expansion in humans unfolded31,51,53–55. The largest feature importance in the primary and most stable regions may indicate that the classifiers rely on those individual differences which are replicable across individuals of the same group. In such a setting, intra-individual variability cannot be accounted for. This shortcoming can be potentially addressed by studies focusing on datasets featuring hours of scanning time per individual 56–58 .

We were also interested in how the ranking of the 13 classifiers tested in this work evolved with the number of best features selected based on permutation feature importance. As depicted in Figure 5D, some classifiers’ performance relative to each other fluctuated considerably depending on the size of the feature subset. Overall, no classifier outperformed others at all times for any number of features. It is a clear indication of the necessity of empirical testing of the candidate classifiers. However, it appears that as the number of features increases, the performance of simpler classifiers — such as logistic regression and LDA — improves.

Regarding limitations, a certain degree of caution is warranted when interpreting our results. Firstly, we did not account for the effect of medication in our study since the medication data were missing for a large number of patients. Excluding these subjects would have resulted in a drastically decreased sample size and, more importantly, in an increased imbalance of classes.

Secondly, most classifiers in this study were fitted on datasets with more features than observations which increases the risk of overfitting. Nonetheless, we believe to have addressed this issue by employing a 10-fold CV for each classifier. In addition, we have also witnessed that for some classifiers test performance exceeded CV performance (Figure 5C) which is not characteristic of overfitting.

In addition, we did not consider the issue of comorbidity and transdiagnostic phenomena across psychiatric disorders. While this topic lies beyond the purview of the present study, it is imperative to acknowledge its critical significance. The inquiry into transdiagnostic classification and symptom prediction holds potential to profoundly transform the paradigms governing the diagnosis and treatment of patients. Indeed, as of late, the research looking into transdiagnostic effects has garnered substantial momentum5,59–64.

The emergence of novel connectivity-based methods broadens our toolkit for predicting psychiatric disorders, introducing a necessity for empirical validation. Our findings indicate that functional connectivity outperforms its more recent, low-dimensional derivatives such as cortical gradients and gradient dispersion in predicting schizophrenia. Additionally, in this study, the connectivity within the primary sensory regions showed the highest discrimination capabilities, possibly due to the reduced anatomical and functional variability of those regions. We anticipate, however, that it is also informative for a broad spectrum of major psychiatric disorders. The exploration of this latter possibility warrants thorough examination in future work.

Acknowledgements

We thank Joshua Vogelstein for methodological guidance and Vincent Kreft for contributing to the conceptual development of the study. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant agreement No. 866533) awarded to D.S.M. This project was also supported by the contrat doctoral spécifique normalien (CDSN) awarded to V.S. by the Ministry of Higher Education and Research of France. The computational aspects of this research were supported by the Wellcome Trust Core Award Grant Number 203141/Z/16/Z and the NIHR Oxford BRC. The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.

Author contributions

V.S. researched the data, conducted the analyses and wrote the manuscript. C.P., D.W., P-L.B., and D.S.M provided methodological expertise. C.P., R.A.B, R.S., W.W., U.K., F.A., D.W., T.D.S., PL.B., and D.S.M. assisted in interpretation of findings and manuscript revision.

Competing interests

The authors report no competing interests.

Data availability

The data used in this work are in open access, conditioned upon the signature of a data sharing agreement; for

COBRE: http://schizconnect.org/,

SPBRS-1600: https://bicrresource.atr.jp/srpbsopen/, and LA5c: https://openfmri.org/dataset/ds000030/.

Computer code

The code used to perform the analyses and to produce the figures featured in this study is available Github: https://github.com/victoris93/feature-selection-scz.

97 (2018).

Mikolas, P. et al. Machine learning classification of first-episode schizophrenia spectrum disorders and controls using whole brain white matter fractional anisotropy. BMC Psychiatry 18,

Yang, G. J. et al. Functional hierarchy underlies preferential connectivity disturbances in

Li, S. et al. Dysconnectivity of Multiple Brain Networks in Schizophrenia: A Meta-Analysis of Resting-State Functional Connectivity. Front. Psychiatry 10, 482 (2019).

Lynall, M.-E. et al. Functional connectivity and brain networks in schizophrenia. J. Neurosci. 30, 9477–9487 (2010).

Ma, Q. et al. Transdiagnostic Dysfunctions in Brain Modules Across Patients with Schizophrenia, Bipolar Disorder, and Major Depressive Disorder: A Connectome-Based Study. Schizophr. Bull. 46, 699–712 (2020).

Cearns, M., Hahn, T. & Baune, B. T. Recommendations and future directions for supervised machine learning in psychiatry. Transl. Psychiatry 9, 271 (2019).

Cui, Z. & Gong, G. The effect of machine learning regression algorithms and sample size on individualized behavioral prediction with functional connectivity features. Neuroimage 178, 622–637 (2018).

Serin, E., Zalesky, A., Matory, A., Walter, H. & Kruschwitz, J. D. NBS-Predict: A predictionbased extension of the network-based statistic. Neuroimage 244, 118625 (2021).

Jung, H., Wager, T. D. & Carter, R. M. Novel Cognitive Functions Arise at the Convergence of Macroscale Gradients. J. Cogn. Neurosci. 34, 381–396 (2022). 10. Margulies, D. S. et al. Situating the default-mode network along a principal gradient of macroscale cortical organization. Proc. Natl. Acad. Sci. U. S. A. 113, 12574–12579 (2016). 11. Bethlehem, R. A. I. et al. Dispersion of functional gradients across the adult lifespan.

Neuroimage 222, 117299 (2020). Depressive

Disorder.

Biol

Psychiatry

Cogn

Neurosci

Neuroimaging (2022) 13. Paquola, C. et al. Shifts in myeloarchitecture characterise adolescent development of cortical gradients. Elife 8, (2019).

discovery. Neuroimage 223, 117322 (2020). 14. Hong, S.-J. et al. Toward a connectivity gradient-based framework for reproducible biomarker 16. Wang, J., Zhou, Y., Ding, J. & Xiao, J. Functional gradient alteration in individuals with cognitive vulnerability to depression. J. Psychiatr. Res. 144, 338–344 (2021). 17. Xia, M. et al. Connectome gradient dysfunction in major depression and its association with gene expression profiles and treatment outcomes. Mol. Psychiatry 27, 1384–1393 (2022). 18. Xiao, Y. et al. Charting the dorsal-medial functional gradient of the default mode network in major depressive disorder. J. Psychiatr. Res. 153, 1–10 (2022). 19. Bassett, D. S. et al. Hierarchical organization of human cortical networks in health and schizophrenia. J. Neurosci. 28, 9239–9248 (2008). processing. Med. Hypotheses 72, 740–744 (2009).

Neuroinformatics 15, 343–364 (2017).

3, 160110 (2016). 20. Krishnan, R. R., Keefe, R. & Kraus, M. Schizophrenia is a disorder of higher order hierarchical 21. Aine, C. J. et al. Multimodal Neuroimaging in Schizophrenia: Description and Dissemination. 22. Poldrack, R. A. et al. A phenome-wide examination of neural and cognitive function. Sci. Data 23. Tanaka, S. C. et al. A

multi-site, multi-disorder resting-state magnetic resonance image database. Sci Data 8, 227 (2021).

16, 111–116 (2019).

Wang, C. & Mahadevan, S. Manifold alignment using Procrustes analysis. in Proceedings of the 25th international conference on

Machine learning - ICML ’08 (ACM

Press, 2008).

Neuroimage 62, 2222–2231 (2012). 30. Van Essen, D. C. et al. The Human Connectome Project: a data acquisition perspective. 31. Yeo, B. T. T. et al. The organization of the human cerebral cortex estimated by intrinsic functional connectivity. J. Neurophysiol. 106, 1125–1165 (2011). 32. Huang, Z., Mashour, G. A. & Hudetz, A. G. Functional geometry of the cortex encodes dimensions of consciousness. Nat. Commun. 14, 72 (2023). 33. Breiman, L. Random Forests. Mach. Learn. 45, 5–32 (2001). Regression Trees. Biometrics 40, 874 (1984). 35. Alzaid, M. A comparative study between classical feature engineering and RNNs for seizure detection in imbalanced data. in 2020 IEEE Signal Processing in Medicine and Biology Symposium (SPMB) (IEEE, 2020). doi:10.1109/spmb50085.2020.9353616. 36. Viglione, A. et al. Behavioral impulsivity is associated with pupillary alterations and 40. van Dellen, E. et al. Functional brain networks in the schizophrenia spectrum and bipolar disorder with psychosis. NPJ Schizophr. 6, 22 (2020). brain networks. Neuroimage 53, 1197–1207 (2010).

Psychiatry 17, 45 (2017).

Schizophr. Res. 176, 83–94 (2016). 42. Zhuo, C. et al. Brain structural and functional dissociated patterns in schizophrenia. BMC (2019).

16, 921547 (2022). 44. Zhou, Y., Fan, L., Qiu, C. & Jiang, T. Prefrontal cortex and the dysconnectivity hypothesis of schizophrenia. Neurosci. Bull. 31, 207–219 (2015). 45. Hong, S.-J. et al. Atypical functional connectome hierarchy in autism. Nat. Commun. 10, 1022 46. Kong, M. et al. Abnormal network homogeneity of default-mode network and its relationships with clinical symptoms in antipsychotic-naïve first-diagnosis schizophrenia. Front. Neurosci. 47. Roiser, J. P. et al. Dysconnectivity in the frontoparietal attention network in schizophrenia.

Front. Psychiatry 4, (2013). network: aberrant within- and between-network connectivity of the frontoparietal network in

Wang, H. et al. Evidence of a dissociation pattern in default mode subnetwork functional brain. Neuron 77, 586–595 (2013). cortical areas. Neuroimage 111, 107–122 (2015).

brain. Trends Cogn. Sci. 17, 648–665 (2013). 52. Tardif, C. L. et al. Multi-contrast multi-scale surface registration for improved alignment of 53. Buckner, R. L. & Krienen, F. M. The evolution of distributed association networks in the human (2022). 55. Heuer, K. et al. Evolution of neocortical folding: A phylogenetic comparative analysis of MRI from 34 primate species. Cortex 118, 275–291 (2019). prediction of BOLD fMRI signal. in 2022

Conference on

Cognitive

Computational

Neuroscience (Cognitive Computational Neuroscience, 2022). doi:10.32470/ccn.2022.1228-0. 57. Rastegarnia, S., St-Laurent, M., DuPre, E., Pinsard, B. & Bellec, P. Brain decoding of the Human Connectome Project tasks in a dense individual fMRI dataset. Neuroimage 283, 120395

schizophrenia. Proc. Natl. Acad. Sci . U. S. A. 113 , E219 - 28 ( 2016 ). 12. Pasquini , L. et al. Dysfunctional Cortical Gradient Topography in Treatment-Resistant Major 15 . Dong , D. et al. Compressed sensorimotor-to-transmodal hierarchical organization in 24 . Esteban , O. et al. fMRIPrep: a robust preprocessing pipeline for functional MRI . Nat. Methods 25 . Gorgolewski , K. et al. Nipype: a flexible, lightweight and extensible neuroimaging data processing framework in python . Front. Neuroinform . 5 , 13 ( 2011 ). functional connectivity MRI. Cereb. Cortex 28 , 3095 - 3114 ( 2018 ). 26 . Schaefer , A. et al. Local-global parcellation of the human cerebral cortex from intrinsic 27 . Girn , M. et al. Serotonergic psychedelic drugs LSD and psilocybin reduce the hierarchical differentiation of unimodal and transmodal cortex . Neuroimage 256 , 119220 ( 2022 ). 28 . Huntenburg , J. M. , Bazin , P.-L. & Margulies , D. S. Large-Scale Gradients in Human Cortical Organization . Trends Cogn. Sci . 22 , 21 - 31 ( 2018 ). 34. Gordon , A. D. , Breiman , L. , Friedman , J. H. , Olshen , R. A. & Stone , C. J. Classification and hyperactivity in CDKL5 mutant mice . Hum. Mol. Genet . 31 , 4107 - 4120 ( 2022 ). 37 . Cai , M. et al. Disrupted local functional connectivity in schizophrenia: An updated and extended 38 . Du , X. et al. Unraveling schizophrenia replicable functional connectivity disruption patterns meta-analysis . Schizophrenia (Heidelb) 8 , 93 ( 2022 ). across sites . Hum. Brain Mapp . 44 , 156 - 169 ( 2023 ). 39. Liu , Y. et al. Disrupted small-world networks in schizophrenia . Brain 131 , 945 - 961 ( 2008 ). 41. Zalesky , A. , Fornito , A. & Bullmore , E. T. Network-based statistic: identifying differences in 43 . Friston, K. , Brown , H. R. , Siemerkus , J. & Stephan , K. E. The dysconnection hypothesis ( 2016 ). 48. Tu , P.-C., Lee , Y.-C. , Chen , Y.-S. , Li , C.-T. & Su , T.-P. Schizophrenia and the brain's control schizophrenia . Schizophr. Res . 147 , 339 - 347 ( 2013 ). connectivity in schizophrenia . Sci. Rep . 5 , 14655 ( 2015 ). 50 . Gratton , C. et al. Functional Brain Networks Are Dominated by Stable Group and Individual Factors, Not Cognitive or Daily Variation . Neuron 98 , 439 - 452 . e5 ( 2018 ). 51 . Mueller , S. et al. Individual variability in functional connectivity architecture of the human 54 . Preuss , T. M. & Wise , S. P. Evolution of prefrontal cortex . Neuropsychopharmacology 47 , 3 - 19 56. Paugam , F. , Pinsard , B. , Lajoie , G. & Bellec , P. Individual auto-regressive models for long-term 58 . Pinho , A. L. et al. Individual Brain Charting dataset extension, second release of high-resolution fMRI data for cognitive mapping . Sci. Data 7 , 353 ( 2020 ). 59 . Buckholtz , J. W. & Meyer-Lindenberg , A. Psychopathology and the human connectome: toward 60 . Hettwer , M. D. et al. Coordinated cortical thickness alterations across six neurodevelopmental and psychiatric disorders . Nat. Commun . 13 , 6851 ( 2022 ). 61 . Elliott , M. L. , Romer , A. , Knodt , A. R. & Hariri , A. R. A Connectome-wide Functional Signature of Transdiagnostic Risk for Mental Illness . Biol. Psychiatry 84 , 452 - 459 ( 2018 ). 62. Park , B.-Y. et al. Multiscale neural gradients reflect transdiagnostic effects of major psychiatric conditions on cortical morphology . Commun Biol 5 , 1024 ( 2022 ). 63 . Segal , A. et al. Regional, circuit and network heterogeneity of brain abnormalities in psychiatric disorders . Nat. Neurosci . 26 , 1613 - 1629 ( 2023 ). networks. Nat. Commun . 9 , 3003 ( 2018 ). 64 . Xia , C. H. et al. Linked dimensions of psychopathology and connectivity in functional brain