Skip to main content

Functional and structural MRI based obsessive-compulsive disorder diagnosis using machine learning methods

Abstract

Background

The success of neuroimaging in revealing neural correlates of obsessive-compulsive disorder (OCD) has raised hopes of using magnetic resonance imaging (MRI) indices to discriminate patients with OCD and the healthy. The aim of this study was to explore MRI based OCD diagnosis using machine learning methods.

Methods

Fifty patients with OCD and fifty healthy subjects were allocated into training and testing set by eight to two. Functional MRI (fMRI) indices, including amplitude of low-frequency fluctuation (ALFF), fractional ALFF (fALFF), regional homogeneity (ReHo), degree of centrality (DC), and structural MRI (sMRI) indices, including volume of gray matter, cortical thickness and sulcal depth, were extracted in each brain region as features. The features were reduced using least absolute shrinkage and selection operator regression on training set. Diagnosis models based on single MRI index / combined MRI indices were established on training set using support vector machine (SVM), logistic regression and random forest, and validated on testing set.

Results

SVM model based on combined fMRI indices, including ALFF, fALFF, ReHo and DC, achieved the optimal performance, with a cross-validation accuracy of 94%; on testing set, the area under the receiver operating characteristic curve was 0.90 and the validation accuracy was 85%. The selected features were located both within and outside the cortico-striato-thalamo-cortical (CSTC) circuit of OCD. Models based on single MRI index / combined fMRI and sMRI indices underperformed on the classification, with a largest validation accuracy of 75% from SVM model of ALFF on testing set.

Conclusion

SVM model of combined fMRI indices has the greatest potential to discriminate patients with OCD and the healthy, suggesting a complementary effect of fMRI indices on the classification; the features were located within and outside the CSTC circuit, indicating an importance of including various brain regions in the model.

Peer Review reports

Background

Obsessive-compulsive disorder (OCD) is a common psychiatric condition with a lifetime prevalence between 1 and 3% in the general population [1, 2]. It is characterized by obsessions and/or compulsions with a continuous course if untreated [3]. OCD is among the most disabling psychiatric disorders [4], which leads to an enormous impairment in quality of life [5, 6] and constitutes a major health-economic burden on society [2, 7]. In recent years, magnetic resonance imaging (MRI) studies have provided numerous evidence of functional and structural abnormalities in various brain regions in OCD, mainly within the cortico-striato-thalamo-cortical (CSTC) circuit [8,9,10,11,12]. The success of neuroimaging in revealing the neural correlates of OCD has raised hopes of using MRI indices to discriminate OCD patients and the healthy.

In order to identify the neuroimaging biomarkers and implement the diagnosis classification, machine learning methods have been introduced. Machine learning methods have the advantage of being able to detect subtle and spatially distributed effects of neuroimaging data [13] and allow inference at the individual level rather than the group [14]. For example, support vector machine (SVM), one of the most widely used machine learning models, has been applied to build the diagnosis model of psychiatric disorders, such as schizophrenia [15, 16], depression [17, 18] and autism [19, 20]. Therefore, in this study, we applied MRI data and machine learning methods, including SVM and other two widely used classifiers, that is logistic regression (LR) and random forest (RF), to discriminate OCD patients and healthy subjects.

To be specific, amplitude of low-frequency fluctuation (ALFF), fractional ALFF (fALFF), regional homogeneity (ReHo) and degree of centrality (DC) were extracted from resting-state functional MRI (rs-fMRI) as the functional neuroimaging markers; volume of gray matter (VGM), cortical thickness and sulcal depth were extracted from T1-weighted images as the structural neuroimaging markers. These MRI indices were used because previous studies have successfully revealed altered ALFF [21,22,23], fALFF [24,25,26,27], ReHo [28,29,30,31], DC [32, 33], VGM [34,35,36] and cortical thickness [37,38,39] in various brain regions, including the traditional CSTC circuit and newly reported regions, such as the occipital, parietal, and temporal lobes and the cerebellum, in patients with OCD. Sulcal depth may provide valuable information for classification since previous studies indicate the association of altered sulcus morphology and psychotic disease [40,41,42]. In addition, these functional MRI (fMRI) and structural MRI (sMRI) indices can be easily calculated and explained, with no need for a priori selection of brain region like functional connectivity, as such has an advantage of clinical application.

Furthermore, it is unknown which index can achieve the optimal performance among these fMRI and sMRI indices, and whether a combination of multilevel MRI indices can improve the classification performance. At present, only a small number of studies applied fMRI or sMRI data to discriminate patients with OCD and healthy subjects, which mainly focused on separate index [43,44,45,46,47,48]. For example, one study found training of an SVM classifier to distinguish OCD patients from healthy subjects achieved excellent performance when using ALFF maps and good performance when using ReHo maps, with cross-validation accuracy of 95.37% and 86.11%, respectively [43]. Another study applied VGM to discriminate OCD patients and healthy subjects, achieving a cross-validation accuracy of 75.76% [44].

It is also important to explain the contribution of the MRI markers when constructing the OCD diagnosis models. However, due to the “black box” problem of machine learning models, such as SVM, previous studies seldom explored the contribution of the MRI markers used in the classification models. The Shapley value is a fair profit allocation among many stakeholders depending on their contribution and was derived from the name of the economist who introduced it. By using the idea of the Shapley value, approaches were proposed to interpret the predictions from any “black box” model [49, 50]. The key component of general explanations is the contributions (equivalent to the Shapley value) of individual input features. A prediction is explained by assigning to each feature a number which denote its influence. For each feature, such contributions can be aggregated to plot the feature’s average contribution against the feature’s value. This provides an overview of the model and explanation of the predictions.

Therefore, the aim of this study was to build and validate classification models based on fMRI and sMRI indices to discriminate patients with OCD and healthy subjects. Specifically, first, to investigate which MRI index achieves the optimal performance; second, to investigate whether a combination of multilevel MRI indices improves the performance of the classification; and third, to investigate the contribution of the MRI markers on classification.

Methods

Subjects

Fifty patients with OCD and 50 healthy control subjects (HCS) were included in this study. The patients and healthy subjects were recruited from Beijing Anding Hospital and local community, respectively. All the participants were diagnosed and classified using Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition (DSM-IV) Axis I Disorders (SCID), Patient Edition and Nonpatient Edition. The inclusion criteria for patients with OCD were as follows: (1) right-hand, (2) 18–60 years old, (3) Yale-Brown Obsessive-Compulsive Scale (Y-BOCS) score ≥ 16, (4) 17-item Hamilton Depression Rating Scale (HAMD-17) score < 18. The exclusion criteria for patients with OCD were as follows: (1) having taken or were taking psychiatric or psychological treatment, (2) having other mental disorders, neurological illnesses or major physical diseases. The inclusion criteria for healthy subjects were as follows: (1) right-hand, (2) 18–60 years old, (3) no history of any mental disorders, neurological illnesses, or major physical diseases.

MRI acquisition

All the subjects were scanned on a Siemens Trio 3-Tesla scanner with a 32-channel head coil. T1-weighted images were obtained using a sagittal 3D magnetization prepared rapid gradient echo (MP-RAGE) sequence: slice number = 144, time repetition (TR) = 2530 ms, time echo (TE) = 3.39 ms, flip angle (FA) = 7°, slice thickness = 1.33 mm, field of view (FOV) = 256 × 256 mm2, in-plane resolution = 256 × 256, time to inversion (TI) = 1100 ms, voxel size = 1 × 1 × 1.33 mm3. The resting-state functional images were obtained using an echo-planar imaging (EPI) sequence (8 min): slice number = 33, thickness/gap = 3.5/0.6 mm, TR = 2000 ms, TE = 30 ms, FOV = 200 × 200 mm2, in-plane resolution = 64 × 64, FA = 90°, and 200 volumes.

Preprocessing and feature extraction for rs-fMRI data

DPABI [51] was used for rs-fMRI data preprocessing with the following steps: removing the first 10 time points; slice timing; head motion correction; co-registered T1 images to functional images; segmentation with DARTEL; nuisance covariates regression (Friston 24, white matter and cerebrospinal fluid); normalization by DARTEL; and detrend. All the subjects met the head motion criteria of less than 2° of maximal rotation and 2 mm of maximal translation. ALFF [52], fALFF [53], ReHo [54] and DC [55] were extracted using DPABI. After preprocessing and smoothing, mean ALFF and fALFF maps were generated. ALFF detects the neural fluctuations within 0.01–0.08 Hz, reflecting intensity of regional spontaneous brain activity. fALFF is a ratio of amplitude within 0.01–0.08 Hz to the total amplitude within the full frequency band, indicating the relative contribution of spontaneous brain activity. After preprocessing and filter, mean ReHo and mean weighted DC with correlation coefficient > 0.25 were calculated and smoothed with a Gaussian kernel of 4 mm. ReHo reflects regional brain activity by calculating the Kendall coefficient of concordance between a particular voxel and its nearby neighbors (26 voxels). Weighted DC is defined as the sum of weights from edges connecting to a node and represents the node strength. We eventually obtained 116 ALFF, 116 fALFF, 116 ReHo and 116 DC features by averaging all the voxels in each region of interest based on Anatomical Automatic Labeling (AAL) atlas. Thus, a total of 464 fMRI features were extracted.

Preprocessing and feature extraction for sMRI data

CAT12 toolbox for SPM12 was used for sMRI data preprocessing with the following steps: segmentation with a prior tissue probability map; calculating total intracranial volume (TIV) in native-space; registering native-space segmentations to a standard Montreal Neurological Institute (MNI) template; modulation to mitigate volume changes caused by spatial normalization. Finally, VGM, cortical thickness, and sulcal depth were extracted using CAT12. A total of 412 sMRI features were extracted based on the atlas. Among of them, 116 VGM features were obtained based on AAL atlas, 148 cortical thickness and 148 sulcal depth features were obtained based on Destrieux aparc.a2009s atlas [56].

Feature selection

The whole dataset was divided into training set and testing set according to the number sequence of the subjects. The first 80% of the participants (including 40 patients with OCD and 40 HCS) were selected as the training set and the last 20% of the participants (including 10 patients with OCD and 10 HCS) were selected as the testing set. We used the same training set to select features and train classifiers. The testing set was used to validate the performance of the classification models.

Least absolute shrinkage and selection operator (LASSO) logistic regression was conducted to select features. LASSO is a popular method for regression that uses an L1 penalty to achieve a sparse solution and shrinks the coefficient estimates toward zero, with the degree of shrinkage dependent on an additional penalty parameter, lambda (λ) [57, 58]. The L1 penalty is the sum of the absolute coefficients (\({w}_{j}\)): \({\left|w\right|}_{1}={\sum }_{j=1}^{p}\left|{w}_{j}\right|\). LASSO uses this L1 penalty by adding λ to control the penalization: \(\widehat{w}={arg}\underset{w}{{min}}{\sum }_{i=1}^{n}{({y}_{i}-{\sum }_{j}{x}_{ij}{w}_{j})}^{2}+\lambda {\sum }_{j=1}^{p}\left|{w}_{j}\right|.\) Firstly, LASSO regression was conducted to select inputting variables for single MRI index models. Then, the selected features of each index were reduced again by LASSO for the combined MRI indices models.

We utilized software R (version 4.3.1), package “glmnet” (version 4.1-7) to fit the LASSO regression. The optimal λ was determined through leave-one-out cross-validation (LOOCV), which means leaving one subject out as the testing data, and the others as the training data. The binomial deviance was computed as the measure to be minimized when cross-validating the selected model. Model with a lower deviance fits better. The cross-validation produces two optional λ values, λmin and λ1se. λmin minimizes the average binomial deviance of LOOCV and λ1se represents largest λ that is still within one standard error of the minimum binomial deviance. λmin was considered as the optimal λ in this study because it results in weaker penalty than λ1se, thus allowing us to include enough features. After obtaining the optimal λ, the LASSO regression was fitted again for all the subjects using the selected λ and features with nonzero coefficient were retained.

Classification and validation

Radial basis function (RBF) kernel SVM, LR and RF classifiers were used to build the classification models on training set (including 40 patients with OCD and 40 HCS). Classification models based on single MRI index and combined MRI indices were established. We utilized software R (version 4.3.1), package “caret” (version 6.0–94) to fit the RBF kernel SVM, LR and RF models.

There are two tuning parameters for RBF kernel SVM in “caret”, sigma and cost (C). We used on the function “sigest” from the package “kernlab” (version 0.9–32) to estimate the range of values for the sigma parameter which would return good results when used with SVM models [59]. The estimation is based upon the 0.1 and 0.9 quantile of \(||x - x'|{|^2}\). Basically, any value in between those two bounds will produce good results. Then sigma and C were tuned by the grid search through LOOCV. The grid search contained 54 combinations with six sigma values ranging from minimum sigma to maximum sigma generated by function “sigest”, and nine C values (0.25, 0.5, 1, 2, 4, 8, 16, 32, 64). The hyperparameter (mtry) of RF was tuned by the grid search of 1 to k (k = number of features) through LOOCV. Area under the receiver operating characteristic (ROC) curve (AUC) was used to select the optimal model using the largest value of AUC.

Average accuracy, sensitivity and specificity of LOOCV were calculated to show the performance of fitted models on training set. Then, we validated the performance of the classification models on testing set (including 10 patients with OCD and 10 HCS). Accuracy, sensitivity and specificity were calculated and ROC curves with AUC values were drawn for the classification models on testing set.

The contribution of the features

We investigated the contribution of features for the classification model that obtained optimal performance. We used software R (version 4.3.1), package “fastshap” (version 0.1.0) to compute fast approximate Shapley values of features for each individual, and visualized the mean absolute Shapley value of all the subjects to show the contribution of the feature. A higher mean absolute Shapley value indicates a larger contribution to the classification.

Other statistical analyses

Independent sample t test was used to compare the characteristics of subjects between OCD and HCS group. A two-tailed P value < 0.05 was considered significant. software R (version 4.3.1) was used to conduct these statistical analyses. BrainNet viewer [60] was used to visualize the brain regions.

Results

Basic information of participants

Basic information of subjects is shown in Table 1. A total of 40 patients with OCD and 40 HCS were included in the training set; a total of 10 patients with OCD and 10 HCS were included in the testing set. There were no statistically significant differences in age, sex, head motion or TIV between patients with OCD and HCS (P > 0.05) in both training and testing set. The mean Y-BOCS and HAMD score of patients with OCD in training set was 23.85 and 6.60, respectively; the mean Y-BOCS and HAMD score of patients with OCD in testing set was 23.50 and 6.80, respectively.

Table 1 Basic information of participants

Features selection by LASSO

Figure 1 shows the optimal λ selection of LASSO through LOOCV. There were 27, 1, 12, 2, 12, 29, and 35 features selected for ALFF, fALFF, ReHo, DC, sulcal depth, combined fMRI indices (including 19 ALFF, 1 fALFF, 7 ReHo and 2 DC features) and combined fMRI and sMRI indices (including 16 ALFF, 1 fALFF, 8 ReHo, 1 DC, and 9 sulcal depth features). There were no VGM or cortical thickness features selected by LASSO. The optimal λ values used in LASSO for feature selection are shown in supplementary materials file 1. The selected features are shown in supplementary materials file 2.

Fig. 1
figure 1

Optimal lambda selection of LASSO through cross-validation. Notes: ALFF, Amplitude of Low Frequency Fluctuation; DC, Degree of Centrality; fALFF, Fractional Amplitude of Low Frequency Fluctuation; fMRI, Functional Magnetic Resonance Imaging; ReHo, Regional Homogeneity; sMRI, Structural Magnetic Resonance Imaging. The longitudinal coordinate represents the binomial deviance and the upper and lower standard deviation. The left dashed line points to λmin and the right dashed line points to λ1se

The performance of classification models based on single MRI index

Table 2 exhibits the AUC, accuracy, sensitivity and specificity of classification models based on single MRI index. Figure 2 shows ROC curves of these classification models on testing set. The optimal hyperparameters used in SVM and RF models are shown in supplementary materials file 3. For single MRI index models, SVM model of ALFF achieved the best performance. On training set, the cross-validation AUC and accuracy was 0.98 and 93%, respectively; on testing set, the AUC was 0.87 and the accuracy was 75%. Although SVM model of ReHo and SVM model of sulcal depth achieved good performance by LOOCV on training set, with AUC values larger than 0.90, the performance on testing set was poor. Classification models of fALFF / DC yielded poor performance.

Table 2 Performance of classification models based on single MRI index
Fig. 2
figure 2

ROC curves of classification models based on single MRI index on testing set. Notes: ALFF, Amplitude of Low Frequency Fluctuation; AUC, Area Under the Curve; DC, Degree of Centrality; fALFF, Fractional Amplitude of Low Frequency Fluctuation; LR, Logistic Regression; ReHo, Regional Homogeneity; RF, Random Forest; SVM, Support Vector Machine

The performance of classification models based on combined MRI indices

Table 3 exhibits the AUC, accuracy, sensitivity and specificity of classification models based on combined MRI indices. Figure 3 shows ROC curves of these classification models on testing set. The optimal hyperparameters used in SVM and RF models are shown in supplementary materials file 3. SVM model of combined fMRI indices achieved excellent cross-validation performance on training set and good performance on testing set. On training set, the cross-validation AUC and accuracy was 0.99 and 94%, respectively; on testing set, the AUC was 0.90 and the accuracy was 85%. Classification models based on combined fMRI and sMRI indices underperformed on testing set. We did not construct combined sMRI indices models because only sulcal depth features were selected during the feature selection procedure.

Table 3 Performance of classification models based on combined MRI indices
Fig. 3
figure 3

ROC curves of classification models based on combined MRI indices on testing set. Notes: AUC, Area Under the Curve; fMRI, Functional Magnetic Resonance Imaging; LR, Logistic Regression; RF, Random Forest; sMRI, Structural Magnetic Resonance Imaging; SVM, Support Vector Machine. fMRI indices included ALFF, fALFF, ReHo and DC; fMRI + sMRI indices included ALFF, fALFF, ReHo, DC and sulcal depth

The contribution of features in SVM model based on combined fMRI indices

Figure 4 shows the location of the features included in SVM model of combined fMRI indices. Figure 5 shows the contribution of these features in the model. The top ten contribution came from ALFF of right Cerebelum_9_R, ALFF of left cuneus, ALFF of Vermis_6, DC of left temporal pole middle temporal gyrus, fALFF of right anterior cingulate and paracingulate gyri, ReHo of left superior parietal gyrus, ALFF of right thalamus, ReHo of right middle frontal gyrus orbital part, ALFF of right Cerebelum_6_R, and ALFF of left middle occipital gyrus.

The middle ten contribution came from DC of left superior frontal gyrus orbital part, ALFF of left Cerebelum_6_L, ALFF of right fusiform gyrus, ALFF of left inferior frontal gyrus triangular part, ReHo of left posterior cingulate gyrus, ALFF of right middle frontal gyrus orbital part, ReHo of right caudate nucleus, ReHo of Vermis_3; ALFF of left posterior cingulate gyrus, and ALFF of Vermis_1_2.

The bottom nine contribution came from ReHo of right gyrus rectus, ALFF of right amygdala, ALFF of right Rolandic operculum, ALFF of right lingual gyrus, ALFF of right olfactory cortex, ALFF of right superior frontal gyrus orbital part, ALFF of left temporal pole middle temporal gyrus, ReHo of right middle frontal gyrus, and ALFF of left inferior frontal gyrus opercular part.

Fig. 4
figure 4

Features included in SVM model of combined fMRI indices. Notes: ALFF, Amplitude of Low Frequency Fluctuation; DC, Degree of Centrality; fALFF, Fractional Amplitude of Low Frequency Fluctuation; ReHo, Regional Homogeneity

Fig. 5
figure 5

Contribution of features in SVM model of combined fMRI indices. Notes: ALFF, Amplitude of Low Frequency Fluctuation; DC, Degree of Centrality; fALFF, Fractional Amplitude of Low Frequency Fluctuation; ReHo, Regional Homogeneity

Discussion

To the best of our knowledge, this is the first study to comprehensively utilize multilevel MRI indices to distinguish OCD patients and healthy subjects. We applied different classifiers to build the classification models. SVM model based on combined fMRI indices, including ALFF, fALFF, ReHo and DC, exhibited optimal classification performance.

For single MRI index, ALFF achieved better performance than fALFF, ReHo, DC and sulcal depth. Bu et al., (2019) compared the classification performance of different rs-fMRI index and found that SVM model using voxel-wise ALFF map achieved the best performance on OCD diagnosis [43]. They argued that ALFF directly correlates with the intensity of spontaneous neural activity in the resting-state and is related to the rate of regional glucose metabolism, which could make ALFF more sensitive to detect dysfunctional neural activity than the other functional parameters [61].

Although previous studies have reported VGM [34,35,36] and cortical thickness [37,38,39] changes in patients with OCD compared with healthy controls, the VGM and cortical thickness features were not retained during the feature selection stage in our study. Some sulcal depth features were selected, however, the classification models of sulcal depth underperformed on testing set. Previous studies also showed limited accuracy when using structural neuroanatomy and machine learning methods to build diagnosis model of OCD [44, 47]. Considering the current results and previous findings, the value of using single sMRI index to distinguish OCD and the healthy needs to be further verified.

Although ALFF exhibited stronger classification power than other MRI indices, the performance of ALFF models on testing set was less than satisfactory. The results indicate that single MRI index cloud not provide enough information to discriminate patients with OCD and the healthy. ALFF, fALFF, ReHo, DC and sulcal depth reflect the neuroimaging changes of OCD from different perspective. ALFF reflects intensity of regional spontaneous brain activity [52]; fALFF reflects the relative contribution of the oscillations [53]; ReHo represents local coherence of spontaneous brain activity [54]; weighted DC shows functional connectivity strength of a certain brain region to the whole brain [55]; sulcal depth has been widely used to study the morphological characteristic of the cerebral folding [62, 63]. It seems to be a reasonable hypothesis that the combination of multilevel MRI indices can improve the classification performance.

As expected, a combination of multilevel fMRI indices improved the classification performance compared with single MRI index, indicating a complementary effect of ALFF, fALFF, ReHo and DC on the classification of OCD and HCS. Unfortunately, classification models based on combined fMRI and sMRI indices underperformed on testing set. The results suggest that sMRI index (that is sulcal depth in this study) may have an interference effect on fMRI indices for the classification of OCD and HCS.

For different classifiers, SVM exhibited superior performance than LR and RF in this study. SVM aims to classify data points by maximizing the margin between classes in a high dimensional space [14]. Evidence of comparison among machine learning approaches showed that SVM helps weigh down the effect of noisy features that are highly correlated with each other when there are a large number of features [64] and outperforms other machine learning classifiers on MRI-based brain tumor [65] and autism classification [20].

In SVM model of combined fMRI indices, the features were located both within the traditional CSTC circuit of OCD (such as anterior cingulate cortex, thalamus, orbitofrontal cortex, prefrontal cortex, caudate nucleus), and outside the CSTC circuit (distributing among parietal, temporal, occipital cortex and cerebellum). It is consistent with researches which investigated the neuroimaging alterations in patients with OCD. The dysfunction of CSTC circuit has been classically considered to underpin the clinical manifestations of OCD [9, 10, 12]. However, in recent years, evidence has been accumulating pointing to other regions outside the CSTC circuit, such as parietal, temporal, occipital cortex and cerebellum [27, 66, 67]. Furthermore, in top / middle / bottom contribution group, the features were also located both within and outside the CSTC circuit of OCD. The results suggest that future researches need to consider the role of indicators in various brain regions, as well as to further explore the neurophysiological mechanisms of OCD.

Limitation and future direction

Despite the novel findings, it is important to note limitations within this study. First, this study was designed and analyzed on relatively small, single center data. Second, we included patients with diagnostic OCD without any comorbidities or prior treatment history, which may decrease the generalizability of our findings. In the real world, the diagnosis of OCD is much more complicated. Some individuals may experience obsessive-compulsive symptom, however, not meet the diagnostic criteria. Therefore, the explanation of the models in this study should be limited to diagnostic OCD without comorbidities and treatment history. Therefore, including a larger, multicenter sample with different clinical characteristics to validate the generalizability of the model is needed.

For the future direction, developing a computer-aid neuroimaging automated diagnosis system to implement outputs of diagnosis from inputs of MRI images is worth exploring. The procedure should include MRI scanning, raw images preprocessing, feature extracting, representation inputting and diagnosis outputting. Some challenges and restrictions should be considered. First, the patients have to take MRI scanning using the same sequence with the same parameter setting; second, current preprocessing and features extracting are based on manual manipulation in the software and usually time-consuming; third, it is better to develop an automated diagnosis software which can connect the different steps to simplify the computer-aid diagnosis process.

Conclusion

In summary, SVM model of combined fMRI indices, including ALFF, fALFF, ReHo and DC, exhibited good performance on discriminating OCD patients and the healthy. The selected brain regions in SVM model of combined fMRI indices were located both within and outside the traditional CSTC circuit of OCD.

Data Availability

The datasets generated and / or analyzed during the current study are not publicly available due to exposition of subjects’ private information, but are available from the corresponding author on reasonable request.

Abbreviations

AAL atlas:

Anatomical Automatic Labeling atlas

ALFF:

Amplitude of low-frequency fluctuation

AUC:

Area under the curve

CSTC circuit:

Cortico-striato-thalamo-cortical circuit

DC:

Degree of centrality

fALFF:

Fractional amplitude of low-frequency fluctuation

fMRI:

Functional magnetic resonance imaging

HAMD:

Hamilton depression rating scale

HCS:

Healthy control subjects

LASSO:

Least absolute shrinkage and selection operator

LR:

Logistic regression

LOOCV:

Leave-one-out cross-validation

MNI:

Montreal Neurological Institute

MRI:

Magnetic resonance imaging

OCD:

Obsessive-compulsive disorder

RBF:

Radial basis function

ReHo:

Regional homogeneity

RF:

Random forest

ROC curve:

Receiver operating characteristic curve

rs-fMRI:

Resting-state functional magnetic resonance imaging

SD:

Standard deviation

sMRI:

Structural magnetic resonance imaging

SVM:

Support vector machine

TIV:

Total intracranial volume

Y-BOCS:

Yale-brown obsessive-compulsive scale

VGM:

Volume of gray matter

References

  1. Fawcett EJ, Power H, Fawcett JM. Women are at greater risk of OCD than men: a meta-analytic review of OCD prevalence worldwide. J Clin Psychiatry. 2020;81(4):19r13085.

    Article  PubMed  Google Scholar 

  2. Ruscio AM, Stein DJ, Chiu WT, Kessler RC. The epidemiology of obsessive-compulsive disorder in the National Comorbidity Survey Replication. Mol Psychiatry. 2010;15(1):53–63.

    Article  CAS  PubMed  Google Scholar 

  3. Fineberg NA, Hengartner MP, Bergbaum C, Gale T, Rössler W, Angst J. Remission of obsessive-compulsive disorders and syndromes; evidence from a prospective community cohort study over 30 years. Int J Psychiatry Clin Pract. 2013;17(3):179–87.

    Article  PubMed  Google Scholar 

  4. Whiteford HA, Degenhardt L, Rehm J, Baxter AJ, Ferrari AJ, Erskine HE, et al. Global burden of Disease attributable to mental and substance use disorders: findings from the global burden of Disease Study 2010. Lancet. 2013;382(9904):1575–86.

    Article  PubMed  Google Scholar 

  5. Macy AS, Theo JN, Kaufmann SC, Ghazzaoui RB, Pawlowski PA, Fakhry HI, et al. Quality of life in obsessive compulsive disorder. CNS Spectr. 2013;18(1):21–33.

    Article  PubMed  Google Scholar 

  6. Remmerswaal KCP, Batelaan NM, Hoogendoorn AW, van der Wee NJA, van Oppen P, van Balkom A. Four-year course of quality of life and obsessive-compulsive disorder. Soc Psychiatry Psychiatr Epidemiol. 2020;55(8):989–1000.

    Article  PubMed  Google Scholar 

  7. Lenhard F, Aspvall K, Andersson E, Ahlen J, Serlachius E, Lavner M, et al. The cost of obsessive-compulsive disorder in Swedish youth. Child Psychiatry Hum Dev. 2023;54(1):248–54.

    Article  PubMed  Google Scholar 

  8. de Wit SJ, Alonso P, Schweren L, Mataix-Cols D, Lochner C, Menchón JM, et al. Multicenter Voxel-based morphometry mega-analysis of structural brain scans in obsessive-compulsive disorder. Am J Psychiatry. 2014;171(3):340–9.

    Article  PubMed  Google Scholar 

  9. Gürsel DA, Avram M, Sorg C, Brandl F, Koch K. Frontoparietal areas link impairments of large-scale intrinsic brain networks with aberrant fronto-striatal interactions in OCD: a meta-analysis of resting-state functional connectivity. Neurosci Biobehav Rev. 2018;87:151–60.

    Article  PubMed  Google Scholar 

  10. Liu J, Cao L, Li H, Gao Y, Bu X, Liang K, et al. Abnormal resting-state functional connectivity in patients with obsessive-compulsive disorder: a systematic review and meta-analysis. Neurosci Biobehav Rev. 2022;135:104574.

    Article  PubMed  Google Scholar 

  11. Picó-Pérez M, Moreira PS, de Melo Ferreira V, Radua J, Mataix-Cols D, Sousa N, et al. Modality-specific overlaps in brain structure and function in obsessive-compulsive disorder: Multimodal meta-analysis of case-control MRI studies. Neurosci Biobehav Rev. 2020;112:83–94.

    Article  PubMed  Google Scholar 

  12. Soriano-Mas C. Functional brain imaging and OCD. Curr Top Behav Neurosci. 2021;49:269–300.

    Article  PubMed  Google Scholar 

  13. Lao Z, Shen D, Xue Z, Karacali B, Resnick SM, Davatzikos C. Morphological classification of brains via high-dimensional shape transformations and machine learning methods. NeuroImage. 2004;21(1):46–57.

    Article  PubMed  Google Scholar 

  14. Orrù G, Pettersson-Yeo W, Marquand AF, Sartori G, Mechelli A. Using support Vector Machine to identify imaging biomarkers of neurological and psychiatric Disease: a critical review. Neurosci Biobehav Rev. 2012;36(4):1140–52.

    Article  PubMed  Google Scholar 

  15. Jin K, Xu D, Shen Z, Feng G, Zhao Z, Lu J, et al. Distinguishing hypochondriasis and schizophrenia using regional homogeneity: a resting-state fMRI study and support vector machine analysis. Acta Neuropsychiatr. 2021;33(4):182–90.

    Article  PubMed  Google Scholar 

  16. Lee J, Chon MW, Kim H, Rathi Y, Bouix S, Shenton ME, et al. Diagnostic value of structural and diffusion imaging measures in schizophrenia. Neuroimage Clin. 2018;18:467–74.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Gao Y, Wang X, Xiong Z, Ren H, Liu R, Wei Y, et al. Abnormal fractional amplitude of low-frequency fluctuation as a potential imaging biomarker for first-episode major depressive disorder: a resting-state fMRI study and support vector machine analysis. Front Neurol. 2021;12:751400.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Yang J, Zhang M, Ahn H, Zhang Q, Jin TB, Li I, et al. Development and evaluation of a multimodal marker of major depressive disorder. Hum Brain Mapp. 2018;39(11):4420–39.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Anderson JS, Nielsen JA, Froehlich AL, DuBray MB, Druzgal TJ, Cariello AN, et al. Functional connectivity magnetic resonance imaging classification of autism. Brain. 2011;134(Pt 12):3742–54.

    Article  PubMed  Google Scholar 

  20. Wang C, Xiao Z, Wu J. Functional connectivity-based classification of autism and control using SVM-RFECV on rs-fMRI data. Phys Med. 2019;65:99–105.

    Article  PubMed  Google Scholar 

  21. Hou J, Wu W, Lin Y, Wang J, Zhou D, Guo J, et al. Localization of cerebral functional deficits in patients with obsessive-compulsive disorder: a resting-state fMRI study. J Affect Disord. 2012;138(3):313–21.

    Article  PubMed  Google Scholar 

  22. Long J, Luo L, Guo Y, You W, Li Q, Li B, et al. Altered spontaneous activity and effective connectivity of the anterior cingulate cortex in obsessive-compulsive disorder. J Comp Neurol. 2021;529(2):296–310.

    Article  PubMed  Google Scholar 

  23. Xu Y, Zheng R, Wei Y, Chen J, Guo H, Wen B, et al. Static and temporal dynamic changes of intrinsic brain activity in pediatric and adults OCD. J Affect Disord. 2022;311:416–24.

    Article  CAS  PubMed  Google Scholar 

  24. Cheng Y, Xu J, Nie B, Luo C, Yang T, Li H, et al. Abnormal resting-state activities and functional connectivities of the anterior and the posterior cortexes in medication-naïve patients with obsessive-compulsive disorder. PLoS ONE. 2013;8(6):e67478.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Gao J, Zhou Y, Yang X, Luo J, Meng F, Zheng D, et al. Abnormalities within and beyond the cortico-striato-thalamo-cortical circuitry in medication-free patients with OCD revealed by the fractional amplitude of low-frequency fluctuations and resting-state functional connectivity. Neurosci Lett. 2019;712:134449.

    Article  CAS  PubMed  Google Scholar 

  26. Giménez M, Guinea-Izquierdo A, Villalta-Gil V, Martínez-Zalacaín I, Segalàs C, Subirà M, et al. Brain alterations in low-frequency fluctuations across multiple bands in obsessive compulsive disorder. Brain Imaging Behav. 2017;11(6):1690–706.

    Article  PubMed  Google Scholar 

  27. Qiu L, Fu X, Wang S, Tang Q, Chen X, Cheng L, et al. Abnormal regional spontaneous neuronal activity associated with symptom severity in treatment-naive patients with obsessive-compulsive disorder revealed by resting-state functional MRI. Neurosci Lett. 2017;640:99–104.

    Article  CAS  PubMed  Google Scholar 

  28. Chen Y, Meng X, Hu Q, Cui H, Ding Y, Kang L, et al. Altered resting-state functional organization within the central executive network in obsessive-compulsive disorder. Psychiatry Clin Neurosci. 2016;70(10):448–56.

    Article  PubMed  Google Scholar 

  29. Niu Q, Yang L, Song X, Chu C, Liu H, Zhang L, et al. Abnormal resting-state brain activities in patients with first-episode obsessive-compulsive disorder. Neuropsychiatr Dis Treat. 2017;13:507–13.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Qing X, Gu L, Li D. Abnormalities of localized connectivity in obsessive-compulsive disorder: a voxel-wise meta-analysis. Front Hum Neurosci. 2021;15:739175.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Yang XY, Sun J, Luo J, Zhong ZX, Li P, Yao SM, et al. Regional homogeneity of spontaneous brain activity in adult patients with obsessive-compulsive disorder before and after cognitive behavioural therapy. J Affect Disord. 2015;188:243–51.

    Article  PubMed  Google Scholar 

  32. Göttlich M, Krämer UM, Kordon A, Hohagen F, Zurowski B. Resting-state connectivity of the amygdala predicts response to cognitive behavioral therapy in obsessive compulsive disorder. Biol Psychol. 2015;111:100–9.

    Article  PubMed  Google Scholar 

  33. Li P, Yang X, Greenshaw AJ, Li S, Luo J, Han H, et al. The effects of cognitive behavioral therapy on resting-state functional brain network in drug-naive patients with obsessive-compulsive disorder. Brain Behav. 2018;8(5):e00963.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Hu X, Du M, Chen L, Li L, Zhou M, Zhang L, et al. Meta-analytic investigations of common and distinct grey matter alterations in youths and adults with obsessive-compulsive disorder. Neurosci Biobehav Rev. 2017;78:91–103.

    Article  PubMed  Google Scholar 

  35. Kodancha PG, Shivakumar V, Jose D, Venkatasubramanian G, Reddy YCJ, Narayanaswamy JC. Gray matter volume abnormalities and clinical correlates in OCD with exclusive washing dimension. Asian J Psychiatr. 2020;54:102343.

    Article  PubMed  Google Scholar 

  36. Tao Q, Dang J, Niu X, Gao X, Zhang M, Yang Z, et al. White matter microstructural abnormalities and gray matter volume alterations in obsessive-compulsive disorder: a coordinate-based meta-analysis. J Affect Disord. 2023;320:751–61.

    Article  PubMed  Google Scholar 

  37. Fouche JP, du Plessis S, Hattingh C, Roos A, Lochner C, Soriano-Mas C, et al. Cortical thickness in obsessive-compulsive disorder: multisite mega-analysis of 780 brain scans from six centres. Br J Psychiatry. 2017;210(1):67–74.

    Article  PubMed  Google Scholar 

  38. Liu W, Gan J, Fan J, Zheng H, Li S, Chan RCK, et al. Associations of cortical thickness, surface area and subcortical volumes with insight in drug-naïve adults with obsessive-compulsive disorder. Neuroimage Clin. 2019;24:102037.

    Article  PubMed  PubMed Central  Google Scholar 

  39. Zhou C, Xu J, Ping L, Zhang F, Chen W, Shen Z, et al. Cortical thickness and white matter integrity abnormalities in obsessive-compulsive disorder: a combined multimodal surface-based morphometry and tract-based spatial statistics study. Depress Anxiety. 2018;35(8):742–51.

    Article  PubMed  Google Scholar 

  40. Asal N, Bayar Muluk N, Inal M, Şahan MH, Doğan A, Buturak SV. Olfactory bulbus volume and olfactory sulcus depth in psychotic patients and patients with anxiety disorder/depression. Eur Arch Otorhinolaryngol. 2018;275(12):3017–24.

    Article  PubMed  Google Scholar 

  41. Takahashi T, Nakamura M, Sasabayashi D, Nishikawa Y, Takayanagi Y, Furuichi A, et al. Association between olfactory sulcus morphology and olfactory functioning in schizophrenia and psychosis high-risk status. Heliyon. 2019;5(10):e02642.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Takahashi T, Nishikawa Y, Yücel M, Whittle S, Lorenzetti V, Walterfang M, et al. Olfactory sulcus morphology in patients with current and past major depression. Psychiatry Res Neuroimaging. 2016;255:60–5.

    Article  PubMed  Google Scholar 

  43. Bu X, Hu X, Zhang L, Li B, Zhou M, Lu L, et al. Investigating the predictive value of different resting-state functional MRI parameters in obsessive-compulsive disorder. Transl Psychiatry. 2019;9(1):17.

    Article  PubMed  PubMed Central  Google Scholar 

  44. Hu X, Liu Q, Li B, Tang W, Sun H, Li F, et al. Multivariate pattern analysis of obsessive-compulsive disorder using structural neuroanatomy. Eur Neuropsychopharmacol. 2016;26(2):246–54.

    Article  CAS  PubMed  Google Scholar 

  45. Hu X, Zhang L, Bu X, Li H, Li B, Tang W, et al. Localized connectivity in obsessive-compulsive disorder: an investigation combining univariate and multivariate pattern analyses. Front Behav Neurosci. 2019;13:122.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Yan H, Shan X, Li H, Liu F, Guo W. Abnormal spontaneous neural activity in hippocampal-cortical system of patients with obsessive-compulsive disorder and its potential for diagnosis and prediction of early treatment response. Front Cell Neurosci. 2022;16:906534.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Zhou C, Cheng Y, Ping L, Xu J, Shen Z, Jiang L, et al. Support vector machine classification of obsessive-compulsive disorder based on whole-brain volumetry and diffusion tensor imaging. Front Psychiatry. 2018;9:524.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Yang X, Hu X, Tang W, Li B, Yang Y, Gong Q, et al. Multivariate classification of drug-naive obsessive-compulsive disorder patients and healthy controls by applying an SVM to resting-state functional MRI data. BMC Psychiatry. 2019;19(1):210.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Lundberg SM, Lee SI. A unified approach to interpreting model predictions. In Adv Neural Inf Process Syst. 2017; 4765–74.

  50. Strumbelj E, Kononenko I. Explaining prediction models and individual predictions with feature contributions. Knowl Inf Syst. 2014;41(3):647–65.

    Article  Google Scholar 

  51. Yan CG, Wang XD, Zuo XN, Zang YF. DPABI: data processing & analysis for (resting-state) brain imaging. Neuroinformatics. 2016;14(3):339–51.

    Article  PubMed  Google Scholar 

  52. Zang YF, He Y, Zhu CZ, Cao QJ, Sui MQ, Liang M, et al. Altered baseline brain activity in children with ADHD revealed by resting-state functional MRI. Brain Dev. 2007;29(2):83–91.

    Article  PubMed  Google Scholar 

  53. Zou QH, Zhu CZ, Yang Y, Zuo XN, Long XY, Cao QJ, et al. An improved approach to detection of amplitude of low-frequency fluctuation (ALFF) for resting-state fMRI: fractional ALFF. J Neurosci Methods. 2008;172(1):137–41.

    Article  PubMed  PubMed Central  Google Scholar 

  54. Zang Y, Jiang T, Lu Y, He Y, Tian L. Regional homogeneity approach to fMRI data analysis. NeuroImage. 2004;22(1):394–400.

    Article  PubMed  Google Scholar 

  55. Zuo XN, Ehmke R, Mennes M, Imperati D, Castellanos FX, Sporns O, et al. Network centrality in the human functional connectome. Cereb Cortex. 2012;22(8):1862–75.

    Article  PubMed  Google Scholar 

  56. Destrieux C, Fischl B, Dale A, Halgren E. Automatic parcellation of human cortical gyri and sulci using standard anatomical nomenclature. NeuroImage. 2010;53(1):1–15.

    Article  PubMed  Google Scholar 

  57. Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc Series B. 1996; 58(1).

  58. Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. 2010;33(1):1–22.

    Article  PubMed  PubMed Central  Google Scholar 

  59. Caputo B, Sim K, Furesjo F, Smola A. Appearance-based object recognition using SVMs: which kernel should I use? In: Proc. of neural information processing systems workshop on statistical methods for computational experiments in visual processing and computer vision. Whistler, 2002.

  60. Xia M, Wang J, He Y. BrainNet Viewer: a network visualization tool for human brain connectomics. PLoS ONE. 2013;8(7):e68910.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Nugent AC, Martinez A, D’Alfonso A, Zarate CA, Theodore WH. The relationship between glucose metabolism, resting-state fMRI BOLD signal, and GABAA-binding potential: a preliminary study in healthy subjects and those with temporal lobe Epilepsy. J Cereb Blood Flow Metab. 2015;35(4):583–91.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Im K, Lee JM, Seo SW, Hyung Kim S, Kim SI, Na DL. Sulcal morphology changes and their relationship with cortical thickness and gyral white matter volume in mild cognitive impairment and Alzheimer’s Disease. NeuroImage. 2008;43(1):103–13.

    Article  PubMed  Google Scholar 

  63. Yun HJ, Im K, Jin-Ju Y, Yoon U, Lee JM. Automated sulcal depth measurement on cortical surface reflecting geometrical properties of sulci. PLoS ONE. 2013;8(2):e55977.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Pereira F, Mitchell T, Botvinick M. Machine learning classifiers and fMRI: a tutorial overview. NeuroImage. 2009;45(1 Suppl):199–209.

    Article  Google Scholar 

  65. Kang J, Ullah Z, Gwak J. MRI-based Brain Tumor classification using ensemble of deep features and machine learning classifiers. Sens (Basel). 2021;21(6):2222.

    Article  Google Scholar 

  66. Fan J, Zhong M, Gan J, Liu W, Niu C, Liao H, et al. Spontaneous neural activity in the right superior temporal gyrus and left middle temporal gyrus is associated with insight level in obsessive-compulsive disorder. J Affect Disord. 2017;207:203–11.

    Article  PubMed  Google Scholar 

  67. Zhao HZ, Wang CH, Gao ZZ, Ma JD, Huang P, Li HF, et al. Effectiveness of cognitive-coping therapy and alteration of resting-state brain function in obsessive-compulsive disorder. J Affect Disord. 2017;208:184–90.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

This work was supported in part by National Natural Science Foundation of China (grant numbers 81271493, 82171542), Natural Science Foundation of Henan Province (grant number 232300420267), Key Scientific Research Project of Colleges and Universities in Henan Province (grant number 23A330005) and Key Specialized Research and Development Breakthrough in Henan Province (grant number 212102310583).

Author information

Authors and Affiliations

Authors

Contributions

FFH performed experiment design, statistical analysis, model establishment, and draft writing. XYY, JL, XJY, FQM and PCW were involved in participants recruitment, scale assessment and MRI scanning. ZJL was the administrator of the project and substantively revised the draft. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Zhan-Jiang Li.

Ethics declarations

Ethics approval and consent to participate

This study was performed in accordance with the Declaration of Helsinki and approved by the Research Ethics Committee of Beijing Anding Hospital, Capital Medical University. Written informed consent was obtained from each participant.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Supplementary Material 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huang, FF., Yang, XY., Luo, J. et al. Functional and structural MRI based obsessive-compulsive disorder diagnosis using machine learning methods. BMC Psychiatry 23, 792 (2023). https://doi.org/10.1186/s12888-023-05299-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12888-023-05299-2

Keywords