Skip to main content

Machine learning-enabled detection of attention-deficit/hyperactivity disorder with multimodal physiological data: a case-control study

Abstract

Background

Attention-Deficit/Hyperactivity Disorder (ADHD) is a multifaceted neurodevelopmental psychiatric condition that typically emerges during childhood but often persists into adulthood, significantly impacting individuals’ functioning, relationships, productivity, and overall quality of life. However, the current diagnostic process exhibits limitations that can significantly affect its overall effectiveness. Notably, its face-to-face and time-consuming nature, coupled with the reliance on subjective recall of historical information and clinician subjectivity, stand out as key challenges. To address these limitations, objective measures such as neuropsychological evaluations, imaging techniques and physiological monitoring of the Autonomic Nervous System functioning, have been explored.

Methods

The main aim of this study was to investigate whether physiological data (i.e., Electrodermal Activity, Heart Rate Variability, and Skin Temperature) can serve as meaningful indicators of ADHD, evaluating its utility in distinguishing adult ADHD patients. This observational, case-control study included a total of 76 adult participants (32 ADHD patients and 44 healthy controls) who underwent a series of Stroop tests, while their physiological data was passively collected using a multi-sensor wearable device. Univariate feature analysis was employed to identify the tests that triggered significant signal responses, while the Informative k-Nearest Neighbors (KNN) algorithm was used to filter out less informative data points. Finally, a machine-learning decision pipeline incorporating various classification algorithms, including Logistic Regression, KNN, Random Forests, and Support Vector Machines (SVM), was utilized for ADHD patient detection.

Results

Results indicate that the SVM-based model yielded the optimal performance, achieving 81.6% accuracy, maintaining a balance between the experimental and control groups, with sensitivity and specificity of 81.4% and 81.9%, respectively. Additionally, integration of data from all physiological signals yielded the best results, suggesting that each modality captures unique aspects of ADHD.

Conclusions

This study underscores the potential of physiological signals as valuable diagnostic indicators of adult ADHD. For the first time, to the best of our knowledge, our findings demonstrate that multimodal physiological data collected via wearable devices can complement traditional diagnostic approaches. Further research is warranted to explore the clinical applications and long-term implications of utilizing physiological markers in ADHD diagnosis and management.

Peer Review reports

Introduction

Attention Deficit Hyperactivity Disorder (ADHD) is a multifaceted neurodevelopmental psychiatric disorder typically emerging during childhood. Individuals with ADHD exhibit distinct personality traits and cognitive characteristics that impede their ability to regulate attention, behavior, and emotional responses [1]. The symptoms often manifest themselves as a persistent challenge to maintain focus, accompanied by hyperactivity or impulsivity, resulting in impairments in cognition and various life domains, including social, educational, and vocational among others [2, 3]. Although ADHD primarily arises early in childhood, in most cases, it often persists into adulthood, exerting long-term effects on individuals’ functioning, productivity, and overall quality of life [4, 5]. Although extensive research efforts have focused on childhood ADHD, where approximately 8% of children and 6% of adolescents worldwide are affected [6], researchers have only recently started investigating the disorder in adults [7,8,9,10,11], with an estimated global prevalence of 6.76% as reported by a 2022 study [12].

Although increased self-control in adulthood can help mitigate some of the symptoms associated with hyperactivity, difficulties in regulating emotional responses, maintaining focus, inferiority and feelings of impulsivity often persist beyond childhood [5, 13]. Inattention in adults with ADHD may lead to a slower pace of thinking and decision-making, as they often become entangled in irrelevant details [14]. Meanwhile, hyperactivity is linked to an inner sense of restlessness, with symptoms such as talking too much or too loudly, pacing up and down, or even experiencing muscle strain when seated [14]. Overall, adults with ADHD encounter several challenges in their personal, social, academic and vocational lives [2, 14, 15]. Their struggle to maintain focus and complete tasks renders them less productive at work [16], leading to work loss and substantial economic repercussions [17]. Furthermore, adults with ADHD are more prone to addictive behaviors (e.g. substance abuse disorders) [18, 19], while they also have a higher risk of injury and serious accidents, particularly stemming from risky driving behavior [20, 21].

In light of the profound impact ADHD has on individuals throughout their lives, diagnosing ADHD in adults is a crucial step towards managing and treating the disorder, ultimately enhancing their overall well-being and quality of life. The current standard for diagnosing ADHD in adults relies on a combination of psychometric questionnaires, in-person interviews with the individual and/or their parents to gather a comprehensive clinical history, and clinical assessments to identify the presence of specific symptoms [22]. Although widely used, this process has several limitations that can significantly affect its overall effectiveness [1]. Firstly, the current diagnostic process primarily relies on face-to-face assessments, which presents a significant limitation in terms of accessibility. Additionally, it can be time-consuming and economically burdensome, as multiple sessions may be required [22]. Furthermore, obtaining ancillary information to establish ADHD onset in childhood, which in most cases extends to adulthood, relies on subjective recall and may not always be available [23]. Moreover, recognizing the occurrence of symptoms is subject to the clinician’s interpretation, introducing potential bias into the diagnosis [24, 25]. Finally, ADHD symptomatology does not uniquely correlate with an ADHD condition and can overlap with symptoms of other psychiatric conditions [26]. It is estimated that approximately three out of four adults with ADHD suffer from at least one additional mental disorder such as depression, anxiety, personality disorders, or substance abuse [14, 24, 25]. The non-distinct nature of ADHD symptoms and the presence of psychiatric comorbidities introduce an additional level of uncertainty and complexity into the current diagnostic process [24, 26, 27].

In an effort to address some of the limitations associated with the standard of care interview-based diagnostic approaches, clinical experts working with ADHD patients have sought additional tools to provide objective data and facilitate more informed decisions [28, 29]. In this context, evaluations of the neuropsychological and neurophysiological aspects of the disorder have emerged as promising tools, though they have attracted varying levels of scientific interest. On the one hand, neuropsychological evaluations, which are more popular, have been employed to uncover impairments in various cognitive functions [29], serving as objective indicators of ADHD [28]. One such evaluation method is the Continuous Performance Tests (CPTs), which measure sustained attention and vigilance in a task-oriented computerized setting [1, 28, 30, 31]. Additionally, Stroop tests are commonly utilized to assess selective attention, interference and inhibitory control [31,32,33,34]. While the value of CPTs has been demonstrated in child populations [35, 36] and several studies [37,38,39] have highlighted the potential of Stroop tests for adults with ADHD, the utility of neuropsychological evaluations for diagnosing ADHD in adults is limited, as conclusive results cannot always be obtained [28, 29, 40,41,42,43]. The primary drawback of this approach is that it targets specific cognitive deficits, which may fail to capture impairments when multiple cognitive domains are affected, as is often the case for ADHD [28, 44].

On the other hand, clinicians have recently focused on the neurophysiological impact of the disorder, including brain and Autonomic Nervous System (ANS) functioning [5, 45,46,47,48,49,50,51]. Research has shown that ADHD is associated with alterations in brain function that may lead to cognitive impairment [49]. These alterations can be captured by imaging techniques capable of measuring functional brain activity (e.g. Electroencephalography (EEG), Magnetoencephalography, and Magnetic Resonance Imaging (MRI)) [52,53,54,55,56,57]. Recent studies [52,53,54,55,56,57] have underscored the correlation between functional brain activity, primarily captured by Functional Magnetic Resonance Imaging and EEG methods, and an underlying ADHD condition. However, such methods can not be used for large-scale testing and deployment, as MRI and EEG setups tend to be expensive, time-consuming, and obtrusive data collection approaches. Consequently, integrating these methods into current clinical practice may prove inefficient and impractical [58].

Besides its impact on brain function, studies have demonstrated that ADHD also affects the functioning of the Autonomic Nervous System (ANS) [50], which controls involuntary physiological processes [59, 60]. Within the context of ADHD, several studies [5, 46,47,48, 50] have suggested dysregulation in the part of ANS responsible for controlling arousal. This dysregulation may be linked to the behavioral challenges experienced by individuals with ADHD [47, 50]. Arousal of the ANS can be gauged in real-time by measuring physiological data [61], such as electrodermal activity (EDA), heart rate (HR), heart rate variability (HRV) and skin temperature (ST) [13, 62,63,64,65,66]. Several studies have proposed that these data modalities [45, 67,68,69,70,71,72,73,74,75] may carry useful information and offer valuable insights into an underlying ADHD condition. However, there has been limited focus on adult ADHD [45, 67,68,69,70, 72], with most studies involving relatively complicated setups [45, 68, 71,72,73,74,75] that hinder scalability and fail to leverage wearable sensors that enable multimodal data capturing in an unobtrusive and continuous manner [58].

In summary, neuropsychological and neurophysiological evaluations provide complementary and valuable information, offering different insights into ADHD conditions. Both approaches appear beneficial for clinicians and have the potential to address the limitations and complement the current symptom-based diagnostic process for adults with ADHD. Given that that research has demonstrated that ADHD is characterized by dysregulation in controlling physiological arousal, exploring the distinct physiological expressions of adult ADHD patients during potentially arousing conditions, such as neuropsychological tests, is of particular interest. Neuropsychological tests can reliably elicit ANS responses related to attention, cognitive control, and emotional regulation [76,77,78,79,80]. Therefore, the main focus of this study is to assess the potential capability of physiological data (i.e. EDA, HRV, ST) to distinguish ADHD adults, when collected during a series of neuropsychological evaluations (i.e. Stroop tests).

The selection of these specific data modalities serves multiple purposes. Firstly, we aim to explore the potential of physiological markers as standalone indicators for ADHD. To the best of our knowledge, there is limited research exploring the connection between physiological data and adult ADHD compared to neuropsychological assessments. Secondly, we seek to assess the utility of unobtrusively and continuously collected data, using scalable wearable technology. Such approaches would be more feasible for real-world applications and more likely to be integrated into current clinical practice compared to brain activity evaluations. Lastly, even though including additional data modalities (e.g. metrics from neuropsychological evaluations) in our analysis would be interesting, at the same time, it would significantly increase the complexity and dimensionality of our models, potentially compromising the robustness of the results and necessitating a more extensive dataset.

Therefore, this study employs EDA, HRV and ST data collected from a wrist-worn sensor during neuropsychological evaluations to investigate their potential utility in the diagnostic process for adult ADHD. For this purpose, we leverage the Feel Digital Precision Medicine Platform (DPMP), which enables continuous, real-time, and unobtrusive data capturing through a single wearable device. Furthermore, the platform offers signal processing and feature extraction capabilities, which are also utilized in this work. The main aims of this work are to: (i) evaluate the feasibility and utility of integrating physiological data, captured through a cost-effective and unobtrusive wearable sensor, into the current diagnostic framework for adult ADHD, (ii) develop and validate the performance and diagnostic capability of a machine learning (ML)-based pipeline in distinguishing ADHD adults, leveraging our DPMP and (iii) assess the complementary value of physiological data in providing a comprehensive understanding of ADHD symptoms.

Methods

Study design and participant recruitment

This was an observational, case-control study that included two groups of participants, an experimental and a control one. Participants for this study were recruited in collaboration with the First Department of Psychiatry of the Athens School of Medicine. The experimental group (EG) consisted of adult patients diagnosed with ADHD who were evaluated by healthcare professionals of the clinic. The assessment process for potential candidates in the EG included completing a questionnaire that consisted of questions collecting demographic, educational, occupational, and clinical data. This was followed by a battery of screening instruments, including a modified version of the Barkley Adult ADHD Rating Scale (BAARS) [81], the Autism-Spectrum Quotient (AQ) [82] and the Empathy Quotient (EQ) [83, 84]. Additionally, the semi-structured Diagnostic Interview for ADHD in Adults (DIVA) was administered to all patients [85, 86]. Complementary information was also collected from relatives. The second step of the assessment process entailed a comprehensive two-hour psychiatric examination by an experienced psychiatrist in the Psychiatry Department. This examination aimed to explore the presence of lifetime psychopathology using the Mini-International Neuropsychiatric Interview (M.I.N.I.) Greek version [87], which consists of a short structured interview assessing patient symptoms and signs against diagnostic criteria outlined in the Diagnostic and Statistical Manual of Mental Disorders (DSM-5) [3]. Overall, the main inclusion criteria for the EG candidates were: (i) ADHD diagnosis; (ii) age\(\ge\)18 years; (iii) IQ>70; (iv) proficient in Greek; and (v) willing and able to give informed consent for participation. On the other hand, the main exclusion criteria were: (i) major psychiatric disorder (e.g., schizophrenia, bipolar disorder); (ii) significant neurological conditions (e.g., epilepsy, traumatic brain injury); and (iii) severe learning disabilities. The final diagnosis regarding the presence of ADHD was based on DSM-5 criteria. Furthermore, candidates for the control group (CG) included neurotypical adults who had not undergone any screening for the presence of ADHD. Control participants were required to meet the same inclusion and exclusion criteria with EG participants, apart from the ADHD diagnosis.

Eligible candidates for both groups were then informed about the study scope, aims, and experimental process and were invited to participate. Those who expressed interest in joining had to select their preferred time slot and schedule an in-person experimental session. During this session, held at the First Department of Psychiatry of the Athens School of Medicine, participants were provided the Feel physiological data monitoring device (see Data acquisition, preprocessing and feature extraction infrastructure section). This device unobtrusively and continuously collected participants’ data, as they progressed through the various steps of experimental protocol. The flowchart of the participant screening and recruitment process is illustrated in Fig. 1.

Fig. 1
figure 1

Flowchart for participant screening and recruitment

Experimental setup

The experimental protocol for both the EG and CG, consisted of a series of six computerized Stroop tests during which participants were required to answer as quickly and accurately as possible: three color-word Stroop tests, two number Stroop tests, and one emotion Stroop test. At the same time, their physiological data was collected via the Feel Monitoring Device. The Stroop tasks were developed in Python programming language [88] and were presented using a Windows personal computer and a 17-inch flat screen. Participants provided their responses for the number and emotion Stroop tests using the computer keyboard, while the number of correct and wrong responses for the color-word Stroop tests were manually recorded by the researcher executing the experiment. The start and end times for each test were manually input by the researcher through the data collection mobile application (see Data acquisition, preprocessing and feature extraction infrastructure section). This ensured that the physiological data recorded by the wearable device was synchronized with the task events. In more detail, each test was designed as follows:

  • Color-Word Stroop: This step consisted of three different tests labeled as C1, C2 and C3. In C1, participants were instructed to name a color word depicted in a neutral color (e.g. the word “red” presented in a black-colored font). In C2, participants were presented with differently colored symbols and asked to identify the color of the font (e.g. symbol XXX presented in a blue-colored font). In C3, color words were depicted in a different color, which the subjects were asked to identify, ignoring the word itself (e.g. the word “red” presented in a blue-colored font). Each of the color-word Stroop tests lasted approximately one minute, during which participants were prompted to give as many answers as possible from a list of 200 words. The total duration of the session was 3 minutes.

  • Number Stroop: This part consisted of two tests labeled as N-S (Number-Size) and N-V (Number-Value). In both tests, participants were presented with pairs of numbers, each one displayed in a different font size. In the congruent condition, the number with the higher value was displayed in the larger font size, while in the incongruent condition, the number with the higher value was displayed in the smaller font size. During the N-S test, participants were asked to identify the number with the larger font size, while in N-V, they were asked to identify the number with the larger value. Each test included 100 pairs of numbers. This series of tests did not have a fixed duration and test duration depended on the time required by each participant to respond to all pairs of numbers. The average duration for each test was 3.9 minutes, with the total duration of this session being on average 7.8 minutes.

  • Emotion Stroop: This part consisted of a single test labeled as E. In the Emotion Stroop task, participants were instructed to name as quickly and accurately as possible the color of the presented words, while ignoring their meaning, which could be either neutral (e.g. tower, fork, etc.) or negative (e.g. crime, traitor, annoying, etc.). This session included 30 neutral and 30 negative words, presented in 4 different colors. The total number of trials was 240. The number of words displayed was fixed, and like the Number Stroop step, the total test time varied among participants. The duration of this session was on average 4.3 minutes.

Data acquisition, preprocessing and feature extraction infrastructure

For the purposes of this study, we have leveraged the capabilities and functionalities of our proprietary DPMP. The DPMP is a remote patient monitoring platform designed for unobtrusive, passive, and continuous monitoring and analysis of neurological and psychiatric patient data. The platform facilitates the collection, curation, and processing of multimodal data, with a focus on the discovery, extraction, and validation of metrics and biomarkers for various use cases in the fields of neurology and psychiatry. In this study, the following components of the platform were employed:

  • Feel Monitoring Device: The wrist-worn device features five embedded sensors and connects to the user’s smartphone via Bluetooth to accommodate continuous and passive data collection and transmission (Fig. 2A). The device collects psychophysiological, activity, and ambient conditions data, which have proven to be relevant and highly valuable for many neurological and psychiatric applications [89,90,91]. More specifically, the following data modalities are captured by the wearable device: EDA, HR, HRV, ST, 9-axis Inertial Measurement Unit, ambient temperature, and humidity. The focus of this work is on the first three data modalities.

  • Feel Mobile app: The Mobile app (Fig. 2B) connects to the Feel Monitoring Device, collects data, and transfers it from the device to the secure cloud-based processing infrastructure. The app allows users to control the start/stop of the data acquisition session and annotate specific timestamps during the acquisition with custom labels, indicating the start and end of each session step. These labels are used to identify data segments associated with each Stroop test. The Feel Mobile App is available for Android and iOS platforms.

  • Digital Endpoints Development & Biomarker Discovery Infrastructure: This sophisticated infrastructure integrates data curation, signal processing, time series analysis, and pattern recognition tools and frameworks for noisy artifact detection and denoising, signal annotation and segmentation, and feature extraction purposes. The processing pipeline for collected time series commences with the denoising step, where noisy signal parts are identified and the impact of noise artifacts is appropriately mitigated. Subsequently, time segments of the collected signals corresponding to each of the six Stroop tests are identified, using the timestamp annotations that have been inserted from the Feel mobile app. Finally, a wide variety of proprietary features are calculated for each segment of the acquired time series. These features range from simple statistical metrics to more complex and highly nonlinear ones reflecting the morphological, frequency, repeatability, and predictability characteristics of the signals. Specifically, 103 features have been extracted from the EDA signals, 77 from the HRV signals, and 11 from the ST signals, resulting in a feature set of 191 features available for our analysis.

Fig. 2
figure 2

The feel monitoring device (A) and the feel mobile app (B)

Statistical analysis and decision pipeline

We employed descriptive statistics to analyze the demographic characteristics of the two groups. Additionally, we compared the age distribution of the two groups using an independent t-test [92] and assessed the gender ratio using the \(\chi ^2\) test [93]. Leveraging the features extracted by the Biomarker Discovery infrastructure, we conducted a two-step univariate feature analysis to identify the best subset of the collected data that, when fed into an ML pipeline, would be able to distinguish between the experimental and control groups. An overview of the data processing and decision pipeline is shown in Fig. 3.

Fig. 3
figure 3

Overview of the data processing and decision pipeline

Firstly, within each group (i.e. EG and CG), we investigated the variance of each feature at the individual level. To accomplish this, we performed a series of nonparametric pairwise statistical tests for every pair of Stroop tests using the Wilcoxon signed-rank test [94]. Utilizing this method, a univariate analysis was performed in order to determine the proportion of total features showing a statistically significant difference between pairs of Stroop tests. A low ratio suggests minimal impact at the individual level for the specific combination of tests, allowing paired data to be treated as uncorrelated. Consequently, the feature data from two Stroop tests could be combined and analyzed together under a common test label. The second step of our statistical analysis involved identifying the data subsets that yielded more profound differences across features between the two groups (CG and EG). To achieve this, for each type of test (either the original tests or the bundled ones determined by the first level of our analysis), we conducted a univariate feature analysis using Kolmogorov-Smirnov statistical tests [95]. Subsequently, the subset of our dataset corresponding to the Stroop tests that yielded a higher percentage of features with statistically significant differences between the two groups were selected as inputs to the learning algorithms. For both levels of the statistical analysis, the cutoff P-value used for statistical significance was set at 0.1. For both statistical tests, we utilized the corresponding implementations in the SciPy Python package [96].

Having built a more informative data subset, the decision pipeline consists of three main parts: i) informative point selection, ii) feature selection, and iii) classification. The input of this pipeline is a dataset that includes the features identified as the most informative from the previously discussed statistical analysis. Separate datasets were constructed both for each of the three signals (i.e. EDA, HRV, and ST) and for their fusion. In the latter case, the feature space is constructed by concatenating the features from all signals.

Informative point selection

We enhance the separability of the two groups by identifying and discarding the least informative points [97]. Towards this, we utilize a modified version of the k-Nearest Neighbors algorithm (KNN), namely the Informative KNN (i-KNN) [98]. The least informative points are defined as data points that are (isolated) instances of one class in a n-dimensional feature space that reside in a neighborhood with a high density of points from the opposite class. For each data point \(\textbf{x}_i\) with class \(y_i\) we select k nearest neighbors and calculate for each neighbor \(\textbf{x}_j\), the associated informativeness given by Eq. (1) [98]:

$$\begin{aligned} P\left( \textbf{x}_j|\textbf{x}_i\right) =\frac{1}{C_i}d\left( \textbf{x}_j,\textbf{x}_i\right) ^{\eta \left( \textbf{x}_j,\textbf{x}_i\right) }\Lambda \left( \textbf{x}_j\right) ^{1-\eta \left( \textbf{x}_j,\textbf{x}_i\right) } \end{aligned}$$
(1)

where d is a distance function defined as \(d(\textbf{x}_j,\textbf{x}_i)=e^{-|\textbf{x}_j-\textbf{x}_i|^2}\), \(\eta\) represents the ratio of neighbors that have the same class \(y_j\) as \(\textbf{x}_j\) and \(C_i\) is a normalization factor such that \(\sum \nolimits _j P(\textbf{x}_j|\textbf{x}_i)=1\). The function \(\Lambda (\textbf{x}_j)\) can be interpreted as a weighting parameter, which quantifies how far apart the point \(\textbf{x}_j\) lies from the rest of the \(k-1\) neighbors of \(\textbf{x}_i\) which have a different label than \(y_j\). This weighting parameter is defined in Eq. (2).

$$\begin{aligned} \Lambda \left( \textbf{x}_j,\textbf{x}_i\right) \prod \limits _{n=1}^{k}\left( 1-d\left( \textbf{x}_j,\textbf{x}_n\right) \left( 1-\delta _{y_j,y_n}\right) \right) \end{aligned}$$
(2)

with \(\delta _{y_j,y_n}\) being the Kronecker delta which equals 1 only if \(y_j=y_n\) and 0 otherwise. We rank the k neighbors according to their informativeness (Eq. (1)) and classify \(\textbf{x}_i\) using the majority vote of the labels from the M neighbors with the highest informativeness. This classification decision is denoted as \(\tilde{y}_i\). We perform this process for \(k\in \{3,5,7,10\}\) and for each k, we use \(M\in [1,k]\) (25 possible combinations in total). For each value l for k and m for M, we compare the real label \(y_i\) with the decision \(\tilde{y}_i\) from the most informative neighbors and assign a binary score \(s(\textbf{x}_i,l,m) = 1-\delta _{y_i,\tilde{y}_i}\). We rank every vector \(\textbf{x}_i\) using the average \(\bar{s}(\textbf{x}_i)\) of the scores \(s(\textbf{x}_i,l,m)\), which is defined as shown in Eq. (3). A higher value of \(\bar{s}(\textbf{x}_i)\) implies that \(\textbf{x}_i\) is less informative for its corresponding class \(y_i\) and could thus be discarded. In this study, we examined two input datasets to the decision pipeline: i) the full dataset and ii) the reduced dataset after discarding the worst 5% of the data points, as described above. The implementation of this algorithm was performed in Python utilizing the respective packages [99].

$$\begin{aligned} \bar{s}(\textbf{x}_i)=\frac{1}{25}\sum \limits _{l=1}^{l=k}\sum \limits _{m=1}^{m=k}s(\textbf{x}_i,l,m) \end{aligned}$$
(3)

Feature selection

Regarding feature selection, we utilize the relative Median Absolute Deviation (rMAD, Eq. (4)) and the relative Interquartile Range (rIQR, Eq. (5)) as lower bounds to discard low-variance features that have minimal impact on the separability of the two groups. We have selected these two quantities as thresholds since they are robust to outlier values.

In Eqs. (4) and (5), \(\textbf{f}_i\) refers to a feature vector, i.e. a list of the values of the feature i. Furthermore, \(\text {med}(\textbf{f}_i)\) in Eq. (4) is the median and \(p(\textbf{f}_i,q)\) in Eq. (5) represents the q-th percentile of \(\textbf{f}_i\).

$$\begin{aligned} \text {rMAD}\left( \textbf{f}_i\right) =\frac{\text {med}\left( |\textbf{f}_i-\text {med}\left( \textbf{f}_i\right) |\right) }{\text {med}\left( \textbf{f}_i\right) } \end{aligned}$$
(4)
$$\begin{aligned} \text {rIQR}\left( \textbf{f}_i\right) =\frac{\left| p\left( \textbf{f}_i,0.75\right) -p\left( \textbf{f}_i,0.25\right) \right| }{p\left( \textbf{f}_i,0.25\right) } \end{aligned}$$
(5)

Both rMAD and rIQR have been normalized respectively. For rIQR if \(p(\textbf{f}_i,0.25)=0\), then \(p(\textbf{f}_i,0.75)\) is used in the denominator.

Classification

For the classification step, we first utilize Principal Component Analysis (PCA) to construct linear combinations of the features that best capture the variability, and thus the useful information, of the dataset. Then, we employ four widely used classification algorithms: Logistic Regression (LR), KNN, Random Forests (RF) and Support Vector Machines (SVM) [99]. For each algorithm, a set of classifier-specific hyperparameters along with the rMAD and rIQR thresholds are tuned to achieve the best accuracy. This tuning process is conducted using a 100-fold cross-validation scheme, where we split the data into a 70%-30% train-test stratified split. It is noteworthy that whether the full or reduced dataset was used was not treated as a hyperparameter, ensuring that each algorithm utilizes the same data points across all validation folds. For each input dataset, we have the option to keep all input data, or discard the least informative points and evaluate the output of the decision pipeline on the obtained accuracy, sensitivity and specificity of the test sets.

Ethical considerations

This observational study was conducted in accordance with the Declaration of Helsinki and was approved by the Ethics Committee of the National and Kapodistrian University of Athens, 1st Department of Psychiatry, Eginition Hospital. Prior to their participation in the study, all individuals were fully informed about the study scope, objectives, methodology, and components, and they provided written informed consent. They were also informed that their participation was voluntary and they could withdraw from the study at any time. Each participant was assigned a unique identifier upon providing informed consent. The collected data was pseudonymized, and no personal identifiers were used during data processing and reporting of results. Furthermore, participants did not receive any monetary compensation for their involvement in the study.

Results

Sample characteristics

A total of 95 individuals participated in this study, with 58 in the control group and 37 in the ADHD group. There were no difference in the age (\(35.18\pm 11.14\) vs \(32.58\pm 11.39\), \(t(93)=1.06\), \(P=.29\)) and gender ratio (Male/Female, 32/26 vs. 24/13, \(\chi ^2(1, 95)=0.5221\), \(P=.46\)) between the two groups. Out of the 95 participants who joined the study, 4 did not complete the experimental process due to time limitations or personal discomfort issues. Additionally, physiological data was not retrievable for 15 participants, mainly due to protocol execution errors, mobile app malfunctions, and internet connectivity issues. The remaining participants (i.e. 44 CG and 32 EG participants) successfully completed the experimental process. For this group of participants, there were no significant differences in the age distributions (36.23 ± 9.36 vs. 33.26 ± 12.18, \(t(74)=1.18\), \(P=.24\)), as well as for the gender ratio (Male/Female 29/15 vs. 21/11, \(\chi ^{2}(1,76)<10^{-4}\), \(P=.97\)). Finally, 23 out of 32 EG participants were receiving ADHD medication.

Feature analysis

Firstly, we explore the variability of the extracted features within the CG and EG for all combinations of Stroop tests in the experimental process, employing the Wilcoxon signed-rank test. In Fig. 4, we showcase the ratio of the total features that show a statistically significant change between every pair of tests for the CG and EG in the left and right plots, respectively. The x and y-axis in each plot indicate the two tests involved in the comparison. For example, the element in the first row and second column corresponds to the output of the comparison between the C1 and C2 Stroop tests. Accordingly, the elements on the diagonal are all equal to 0, since these correspond to the output of the comparison of a Stroop test with itself. It becomes evident that a clear pattern exists involving Stroop tests of the same type (i.e. Color-Word, Number, or Emotion). For such cases, the percentage of features showing a statistically significant change between the corresponding Stroop tests within both CG and EG is much lower (light-colored regions) than for cases with Stroop tests of different types being compared (dark-colored regions). Therefore, data from tests C1, C2, and C3, as well as N-Size and N-Value, can be bundled together under the new label “C” and “N”, respectively.

Fig. 4
figure 4

Percentage of features that differ significantly (\(P\le .1\)) within the control (CG; left) and experimental (EG; right) groups, per every couple of Stroop Tests

After bundling together the Color-Word and Number Stroop tests, we now have three types of tests: Color (C), Number (N), and Emotion (E). As the next step of the analysis, we explore how much the extracted features differ between the CG and EG for the three types of tests. For each case, the percentage of total features that showed a statistically significant difference between the CG and EG is illustrated in Fig. 5A. Moreover, the respective percentages per data modality (i.e. EDA, HRV and ST) are shown in Fig. 5B. As can be seen, the dataset consisting of N tests shows a much larger percentage of features differing between the two groups compared to the C and E tests, with more than half of the features being significantly different. Therefore, in the rest of this work, we focus solely on the subset of our dataset that includes features only from the N test.

Fig. 5
figure 5

Percentage of features showing statistically significant differences (\(P\le .1\)) between the control (CG) and experimental (EG) groups for the Emotion (E), Color-Word (C), and Number (N) Stroop tests in total (A) and per data modality (B)

Decision pipeline

For the evaluation of the performance of the decision pipeline, three metrics have been utilized: accuracy, sensitivity, and specificity. The former corresponds to the ratio of the correctly identified instances (true CG + true EG) to the total number of instances and reflects how many instances were correctly classified in total. Sensitivity assesses the capability of the model to correctly identify positive (true EG) instances and is calculated as the ratio of correctly identified EG instances to their total number. Similarly, specificity evaluates the same aspects for the CG instances and is extracted as the ratio of correctly identified CG instances to their total number. Tables 1, 2, 3 and 4 present these metrics for the different classification models employed for each physiological signal (i.e. EDA, HRV, ST) and their fusion, utilizing data captured during the Number Stroop tests. In all cases, the results for both the full and the reduced datasets are provided.

Electrodermal activity (EDA)

Table 1 displays the accuracy, sensitivity, and specificity of different classification models using both the original and reduced feature sets from EDA data. It should be noted that in most cases, all metrics improve when the least informative points are discarded, and the reduced dataset is utilized. More specifically, all performance metrics improve for the LR and RF algorithms, while only accuracy and sensitivity are enhanced for KNN and SVM. Notably, for SVM, the capability to identify CG instances is significantly reduced (specificity=40%). Interestingly, noisy data points appear to play a crucial role in constructing the support vectors, as the performance of the SVM model in terms of sensitivity and specificity is reversed when these points are discarded. The LR model achieves the highest accuracy and sensitivity but performs relatively poorly in identifying CG data points, with specificity below 60%. On the other hand, the RF classifier achieves a more balanced performance, with all performance metrics ranging from 70% to 74%.

Table 1 Accuracy, sensitivity and specificity for the LR, RF, KNN and SVM classification models using the original and reduced feature set from EDA data

Heart rate variability (HRV)

Similar to the previous case, the performance of all models is generally enhanced when the most informative data points are retained (Table 2). This becomes also evident for the SVM case, at least with regard to the accuracy and sensitivity metrics, which are significantly improved. This time, the KNN algorithm achieves the most balanced performance, with all metrics hovering around 78%. The RF algorithm also demonstrates a balanced performance, albeit with slightly lower metrics ranging from 73% to 77%. On the other hand, the capability of the LR model to identify the CG instances is the best among all models with a specificity of 82.2%, while also showing similar performance to the RF and KNN models with respect to the other metrics. Finally, the SVM achieves the best sensitivity, which is close to that of the KNN, but shows poorer performance in identifying CG data points. Similar to the EDA case, the SVM is notably influenced by the presence of the least informative data points.

Table 2 Accuracy, sensitivity and specificity for the LR, RF, KNN and SVM classification models using the original and reduced feature set from HRV data

Skin temperature (ST)

In the case of the ST signal (Table 3), the LR algorithm achieves an accuracy of 72% with a sensitivity of 89%. However, there is a notable increase in misclassification of CG participants, with a specificity close to 55%. The KNN and RF models exhibit similar performance, with more balanced metrics compared to the other models. On the other hand, the SVM algorithm demonstrates the least balanced performance, successfully identifying most of the EG data (sensitivity: 85%) but misclassifying the majority of CG data (specificity: 40%). Retaining the most informative data points affects the performance of all algorithms, with the SVM being particularly impacted.

Table 3 Accuracy, sensitivity and specificity for the LR, RF, KNN and SVM classification models using the original and reduced feature set from ST data

Data modalities fusion

Finally, when fusing data from all physiological signals together, we effectively expand the feature space and utilize the information from each signal simultaneously. As shown in Table 4, this leads to improved performance for all classifiers. Interestingly, the SVM classifier, which exhibited the least balanced performance when each signal was utilized separately, now shows almost identical capability to identify both the CG and EG data well, with sensitivity and specificity close to 82%. The rest of the classifiers demonstrate similarly balanced performance, but with slightly inferior performance metrics, except for the sensitivity of the LR model, which reaches sensitivity values up to 82.4%. Finally, discarding some of the noisy data points results in a boost of the performance for all classifiers, with the respective performance metrics for the SVM model ranging from 81.4%-81.9%.

Table 4 Accuracy, sensitivity and specificity for the LR, RF, KNN and SVM classification models using the original and reduced feature set from fused EDA, HRV and ST data

Discussion

In this study, we investigated the potential of utilizing physiological data for the detection of adult ADHD. Specifically, we focused on three physiological signals: EDA, HRV, and ST, acquired using a wrist-worn sensor during a series of Stroop tests. Our main hypothesis was that these tests would elicit ANS responses that differ between neurotypical adults and adults with an ADHD condition. One of our primary objectives was to evaluate the performance of unimodal models using each physiological signal separately. Additionally, we explored the potential of the complementarity of information, provided by these data modalities and constructed a multimodal model combining them simultaneously, to enhance the overall model capabilities. Using our proprietary data processing infrastructure, we extracted a series of features for each physiological signal and employed them in ML algorithms for a classification task aimed at identifying neurotypical and ADHD populations. Our results supported that these physiological signals carry significant information to be correlated with underlying ADHD conditions. Specifically, we developed multimodal ML models that achieved up to approximately 82% sensitivity and specificity. In the following, we will delve into the interpretation and implications of these results.

Early in our investigation, we explored the ANS responses, as captured by the available physiological data, across the six different Stroop tests, and between the two participant groups. The aim was to identify the subset of our data that could yield optimal separability between the ADHD and the neurotypical populations. Initially, we analyzed each of the two participant groups separately. Through the use of Wilcoxon signed-rank tests, we found that each type of Stroop test (C, N and E) yields sufficiently distinct responses. More specifically, within the three Color-Word and the two Number Stroop tests, the average variation of the feature distributions was significantly lower than comparing feature distributions from different types of Stroop tests (i.e. Color-Word vs Number, Number vs Emotion and Color-Word vs Emotion). Therefore, for the Color-Word and Number Stroop tests, the impact of the specific test at the individual level was minimal, and thus, we merged the corresponding data under a single unique test label. Utilizing these new groupings, we gained a second insight regarding the efficiency of the Stroop tests in triggering ANS responses sufficient to distinguish between the two groups. Our results suggest that the Number Stroop tests yield far more features showing a statistically significant (\(P\le .1\)) difference between the CG and EG, than Color-Word or Emotion Stroop tests. In more detail, for the Number Stroop, approximately 52% of the total features exhibit a significant difference, while for the Color-Word and the Emotion Stroop, approximately only 10.4% and 3.6% of the total features, respectively, show significant differences. What is more, the percentage of statistically significant features is similar across the different data modalities, indicating their potential value in distinguishing between the CG and the EG. Specifically, statistically significant differences were primarily observed in the statistical time domain and morphological features of the EDA (e.g. mean, standard deviation, first difference, number of EDA responses, EDA responses characteristics, etc.), time and frequency domain features of the HRV (e.g. SDNN, RMSSD, pNN50, power in low and high frequency bands, etc.) and the time domain features of the ST (e.g. standard deviation, first and second difference, etc.).

The sensitivity of the Number Stroop test to group differences in ANS response compared to other Stroop tasks is an intriguing finding that warrants further discussion. This phenomenon can potentially be attributed to the inherent complexity of the numerical Stroop task, which involves comparing two stimuli, the numerical value, and the physical size, exhibiting higher cognitive load and complexity than the color-word Stroop task [100]. Therefore, higher ANS activation is expected during numerical tasks since they require substantial working memory and executive function resources [101]. Additionally, previous research suggests that the processing of numerical and physical magnitudes relies on a semantic abstract and non-verbal magnitude representation, which may facilitate or interfere with cognitive processes without the confounding effects of reading skill and articulatory speed commonly associated with ADHD [102]. Consequently, numerical Stroop tasks may be more effective in triggering cognitive challenges associated with ADHD conditions. Finally, the nature of the stimuli in the numerical Stroop task, where both relevant and irrelevant tasks involve the comparison of magnitudes, may lead to similar brain activations and require constant active control mechanisms to inhibit the irrelevant task [103,104,105]. As a result, it may be more difficult to modulate the task conflict, leading to increased ANS activation.

A later step of our analysis identified data points carrying significant information for their respective classes. This data mining procedure [106] has become essential when dealing with real-world data [106] and aims to filter out the least informative data points, thereby improving input quality for prediction models [106]. In this study, we specifically considered outlier data points from the neurotypical population. Given that participants in the CG were not previously assessed for potential ADHD symptomatology, ADHD-like characteristics in their physiological data might have emerged during the Stroop tests, despite their labeling as CG. In contrast, participants in the EG underwent assessment by a clinical expert, during which their inclusion in the EG group was verified, and, thus, their physiological expressions were representative of an ADHD condition. Among various methods for data filtering [106], we employed an extension of the k-Nearest Neighbors algorithm (i-KNN), which incorporates a distance metric that considers the class labels of the neighboring points [97, 98]. The impact of this filtering method was assessed by comparing the performance of prediction algorithms separately for the full and the reduced dataset. Based on the nature of the i-KNN algorithm, we anticipated a larger impact on prediction algorithms exploiting the geometrical properties of data [106]. To demonstrate the effect on the prediction capabilities of the different models, we utilized balanced accuracy, defined as the mean value of sensitivity and specificity. Our results indicate a consistently positive mean relative change in balanced accuracy across all signals separately and their fusion when using the reduced dataset. Notably, the relative change is more pronounced for the RF (15.21%), KNN (14.23%), and SVM (17.72%) models, which leverage the high-dimensional structure of the data. In contrast, the LR models, which rely on statistical properties, exhibit a smaller mean relative change (9.57%). Therefore, the rest of this section focuses on the reduced dataset.

The primary focus of this work was to explore the performance of models leveraging each physiological signal separately (i.e., unimodal), as well as the fusion of them (i.e. multimodal). In the following, we discuss the findings regarding the performance of these models and compare their effectiveness. Starting from the unimodal models, significant variations in performance were observed across different physiological signals and performance metrics. Figure 6 illustrates the accuracy, sensitivity, and specificity of each physiological signal and their fusion for each tested algorithm, focusing on the reduced dataset. Notably, EDA and ST exhibit equal or better sensitivity compared to HRV, while HRV shows higher specificity. The most interesting observation occurs when the fusion of signals is utilized. In this case, more complex models can be built by exploiting information from all signals simultaneously. Therefore, multimodal models demonstrate more balanced performance, also achieving higher sensitivity and specificity across all algorithms except for KNN (for the HRV dataset), compared to the unimodal ones, which exhibited either high sensitivity or high specificity. This suggests that combining information from multiple signals simultaneously improves model performance, highlighting the importance of considering multimodal approaches in such studies. Independent of the classification algorithm, we can argue that multimodal information from EDA, HRV and ST yields at least equal or better performance than unimodal for both neurotypical and ADHD adults. This could be attributed to the fact that the selected data modalities convey information about various aspects of the ANS [5, 13, 61, 63,64,65,66] and better capture the multifaceted effect of ADHD in autonomic functioning [107]. Further investigation into the underlying mechanisms driving these observations could provide valuable insights into the pathophysiology of ADHD and inform the development of more effective diagnostic and therapeutic strategies.

Fig. 6
figure 6

Test accuracy, sensitivity and specificity of the LR (top left), RF (top right), KNN (bottom left) and SVM (bottom right) models for EDA (orange line), HRV (light blue line), ST (light green line) and Fusion (black line), when the reduced dataset is utilized

Focusing on the multimodal models, a comparison across different algorithms was conducted to evaluate their performance. In Fig. 7, we present the accuracy, sensitivity, and specificity obtained for the LR, RF, KNN, and SVM algorithms for the multimodal case. Our analysis reveals that the SVM algorithm consistently outperforms the others, particularly in terms of accuracy and specificity. Although it is only slightly worse than LR in terms of sensitivity, SVM achieves the most balanced result overall. The superior performance of SVM in this context can be attributed to its ability to effectively utilize multimodal data and learn from its complex structure. By optimizing the utilization of multimodal information, SVM demonstrates proficiency in identifying both neurotypical and ADHD adults.

Fig. 7
figure 7

Test accuracy, sensitivity and specificity for the LR (red line), RF (green line), KNN (blue line) and SVM (orange line) models, when using all data modalities

Overall, the findings of this study validate our hypothesis regarding ANS responses and their association with an underlying ADHD condition. Specifically, we demonstrated that ANS responses triggered by neuropsychological evaluations and captured by the EDA, HRV and ST physiological signals can effectively distinguish between ADHD and neurotypical adults. The most promising results were observed when following a multimodal data approach combining all signals captured during the execution of the Number Stroop tests. The SVM classification model exhibited the most balanced performance with regards to sensitivity and specificity (81.4% and 81.9%, respectively) highlighting the potential to recognize both ADHD and neurotypical adults. Future studies could further explore the potential of integrating additional sensor-collected data modalities (e.g. ambient conditions, accelerometer data, etc.), along with participants’ performance metrics from the Stroop tests (e.g. correct and wrong responses, response times, etc.) in the ADHD detection process. Academic research has recognized the potential contribution of these data sources to improve the efficiency of early ADHD diagnosis, but very limited clinical evidence is available [58].

Our findings contribute to addressing the lack of biomarkers for the detection of complex conditions like ADHD [24, 108]. Even though the current diagnostic process for adult ADHD has been widely used for decades, several limitations including accessibility, subjectiveness of the diagnosis, comorbidities, time and financial requirements, may hinder its efficiency and effectiveness. By leveraging passively collected multimodal physiological data, our study provides valuable insights into the pathophysiology of ADHD and presents a scalable solution for ADHD screening. The unobtrusive data collection method combined with the low cost of associated equipment render our platform a promising tool for assisting specialists during ADHD screening, supplementing traditional and resource-intensive methods. Furthermore, the use of physiological data obtained from wrist-worn devices opens avenues for continuous evaluation, enabling clinicians to monitor symptoms progression in near-real-time and obtain a more accurate clinical image of patients. This continuous monitoring could also lead to timely interventions and improved patient outcomes.

Limitations

While this study provides valuable insights into ADHD detection using multimodal physiological data, a few study limitations should be acknowledged. The relatively small sample size restricts the complexity of the ML models built and the robustness of results obtained, highlighting the need for larger samples to enable more comprehensive analyses. This work relies on in-sample model validation methods (i.e. cross-validation), which may lead to over-optimistic performance estimates. Larger sample sizes will enable the utilization of out-of-sample validation methods, fortifying the generalizability and robustness of the study outcomes. Additionally, the study did not control for co-occurring disorders and medication usage among participants, potentially confounding the observed ANS responses and ADHD detection accuracy. Future research should address these confounding factors to improve the validity of findings. Moreover, the study focused on detecting ADHD as a general condition and did not differentiate between ADHD subtypes, warranting exploration of subtype-specific detection methods. Furthermore, incorporating a wider range of continuous performance tests beyond Stroop tests targeting other ADHD-related characteristics could enhance performance, while also improving the generalizability of findings. Finally, confirming the profiles of the control group participants through clinical evaluations, in order to ensure that only neurotypical individuals are included, would further enhance our understanding of the potential of ADHD detection using physiological data.

Conclusion

In conclusion, this study has shed light on the potential of utilizing physiological data, including EDA, HRV, and ST for the detection of adult ADHD. By investigating ANS responses elicited during Stroop tests, we have demonstrated significant differences between neurotypical adults and those with ADHD, supporting the feasibility of using physiological signals as biomarkers for ADHD detection. Our analysis revealed that multimodal models, combining information from all physiological signals, outperformed unimodal ones, highlighting the importance of considering multimodal approaches in ADHD research. The SVM classification model emerged as the most effective in distinguishing between ADHD and neurotypical adults, achieving a balanced performance in terms of sensitivity and specificity (81.4% and 81.9%, respectively).

However, the relatively small sample size and the use of in-sample model validation methods pose challenges to the generalizability of our findings. Future research should aim to address these limitations by incorporating larger and more diverse samples, as well as employing out-of-sample validation methods to enhance the robustness of the results. Despite these limitations, our findings contribute to the growing body of literature on ADHD detection and underscore the potential of physiological data as valuable tools in clinical practice. Moving forward, further investigation into the underlying mechanisms driving ANS responses in ADHD and the integration of additional data modalities could provide deeper insights into the pathophysiology of the disorder and inform the development of more effective diagnostic and therapeutic strategies.

Availability of data and materials

The original contributions presented in the study are included in the article; Further inquiries can be directed to the corresponding authors.

Abbreviations

ADHD:

Attention-deficit/hyperactivity disorder

ANS:

Autonomic nervous system

AQ:

Autism-spectrum quotient

BAARS:

Barkley adult ADHD rating scale–IV

CG:

Control group

CPT:

Continuous performance test

DIVA:

Diagnostic interview for ADHD in adults

DPMP:

Digital precision medicine platform

DSM-5:

Diagnostic and statistical manual of mental disorders

EDA:

Electrodermal activity

EEG:

Electroencephalography

EG:

Experimental group

EQ:

Empathy quotient

HR:

Heart rate

HRV:

Heart rate variability

i-KNN:

Informative KNN

KNN:

k-Nearest neighbors

LR:

Logistic regression

M.I.N.I.:

Mini-international neuropsychiatric interview

MRI:

Magnetic resonance imaging

PCA:

Principal component analysis

RF:

Random forests

rIQR:

Relative interquartile range

rMAD:

Relative median absolute deviation

ST:

Skin temperature

SVM:

Support vector machines

References

  1. Gualtieri CT, Johnson LG. ADHD: Is Objective Diagnosis Possible? Psychiatry (Edgmont). 2005;2:44–53.

  2. Ginsberg Y, Quintero J, Anand E, Casillas M, Upadhyaya HP. Underdiagnosis of attention-deficit/hyperactivity disorder in adult patients: a review of the literature. Prim Care Companion CNS Disord. 2014;27(1). https://doi.org/10.4088/PCC.13r01600.

  3. Association AP. Diagnostic and Statistical Manual of Mental Disorders (DSM-5®). American Psychiatric Publishing; 2013. Available from: https://www.appi.org/products/dsm.

  4. Sroubek A, Kelly M, Li X. Inattentiveness in attention-deficit/hyperactivity disorder. Neurosci Bull. 2013;29:103–10. https://doi.org/10.1007/s12264-012-1295-6.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Faraone SV, Biederman J, Mick E. The age-dependent decline of attention deficit hyperactivity disorder: a meta-analysis of follow-up studies. Psychol Med. 2006;36(2):159–65. https://doi.org/10.1017/S003329170500471X.

    Article  PubMed  Google Scholar 

  6. Salari N, Ghasemi H, Abdoli N, et al. The global prevalence of ADHD in children and adolescents: a systematic review and meta-analysis. Ital J Pediatr. 2023;49. https://doi.org/10.1186/s13052-023-01456-1.

  7. Bálint S, Czobor P, Komlósi S, Meszaros A, Simon V, Bitter I. Attention deficit hyperactivity disorder (ADHD): gender-and age-related differences in neurocognition. Psychol Med. 2009;39(8):1337–45. https://doi.org/10.1017/S0033291708004236.

    Article  PubMed  Google Scholar 

  8. Pehlivanidis A, Spyropoulou A, Galanopoulos A, Papachristou CA, Papadimitriou GN. Attention deficit hyperactivity disorder (ADHD) in adults: Clinical recognition, diagnosis and therapeutic interventions. Arch Hell Med. 2012;29:562–76.

    Google Scholar 

  9. Volkow ND, Swanson JM. Clinical practice: Adult attention deficit-hyperactivity disorder. N Engl J Med. 2013;369(20):1935–44. https://doi.org/10.1056/NEJMcp1212625.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Pettersson R, Söderström S, Nilsson KW. Diagnosing ADHD in adults: an examination of the discriminative validity of neuropsychological tests and diagnostic assessment instruments. J Atten Disord. 2018;22(11):1019–31. https://doi.org/10.1177/1087054715618788.

    Article  PubMed  Google Scholar 

  11. Yoshimasu K, Barbaresi WJ, Colligan RC, Voigt RG, Killian JM, Weaver AL, et al. Adults with persistent ADHD: Gender and psychiatric comorbidities-A population-based longitudinal study. J Atten Disord. 2018;22(6):535–46. https://doi.org/10.1177/1087054716676342.

    Article  PubMed  Google Scholar 

  12. Song P, Zha M, Yang Q, Zhang Y, Li X, I R. The prevalence of adult attention-deficit hyperactivity disorder: A global systematic review and meta-analysis. J Glob Health. 2021;11(11). https://pubmed.ncbi.nlm.nih.gov/33692893/.

  13. Tan CL, Knight ZA. Regulation of Body Temperature by the Nervous System. Neuron. 2018;98(1):31–48. https://doi.org/10.1016/j.neuron.2018.02.022.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Kooij JJS, et al. Updated European Consensus Statement on diagnosis and treatment of adult ADHD. Eur Psychiatry. 2019;56(1):14–34. https://doi.org/10.1016/j.eurpsy.2018.11.001.

    Article  CAS  PubMed  Google Scholar 

  15. Barkley RA, Murphy KR, Fischer M. ADHD in Adults: What the Science Says. New York: Guilford Press; 2010.

  16. Secnik K, Swensen A, Lage MJ. Comorbidities and costs of adult patients diagnosed with attention-deficit hyperactivity disorder. Pharmacoeconomics. 2005;23. https://doi.org/10.2165/00019053-200523010-00008.

  17. Biederman J, Faraone SV. The effects of attention-deficit/hyperactivity disorder on employment and household income. MedGenMed. 2006;8.

  18. Knecht C, de Alvaro R, Martinez-Raga J, Balanza-Martinez V. Attention-deficit hyperactivity disorder (ADHD), substance use disorders, and criminality: a difficult problem with complex solutions. Int J Adolesc Med Health. 2015;27(2):163–75. https://doi.org/10.1515/ijamh-2015-5007.

    Article  PubMed  Google Scholar 

  19. Martinez-Raga J, Knecht C, de Alvaro R, Szerman N, Ruiz P. Addressing Dual Diagnosis Patients Suffering from Attention-Deficit Hyperactivity Disorders and Comorbid Substance Use Disorders: A Review of Treatment Considerations. Addict Disord Their Treat. 2013;12(4):213–30. https://doi.org/10.1097/ADT.0b013e318277060c.

    Article  Google Scholar 

  20. Brunkhorst-Kanaan N, Libutzki B, Reif A, Larsson H, McNeill RV, Kittel-Schneider S. ADHD and accidents over the life span - A systematic review. Neurosci Biobehav Rev. 2021;125:582–91. https://doi.org/10.1016/j.neubiorev.2021.02.002.

    Article  PubMed  Google Scholar 

  21. Dalsgaard S, Østergaard SD, Leckman JF, Mortensen PB, Pedersen MG. Mortality in children, adolescents, and adults with attention deficit hyperactivity disorder: a nationwide cohort study. Lancet. 2015;385(9983):2190–6. https://doi.org/10.1016/S0140-6736(14)61684-6.

    Article  PubMed  Google Scholar 

  22. Children and Adults with Attention-Deficit/Hyperactivity Disorder (CHADD). The ADHD Diagnostic Process. https://chadd.org/for-professionals/the-adhd-diagnostic-process/.  Accessed 3 Mar 2024.

  23. Waltereit R, Ehrlich S, Roessner V. First-time diagnosis of ADHD in adults: challenge to retrospectively assess childhood symptoms of ADHD from long-term memory. Eur Child Adolesc Psychiatry. 2023;32:1333–5.

  24. Abdelnour E, Jansen MO, Gold JA. ADHD Diagnostic Trends: Increased Recognition or Overdiagnosis? Mo Med. 2022;119(5):467–73.

  25. Johnson J, Morris S, George S. Misdiagnosis and missed diagnosis of adult attention-deficit hyperactivity disorder. BJPsych Adv. 2021;27(1):60–1. https://doi.org/10.1192/bja.2020.34.

    Article  Google Scholar 

  26. Culpepper L, Mattingly G. Challenges in identifying and managing attention-deficit/hyperactivity disorder in adults in the primary care setting: a review of the literature. Prim Care Companion J Clin Psychiatry. 2010;12(6). https://doi.org/10.4088/PCC.10r00951pur.

  27. Reinecke MA, Beebe DW, Stein MA. The third factor of the WISC-III: It’s (probably) not freedom from distractibility. J Am Acad Child Adolesc Psychiatry. 1999;38(2). https://doi.org/10.1017/S003329170500471X.

  28. Baggio S, Hasler R, Giacomini V, El-Masri H, Weibel S, Perroud N, et al. Does the Continuous Performance Test Predict ADHD Symptoms Severity and ADHD Presentation in Adults? J Atten Disord. 2020;24(6):840–8. https://doi.org/10.1177/1087054718822060.

    Article  PubMed  Google Scholar 

  29. Guo N, Fuermaier ABM, Koerts J, et al. Neuropsychological functioning of individuals at clinical evaluation of adult ADHD. Neural Transm. 2021;128. https://doi.org/10.1007/s00702-020-02281-0.

  30. Fasmer OB, Mjeldheim K, Førland W. Linear and non-linear analyses of Conner’s Continuous Performance Test-II discriminate adult patients with attention deficit hyperactivity disorder from patients with mood and anxiety disorders. BMC Psychiatry. 2016;284. https://doi.org/10.1186/s12888-016-0993-4.

  31. Nasiri E, Khalilzad M, Hakimzadeh Z, et al. A comprehensive review of attention tests: can we assess what we exactly do not understand? Egypt J Neurol Psychiatry Neurosurg. 2023;59(26). https://doi.org/10.1186/s41983-023-00628-4.

  32. Assef ECDS, Capovilla AGS, Capovilla FC. Computerized Stroop Test to Assess Selective Attention in Children with Attention Deficit Hyperactivity Disorder. Span J Psychol. 2007;10(1):33–40. https://doi.org/10.1017/S1138741600006296.

    Article  PubMed  Google Scholar 

  33. King JA, Colla M, Brass M, Heuser I, von Cramon D. Inefficient cognitive control in adult ADHD: evidence from trial-by-trial Stroop test and cued task switching performance. Behav Brain Funct. 2007;3(42). https://pubmed.ncbi.nlm.nih.gov/17708762/.

  34. Young S, Bramham J, Tyson C, Morris R. Inhibitory dysfunction on the Stroop in adults diagnosed with attention deficit hyperactivity disorder. Personal Individ Differ. 2006;41(8):1377–84. https://doi.org/10.1016/j.paid.2006.01.010.

    Article  Google Scholar 

  35. Hall CL, Valentine AZ, Groom MJ, et al. The clinical utility of the continuous performance test and objective measures of activity for diagnosing and monitoring ADHD in children: a systematic review. Eur Child Adolesc Psychiatry. 2016;25. https://doi.org/10.1007/s00787-015-0798-xi.

  36. Erdodi LA, Lajiness-O’Neill R. Time-Related Changes in Conners’ CPT-II Scores: A Replication Study. Appl Neuropsychol Adult. 2014;21(1):43–50. https://doi.org/10.1080/09084282.2012.724036.

    Article  PubMed  Google Scholar 

  37. Boonstra AM, Oosterlaan J, Sergeant JA, Buitelaar JK. Executive functioning in adult ADHD: a meta-analytic review. Psychol Med. 2005;35(8):1097–108. https://doi.org/10.1017/s003329170500499x.

    Article  PubMed  Google Scholar 

  38. Frazier TW, Demaree HA, Youngstrom EA. Meta-analysis of intellectual and neuropsychological test performance in attention-deficit/hyperactivity disorder. Neuropsychology. 2004;18(3):543–55. https://doi.org/10.1037/0894-4105.18.3.543.

    Article  PubMed  Google Scholar 

  39. Hervey AS, Epstein JN, Curry JF. Neuropsychology of adults with attention-deficit/hyperactivity disorder: a meta-analytic review. Neuropsychology. 2004;18(3):485–503. https://doi.org/10.1037/0894-4105.18.3.485.

    Article  PubMed  Google Scholar 

  40. Pagán AF, Huizar YP, Schmidt AT. Conner’s Continuous Performance Test and Adult ADHD: A Systematic Literature Review. J Atten Disord. 2023;27(3):231–49. https://doi.org/10.1177/10870547221142455.

    Article  PubMed  Google Scholar 

  41. Homack S, Riccio CA. A meta-analysis of the sensitivity and specificity of the Stroop Color and Word Test with children. Arch Clin Neuropsychol Off J Natl Acad Neuropsychologists. 2004;19(6):725–43. https://doi.org/10.1016/j.acn.2003.09.003.

    Article  Google Scholar 

  42. van Mourik R, Oosterlaan J, Sergeant JA. The Stroop revisited: a meta-analysis of interference control in AD/HD. J Child Psychol Psychiatry Allied Discip. 2005;46(2):150–65. https://doi.org/10.1111/j.1469-7610.2004.00345.x.

    Article  Google Scholar 

  43. Schwartz K, Verhaeghen P. ADHD and Stroop interference from age 9 to age 41 years: a meta-analysis of developmental effects. Psychol Med. 2008;38(11):1607–16. https://doi.org/10.1017/S003329170700267X.

    Article  CAS  PubMed  Google Scholar 

  44. Casal PV, Esposito FL, Martínez IM, Capdevila A, Puig MS, de la Osa N, et al. Clinical Validation of Eye Vergence as an Objective Marker for Diagnosis of ADHD in Children. J Atten Disord. 2019;23(6):599–614. https://doi.org/10.1177/1087054717749931.

    Article  Google Scholar 

  45. Bellato A, Arora I, Kochhar P, Hollis C, Groom MJ. Indices of Heart Rate Variability and Performance During a Response-Conflict Task Are Differently Associated With ADHD and Autism. J Atten Disord. 2022;26(3):434–46. https://doi.org/10.1177/1087054720972793.

    Article  PubMed  Google Scholar 

  46. Hanć T, Cortese S. Attention deficit/hyperactivity-disorder and obesity: A review and model of current hypotheses explaining their comorbidity. Neurosci Biobehav Rev. 2018;92:16–28. https://doi.org/10.1016/j.neubiorev.2018.05.017.

    Article  PubMed  Google Scholar 

  47. Geissler J, Romanos M, Hegerl U, et al. Hyperactivity and sensation seeking as autoregulatory attempts to stabilize brain arousal in ADHD and mania? ADHD Atten Def Hyp Disord. 2014;6. https://doi.org/10.1007/s12402-014-0144-z.

  48. Hvolby A. Associations of sleep disturbance with ADHD: implications for treatment. ADHD Atten Def Hyp Disord. 2015;7. https://doi.org/10.1007/s12402-014-0151-0.

  49. Albrecht B, Uebel-von Sandersleben H, Gevensleben H, Rothenberger A. Pathophysiology of ADHD and associated problems-starting points for NF interventions? Front Hum Neurosci. 2015;9. https://doi.org/10.3389/fnhum.2015.00359.

  50. Bellato A, Arora I, Hollis C, Groom MJ. Is autonomic nervous system function atypical in attention deficit hyperactivity disorder (ADHD)? A systematic review of the evidence. Neurosci Biobehav Rev. 2020;108:182–206. https://doi.org/10.1016/j.neubiorev.2019.11.001.

    Article  CAS  PubMed  Google Scholar 

  51. da Silva BS, Grevet EH, Silva L, et al. An overview on neurobiology and therapeutics of attention-deficit/hyperactivity disorder. Discov Ment Health. 2023;3(2). https://doi.org/10.1007/s44192-022-00030-1.

  52. Riaz A, Asad M, Alonso E, Slabaugh G. DeepFMRI: End-to-end deep learning for functional connectivity and classification of ADHD using fMRI. J Neurosci Methods. 2020;335:108506. https://doi.org/10.1016/j.jneumeth.2019.108506.

    Article  PubMed  Google Scholar 

  53. Kaiser A, Aggensteiner PM, Baumeister S, Holz NE, Banaschewski T, Brandeis D. Earlier versus later cognitive event-related potentials (ERPs) in attention-deficit/hyperactivity disorder (ADHD): A meta-analysis. Neurosci Biobehavi Rev. 2020;112:117–34. https://doi.org/10.1016/j.neubiorev.2020.01.019.

    Article  Google Scholar 

  54. González JJ, Alba G, Mañas S, González A, Pereda E. Assessment of ADHD Through Electroencephalographic Measures of Functional Connectivity. New Dir Diagn Treat. 2015;1. https://doi.org/10.5772/60559.

  55. Khoshnoud S, Nazari MA, Shamsi M. Functional brain dynamic analysis of ADHD and control children using nonlinear dynamical features of EEG signals. J Integr Neurosci. 2018;17(1):17–30. https://doi.org/10.3233/JIN-170033.

    Article  Google Scholar 

  56. Sen B, Borle NC, Greiner R, Brown MRG. A general prediction model for the detection of ADHD and Autism using structural and functional MRI. PLoS ONE. 2018;13(4):1–28. https://doi.org/10.1371/journal.pone.0194856.

    Article  CAS  Google Scholar 

  57. Wang XH, Jiao Y, Li L. Identifying individuals with attention deficit hyperactivity disorder based on temporal variability of dynamic functional connectivity. Sci Rep. 2018;8(1):1–12. https://doi.org/10.1038/s41598-018-30308-w.

    Article  CAS  Google Scholar 

  58. Loh HW, Ooi CP, Barua PD, Palmer EE, Molinari F, Acharya UR. Automated detection of ADHD: Current trends and future perspective. Comput Biol Med. 2022;146:105525. https://doi.org/10.1016/j.compbiomed.2022.105525.

    Article  PubMed  Google Scholar 

  59. Waxenbaum JA, Reddy V, Varacallo M. Anatomy, Autonomic Nervous System. StatPearls Publishing; 2024. https://www.statpearls.com/.

  60. Subramanian S, Barbieri R, Brown EN. Point process temporal structure characterizes electrodermal activity. Proc Natl Acad Sci. 2020;117(42):26422–8. https://doi.org/10.1073/pnas.2004403117.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Debnath S, Levy TJ, Bellehsen M, et al. A method to quantify autonomic nervous system function in healthy, able-bodied individuals. Bioelectron Med. 2021;7. https://doi.org/10.1186/s42234-021-00075-7.

  62. Faraone SV, Bonvicini C, Scassellati C. Biomarkers in the diagnosis of ADHD-promising directions. Current psychiatry reports. 2014;16(497). https://doi.org/10.1007/s11920-014-0497-1.

  63. Koenig J, Rash JA, Kemp AH, Buchhorn R, Thayer JF, Kaess M. Resting state vagal tone in attention deficit (hyperactivity) disorder: A meta-analysis. World J Biol Psychiatry. 2017;18(4):256–67. https://doi.org/10.3109/15622975.2016.1174300.

    Article  PubMed  Google Scholar 

  64. Berntson GG, Thomas Bigger Jr J, Eckberg DL, Grossman P, Kaufmann PG, Malik M, et al. Heart rate variability: Origins, methods, and interpretive caveats. Psychophysiology. 1997;34(6):623–48. https://doi.org/10.1111/j.1469-8986.1997.tb02140.x.

    Article  CAS  PubMed  Google Scholar 

  65. Critchley H, Nagai Y. In: Electrodermal Activity (EDA). New York: Springer New York; 2013. pp. 666–9. https://doi.org/10.1007/978-1-4419-1005-9_13.

  66. Kushki A, Drumm E, Pla Mobarak M, Tanel N, Dupuis A, Chau T, et al. Investigating the Autonomic Nervous System Response to Anxiety in Children with Autism Spectrum Disorders. PLoS ONE. 2013;8(4):1–8. https://doi.org/10.1371/journal.pone.0059730.

    Article  CAS  Google Scholar 

  67. Robe A, Dobrean A, Cristea IA, Păsărelu CR, Predescu E. Attention-deficit/hyperactivity disorder and task-related heart rate variability: A systematic review and meta-analysis. Neurosci Biobehav Rev. 2019;99:11–22. https://doi.org/10.1016/j.neubiorev.2019.01.022.

    Article  PubMed  Google Scholar 

  68. Rukmani MR, Seshadri SP, Thennarasu K, Raju TR, Sathyaprabha TN. Heart Rate Variability in Children with Attention-Deficit/Hyperactivity Disorder: A Pilot Study. Ann Neurosci. 2016;23(2):81–8. https://doi.org/10.1159/000443574.

    Article  PubMed  PubMed Central  Google Scholar 

  69. Yuksel T, Ozcan O. Heart rate variability as an indicator of autonomous nervous system activity in children with attention deficit hyperactivity disorder. Alpha Psychiatry. 2018;19. https://doi.org/10.5455/apd.288995.

  70. Kvadsheim E, Fasmer OB, Osnes B, Koenig J, Adolfsdottir S, Eichele H, et al. Lower Cardiac Vagal Activity Predicts Self-Reported Difficulties With Emotion Regulation in Adolescents With ADHD. Front Psychiatry. 2020;11. https://doi.org/10.3389/fpsyt.2020.00244.

  71. Chua RN, Hau YW, Tiew CM, Hau WL. Investigation of Attention Deficit/Hyperactivity Disorder Assessment Using Electro Interstitial Scan Based on Chronoamperometry Technique. IEEE Access. 2019;7:144679–90. https://doi.org/10.1109/ACCESS.2019.2938095.

    Article  Google Scholar 

  72. Du Rietz E, James SN, Banaschewski T, Brandeis D, Asherson P, Kuntsi J. Autonomic arousal profiles in adolescents and young adults with ADHD as a function of recording context. Psychiatry Res. 2019;275:212–20. https://doi.org/10.1016/j.psychres.2019.03.039.

    Article  PubMed  PubMed Central  Google Scholar 

  73. Lazzaro I, Gordon E, Li W, Lim CL, Plahn M, Whitmont S, et al. Simultaneous EEG and EDA measures in adolescent attention deficit hyperactivity disorder. Int J Psychophysiol. 1999;34(2):123–34. https://doi.org/10.1016/S0167-8760(99)00068-9.

    Article  CAS  PubMed  Google Scholar 

  74. Hermens DF, Williams LM, Lazzaro I, Whitmont S, Melkonian D, Gordon E. Sex differences in adult ADHD: a double dissociation in brain activity and autonomic arousal. Biol Psychol. 2004;66(3):221–33. https://doi.org/10.1016/j.biopsycho.2003.10.006.

    Article  PubMed  Google Scholar 

  75. Bijlenga D, Van Someren EJW, Gruber R, Bron TI, Kruithof IF, Spanbroek ECA, et al. Body temperature, activity and melatonin profiles in adults with attention-deficit/hyperactivity disorder and delayed sleep: a case-control study. J Sleep Res. 2013;22(6):607–16. https://doi.org/10.1111/jsr.12075.

    Article  PubMed  Google Scholar 

  76. Svetlak M, Bob P, Cernik M, Kukleta M. Electrodermal complexity during the Stroop Colour Word Test. Auton Neurosci Basic Clin. 2010;152(1):101–7. https://doi.org/10.1016/j.autneu.2009.10.003.

    Article  Google Scholar 

  77. Hoshikawa Y, Yamamoto Y. Effects of Stroop color-word conflict test on the autonomic nervous system responses. Am J Physiol Heart Circ Physiol. 1997;272(3):H1113–21. https://doi.org/10.1152/ajpheart.1997.272.3.H1113.

    Article  CAS  Google Scholar 

  78. Causse M, Sénard JM, Démonet JF, Pastor J. Monitoring Cognitive and Emotional Processes Through Pupil and Cardiac Response During Dynamic Versus Logical Task. Appl Psychophysiol Biofeedback. 2010;35(2):115–23. https://doi.org/10.1007/s10484-009-9115-0.

    Article  PubMed  Google Scholar 

  79. Brekke JS, Raine A, Ansel M, Lencz T, Bird L. Neuropsychological and Psychophysiological Correlates of Psychosocial Functioning in Schizophrenia. Schizophr Bull. 1997;23(1):19–28. https://doi.org/10.1093/schbul/23.1.19.

    Article  CAS  PubMed  Google Scholar 

  80. Levy B. Autonomic nervous system arousal and cognitive functioning in bipolar disorder. Bipolar Disord. 2013;15(1):70–9. https://doi.org/10.1111/bdi.12028.

    Article  PubMed  Google Scholar 

  81. Barkley RA, Murphy KR. Comorbid Psychiatric Disorders in Adults with ADHD. ADHD Rep. 2007;15(2):1–7. https://doi.org/10.1521/adhd.2007.15.2.1.

    Article  Google Scholar 

  82. Baron-Cohen S, Wheelwright S, Skinner R, et al. The Autism-Spectrum Quotient (AQ): Evidence from Asperger Syndrome/High-Functioning Autism, Males and Females, Scientists and Mathematicians. J Autism Dev Disord. 2001;31. https://doi.org/10.1023/A:1005653411471.

  83. Baron-Cohen S, Wheelwright S. The empathy quotient: an investigation of adults with Asperger syndrome or high functioning autism, and normal sex differences. J Autism Dev Disord. 2004;34. https://doi.org/10.1023/b:jadd.0000022607.19833.00.

  84. Baron-Cohen S, Wheelwright S, Robinson J, Woodbury-Smith M. The Adult Asperger Assessment (AAA): a diagnostic method. J Autism Dev Disord. 2005;35. https://doi.org/10.1007/s10803-005-0026-5.

  85. Pehlivanidis A, Papanikolaou K, Korobili K, Kalantzi E, Mantas V, Pappa D, et al. Trait-Based Dimensions Discriminating Adults with Attention Deficit Hyperactivity Disorder (ADHD), Autism Spectrum Disorder (ASD) and, Co-occurring ADHD/ASD. Brain Sciences. 2021;11(1). https://doi.org/10.3390/brainsci11010018.

  86. Kooij J, Francken M. Diagnostic interview for ADHD in adults 2.0 (DIVA 2.0). DIVA Foundation. 2010.

  87. Sheehan D, Lecrubier Y, Harnett SK, Javans J, Weiller E. The MINI international psychiatric interview (MINI): the development and validation of a structured diagnostic psychiatric interview for DSMIV and ICD10. J Clin Psychiatry. 1997;59:22–33.

  88. van Rossum G. Python tutorial. Amsterdam: Centrum voor Wiskunde en Informatica (CWI); 1995. CS-R9526.

  89. Sheikh M, Qassem M, Kyriacou PA. Wearable, Environmental, and Smartphone-Based Passive Sensing for Mental Health Monitoring. Front Digit Health. 2021;3. https://doi.org/10.3389/fdgth.2021.662811.

  90. Singh J, Sharma D. Automated detection of mental disorders using physiological signals and machine learning: A systematic review and scientometric analysis. Multimed Tools Appl. 2023. https://doi.org/10.1007/s11042-023-17504-1.

  91. Hickey BA, Chalmers T, Newton P, Lin CT, Sibbritt D, McLachlan CS, et al. Smart Devices and Wearable Technologies to Detect and Monitor Mental Health Conditions and Stress: A Systematic Review. Sensors. 2021;21(10). https://doi.org/10.3390/s21103461.

  92. Kalpić D, Hlupić N, Lovrić M. In: Student’s t-Tests. Berlin, Heidelberg: Springer Berlin Heidelberg; 2011. pp. 1559–63. https://doi.org/10.1007/978-3-642-04898-2_641.

  93. Wuensch KL. In: Chi-Square Tests. Berlin, Heidelberg: Springer Berlin Heidelberg; 2011. pp. 252–3. https://doi.org/10.1007/978-3-642-04898-2_173.

  94. Rey D, Neuhäuser M. In: Wilcoxon-Signed-Rank Test. Berlin, Heidelberg: Springer Berlin Heidelberg; 2011. pp. 1658–9. https://doi.org/10.1007/978-3-642-04898-2_616.

  95. Yadolah D. In: Kolmogorov–Smirnov Test. New York: Springer New York; 2008. pp. 283–7. https://doi.org/10.1007/978-0-387-32833-1_214.

  96. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat Methods. 2020;17:261–72. https://doi.org/10.1038/s41592-019-0686-2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  97. Huawen L, Shichao Z. Noisy data elimination using mutual k-nearest neighbor for classification mining. J Syst Softw. 2012;85(5):1067–74. https://doi.org/10.1016/j.jss.2011.12.019.

    Article  Google Scholar 

  98. Song Y, Huang J, Zhou D, Zha H, Giles CL. IKNN: Informative K-Nearest Neighbor Pattern Classification. In: Kok JN, Koronacki J, Lopez de Mantaras R, Matwin S, Mladenic D, Skowron A, editors. Knowledge Discovery in Databases: PKDD 2007. Berlin, Heidelberg: Springer Berlin Heidelberg; 2007. pp. 248–64.

  99. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine Learning in Python. J Mach Learn Res. 2011;12:2825–30.

    Google Scholar 

  100. Sweller J, Ayres P, Kalyuga S. In: Cognitive Load Theory. Springer New York; 2011. https://doi.org/10.1007/978-1-4419-8126-4.

  101. Critchley HD. Review: Electrodermal Responses: What Happens in the Brain. Neuroscientist. 2002;8(2):132–42. https://doi.org/10.1177/107385840200800209.

    Article  PubMed  Google Scholar 

  102. Dehaene S, Cohen L. Towards an anatomical and functional model of number processing. Math Cogn. 1995;1(1):83–120.

    Google Scholar 

  103. Cohen Kadosh R, Henik A. A common representation for semantic and physical properties: A cognitive-anatomical approach. Exp Psychol. 2006;53(2):87–94. https://doi.org/10.1027/1618-3169.53.2.87.

    Article  PubMed  Google Scholar 

  104. Cohen Kadosh R, Henik A. When a line is a number: Color yields magnitude information in a digit-color synesthete. Neuroscience. 2006;137(1):3–5. https://doi.org/10.1016/j.neuroscience.2005.08.057.

    Article  CAS  PubMed  Google Scholar 

  105. Pinel P, Piazza M, Le Bihan D, Dehaene S. Distributed and Overlapping Cerebral Representations of Number, Size, and Luminance during Comparative Judgments. Neuron. 2004;41(6):983–93. https://doi.org/10.1016/S0896-6273(04)00107-2.

    Article  CAS  PubMed  Google Scholar 

  106. Gupta S, Gupta A. Dealing with Noise Problem in Machine Learning Data-sets: A Systematic Review. Procedia Comput r Sci. 2019;161:466–74. https://doi.org/10.1016/j.procs.2019.11.146. The Fifth Information Systems International Conference, 23-24 July 2019, Surabaya, Indonesia.

  107. Idrees I, Bellato A, Cortese S, Groom MJ. The effects of stimulant and non-stimulant medications on the autonomic nervous system (ANS) functioning in people with ADHD: A systematic review and meta-analysis. Neurosci Biobehav Rev. 2023;144:104968. https://doi.org/10.1016/j.neubiorev.2022.104968.

    Article  CAS  PubMed  Google Scholar 

  108. Michelini G, Norman LJ, Shaw P, et al. Treatment biomarkers for ADHD: Taking stock and moving forward. Transl Psychiatry. 2022;12. https://doi.org/10.1038/s41398-022-02207-2.

Download references

Acknowledgements

Not applicable.

Funding

This research received no external funding.

Author information

Authors and Affiliations

Authors

Contributions

AP, CP, CT and GV contributed to the conceptualization and methodology of the study, and AP, CP, DA, and GV contributed to the coordination, monitoring and execution of the study. CT, DA, and PF designed and performed the data curation, processing and statistical analyses. The original draft was prepared by CT, DA, GV, and PF and all authors provided feedback, edited multiple drafts and reviewed the final manuscript.

Corresponding author

Correspondence to Dimitrios Andrikopoulos.

Ethics declarations

Ethics approval and consent to participate

The studies involving human participants were reviewed and approved by the Ethics Committee of the National Kapodistrian University of Athens, First Department of Psychiatry, Eginition Hospital. The patients/participants provided their written informed consent to participate in this study.

Consent for publication

Not applicable.

Competing interests

CT is employed by Feel Therapeutics Inc, receives a salary, and owns a large share of the company stocks. DA and PF are employed by Feel Therapeutics Inc, receive a salary, and own options in the company. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Andrikopoulos, D., Vassiliou, G., Fatouros, P. et al. Machine learning-enabled detection of attention-deficit/hyperactivity disorder with multimodal physiological data: a case-control study. BMC Psychiatry 24, 547 (2024). https://doi.org/10.1186/s12888-024-05987-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12888-024-05987-7

Keywords