The biological classification of mental disorders (BeCOME) study: a protocol for an observational deep-phenotyping study for the identification of biological subtypes
BMC Psychiatry volume 20, Article number: 213 (2020)
A major research finding in the field of Biological Psychiatry is that symptom-based categories of mental disorders map poorly onto dysfunctions in brain circuits or neurobiological pathways. Many of the identified (neuro) biological dysfunctions are “transdiagnostic”, meaning that they do not reflect diagnostic boundaries but are shared by different ICD/DSM diagnoses. The compromised biological validity of the current classification system for mental disorders impedes rather than supports the development of treatments that not only target symptoms but also the underlying pathophysiological mechanisms. The Biological Classification of Mental Disorders (BeCOME) study aims to identify biology-based classes of mental disorders that improve the translation of novel biomedical findings into tailored clinical applications.
BeCOME intends to include at least 1000 individuals with a broad spectrum of affective, anxiety and stress-related mental disorders as well as 500 individuals unaffected by mental disorders. After a screening visit, all participants undergo in-depth phenotyping procedures and omics assessments on two consecutive days. Several validated paradigms (e.g., fear conditioning, reward anticipation, imaging stress test, social reward learning task) are applied to stimulate a response in a basic system of human functioning (e.g., acute threat response, reward processing, stress response or social reward learning) that plays a key role in the development of affective, anxiety and stress-related mental disorders. The response to this stimulation is then read out across multiple levels. Assessments comprise genetic, molecular, cellular, physiological, neuroimaging, neurocognitive, psychophysiological and psychometric measurements. The multilevel information collected in BeCOME will be used to identify data-driven biologically-informed categories of mental disorders using cluster analytical techniques.
The novelty of BeCOME lies in the dynamic in-depth phenotyping and omics characterization of individuals with mental disorders from the depression and anxiety spectrum of varying severity. We believe that such biology-based subclasses of mental disorders will serve as better treatment targets than purely symptom-based disease entities, and help in tailoring the right treatment to the individual patient suffering from a mental disorder. BeCOME has the potential to contribute to a novel taxonomy of mental disorders that integrates the underlying pathomechanisms into diagnoses.
Retrospectively registered on June 12, 2019 on ClinicalTrials.gov (TRN: NCT03984084).
The lack of biological validity of the current classification systems of mental disorders, namely the World Health Organization’s (WHO) International Classification of Diseases (ICD-10)  and the American Psychiatric Association’s Diagnostic and Statistical Manual of Mental Disorders (DSM-5) , is considered to be one of the major reasons why psychiatry has made little progress in translating biomedical research findings into clinical practice. The past three decades have been marked by tremendous technological advances in basic scientific disciplines such as genomics and imaging. However, unlike in other medical disciplines (e.g., oncology), these new developments have not yet reached patients with mental health problems. The rapid gain in biological knowledge has not improved the understanding, diagnosis or treatment of mental disorders (e.g. ).
Why is it so difficult for psychiatry to translate new (neuro)-biological research findings into clinical applications? It has been argued that our current diagnostic classification approach and the way it defines mental disorders hinders the translation of biological knowledge into the clinic . Currently, the diagnosis of a mental disorder is based on predominantly self-reported symptoms (e.g., feeling sad). It does not rely on any biological or etiological information. Diagnostic criteria only require the presence of a certain number of symptoms over a defined period of time and that the symptoms cause clinically significant impairment in daily life functioning.
There are several attributes of these symptom-based disease categories that hinder causal research for mental disorders: 1) Disorder categories are not clearly separated from each other. Comorbidity is the rule and not the exception. In the Netherlands Study of Depression and Anxiety (NESDA), for example, 75% of individuals with a lifetime diagnosis of depression also met criteria for an anxiety disorder and among those with an anxiety disorder, 81% fulfilled criteria for depression  raising the question of whether these two disorders share common causes. 2) Different symptom profiles can lead to the same diagnosis. A patient complaining about markedly diminished interest, weight gain, hypersomnia, fatigue and diminished ability to concentrate would receive a diagnosis of major depression, as would a patient presenting an almost opposite pattern of symptoms (depressed mood, weight loss, insomnia, psychomotor agitation, feelings of inappropriate guilt, suicide attempt). 3) Assigning a mental diagnosis is a categorical yes/no decision that depends on partly arbitrary thresholds. A person reporting only four impairing depressive symptoms (instead of the required five) would not be diagnosed with depression but nevertheless share many similarities with a person meeting the threshold diagnosis. 4) Symptoms overlap between different classes of disorders. For example, psychotic features can be part of schizophrenia, major depression or bipolar disorder but the underlying pathophysiological mechanism may be the same. In sum, the high degree of comorbidity, heterogeneity, categorical threshold definitions and overlapping symptoms across disorders underline that symptom-based disorder categories do not constitute useful concepts for biomedical research into the causes of mental disorders (for an in-depth review of diagnostic systems see ).
Therefore, the National Institute of Mental Health (NIMH) as well as the biomedical workgroup of the EC-funded Roadmap for mental health (ROAMER) have called for new paradigms and ways to classify mental disorders in biomedical research in order to promote the goal of precision or stratified medicine in psychiatry [4, 7]. Transdiagnostic approaches in psychiatry such as the NIMH Research Domain Criteria (RDoC) have been developed in response to this call . Instead of grouping patients into classes by their reported symptoms, RDoC classifies according to dysregulations in major domains of human functioning that are highly relevant for psychopathology (e.g. motivational, emotional, cognitive systems as well as social behavior) (RDoC webpage: ).
The term RDoC is a direct reference to RDC, the Research Diagnostic Criteria that revolutionized psychiatric classification systems in the late seventies. When the RDC were proposed , the most pressing problem of psychiatry was that mental diagnoses were not clearly defined. Through specifying observable symptoms and criteria for mental disorders, RDC, as all editions of the DSM thereafter [2, 11,12,13,14], helped to create a common language in psychiatry. In fact, these descriptive classification systems laid the foundation for the scientific investigation of mental disorders. Before the DSM-III , the first DSM edition incorporating RDC, mental disorders were ill-defined and hardly investigable phenomena. Nonetheless, the expectation that a purely descriptive classification approach that does not rely on a specific theory would lead to the identification of the neurobiological underpinnings of mental disorders has not been fulfilled, as almost four decades of research have shown. Numerous biological correlates of mental disorders are now known, but symptom-based categories of mental disorders map poorly onto dysfunctions in brain circuits or neurobiological pathways. Patients with different disease mechanisms requiring different treatments may be diagnosed with the same disorder and patients with different diagnoses may share the same pathophysiology.
While the RDC aimed at overcoming the lack of reliability of mental diagnoses, the aim of RDoC-like approaches such as the Biological Classification of Mental Disorders (BeCOME) study is to identify biology-based disease categories. The long-term aim of BeCOME is to contribute to a novel taxonomy of mental disorders that integrates the underlying pathomechanisms into diagnoses. In BeCOME, patients with various affective, anxiety and stress-related mental disorders and controls are comprehensively (neuro) biologically characterized on two consecutive days. Assessments comprise genetic, molecular, physiological, neuroimaging, neurocognitive, psychophysiological and psychometric measurements. This multilevel information set will be used to identify data-driven biologically-informed categories of mental disorders . The focus of BeCOME lies on stress-related mental disorders.
Overview and setting
BeCOME is an observational and exploratory study that was initiated in 2015 by the Max Planck Institute of Psychiatry (MPIP) in Munich, Germany. It collates the Institute’s translational and clinical research infrastructure for the combined collection of deep phenotype and omics data in order to gain a better understanding of the biological basis of mental disorders. The core of the study consists of the comprehensive cross-sectional characterization of patients with depressive, anxiety and stress-related mental disorders and healthy individuals in basic motivational, emotional, cognitive and regulatory processes as well as stress arousal on two consecutive days. Patients who continue treatment at the MPIP (making up approximately one third of the total patient sample), are asked to take part in brief follow-up examinations around study days 14, 28 and 56, when they come in for their regular appointments, in order to evaluate the stability of basic phenotypic and biological parameters. External patients and healthy participants are not prospectively examined. For them the study ends at study day 2. The study was approved by the local Ethics Committee of the Ludwig Maximilians University, Munich, Germany, and written informed consent is obtained from all participants. Data have so far been collected only at the MPIP, but further sites may be invited to participate. The study is conducted in accordance with the Declaration of Helsinki.
In order to include individuals with varying degrees and a broad range of mental disorders from the anxiety and depression spectrum and to achieve adequate participant enrolment to reach target sample size, we employ the following recruitment strategies:
Eligible patients seeking treatment in one of our outpatient clinics are asked to participate and informed about the study by the treating physician.
Additional patients are recruited through a cooperation network with surrounding psychiatric and psychotherapy practices. Our collaboration partners either distribute study flyers in their waiting rooms and/or actively address the study to eligible patients.
Patients and healthy volunteers are also recruited though the MPIP website providing information about BeCOME and other ongoing studies that need participants.
We also approach potential patients and healthy volunteers through advertisements in print/social media and distribute flyers at local mental health events, meetings of anxiety and depression self-help groups, pharmacies etc.
The way of recruitment is documented which allows for the determination of the sampling probability for each group of participants as well as the examination of systematic biases between differently recruited groups of participants. In the case of systematic biases, we will adapt our recruiting strategy accordingly and for example additionally approach individuals randomly selected from the residents’ registration office. The current sample size comprises a total of 307 participants who went through study days 1 and 2. The proportion of patients recruited through our outpatient clinic amounts to 22.5% (N = 91). Independent of the participants’ self-referral as patient or healthy volunteer, case status is ascertained with a fully standardized diagnostic interview (DIA-X/M-CIDI). Approximately 16% of the sample do not meet criteria for DIA-X/M-CIDI diagnosis at a subthreshold or threshold level and can be considered as super healthy controls .
First contact with the study team is made either through self-referral in response to advertisements (e.g. webpage, flyers etc.) or through referral of a clinician from the MPIP clinic or a collaborating mental health practice. All patients and healthy volunteers expressing interest in participating receive (usually via email) the informed consent form which contains a detailed description of the study assessments and are asked to complete an online screening questionnaire checking eligibility criteria via secure email transfer. Since the major study assessments are rather sensitive to the influence of psychotropic substances (e.g. psychophysiological, neuroimaging and omics markers) and the study was designed to focus on neurobiological dysregulations associated with the disease status, inclusion criteria had to be kept strict in regard to the consumption of psychotropic substances (see Table 1). Any individuals with acute schizophrenia or psychotic symptoms as well as current eating disorders are excluded from the study, in order to minimize the confounding influence of medical conditions other than affective and anxiety disorders. If all criteria for study entry are met (see Table 1) and the consent form has been read, participants are invited to further appointments at the MPIP and receive their study time schedule via email.
Study time schedule
Figure 1 (left side) depicts the time schedule for a BeCOME participant. Study participation consists of three visits at the MPIP. The inclusion visit takes part approximately one to 2 weeks before study day 1. The assessment times on study day 1 and 2 are the same for all participants, while the schedule of the inclusion visit can be flexibly arranged depending on the participant’s availabilities. Approximately 2 days before each study visit, participants receive an email reminder or a phone call.
Inclusion visit (D0)
During the inclusion visit (D0), participants are informed in detail about the study by a staff psychologist or physician. They receive information on the purpose of the research, the study assessments, handling of data and their right to withdraw from study participation at any time. A staff physician examines whether there are any magnet resonance imaging (MRI) contraindications. When all study-related questions are clarified and the willingness for participation still exists, participants are asked for their informed written consent. For study inclusion, they need to agree to being informed in the case of medically relevant incidental blood or MRI findings that are indicative of a brain disorder, general medical problems, or an increased neurovascular risk. Study participation is not associated with any risks and not expected to produce any harms. Some of the study procedures (e.g. questions related to trauma exposure or stress test on day 2) may lead to emotional distress. Participants are informed about this and informed that they can consult a doctor or psychologist from the outpatient clinic during their study visits in case they need it (in addition to the psychiatric consultation on study day 2). Research assistants with direct patient contact are trained in how to respond to an emotional crisis and equipped with an emergency plan. In case of an emotional crisis, the participant is brought to the outpatient clinic. After participants have given their written informed consent, they are asked to wear an actigraph until the morning of study day 2. They also receive two paper and pencil questionnaires (for the assessment of sociodemographic factors and loss events) and an online link to a battery of computerized questionnaires with the request to complete these questionnaires by study day 1. The duration of the informed consent procedure is about 1 h. After a break, the fully structured diagnostic interview (DIAX/M-CIDI) is administered by a trained study assistant (approximately duration 2 to 3 h). Depending on the time schedule of the participant, the interview can also be moved to the end of study day 2.
Study days 1 and 2 (D1 and D2)
The major study assessments and experimental procedures take place on two consecutive days called study day 1 (D1) and study day 2 (D2). D1 takes from 08:15 am until 03:00 pm and consists of the following parts (see also Fig. 1):
Blood draw (fasting) for clinical laboratory parameters as well as plasma and blood containing the anticoagulant ethylenediamine tetraacetic acid (EDTA) for the extraction of DNA, RNA and for the isolation of peripheral blood mononuclear cells (PBMCs). PBMCs are stored so that they can be used for the generation of induced pluripotent stem cells;
Measurement of weight, height, waist-hip-ratio, blood pressure;
Psychophysiology (part 1);
Neuroimaging (part 1);
For the detection of heartrate variability, a mini-electrocardiogram is applied for 18 h.
On D2 participants undergo part 2 of psychophysiology and neuroimaging. D2 ends with an optional psychiatric consultation in the MPIP outpatient clinic. All participants receive a compensation of 150 Euros for their time and effort and get vouchers for breakfast and lunches on D1 and D2.
Only patients who are treated at the MPIP come in for repeated blood draws (for DNA, RNA, plasma) and psychometric assessments on study days 14, 28 and 56. At 4 and 12 months after study inclusion, MPIP patients are asked for a follow up of psychometric assessments.
The study combines psychometric data with biological measures. Particular emphasis is placed on the investigation of pathophysiological changes connected to the stress response in mental disorders. Biological markers include the study of DNA, RNA and proteins (extraction from blood sampling) as well as the performance in functional MRI (fMRI), psychophysiological and neuropsychological tasks. The following assessment domains are covered in BeCOME and described in detail below:
In this study, we plan to include a number of omics-based biomarkers from peripheral blood for patient stratification and grouping. The possible levels of investigation include genetics, epigenetic measures with DNA methylation, non-coding RNAs but also other epigenetics marks such as histone modifications as well as proteomics, gene expression and metabolomics. For these measures, we carry out blood draws at baseline as well as at follow-up visits that include EDTA blood for DNA extraction (genome-wide genotyping and DNA methylation), Pax-Gene RNA tubes for messenger RNA (mRNA) expression as well as small non-coding RNAs, serum and plasma for proteomics and metabolomics. Plasma can also be used to assess microRNAs circulating in exosomes. Finally, only at the baseline visit, we collect peripheral blood mononuclear cells using Biocoll separation. At least 30 Mio. cells are stored for each individual and these can be used as a source tissue for induced pluripotent stem cell programming as well as functional assays in live mononuclear cells. Cells are stored with a protective medium containing 10% dimethyl sulfoxide (DMSO) at cryogenic temperatures (storage below − 130 °C) in the gas phase of nitrogen.
All samples are stored in our biobanking unit (http://www.psych.mpg.de/1495662/bioprep). As omics assays evolve rapidly, the exact method for each of the assessments will only be selected once assays can be run collectively in several hundred individuals. Processing large number of samples together will reduce batch effects.
The acquisition of neuroimaging data in the BeCOME study is organized in two sessions that take place on D1 and D2 in a 3 Tesla MRI scanner (Discovery MR750, General Electric, Milwaukee, U.S.A.) using a 32-channel coil (see Fig. 2 for overview of the MRI procedures and Table 2 for sequence details). Both MRI sessions begin at defined times to minimize circadian influences. For paradigms that require active participation, explanations and instructions are given by experienced MR technicians, including a training session on a personal computer outside the MRI scanner. Instructions are standardized, but may be varied and expanded to guarantee that the paradigm and task has been understood. For the MRI session, all paradigms were programmed and delivered using Presentation (Neurobehavioral Systems, Berkeley, USA), displayed an MRI-compatible 30″ LCD (OptoStim, 1680 × 1050 pixels, 250 Hz, medical research GmbH, Cologne, Germany), and combined with a Lumina response box (LS-Line, Cedrus, San Pedro, U.S.A.). Participants wear noise cancellation headphones and ear plugs during the whole MRI session (Optoacoustics, Moshav Mazor, Israel).
The course of the MRI measurements on D1 (total average time spent in the scanner on day 1: ~ 70 min) is characterized by alternating functional and anatomical acquisitions, in order to grant pauses in-between functional scans and to minimize task carry-over effects. All day 1 fMRI measurements are accompanied by eye-tracking/pupillometry (EyeLink 1000 Plus, SR Research, Ottawa, Canada) (250 Hz sampling rate) for which a short calibration session is required after the participant’s positioning in the scanner. In addition, respiration and heart rate are monitored using GE’s respiration belt and plethysmography. Shortly before tasks are started, participants are verbally reminded of the instructions through the headphones. Sequences of day 1 are as follows:
High resolution T1-weighted image: This sequence serves as a main anatomical reference and as basis for morphometric studies.
Resting state functional MRI (rs-fMRI) using a whole brain echo planar imaging (EPI) sequence over 6:28 min, with a black fixation cross against a light-grey screen. The instruction for this task is: “Please lie as still as possible and fixate on the crosshairs. Try not to fall asleep. Eye-blinking is allowed.”
High contrast single spin-echo planar imaging to support optimal spatial post-processing of functional time series.
Time estimation task: This fMRI task focuses on processing of positive, negative and ambiguous feedback [19, 20]. It requires the participant to repeatedly estimate a time span of one second, starting when a fixation cross disappears, and then to press a response button. During this paradigm a total of 84 stimuli is presented in about 9:20 min, depending on the participant’s performance. The participant’s response triggers the immediate presentation of one of three graphical feedback symbols which are isoluminant and shown for 1500 ms: The feedback comprises either a green tick for ‘correct’, indicating sufficiently accurate approximation of one second inside the target time window, or a red cross for ‘incorrect’, indicating an answer outside the target window, or a black question mark for ‘uncertain’, concealing a clear feedback. The uncertain feedback is given in 30% of the trials. After each trial, the duration of the target time window is adapted to achieve a balanced proportion of successful and unsuccessful trials. The intertrial interval varies between 1800 and 2000 ms during which an empty screen is shown. The fixation cross is shown for 400 to 600 ms.
Axial FLAIR sequence is acquired to support the screening for incidental findings and to allow for the segmentation of WM lesions.
Reward anticipation task based on a monetary incentive delay task . Again, stimuli, which are isoluminant to the background, are used to avoid the light reflex for pupillometric readouts. One trial consists of the projection of one of three graphical, abstract symbols for 6 s. These symbols announce three different types of trials: type 1, a trial that allows to win a small amount of money; type 2, a trial with verbal feedback on the performance but no monetary incentive; type 3, a trial with no response requested. After the two reward conditions (type 1 and 2), the symbols are followed by a short white screen flash (100 ms) to which the participant should react as fast as possible by pressing a button. After another 1000 ms, a symbol indicating either win of money (€), fast performance (✓) or too slow response (X) is shown for 1500 ms, followed by an update of the gain balance shown for 2000 ms. An adaptive algorithm adjusting the allowed response time window ensures that participants will succeed in approximately 50% of their responses across the session. Intertrial intervals vary between 3000 and 6000 ms during which a fixation cross is shown. For a full description of our adaption of the original task by Knutson et al. , see Schneider et al. .
Whole brain diffusion tensor imaging (DTI) and auxiliary files to allow for distortion correction procedures are acquired as basis for microstructural analyses.
Verbal n-back task: This task reliably elicits working memory circuits in neuroimaging studies [23, 24]. A total of 8 randomized blocks (each comprising 16 stimuli over 40 s) of the type 0-back, 1-back, 2-back and fixation are displayed. Before each block, the respective instruction is displayed for 6 s. Stimuli are small and capital letters displayed for 500 ms with an additional 1000 ms during which answers are collected. A pause of 1 s is interposed before the next stimulus.
On day 2, participants perform two further functional tasks, again after specific instructions and preparations. Saliva is sampled upon arrival at the MR scanner and repeatedly during the second paradigm to assess the endocrine stress response. In addition, a peripheral venous cannula is set up in a subset of participants to repeatedly sample blood during the second paradigm. After positioning on the MRI table, magnetic resonance compatible sensors for electrocardiogram (Ag/AgCl multitrodes, EasyCap, Herrsching, Germany), pulse plethysmography (Nonin 8604D pulse oximeter, Nonin Medical Inc., Plymouth MN, USA) and SCL (two Ag/AgCl electrodes; left index and middle finger; Brain Products GmbH, Gilching, Germany) are attached and calibrated in order to record autonomous nervous system signals (sampled and stored using the BrainVision ExG AUX Box, BrainVision ExG MR Amplifier, and BrainVision Recorder software 1.0, Brain Products GmbH, Gilching, Germany). Of note, the instructions for the second paradigm, a psychosocial stress test, only contains vague information that the task is to perform mental arithmetic while being monitored and evaluated. Details on the aversive feedback elements are not communicated prior to the experiment. In consequence, a detailed debriefing is performed after the entire experiment during which the false and knowingly aversive elements of the feedback (mainly under-average performance) are uncovered. Functional tasks of day 2:
Face matching task: This task is an adaptation of the paradigm originally presented by Hariri et al.  that is widely used to study emotional face processing . We implemented a version with a mixed block/event-related design, consisting of 8 blocks over a total of 9:35 min, during which either faces need to be matched according to their emotional expression, or, as a control condition, simple geometrical objects need to be matched according to their shape. As face stimuli we use examples from the Ekman faces collection  with neutral, fearful, sad, angry and happy facial expressions, adjusted graphically in terms of size, average luminosity, and with hairstyle or jewelry being masked.
Imaging stress test (IST): This paradigm is an adaptation of the Montreal IST [28, 29]. Before the actual stress experiment, a 6:28 min baseline resting state fMRI is collected. The stress experiment includes three fixed phases of 8 min each referred to as pre-stress, stress and post-stress phase. During each phase, 5 blocks of arithmetic tasks (each 55 s) followed by 45 s of fixation cross are presented. The main instruction is to solve arithmetic problems (including simple additions and multiplications, presented for ~ 4 s [adaptive]) as fast and accurate as possible and to select the correct answer (a number between 0 and 9 on a dial) by using the response box. During the stress phase, psychosocial stress is exerted by aversive feedback (announcements of being watched and monitored; screen elements indicating ostensible under-average performance; repeated, scripted negative verbal feedback). Following the submission of an answer, a mathematically true feedback is printed on the screen after 1.5–3 s, before the next calculus appears. After the 24-min stress experiment, a 30-min recovery phase is appended, with the participant resting on the MRI table outside of the magnet, but autonomous nervous system recordings are continued. Eventually, the participant is positioned back inside the MRI scanner and second resting state fMRI of 6:28 min is acquired. After completing the IST, all devices are removed and the participant is debriefed as explained above. Figure 2 details the time points of parallel blood or saliva collections and subjective ratings.
Anatomical images (sagittal T1-weighted images, axial FLAIR images) are screened by an experienced MRI reader and verified by a board certified radiologist. Standard operation procedures exist how participants and their general practitioners are informed about incidental findings. To meet ethical standards, the general threshold for informing participants is kept low. Only a small percentage of cases with incidental findings (currently< 10%) are excluded from the study post-hoc, e. g. cases with distinct hints towards a neurodegenerative disorder.
Psychophysiology: tasks and procedures
The psychophysiological measurements occur on 2 days. On D1 (~ 10 AM), after the set-up with electrodes and sensors, participants undergo a habituation session. With the simple instruction that this session is a brief habituation session, a startling noise is presented four times and three visual stimuli (geometric forms) are each presented three times. In the fear conditioning session, two conditioned stimuli are followed by aversive, unconditioned stimuli (US) during conditioning (CS+, 75% reinforcement schedule), whereas a safety stimulus (CS–) is not followed by an US. The US follows at stimulus offset and comprise either an electrical shock to the back of the right wrist or an air puff to the larynx , dependent on the preceding CS+. Participants receive the following instruction: “The geometric forms may be followed by mild electrical shocks or air puffs”. During the fear extinction session that immediately follows, the CS+ paired with the electrical shocks is presented without any following shocks, interspersed with CS–. On day 2 (~ 9 AM), all three stimuli are presented again in the recall session: the safety stimulus, the extinguished stimulus and the un-extinguished stimulus (CS+ followed by air puffs); again neither electrical shocks nor air puffs are administered during this run, as its sole focus is on memory recall (of the extinction, fear and safety memory). Participants will receive the same instruction as on the first test day. After the recall session, one unsignaled air puff is administered and the session is continued without any US. This allows the evaluation of any possible differences in the return of fear through reinstatement.
The stimuli consist of simple geometric shapes and are presented for 4 s each, with inter-stimulus-intervals jittered between 12 and 16 s, displayed in a pseudorandom order (≤ 2 consecutive presentations of the same stimulus). Seventy-five percent of trials contain an auditory startle probe at 3.0 or 3.5 s, (104 dB white noise delivered via head phones), whereas behavioral ratings occur before, during and after the psychophysiology recordings on day 1 and day 2. One US consists of electrical shocks which are pulses of 20 ms duration with typical intensities between 3 and 25 mA, generated by a Digitimer Stimulator (Model DS7, Digitimer Ltd., Hertfordshire, United Kingdom). Stimulation intensity is individually titrated before the psychophysiology measurement following a staircase protocol. The initial shock intensity is set at 0.5 mA and 0.5 mA increments are used to find the level at which shocks will be uncomfortable but not painful. The other US is a 9 bar airblast of 250 ms duration (see ) delivered to the larynx from a distance of approximately 1–2 cm.
Skin conductance level and responses (SCR) and heart rate (variability) are assessed with the wireless Electrodermal Activity and Pulse Plethysmogram BioNomadix module with Biopac’s MP150 system. SCR electrodes are placed on the palm of the left hand, the pulse plethysmogram sensor is placed on the left thumb. Participants enter the behavioral ratings to stimuli on a computer keyboard with their right hand.
Eyeblink electromyographic (EMG) responses to startle sounds are measured by two electrodes on the skin surface overlaying the left orbicularis oculi muscle, after thorough preparation of the surface of the skin. A ground electrode is placed behind the left ear. The EMG signal is conducted with a dual Wireless EMG BioNomadix Pair to the MP150 system. The software package Acknowledge 4.1 is used for recording (1000 Hz) and initial preparation of data before export to MATLAB.
Pupillometry is assessed with the Eyelink 1000Plus system via the desktop mount with head support in the blinded laboratory room (artificial light only). After optimizing the pupil detection, initial calibration and validation of eye gaze (to minimize the fixation error), data is recorded of the right eye with a sampling rate of 250 Hz. Participants are instructed to fixate on the fixation cross whenever one is present and stimuli appear small and centered on the screen to minimize differences in pupil size due to gaze direction.
Actigraphy is measured between the screening visit and the morning of D2 (approximately 14–28 days) by ActiSleep Monitors (Actigraph, Pensacola FL) or Daqtometers (Versions 2.3 and 2.4, Daqtix, Germany), which are placed on the non-dominant wrist. Devices are set to sample acceleration every second and to store activity counts every 30s as the mean of all samples within the storage interval , the result of which can be compared with subjective daily sleep logs.
Heart rate variability (HRV) measurements
A bipolar, portable mini-electrocardiogram device (Faros 180°, Biosign GmbH, Ottenhofen, Germany) is applied after the end of the fMRI measurements on D1. Controlled recordings are obtained during resting (5 min) and deep breathing (1 min). The Valsalva and orthostasis tests are optional. The device is worn by the participant until the next morning (~for 18 h overnight). The device is removed and the data are read out on a standard PC in the morning of D2. Electrocardiogram samples have a resolution of 500 Hz, allowing for precise peak detection, and accompanying accelerometer data have a resolution of 50 Hz. Data analysis is programmed in-house in Matlab (Pan Tompkin’s algorithm) and R, supported by open-source packages (HRVR), and covers semi-automated artifact correction, automated peak detection and generation of basic standard heart rate variability parameters.
Neuropsychology: tasks and procedures
The cognitive test battery includes tests from the Test of Attentional Performance (TAP) , the “Materialien und Normwerte für die neuropsychologische Diagnostik” (MNND) , the Wechsler Adult Intelligence Scale (WAIS-IV) , as well as the Trail Making Test (TMT) , the d2-R , and the “Mehrfachwahl-Wortschatztest” (MWTB) . Three basic domains of attention, executive functioning, and memory are assessed with the following tests:
Episodic memory (MNND): A brief text is read out loud to the subject, who is instructed to memorize the content. The subject is required to reproduce the text in as much detail as possible. Reproduced contents (verbatim and analogous) are assessed immediately after the presentation (short delay) as well as 30 min later (long delay).
Working memory (2-back task, TAP): A series of numbers are presented on a screen. The participant is required to respond with a button press whenever a number is the same as the number second to last. Reaction time, omission and commission errors are recorded.
Inhibitory control (GoNogo task, TAP): In this task, the symbols “X” and “+” are presented in alternating sequence on a screen. The participant is required to respond with a button press upon appearance of “X” (Go trial), but not “+” (Nogo trial).
Cognitive flexibility (TAP): A pair of stimuli, consisting of a number and a letter, is presented on a screen. The subject is required to respond with either a button press on the right or the left side depending on the required response. For example, for the first pair, the button needs to be pressed on the side of the letter, for the next pair on the side of the number, and so forth. Errors and reaction time are assessed.
Word fluency (MNND): The participant is asked to produce as many words as possible beginning with the letter “S” during three minutes.
Sensitivity to interference (Stroop, MNND): The participant is presented with three consecutive templates. The first template shows colored circles and the participant is required to name the color of each circle as fast as possible. On the second template, the participant is shown words in differently colored font and is asked to name the color of each word as fast as possible. On the third template, colored words (e.g., “blue” or “green”) are presented to the participant in differently colored fonts. Again, the participant is required to name the color of the word as fast as possible. Processing time, Stroop mistakes (reading the color word on the third template) and other mistakes are assessed.
Attention/flexibility (TMT): In this test, 25 circles are distributed over a sheet of paper. In the first part (TMT-A), the circles are numbered from 1 to 25 and the participant is required to draw lines to connect the numbers in ascending order. In the second part (TMT-B), the circles include both numbers and letters and the subject is required to connect the circles in alternating ascending order (1-A-2-B etc.). Processing time for TMT-A and TMT-B as well as the ratio (TMT-B/A) are registered.
Attention (d2-R): This is a measure to assess the ability to concentrate. The participant is presented with 14 lines (each 20 s) of characters, of which one character (the letter “d” with two dashes) needs to be marked. All other characters (b, q, p) or the letter “d” associated with one or three dashes need to be omitted. Concentration ability is calculated from the number of processed items and processing speed.
Crystallized intelligence (MWTB): The MWTB is a choice vocabulary test of verbal intelligence, developed to match the construct of crystallized intelligence. Participants have to indicate the real word in a row of five words. The number of correctly answered items (37 in total) provides an estimate of crystallized intelligence.
The neuropsychological test battery is administered by trained research personnel (Psychology students and medical technical assistants). The training consists of observations of five test sessions and being observed in five test session by trained research personnel. Each assistant is then observed at least once by a trained neuropsychologist before being admitted as assessor of neuropsychology data.
Probabilistic social reward learning task
In order to assess implicit social learning and cue integration, we employ an established probabilistic social reward learning task . In this computational modeling task, participants decide between one of two cards with varying winning probabilities. At the center of the screen, a face of a computer-generated avatar is presented. At the beginning of each trial, the face looks towards one of the two cards. The probability of the gaze providing a helpful advice is systematically manipulated independent of the changing winning probabilities of the cards. After the saccade, the participant is asked to choose a card and wait for the feedback in which the outcome (correct/wrong) is presented. Both cards are associated with reward values. When a choice was correct, the reward value of the chosen card is added onto a cumulative score, which is updated in the feedback phase. In the instruction, the participants are informed that the winning probabilities of each card would change during the experiment. However, no explicit information is given with respect to the social cue. Behavioral responses of this task will be modelled using hierarchical generative models  in order to estimate the hidden states that govern the learning process about the card and gaze probabilities, respectively. Applying the model also allows to individually estimate the extent to which participants are integrating the social information during their decision making process. After the screening visit, participants receive an online link in order to get to an internet-based version of the paradigm, which was implemented in PsyToolkit [39, 40] (https://www.psytoolkit.org). This task consists of 120 successive trials followed by a brief questionnaire to measure the subjective exploitation of social and non-social information.
Self-report measures: interviews, rating scales and questionnaires
A shortened and slightly modified lifetime and 12-month version of the computer-assisted Munich-Composite International Diagnostic Interview (DIA-X/M-CIDI) [41, 42] is used to assess symptoms, syndromes and diagnoses of the following mental disorders according to DSM-IV : nicotine use and dependence (section B), anxiety disorders (panic attacks, panic disorder, agoraphobia, specific phobias, generalized anxiety disorder; section D), depressive episodes and dysthymia (section E), mania and bipolar disorders (section F), psychoses (section G), alcohol use/disorders (section I), obsessive-compulsive disorders (section K), illegal substance use/disorders (section L) and post-traumatic stress disorder (section N). The M-CIDI additionally collects information on onset, duration and severity of mental disorders. The M-CIDI was developed on the basis of the World Health Organization’s CIDI version 1.2  to additionally cover ICD-10 criteria. Psychometric properties of the DIA-X/M-CIDI have been reported elsewhere [44,45,46]. The interview is conducted face-to-face by trained study assistants who are regularly supervised by a staff psychologist and undergo repeated training in the administration of the DIA-X/M-CIDI. Each interview undergoes a plausibility check according to a standard procedure.
Observer-administered rating scales
Past-week severity of depressive symptoms is rated with the Montgomery-Åsberg Depression Rating scale (MADRS) (, German version: ). In order to ensure interrater reliability, raters use a structured interview guide (, own translation into German) for gathering the information needed for coding the MADRS.
The Panic and Agoraphobia Scale (PAS) by Bandelow  is administered to collect information on past-week occurrence of panic attacks, agoraphobic avoidance, anticipatory anxiety, disability due to panic and agoraphobia and functional avoidance. Each assessor is receiving training in the administration of the PAS scale.
In order to continuously ensure the quality of data from observer-based ratings, raters undergo regular training sessions and are supervised by a staff psychologist. Before being allowed to conduct the first interview by themselves, trainees are required to observe an experienced rater administering the interview and to conduct an interview with an experienced rater sitting in the session. In both situations, their coded sum score needs to fall within 2 points of the total score of the experienced rater. The training is continued until the criterion is met. Questions and difficulties arising from the application of the instruments are addressed in weekly meetings or immediately by contacting the supervisor. Deviations from the rating protocol are checked in regular interrater reliability meetings and booster training sessions.
Socioeconomic status (SES): Education, occupation, current employment status, household composition/income and social class status of participants and their spouses is assessed with a questionnaire that we developed on the basis of the recommendations by the German Statistical Federal Office . It includes a section for the assessment of the SES of the participant’s family of origin to determine the SES during the forming years of childhood and youth. The questionnaire additionally includes the German version of the Quality of Marriage Index (QMI-D) (, original version: ) and allows for the determination of a social prestige/status index according to international standards .
The following depression- and anxiety-related symptom measures are used for a deeper dimensional characterization of participants:
The State-Trait-Anxiety Inventory (STAI , German version: ) with the sum score of the state anxiety scale (A-State, form X1) representing an intensity measure of a transient emotional state characterized by tension, uneasiness, nervousness, fear of future events and arousal of the autonomic nervous system and the sum score of trait anxiety (A-Trait, form X2) indicating a general anxiety predisposition that is stable over time.
The Body Sensations Questionnaire (BSQ) and the Agoraphobic Cognitions Questionnaire (ACQ) developed by Chambless et al.  for the assessment of the fear of fear (German version: ). The ACQ comprises three subscales: agoraphobic cognitions, loss of control and physical concerns.
The Intolerance-of-Uncertainty Scale (IUS) (, German version: ) assesses reactions to ambiguous situations, uncertainty and future events and is listed in the RDoC matrix as a self-report measure for potential threat.
The Behavioral Inhibition and Behavioral Approach System (BIS/BAS) scales to assess approach motivation and individual differences in the sensitivity to reward and punishment (, original version: ). The newest version of the RDoC matrix lists BIS also as self-report measure for the RDoC domain potential threat.
A four-item questionnaire (IE-4) is used to measure locus of control (LOC) , the personal belief about whether life is controllable by own actions (internal LOC) or by external factors outside one’s influence (external LOC). LOC is assessed because it might have a strong influence on reward learning and valuation.
For a categorical and dimensional self-assessment of DSM-IV personality disorders, we use the “Assessment of DSM-IV Personality Disorders” (ADP-IV) by Schotte et al. (, German version: ). The ADP-IV specifically asks for personality features that cause stress, problems and social conflicts.
(d) Aspects of social processes are measured with the following questionnaires:
The Relationship Scales Questionnaire (RSQ) (, German version: ) was chosen for the assessment of attachment style because it operationalizes several theoretical concepts of attachment theory and it can be completed by singles. The RSQ allows for the dimensional assessment of four attachment prototypes (secure, preoccupied, dismissing-avoidant, fearful-avoidant) according to the four-category model of Bartholomew and Horowitz . The RSQ comprises the 18 items of the Adult Attachment Scale (AAS ) from which the three subscales closeness, dependency and anxiety in relationships can be derived. The RSQ can also be scored in regard to the two dimensions attachment-related avoidance and attachment-related anxiety [74, 75].
Interpersonal pleasure is measured with the German Translation of the Anticipatory and Consummatory Interpersonal Pleasure Scale (ACIPS) [Skala der erwarteten und vollendeten zwischenmenschlichen Freude, translated by K. Kirst, University of Wisconsin-Madison] . Four factors (general social interactions, close relationships, shared interests and experiences as well as family-related interactions) can be derived from the ACIPS .
The size of the social network is estimated with the Social Network Questionnaire (SNQ)  that has been derived from the social contact circle interview . The SNQ asks participants to list the names of all individuals (household members, family members, friends, colleagues, neighbors, others) with whom they were in contact within the past 4 weeks.
Empathy Quotient (EQ) and Autism Quotient (AQ) are measured with the questionnaires originally developed by Baron-Cohen et al. (EQ: , German translation: , AQ: , German translation: ).
Exposure to trauma, critical life events and coping strategies: The German version of the childhood trauma questionnaire (CTQ)  by Wingenfeld et al.  is used to assess sexual, physical and emotional abuse as well as physical and emotional neglect experiences during childhood. The CTQ is listed as a self-report measure for the RDoC domain “sustained threat” . For collecting more detailed information on traumatization during childhood and adolescence (up to age 18) in terms of exposure frequency during certain age periods and past/current impact of the event, we additionally administer the short inventory for the assessment of early traumatic life events (K-IFTL). The K-IFTL is a modified version of the Early Trauma Inventory (ETI , German version: ) and covers the domains physical punishment/assaults, unwanted sexual experiences and general traumas (e.g., death of a close friend). We additionally added a section on emotional traumas to the K-IFTL, as emotional traumas are the most frequent childhood traumas in our patient population. Lifetime exposure to accidents, natural disasters, criminal acts, physical assaults and sexual traumas together with information on age at traumatization and exposure duration is assessed within the post-traumatic stress disorder (PTSD) section of the DIA-X/M-CIDI [41, 42]. The trauma list N1 was modified to include occurrence frequency, duration age at exposure for each trauma category. Information on current (past 6 months) exposure to negative life events is collected with a modified version of the Munich event list (MEL by , described in detail in ). Since the RDoC domain “loss” was not completely covered by our environmental measures but is of importance for the development of depressive and anxiety disorders, we developed a questionnaire for assessing experiences of loss over lifetime. This self-developed loss event questionnaire (LEQ) assesses the frequency, past/current impact and age at occurrence of several loss events such as death of parents, close attachment figures, siblings, spouses, children, close friends, separation/divorce of parents, separation/divorce from spouse/partner as well as severe (life-threatening) illnesses of loved ones.
The following stress coping strategies are assessed with a validated German stress coping inventory (Stressverarbeitungsfragebogen-78 item version, SVF78, ): Play down, guilt denial, substitutional satisfaction, situation control, reaction control, positive self-instruction, need for social support, active avoidance, flight tendency, rumination, resignation, self-accusation. The SVF78 additionally allows for the differentiation between stress-reducing (positive) and stress-augmenting (negative) strategies. For the measurement of psychosocial stress-resistance we used the 11 item version of the resilience scale (RS-11) (, German version: ). The RS-11 is a one-dimensional scale that conceptualizes resilience as a protective personality factor.
Each questionnaire (except for the loss event and the SES questionnaire) was computerized and collected with an online survey tool (collector 2015.Q2 by survalyzer). The battery of questionnaires is split into three packages to reduce the load associated with filling in the questionnaires. For each package, participants receive a separate online link after the screening visit.
In MPIP patients, the MADRS, BDI-II, PAS and STAI-X1 are repeatedly assessed at each in-house study visit (days 14, 28 and 56) to assess the course of symptoms over time.
A biomedical research portal (software CentraXX by Kairos, Germany) will be used for the secure storage and management of data. The software includes a biobanking and a clinical trial management system supporting a quality-assured execution of the study. Data are entered in eCRFs (electronic case report forms) with automatic checks whether values fall within the valid range. Incomplete eCRFs are flagged. Every eCRF is checked by a second data entry assistant in regard to transferring errors and inconsistencies. Only complete and quality controlled data (flagged green) are released.
Data confidentiality and security are enforced through several mechanisms. Access to the database is password secured and restricted by role-based rights. The visibility of confidential data and the type of activity an individual user is allowed to undertake is regulated by his/her role in the study and privileges associated with the personal login data. The security architecture of the research portal implements German data protection laws.
The principal investigator (EBB) and coordinators of the study (TB, VS, PS, SL, AE) have full access to the dataset. Access to the data for research purposes is granted by the PI upon request. Topics for publication need to be approved by the PI and communicated with the coordination team who oversee authorship regulations. Study participants, who signed that they want to be informed about publications arising from the study, receive an email notification when a publication appears. For data protection reasons, the full data set including omics data cannot be made publicly available. Access to data of published results will be granted upon request.
The overall aim of the study is to integrate multi-level information across all assessment domains and to yield biology-based subclasses of mental disorders by using cluster analytical techniques. Previous attempts to identify biomarkers and biotypes of mental disorders have failed to identify reproducible biomarkers. So far, these studies have included fewer assessment domains, such as only neuroimaging, neurotransmitter measurements or self-report data (see e.g. [94,95,96]). Given the complexity of mental disorders that associate with domains like personality, life history as well as environmental, behavioral, neurobiological, and genetic factors, it is reasonable to assume that heterogeneity in mental disorders can be best understood by considering as wide a range of modalities and data sources as possible. The comparative utility of different data modalities has been examined in studies seeking to predict disease outcomes (e.g. ), but has not been adequately addressed with regard to more general biomarker discovery for mental disorders. To ensure that the rich data collected in this study will be used in the most beneficial way, the core analyses of this study will be planned and carried out in alignment with expert recommendations regarding development of biomarkers for psychiatric disorders [98, 99].
Two of the primary concerns that need to be addressed in our analysis approach are the integration of information from multiple data sources and the ratio of observations to possible variables to be examined. To this end, the core assessment domains - omics, neuroimaging, psychophysiology, neuropsychology, and self-report measures – will first be analyzed separately to reduce the dimensionality of the feature space. These partial analyses will include feature extraction (e.g. PCA) and feature selection steps that will inform the input into a multi-modal unsupervised learning analysis. For example, data-driven approaches will be combined with expert-based feature selection (e.g. evidence from the literature, meta-analyses) in order to include domain knowledge in the selection process. The goal of these dimensionality reduction steps is to reduce the number of input variables from multiple assessment domains to a small set of validated and generalizable features. These reduced feature set will be used as input into an unsupervised learning pipeline incorporating subsampling and cross-validation steps to guard against overfitting. An appropriate clustering approach will then be chosen based on the outcome from the previous data integration steps. Depending on the dimensionality of the final feature set, ensemble learning will be considered as a possibility for separating the contribution of the different assessment domains. This may be particularly beneficial when considering that previous research has shown domains such as self-report questionnaire assessments to account for a much larger portion of variance than omics or neuroimaging data in explaining behavioral or disease phenotypes (e.g. in alcohol use: [97, 100, 101]). By constructing separate models for different data domains, a “drowning out” of weaker effects that nevertheless contribute unique variance to a model may be counteracted.
While the prior reduction of the feature space is an important step in reducing the risk of overfitting, the suitability of our sample size for recovering existing effects is nevertheless of concern . Thus far, no results from comparable studies examining affective or anxiety disorders have been reported, making it difficult to estimate possible effect sizes and determine case numbers accordingly. Therefore the determination of the case number was guided by studies with a comparable research agenda (e.g. B-SNIP study ). We plan to include at least 1000 patients or individuals affected by a mental disorder and 500 individuals without a mental disorder according to the DIA-X/M-CIDI. These sample sizes are comparable to the case numbers Clementz et al. used for the identification of psychosis biotypes  and are also informed by examinations of the changes in retrievable effects in prediction studies with low signal-to-noise data and varying sample and feature set sizes . The feature reduction steps are used to keep the feature-to-observation ratio in a balance. Since the partial analyses and other projects resulting from data collected in this study will precede our main analysis, we will track the participants included in each prior examination of the data and use this information to partition the dataset into a training set including all participants previously examined and a test set including only those participants untouched in prior analyses. Consequently, any models developed based on data collected in this study will be tested on a held-over sample to avoid overfitting due to in-sample model validation .
Only few studies exist that have tried to identify biological subtypes of affective and anxiety disorders. Most of these studies were limited by relatively small sample sizes and a more narrow focus on only one specific biological factor (for a review see ). The BeCOME study tries to overcome the shortcomings of previous subtyping research by
the recruitment of a much larger sample of patients as well as controls;
using a multilevel assessment and in depth-phenotyping approach (including a broad range of omics, neuroimaging, psychophysiological, cognitive and self-report data)
and by the application of experimental paradigms for the induction of responses in basic behavioral, emotional or motivational systems known to be disturbed in affective and anxiety disorders.
The activation of disorder-related systems allows for the collection of dynamic data which will carry more information about pathophysiological processes underlying the disorder than a baseline only characterization. Table 3 illustrates how the instruments and measures of the BeCOME study have been selected to characterize basic domains of functioning aligned with RDoC domains and constructs that are relevant for the psychopathology of stress-related mental disorders. The BeCOME study was explicitly designed as a multilevel assessment and deep-phenotyping study which resulted in an extensive and time-intensive research project. Participants need to invest approximately 16 h of their time for completing all study assessments and need to be free on two consecutive weekdays. These high time demands and the strict inclusion criteria likely leads to a selection bias which limits the generalizability of results. Another challenge of the study is the integration and reduction of data across multiple assessment levels. Below, we first discuss how each level of assessment can contribute to this task and then give examples for how different measures map to RDoC-like construct and the horizontal integration of RDoC constructs across tasks and measurement levels.
We will perform genome-wide genotyping followed by genotype imputation  of all participants and this information can be used to determine polygenic risk scores for each individual based on large genome-wide association studies (GWAS) of common psychiatric disorders, but also medical conditions or physiological, biological or lab measures, including for example scores derived from the GWAS from large imaging studies in ENIGMA, immune parameters or body mass index. In addition, we plan to map functional genetic scores based on expression or methylation or other quantitative trait loci, as available from public databases (e.g., https://gtexportal.org/home/) or our own resources . Polygenic risk scores may support novel patient stratification by testing overlapping or divergent genetic risk across current diagnostic categories. Previous studies have shown that a higher polygenic risk score for schizophrenia is associated with more mood incongruent psychotic symptoms in patients with bipolar disorder  or that patients with features of atypical depression have a polygenic score predictive of higher BMI . Besides genetics, we also aim to use other omics data to refine patient stratification. Metabolomics in peripheral blood, both at baseline as well as with antidepressant treatment have already been shown to associate with disease severity and future treatment response in depression [111, 112]. Measure of epigenetic features, such as DNA methylation may allow assessment of features driven by both, genetic as well as environmental factors . Peripheral blood microRNAs are emerging as promising biomarkers and some may correlate with the function in specific neurotransmitter systems  or exposure to chronic stress . Methodological developments, e.g., for proteomics, will allow the assessment of these parameters at higher sensitivity, decreased cost and input requirement.
Finally, the development of machine learning tools allowing the integration of a number of biomarker measures will be necessary to fully leverage the power of these methods. Such methods are increasingly used in other areas of medicine  but also in psychiatry .
Structural and functional neuroimaging
The structural and functional MRI assay has been composed mainly under two perspectives: First, to provide a broad coverage of brain imaging data suitable to study and deep-phenotype the depression/anxiety spectrum, and second, feasibility of the protocol, particularly in terms of its tolerability by patients. Both aspects have led to a protocol that is distributed over two sessions on two different days at baseline. Here, the first, longer session is characterized by an alternation between structural and functional measurements to allow the participant for sufficient pauses between task engagements.
The MRI assay delivers a robust basis for an extraction of macrostructural, microstructural and functional markers in the sense of a circuit-directed phenotyping . It comprises state-of-the art anatomical sequences including diffusion tensor imaging for morphometric studies and to derive measures of structural connectivity (‘structural connectomics’). The task battery has been aligned to RDoC schemes in that key functional domains are covered: Brain circuits underlying positive and negative valence are addressed by several paradigms. First, an established face matching task [25, 26] for which a large body of literature has demonstrated relevance to affective and anxiety disorders (e.g. ) and that is suitable to map the amygdala-centered fear circuitry. Second, reward processing is approached by an incentive monetary delay task that focusses on reward anticipation and robustly recruits ventral striatal and salience network responses . Third, neural responses to positive, negative and ambiguous feedback are reliably elicited by the time estimation task that originates in the feedback-related negativity, a known event-related potential in response to negative feedback . Negative and ambiguous feedback can trigger self-related processing of the brain, and abnormal responses to performance feedback were found to predict neuroticism traits  and anhedonia . With regard to cognitive domains, working memory (WM) functions can be robustly mapped using a verbal n-back paradigm . WM deficits are seen under acute stress  and are common in depression where they tend to persist beyond acute episodes [121, 122]. WM dysfunction broadly impacts other executive functions such as planning, decision making and problem solving , and the elucidation of its neurobiological basis is thus important. Eventually, arousal and stress regulatory systems are studied explicitly by the IST during which psychosocial stress (operationalized as aversive evaluation during performance of mental arithmetic) is exerted. The IST is a multimodal advancement of the Montreal IST [28, 29], characterized by pre-stress, stress and post-stress phase to allow for studies on the deflection and recovery of the stress response system at different levels (fMRI, endocrine, autonomous nervous system, and molecular such as gene expression). Its multiple layers can be exploited for combined analyses (e.g., autonomous nervous system-informed fMRI analyses, imaging genomics) or deliver separate, complementary readouts.
Resting state fMRI, an area that has strongly expanded over the last 10 years since its entry into the neuroimaging field (see reviews [123, 124]), is obtained on D1 and D2, here before and after the IST. This allows for extracting functional connectivity (FC) measures including within-subject-stability measures and stress-induced changes. Homogenization and in part automatization of fMRI preprocessing are first steps towards a systematic data-to-information breakdown. Next, the extraction of carefully selected, task-specific, regional BOLD amplitude contrasts is needed, before aggregation with other modalities and data-driven definition of clinical subtypes can be applied. Given the rich phenotypes and depth within each domain and per subject, a large sample is needed to avoid overfitting.
The psychophysiological tests addressing fear learning and recall will include eye blink startle EMG and SCR to relate our findings to other samples and the current body of literature. In addition to that we have incorporated pupillometry into our analyses as a sensitive readout for differential fear learning , and we have validated this comparatively novel readout against the state-of-the-art in an initial study on healthy participants . Incidentally, this work revealed that pupil dilation closely matched subjective US expectancy ratings and did not habituate like startle EMG or SCR. Slow pupil dilations seem to more closely reflect valence-unspecific emotional arousal like SCR , whereas studies on startle EMG indicate that this measure captures valence-aspects of fear learning to a larger extent . The recall session on D2 will allow the evaluation of fear and extinction memory consolidation, with an additional procedure of presenting two unsignaled USs to assess reinstatement of fear. With these readouts and sessions, we aim to disentangle various physiological and cognitive/affective processes, including non-associative learning (habituation), differential fear learning and uncertainty. Classical and Bayesian statistical analyses will be applied in order to optimize robust extraction of parameters for such processes at the individual level, which can then be related to the other data modalities. As an outlook, the sessions will be extended to a virtual reality (VR) setting in order to assess fear behavior in addition to physiological and subjective responses.
Cognitive functions are known to be impaired in most patients with psychiatric disorders . Cognitive deficits can precede the onset of psychiatric symptoms and often persist beyond acute episodes, indicating at least partial independence from other symptoms. For example, cognitive dysfunctions were found to persist in patients whose depressive symptoms had remitted [130,131,132,133,134,135]. Notably, however, different psychiatric diagnoses are associated with similar patterns of impairment, which underlines the lack of biological validity of current classification systems. Dysfunctions mostly occur in executive functions , but also arise in attention and memory functions . They have been suggested to be mediated by stress-related neuropeptides  and shown to be associated with network dysfunctions [137,138,139]. However, it appears that cognitive changes can also co-vary with affective changes. For example, improvements in verbal memory, verbal fluency and psychomotor speed. Improvements in mood were most closely related to improvements in verbal memory, verbal fluency and psychomotor speed appeared most closely related to improvements in mood, whereas attention and executive function remained impaired across treatment . Importantly, cognitive impairments have been shown to affect the return to work and relapse frequency.
Only few studies to date have investigated cognitive functions as possible predictors of severity of illness and therapy outcome, as well as how they could contribute to optimal therapy selection [141, 142].
Different mechanisms might be at play in younger and older populations. Late-life depression, for example, is related to possible transition to neurodegenerative disorders . Furthermore, depression predicts the worsening of cognitive function in older age and better pre-treatment cognition, particularly verbal memory, is associated with better treatment response in late-life depression . We aim to elucidate the role of cognitive functions, alone and in conjunction with the broad variety of other measures applied.
Computational modeling task for assessing social reward learning
We employ a probabilistic social reward learning task, which involves learning about the changing winning probabilities of two cards. In addition, a social cue in form of a computer-generated face is presented which would, in every trial, give advice on which card to choose. We are interested in the computations that drive the learning and decision-making process for these two sources of information. The framework of predictive coding or Bayesian inference provides generative learning models of how the brain combines previously learned information with newly observed data in a Bayesian optimal manner . Fitting such computational models to behavioral data acquired in our task, allows us to estimate individual approximations to Bayes-optimality, that is the estimation of subject specific parameter values that govern the learning process and extent to which participants are integrating the social information during their decision making process. Recent work in the burgeoning field of computational psychiatry has pointed to an aberrant learning process that manifests itself in error-prone inferences or conclusions when we are trying to make sense of the world. For instance, Browning, Behrens, Jocham, O'Reilly & Bishop  showed that trait anxiety was associated with impaired learning about environmental volatility, i.e., the change in probabilistic relationships over time. DeBerker et al.  applied the Hierarchical Gaussian Filters  to demonstrate that emotional and physiological stress reactions are tightly linked to the subjective uncertainty when learning about the probabilistic relationship between visual cues and electric shocks. Applying similar modelling approaches, we hope to gain a better understanding of the computational commonalities and or differences associated with different patient cohorts with respect to social and non-social information processing.
The selection of self-report measures for BeCOME was guided by three principles: (1) A part of the measures should cover the major domains of human behavior and functioning that have been specified by RDoC and ROAMER as useful constructs for biomedical research in psychiatry (negative/positive valence, cognitive, social processes and arousal/regulatory systems) [7, 148]. However, until today it remains unclear which questionnaires are appropriate measures that reflect specific RDoC domains [7, 148]. In 2015, when BeCOME started, only very few self-report measures had been proposed by the RDoC initiative and some of these measures were later criticized for being misplaced within the original RDoC matrix (e.g., the STAI taps into potential rather than acute threat)  or even removed from the RDoC matrix (e.g., BAS scales). The final report of a more recent workgroup on tasks and measures for RDoC aiming at compiling a set of standardized behavioral tasks and self-report instruments for the assessment of RDoC constructs again lacks recommendations for self-report measures . For some RDoC constructs, appropriate self-report instruments simply do not exist yet. Over the course of the study, we intend to adapt psychometric measurements in BeCOME to new developments in the field.
(2) Despite the study’s focus on (neuro) biological dysregulations, a second guiding principle was that the selected instruments should still allow for a comprehensive symptomatic characterization of patients. We administer a structured diagnostic interview in order to be able to contrast new biological subtype findings with traditional DSM−/ICD-10 diagnoses . For the discovery of new phenotypic targets for biomarker research, we intend to perform data-driven clustering and other sub-phenotyping strategies on self-report measures and integrate the findings with biological results. RDoC has already stimulated research into symptom-based alternatives to traditional DSM categories, which led to the identification of phenotypes that show stronger links with biomarker findings. For example, using a data-driven approach, Grisanzio et al. (2018) identified symptom subtypes that map onto cognitive functions, electroencephalographic and behavioral measures . By referring to the RDC criteria for subtyping schizophrenic, schizoaffective and bipolar patients, Allardyce et al.  showed a dose-response relationship between polygenic risk for schizophrenia and the presence of mood-incongruent psychotic symptoms.
(3) Another major focus of BeCOME is the determination of lifetime exposure to traumatic and adverse events in terms of frequency, duration, subjective impact/severity and age at exposure. When necessary, we adopted existing environmental risk measures to include temporal and severity aspects. The environmental information will be related to epigenetic data in order to identify gene-environment signatures.
Horizontal integration within functional domains across tasks: an outlook
An important asset of our multilevel investigation is the opportunity to examine RDoC-like domains not only within one type of measures but across several types of assessments allowing for a horizontal integration of these constructs across tasks and measurement levels. Three examples of such possible horizontal integration are given below.
The fourth version of the RDoC matrix, published in May 2018, (https://www.nimh.nih.gov/research-priorities/rdoc/constructs/rdoc-snapshot-version-4-saved-5-30-18.shtml) categorizes fear learning paradigms and social stress tests as part of the acute threat construct of the negative valence domain. In our study, fear learning and recall tasks are administered in the psychophysiology laboratory whereas the adaptation of the Montreal IST takes place within the MR-scanner (with continuous blood and/or saliva sampling). This allows within-subject comparisons of psychophysiological anticipatory responses to acute threat within a fear learning framework (electric shocks, air puffs) to the more social evaluative acute threat during the stress test on multiple levels. Some measurements such as pulse plethysmography and skin conductance response recordings are obtained during all of these tasks - providing a direct connection, whereas the tasks combined provide a coherent multi-level assessment of the acute stress construct from circuits to physiology, behavior and self-report. In all tasks, we can also map the correlation with polygenic or epigenetic (DNA methylation or microRNA) measures associated with this construct in prior human or animal experiments, allowing integration of molecular information.
The construct of the arousal and regulatory systems domain will also be captured throughout various tasks and levels, resulting in multiple parameter estimates that can be hierarchically and individually related. Arousal-related processes, for instance hyperarousal, can be quantified by a reduced habituation in the standard psychophysiological measures (e.g., startle responses, SCR) in the psychophysiology lab , but also by pupil fluctuations during resting-state fMRI, which are robustly related to activity in the so-called salience network (dACC and bilateral insula, among others) . Such parameter estimates across domains can be related to other relevant levels, including sleep fragmentation as assessed by actigraphy or hyperarousal as obtained through the structured CIDI-interview.
The cognitive domain provides another instance of horizontal integration, as working memory is assessed both in the MR-scanner and in the neuropsychology laboratory, resulting in behavioral and imaging readouts. Moreover, in the scanner we employ simultaneous pupillometry to assess cognitive load and relate this to a cognitive load specific activity patterns. These physiological, imaging and behavioral data can be related to self-report measures regarding behavioral inhibition and approach as well as to other cognitive control tasks (e.g., response inhibition). These can also be mapped to relevant polygenic risk scores, such as for hippocampal volume for example.
Summary and outlook
The information collected in BeCOME spans many levels from omics, cellular and imaging data to psychophysiological parameters as well as self-reported symptoms of mental disorders, personality traits and lifetime exposure to trauma and other environmental risk factors. For a more dynamic in-depth phenotyping, BeCOME additionally applies several validated paradigms to experimentally induce a response in a basic system of human functioning (e.g., fear system, stress or reward system) that is frequently disturbed in patients with affective, anxiety and other stress-related mental disorders. The response to these stimulations is read out across multiple levels (mostly with psychophysiological, imaging and stress measurements). The dynamic read-outs can again be related back to basic omics data, environmental risk experiences and other behavioral data.
Extracting information from these multilevel measures using big data approaches, including machine learning methods, may lead to the identification of biology informed diagnoses that could convey information on the specific therapeutic needs of an individual according to their specific dysfunctional pattern. The clinical implication of such results can be illustrated by the previously described example of two patients with opposite symptom patterns but with the same major depression diagnosis. In the future, these two major depression cases might fall within two different disorder classes, one that is mainly characterized by dysregulations in the reward system and another one that is driven by dysfunctions in stress response and the fear system. Such differentiated patient profiles transport more information for an optimized treatment selection than the rather imprecise DSM-based diagnosis of major depression. Besides optimizing the allocation of existing treatments to an individual patient, biology-based subtypes may also lead to the discovery of novel treatment targets and stimulate the development of new pharmaceutical treatments. The overall aim of BeCOME is to contribute to a biology-informed taxonomy of mental disorders that points out the underlying disease mechanism and with it the treatment target. For example, first analyses of the BeCOME reward task and pupillometry data suggest that physiological disturbances in arousal upregulation, when anticipating a reward, might constitute an underlying pathophysiological process of depression symptomatology. Hence, the upregulation and maintenance of arousal during reward anticipation might be a translational process that could prove relevant for stratification of patients, treatment development, and tracking of drug target engagement (Schneider M, Elbau IG, Nantawisarakul T, Pöhlchen D, Brückl T, Erhardt A, et al: The eyes are a window to depression: reduced pupil dilation during reward anticipation in depression, under review). A further gain in knowledge of the biological profile of mental disorders might pave the way for an objective assessment of mental disorders with a validated array of omics and behavioral measures.
Recruitment for the BeCOME study was ongoing at the time this manuscript was submitted.
Availability of data and materials
The datasets generated and/or analysed during the current study contain clinical data and are not publicly available due to the protection of participants’ rights to privacy and data protection but are available from the corresponding author on reasonable request.
Alcohol use disorder identification test – consumption questions
Biological classification of mental disorders
Diagnostic and statistical manual of mental disorders
Functional magnet resonance imaging
International classification of diseases
Imaging stress test
Max Planck Institute of Psychiatry
Magnet resonance imaging
Research diagnostic criteria
Research domain criteria
Skin conductance responses
Time estimation task
World Health Organization. The ICD-10 classification of mental and behavioural disorders: clinical descriptions and diagnostic guidelines. Geneva: World Health Organization; 1992.
American Psychiatric Association. Diagnostic and statistical manual of mental disorders (5th ed.; DSM-5). Washington, DC: American Psychiatric Association; 2013.
Kapur S, Phillips AG, Insel TR. Why has it taken so long for biological psychiatry to develop clinical tests and what to do about it? Mol Psychiatry. 2012;17(12):1174–9.
Insel T, Cuthbert B, Garvey M, Heinssen R, Pine DS, Quinn K, et al. Research domain criteria (RDoC): toward a new classification framework for research on mental disorders. Am J Psychiatry. 2010;167(7):748–51.
Lamers F, van Oppen P, Comijs HC, Smit JH, Spinhoven P, van Balkom AJ, et al. Comorbidity patterns of anxiety and depressive disorders in a large cohort study: the Netherlands study of depression and anxiety (NESDA). J Clin Psychiatry. 2011;72(3):341–8.
Clark LA, Cuthbert B, Lewis-Fernandez R, Narrow WE, Reed GM. Three approaches to understanding and classifying mental disorder: ICD-11, DSM-5, and the National Institute of Mental Health's research domain criteria (RDoC). Psychol Sci Public Interest. 2017;18(2):72–145.
Schumann G, Binder EB, Holte A, de Kloet ER, Oedegaard KJ, Robbins TW, et al. Stratified medicine for mental disorders. Eur Neuropsychopharmacol. 2014;24(1):5–50.
Cuthbert BN. Research domain criteria: toward future psychiatric nosologies. Dialogues Clin Neurosci. 2015;17(1):89–97.
NIH. Research Domain Criteria (RDoC). Retrieved March 18, 2019, from https://www.nimh.nih.gov/research/research-funded-by-nimh/rdoc/index.shtml. 2019.
Spitzer RL, Endicott J, Robins E. Research diagnostic criteria: rationale and reliability. Arch Gen Psychiatry. 1978;35(6):773–82.
American Psychiatric Association. Diagnostic and statistical manual of mental disorders (3rd ed.; DSM-III). Washington, DC: American Psychiatric Association; 1980.
American Psychiatric Association. Diagnostic and statistical manual of mental disorders (3rd ed., revised; DSM-III-R). Washington, DC: American Psychiatric Association; 1987.
American Psychiatric Association. Diagnostic and statistical manual of mental disorders (4th ed.; DSM-IV). Washington, DC: American Psychiatric Association; 1994.
American Psychiatric Association. Diagnostic and statistical manual of mental disorders (4th ed., text rev.; DSM-IV-R). Washington, DC: American Psychiatric Association; 2000.
Insel TR, Cuthbert BN. Brain disorders? Precisely Sci. 2015;348(6234):499–500.
Pöhlchen D, Leuchs L, Binder FP, Blaskovich B, Nantawisarakul T, Topalidis P, Brückl TM, Norrholm SD, Jovanovic T, BeCOME working group, Spoormaker VI. No robust differences in fear conditioning between patients with fear-realted disorders and healthy controls. Behav Res Ther. 2020;129:103610.
Bush K, Kivlahan DR, McDonell MB, Fihn SD, Bradley KA, Project ACQI. The AUDIT alcohol consumption questions (AUDIT-C) - an effective brief screening test for problem drinking. Arch Intern Med. 1998;158(16):1789–95.
Gual A, Segura L, Contel M, Heather N, Colom J. Audit-3 and audit-4: effectiveness of two short forms of the alcohol use disorders identification test. Alcohol Alcohol. 2002;37(6):591–6.
Becker MPI, Nitsch AM, Miltner WHR, Straube T. A single-trial estimation of the feedback-related negativity and its relation to BOLD responses in a time-estimation task. J Neurosci. 2014;34(8):3005–12.
Hirsh JB, Inzlicht M. The devil you know: neuroticism predicts neural response to uncertainty. Psychol Sci. 2008;19(10):962–7.
Knutson B, Fong GW, Adams CM, Varner JL, Hommer D. Dissociation of reward anticipation and outcome with event-related fMRI. Neuroreport. 2001;12(17):3683–7.
Schneider M, Leuchs L, Czisch M, Samann PG, Spoormaker VI. Disentangling reward anticipation with simultaneous pupillometry / fMRI. NeuroImage. 2018;178:11–22.
Karlsgodt KH, Bachman P, Winkler AM, Bearden CE, Glahn DC. Genetic influence on the working memory circuitry: behavior, structure, function and extensions to illness. Behav Brain Res. 2011;225(2):610–22.
Owen AM, McMillan KM, Laird AR, Bullmore E. N-back working memory paradigm: a meta-analysis of normative functional neuroimaging studies. Hum Brain Mapp. 2005;25(1):46–59.
Hariri AR, Tessitore A, Mattay VS, Fera F, Weinberger DR. The amygdala response to emotional stimuli: a comparison of faces and scenes. NeuroImage. 2002;17(1):317–23.
Fusar-Poli P, Placentino A, Carletti F, Landi P, Allen P, Surguladze S, et al. Functional atlas of emotional faces processing: a voxel-based meta-analysis of 105 functional magnetic resonance imaging studies. J Psychiatry Neurosci. 2009;34(6):418–32.
Ekman P, Friesen WV. Pictures of Facial Affect. Palo Alto: Consulting Psychologists Press; 1976.
Dedovic K, Renwick R, Mahani NK, Engert V, Lupien SJ, Pruessner JC. The Montreal imaging stress task: using functional imaging to investigate the effects of perceiving and processing psychosocial stress in the human brain. J Psychiatry Neurosci. 2005;30(5):319–25.
Pruessner JC, Declovic K, Khalili-Mahani N, Engert V, Pruessner M, Buss C, et al. Deactivation of the limbic system during acute psychosocial stress: evidence from positron emission tomography and functional magnetic resonance imaging studies. Biol Psychiatry. 2008;63(2):234–40.
Jovanovic T, Keyes M, Fiallos A, Myers KM, Davis M, Duncan EJ. Fear potentiation and fear inhibition in a human fear-potentiated startle paradigm. Biol Psychiatry. 2005;57(12):1559–64.
Winnebeck EC, Fischer D, Leise T, Roenneberg T. Dynamics and Ultradian structure of human sleep in real life. Curr Biol. 2018;28(1):49–59 e5.
Zimmermann P, Fimm B. A test battery for attentional performance. In: Leclercq M, Zimmernann P, editors. Applied Neuropsychology of Attention Theory, Diagnosis and Rehabilitation; 2002. p. 110–51.
Balzer C, Berger J-M, Caprez G, Gosner A, Gutbrod K, Keller M. Materialien und Normwerte für die neuropsychologische Diagnostik MNND. Rheinfelden: Normdaten; 2011.
Wechsler D. WAIS-IV administration and scoring manual. San Antonio: Psychological Corporation; 2008.
Brickenkamkamp R, Schmidt-Atzert L, Liepmann D. d2-R. test d2 - revision. Göttingen: Hogrefe; 2010.
Lehrl S. Mehrfachwahl-Wortschatz-Intelligenztest MWT-B (5.). Balingen: Spitta Verlag; 2005.
Sevgi M, Diaconescu AO, Henco L, Tittgemeyer M, Schilbach L. Social Bayes: using Bayesian modeling to study autistic trait-related differences in social cognition. Biol Psychiatry. 2020;87(2):185–93.
Mathys CD, Lomakina EI, Daunizeau J, Iglesias S, Brodersen KH, Friston KJ, et al. Uncertainty in perception and the hierarchical Gaussian filter. Front Hum Neurosci. 2014;8:825.
Stoet G. PsyToolkit: a software package for programming psychological experiments using Linux. Behav Res Methods. 2010;42(4):1096–104.
Stoet G. PsyToolkit: a novel web-based method for running online questionnaires and reaction-time experiments. Teach Psychol. 2017;44(1):24–31.
Wittchen HU, Garczynski E, Holly A, Lachner G, Perkonigg A, Pfütze E-M, et al. Münchener composite international diagnostic interview (M-CIDI) (version 2.2 / 2 / 95). München: Max-Planck-Institut für Psychiatrie, Klinische Psychologie und Epidemiologie; 1995.
Wittchen HU, Pfister H. DIA-X-Interview. Instruktionsmanual zur Durchführung von DIA-X-Interviews. Frankfurt: Swets & Zeitlinger; 1997.
World Health Organization. Composite international diagnostic interview (CIDI). Genf: World Health Organization; 1990.
Wittchen HU, Lachner G, Wunderlich U, Pfister H. Test-retest reliability of the computerized DSM-IV version of the Munich composite international diagnostic interview (M-CIDI). Soc Psychiatry Psychiatr Epidemiol. 1998;33(11):568–78.
Reed V, Gander F, Pfister H, Steiger A, Sonntag H, Trenkwalder C, et al. To what degree does the composite international diagnostic interview (CIDI) correctly identify DSM-IV disorders? Testing validity issues in a clinical sample. Int J Methods Psychiatr Res. 1998;7:142–55.
Wittchen HU. Reliability and validity studies of the who composite international diagnostic interview (Cidi) - a critical-review. J Psychiatr Res. 1994;28(1):57–84.
Montgomery SA, Åsberg M. A new depression scale designed to be sensitive to change. Br J Psychiatry. 1979;134:382–9.
Schmidtke A, Fleckenstein P, Moises W, Beckmann H. Studies of the reliability and validity of the German version of the Montgomery-Asberg Depression Rating Scale (MADRS). Schweiz Arch Neurol Psychiatr (1985). 1988;139(2):51–65.
Williams JB, Kobak KA. Development and reliability of a structured interview guide for the Montgomery Asberg depression rating scale (SIGMA). Br J Psychiatry. 2008;192(1):52–8.
Bandelow B. Assessing the efficacy of treatments for panic disorder and agoraphobia. II. The panic and agoraphobia scale. Int Clin Psychopharmacol. 1995;10(2):73–81.
Bundesamt S. Statistik und Wissenschaft. Demographische Standards. Ausgabe 2010. Eine gemeinsame Empfehlung des ADM Arbeitskreis Deutscher Markt- und Sozialforschungsinitiative e.V., der Arbeitsgemeinschaft Sozialwissenschaftlicher Institute e.V. Wiesbaden: (ASI) und des Statistischen Bundesamtes; 2010. Available from: https://www.destatis.de/DE/Methoden/StatistikWissenschaftBand17.pdf?__blob=publicationFile.
Zimmermann T. Questionnaire for partnership quality: quality of marriage index - German deutsche version ( QMI- D). Verhaltenstherapie. 2015;25(1):51–3.
Norton R. Measuring marital quality - a critical-look at the dependent variable. J Marriage Fam. 1983;45(1):141–51.
Hoffmeyer-Zlotnik JHP. “Status in occupation” as a substitute for an occupational classification to determine social prestige ZUMA. Nachrichten. 2003;27(53):114–27.
Beck AT, Steer RA, Brown GK. Manual for the Beck depression inventory second edition (BDI-II). San Antonio: Psychological Corporation; 1996.
Hautzinger M, Keller F, Kühner C. BDI-II. Das Beck depressions-Inventar II. Revision. Manual: Pearson Deutschland; 2006.
Spielberger CD, Gorsuch RL, Lushene RE. Manual for the state-trait-anxiety inventory. Palo Alto: Consulting Psychologists; 1970.
Laux L, Glanzmann P, P. S, Spielberger CD. STAI. Das State-Trait-Angstinventar. Weinheim: Beltz Test; 1981.
Chambless DL, Caputo GC, Bright P, Gallagher R. Assessment of fear of fear in agoraphobics: the body sensations questionnaire and the agoraphobic cognitions questionnaire. J Consult Clin Psychol. 1984;52(6):1090–7.
Ehlers A, Margraf J. AKV. Fragebogen zu körperbezogenen Ängsten, Kognitionen und Vermeidung. 2., überarbeitete und neunormierte Auflage. Göttingen: Beltz Test GmbH; 2001.
Freeston MH, Rheaume J, Letarte H, Dugas MJ, Ladouceur R. Why do people worry? Pers Indiv Differ. 1994;17(6):791–802.
Gerlach AL, Andor T, Patzelt J. The significance of intolerance of uncertainty in generalized anxiety disorder: possible models and development of a German version of the intolerance of uncertainty scale. Zeitschrift fur Klinische Psychologie und Psychotherapie: Forschung und Praxis. 2008;37(3):190–9.
Cloninger CR, Przybeck TR, Svrakic DM. The tridimensional personality questionnaire - United-States normative data. Psychol Rep. 1991;69(3):1047–57.
Weyers P, Krebs H, Janke W. Reliability and construct-validity of the German version of Cloningers tridimensional personality questionnaire. Pers Indiv Differ. 1995;19(6):853–61.
Strobel A, Beauducel A, Debener S, Brocke B. Eine deutschsprachige Version des BIS/BAS-Fragebogens von Carver und White [A German version of Carver and White's BIS/BAS scales]. Zeitschrift fur Differentielle und Diagnostische Psychologie. 2001;22(3):216-27.
Carver CS, White TL. Behavioral-inhibition, behavioral activation, and affective responses to impending reward and punishment - the BIS BAS scales. J Pers Soc Psychol. 1994;67(2):319–33.
Kovaleva A. The IE-4: Construction and Validation of a Short Scale for the Assessment of Locus of Control. (Ed.): G-L-IfS, editor. Köln: GESIS-Schriftenreihe; 2012. p. 9.
Schotte CKW, De Doncker D, Vankerckhoven C, Vertommen H, Cosyns P. Self-report assessment of the DSM-IV personality disorders. Measurement of trait and distress characteristics: the ADP-IV. Psychol Med. 1998;28(5):1179–88.
Doering S, Renn D, Hofer S, Rumpold G, Smrekar U, Janecke N, et al. Validation of the German version of the Assessment of DSM-IV Personality Disorders (ADP-IV) Questionnaire. Z Psychosom Med Psyc. 2007;53(2):111–28.
Griffin D, Bartholomew K. Models of the self and other - fundamental dimensions underlying measures of adult attachment. J Pers Soc Psychol. 1994;67(3):430–45.
Steffanowski A, Oppl M, Meyerberg J, Schmidt J, Wittmann WW, Nübling R. Psychometrische Überprüfung einer deutschsprachigen Version des Relationship Scales Questionnaire (RSQ). In: Bassler M, editor. Störungsspezifische Therapieansätze - Konzepte und Ergebnisse. Gießen: Psychosozial; 2001. p. 320–42.
Bartholomew K, Horowitz LM. Attachment styles among young adults: a test of a four-category model. J Pers Soc Psychol. 1991;61(2):226–44.
Collins NL, Read SJ. Adult attachment, working models, and relationship quality in dating couples. J Pers Soc Psychol. 1990;58(4):644–63.
Roisman GI, Holland A, Fortuna K, Fraley RC, Clausell E, Clarke A. The adult attachment interview and self-reports of attachment style: an empirical rapprochement. J Pers Soc Psychol. 2007;92(4):678–97.
Simpson JA, Rholes WS, Nelligan JS. Support seeking and support giving within couples in an anxiety-provoking situation - the role of attachment styles. J Pers Soc Psychol. 1992;62(3):434–46.
Gooding DC, Pflum MJ. The assessment of interpersonal pleasure: introduction of the anticipatory and Consummatory interpersonal pleasure scale (ACIPS) and preliminary findings. Psychiatry Res. 2014;215(1):237–43.
Gooding DC, Pflum MJ. Further validation of the ACIPS as a measure of social hedonic response. Psychiatry Res. 2014;215(3):771–7.
Preller KH, Hulka LM, Vonmoos M, Jenni D, Baumgartner MR, Seifritz E, et al. Impaired emotional empathy and related social network deficits in cocaine users. Addict Biol. 2014;19(3):452–66.
Linden M, Lischka A-M, Popien C, Golombek J. The Multidimensional Social Contact Circle--An interview for the assessment of the social network in clinical practical. Zeitschrift fur Medizinische Psychologie. 2007;16(3):135–43.
Baron-Cohen S, Wheelwright S. The empathy quotient: an investigation of adults with Asperger syndrome or high functioning autism, and normal sex differences. J Autism Dev Disord. 2004;34(2):163–75.
de Haen J. Deutsche Version der Cambridge Behaviour Scale. Bochum: Autismo Praxis Autismus Therapie; 2006.
Baron-Cohen S, Wheelwright S, Skinner R, Martin J, Clubley E. The autism-spectrum quotient (AQ): Evidence from Asperger Syndrome/high-functioning autism, males and females, scientists and mathematicians (vol 31, pg 5, 2001). J Autism Dev Disord. 2001;31(6):603.
Freitag CM, Retz-Junginger P, Retz W, Seitz C, Palmason H, Meyer J, et al. Evaluation der deutschen version des Autismus-Spektrum-Quotienten (AQ) - die Kurzversion AQ-k [German adaptation of the autism-Spectrum quotient (AQ): evaluation and short version AQ-k]. Zeitschrift fuer Klinische Psychologie und Psychotherapie. 2007;36:280–9.
Bernstein DP, Stein JA, Newcomb MD, Walker E, Pogge D, Ahluvalia T, et al. Development and validation of a brief screening version of the childhood trauma questionnaire. Child Abuse Negl. 2003;27(2):169–90.
Wingenfeld K, Spitzer C, Mensebach C, Grabe HJ, Hill A, Gast U, et al. The German version of the childhood trauma questionnaire (CTQ): preliminary psychometric properties. Psychother Psych Med. 2010;60(11):442–50.
National Institute of Mental Health. Behavioral assessment methods for RDoC constructs: a report by the National Advisory Mental Health Council Workgroup on tasks and measures for Research Domain Criteria (RDoC). Bethesda: National Institute of Mental Health; 2016.
Bremner JD, Vermetten E, Mazure CM. Development and preliminary psychometric properties of an instrument for the measurement of childhood trauma: the early trauma inventory. Depress Anxiety. 2000;12(1):1–12.
Wingenfeld K, Driessen M, Mensebach C, Rullkoetter N, Schaffrath C, Spitzer C, et al. The early trauma inventory: initial psychometric characteristics of the German version. Diagnostica. 2011;57(1):27–38.
Maier-Diewald W, Wittchen HU, Werner-Eilert K. Die Münchner Ereignisliste (MEL) - Anwendungsmanual. Munich: Max Planck Institute of Psychiatry; 1983.
Friis RH, Wittchen HU, Pfister H, Lieb R. Life events and changes in the course of depression in young adults. Eur Psychiatry. 2002;17(5):241–53.
Ising M, Weyers P, Janke W, Erdmann G. The psychometric properties of the SVF78 by Janke and Erdmann, a short version of the SVF120. Zeitschrift fur Differentielle und Diagnostische Psychologie. 2001;22(4):279–89.
Wagnild GM, Young HM. Development and psychometric evaluation of the Resilience Scale. J Nurs Meas. 1993;1(2).
Schumacher J, Leppert K, Gunzelrnann T, Straus B, Brahler E. Die Resilienzskala - Ein Fragebogen zur Erfassung der psychischen Widerstandsfähigkeit als Personmerkmal [The Resilience Scale - A questionnaire to assess resilience as a personality characteristic]. Zeitschrift fur Klin Psychol Psychiatr Psychother. 2005;53(1):16–39.
Beijers L, Wardenaar KJ, van Loo HM, Schoevers RA. Data-driven biological subtypes of depression: systematic review of biological approaches to depression subtyping. Mol Psychiatry. 2019;24(6):888–900.
Dinga R, Schmaal L, Penninx BWJH, van Tol MJ, Veltman DJ, van Velzen L, et al. Evaluating the evidence for biotypes of depression: Methodological replication and extension of Drysdale et al. (2017). Neuroimage-Clin. 2019;22:101796.
Drysdale AT, Grosenick L, Downar J, Dunlop K, Mansouri F, Meng Y, et al. Resting-state connectivity biomarkers define neurophysiological subtypes of depression. Nat Med. 2017;23(1):28–38.
Whelan R, Watts R, Orr CA, Althoff RR, Artiges E, Banaschewski T, et al. Neuropsychosocial profiles of current and future adolescent alcohol misusers. Nature. 2014;512(7513):185.
Jollans L, Whelan R. Neuromarkers for mental disorders: harnessing population neuroscience. Front Psychiatry. 2018;9:242.
Woo CW, Chang LJ, Lindquist MA, Wager TD. Building better biomarkers: brain models in translational neuroimaging. Nat Neurosci. 2017;20(3):365–77.
Afzali MH, Sunderland M, Stewart S, Masse B, Seguin J, Newton N, et al. Machine-learning prediction of adolescent alcohol use: a cross-study, cross-cultural validation. Addiction. 2019;114(4):662–71.
Squeglia LM, Ball TM, Jacobus J, Brumback T, McKenna BS, Nguyen-Louie TT, et al. Neural predictors of initiating alcohol use during adolescence. Am J Psychiatr. 2017;174(2):172–85.
Button KS, Ioannidis JP, Mokrysz C, Nosek BA, Flint J, Robinson ES, et al. Power failure: why small sample size undermines the reliability of neuroscience. Nat Rev Neurosci. 2013;14(5):365–76.
Tamminga CA, Ivleva EI, Keshavan MS, Pearlson GD, Clementz BA, Witte B, et al. Clinical phenotypes of psychosis in the bipolar-schizophrenia network on intermediate phenotypes (B-SNIP). Am J Psychiatr. 2013;170(11):1263–74.
Clementz BA, Sweeney JA, Hamm JP, Ivleva EI, Ethridge LE, Pearlson GD, et al. Identification of distinct psychosis biotypes using brain-based biomarkers. Am J Psychiatr. 2016;173(4):373–84.
Jollans L, Boyle R, Artiges E, Banaschewski T, Desrivieres S, Grigis A, et al. Quantifying performance of machine learning methods for neuroimaging data. NeuroImage. 2019;199:351–65.
Poldrack RA, Huckins G, Varoquaux G. Establishment of best practices for evidence for prediction: a review. JAMA Psychiatry. 2019:3671.
Marchini J, Howie B, Myers S, McVean G, Donnelly P. A new multipoint method for genome-wide association studies by imputation of genotypes. Nat Genet. 2007;39(7):906–13.
Arloth J, Bogdan R, Weber P, Frishman G, Menke A, Wagner KV, et al. Genetic differences in the immediate Transcriptome response to stress predict risk-related brain function and psychiatric disorders. Neuron. 2015;86(5):1189–202.
Allardyce J, Leonenko G, Hamshere M, Pardinas AF, Forty L, Knott S, et al. Association between schizophrenia-related polygenic liability and the occurrence and level of mood-incongruent psychotic symptoms in bipolar disorder. JAMA Psychiatry. 2018;75(1):28–35.
Milaneschi Y, Lamers F, Peyrot WJ, Abdellaoui A, Willemsen G, Hottenga JJ, et al. Polygenic dissection of major depression clinical heterogeneity. Mol Psychiatry. 2016;21(4):516–22.
Gupta M, Neavin D, Liu D, Biernacka J, Hall-Flavin D, Bobo WV, et al. TSPAN5, ERICH3 and selective serotonin reuptake inhibitors in major depressive disorder: pharmacometabolomics-informed pharmacogenomics. Mol Psychiatry. 2016;21(12):1717–25.
Liu D, Ray B, Neavin DR, Zhang J, Athreya AP, Biernacka JM, et al. Beta-defensin 1, aryl hydrocarbon receptor and plasma kynurenine in major depressive disorder: metabolomics-informed genomics. Transl Psychiatry. 2018;8(1):10.
Klengel T, Binder EB. Epigenetics of stress-related psychiatric disorders and gene x environment interactions. Neuron. 2015;86(6):1343–57.
Issler O, Haramati S, Paul ED, Maeno H, Navon I, Zwang R, et al. MicroRNA 135 is essential for chronic stress resiliency, antidepressant efficacy, and intact serotonergic activity. Neuron. 2014;83(2):344–60.
Volk N, Pape JC, Engel M, Zannas AS, Cattane N, Cattaneo A, et al. Amygdalar MicroRNA-15a is essential for coping with chronic stress. Cell Rep. 2016;17(7):1882–91.
Ding MQ, Chen LJ, Cooper GF, Young JD, Lu XH. Precision oncology beyond targeted therapy: combining Omics data with machine learning matches the majority of Cancer cells to effective therapeutics. Mol Cancer Res. 2018;16(2):269–78.
Yahata N, Kasai K, Kawato M. Computational neuroscience approach to biomarkers and treatments for mental disorders. Psychiatry Clin Neurosci. 2017;71(4):215–37.
Etkin A, Wager TD. Functional neuroimaging of anxiety: a meta-analysis of emotional processing in PTSD, social anxiety disorder, and specific phobia. Am J Psychiatry. 2007;164(10):1476–88.
Mies GW, Van den Berg I, Franken IH, Smits M, Van der Molen MW, Van der Veen FM. Neurophysiological correlates of anhedonia in feedback processing. Front Hum Neurosci. 2013;7:96.
Luethi M, Meier B, Sandi C. Stress effects on working memory, explicit memory, and implicit memory for neutral and emotional stimuli in healthy men. Front Behav Neurosci. 2008;2:5.
Cremaschi L, Penzo B, Palazzo M, Dobrea C, Cristoffanini M, Dell'Osso B, et al. Assessing working memory via N-back task in euthymic bipolar I disorder patients: a review of functional magnetic resonance imaging studies. Neuropsychobiology. 2013;68(2):63–70.
Kerestes R, Ladouceur CD, Meda S, Nathan PJ, Blumberg HP, Maloney K, et al. Abnormal prefrontal activity subserving attentional control of emotion in remitted depressed patients during a working memory task with emotional distracters. Psychol Med. 2012;42(1):29–40.
Brakowski J, Spinelli S, Dorig N, Bosch OG, Manoliu A, Holtforth MG, et al. Resting state brain network function in major depression - depression symptomatology, antidepressant treatment effects, future research. J Psychiatr Res. 2017;92:147–59.
Greicius M. Resting-state functional connectivity in neuropsychiatric disorders. Curr Opin Neurol. 2008;21(4):424–30.
Reinhard G, Lachnit H, Konig S. Tracking stimulus processing in Pavlovian pupillary conditioning. Psychophysiology. 2006;43(1):73–83.
Leuchs L, Schneider M, Spoormaker VI. Measuring the conditioned response: a comparison of pupillometry, skin conductance, and startle electromyography. Psychophysiology. 2019;56(1):e13283.
Leuchs L, Schneider M, Czisch M, Spoormaker VI. Neural correlates of pupil dilation during human fear learning. NeuroImage. 2017;147:186–97.
Lonsdorf TB, Menz MM, Andreatta M, Fullana MA, Golkar A, Haaker J, et al. Don’t fear ‘fear conditioning’: methodological considerations for the design and analysis of studies on human fear acquisition, extinction, and return of fear. Neurosci Biobehav Rev. 2017;77:247–85.
Millan MJ, Agid Y, Brune M, Bullmore ET, Carter CS, Clayton NS, et al. Cognitive dysfunction in psychiatric disorders: characteristics, causes and the quest for improved therapy. Nat Rev Drug Discov. 2012;11(2):141–68.
Doumas M, Smolders C, Brunfaut E, Bouckaert F, Krampe RT. Dual task performance of working memory and postural control in major depressive disorder. Neuropsychology. 2012;26(1):110–8.
Gohier B, Ferracci L, Surguladze SA, Lawrence E, El Hage W, Kefi MZ, et al. Cognitive inhibition and working memory in unipolar depression. J Affect Disord. 2009;116(1–2):100–5.
Reppermund S, Ising M, Lucae S, Zihl J. Cognitive impairment in unipolar depression is persistent and non-specific: further evidence for the final common pathway disorder hypothesis. Psychol Med. 2009;39(4):603–14.
Reppermund S, Zihl J, Lucae S, Horstmann S, Kloiber S, Holsboer F, et al. Persistent cognitive impairment in depression: the role of psychopathology and altered hypothalamic-pituitary-adrenocortical (HPA) system regulation. Biol Psychiatry. 2007;62(5):400–6.
Rock PL, Roiser JP, Riedel WJ, Blackwell AD. Cognitive impairment in depression: a systematic review and meta-analysis. Psychol Med. 2014;44(10):2029–40.
Snyder HR. Major depressive disorder is associated with broad impairments on neuropsychological measures of executive function: a meta-analysis and review. Psychol Bull. 2013;139(1):81–132.
Bangasser DA, Kawasumi Y. Cognitive disruptions in stress-related psychiatric disorders: a role for corticotropin releasing factor (CRF). Horm Behav. 2015;76:125–35.
Gotlib IH, Joormann J. Cognition and depression: current status and future directions. Annu Rev Clin Psycho. 2010;6:285–312.
Gyurak A, Patenaude B, Korgaonkar MS, Grieve SM, Williams LM, Etkin A. Frontoparietal activation during response inhibition predicts remission to antidepressants in patients with major depression. Biol Psychiatry. 2016;79(4):274–81.
Rogers MA, Kasai K, Koji M, Fukuda R, Iwanami A, Nakagome K, et al. Executive and prefrontal dysfunction in unipolar depression: a review of neuropsychological and imaging evidence. Neurosci Res. 2004;50(1):1–11.
Douglas KM, Porter RJ. Longitudinal assessment of neuropsychological function in major depression. Aust Nz J Psychiat. 2009;43(12):1105–17.
Beaudreau SA, Rideaux T, O'Hara R, Arean P. Does cognition predict treatment response and remission in psychotherapy for late-life depression? Am J Geriat Psychiat. 2015;23(2):215–9.
Johnco C, Wuthrich VM, Rapee RM. The influence of cognitive flexibility on treatment outcome and cognitive restructuring skill acquisition during cognitive behavioural treatment for anxiety and depression in older adults: results of a pilot study. Behav Res Ther. 2014;57:55–64.
Singh-Manoux A, Dugravot A, Fournier A, Abell J, Ebmeier K, Kivimaki M, et al. Trajectories of depressive symptoms before diagnosis of dementia a 28-year follow-up study. JAMA Psychiatry. 2017;74(7):712–8.
Story TJ, Potter GG, Attix DK, Welsh-Bohmer KA, Steffens DC. Neurocognitive correlates of response to treatment in late-life depression. Am J Geriat Psychiat. 2008;16(9):752–9.
Stephan KE, Mathys C. Computational approaches to psychiatry. Curr Opin Neurobiol. 2014;25:85–92.
Browning M, Behrens TE, Jocham G, O'Reilly JX, Bishop SJ. Anxious individuals have difficulty learning the causal statistics of aversive environments. Nature neuroscience. 2015;18(4):590.
de Berker AO, Rutledge RB, Mathys C, Marshall L, Cross GF, Dolan RJ, et al. Computations of uncertainty mediate acute stress responses in humans. Nat Commun. 2016;7:10996.
Cuthbert BN. The RDoC framework: facilitating transition from ICD/DSM to dimensional approaches that integrate neuroscience and psychopathology. World Psychiatry. 2014;13(1):28–35.
Watson D, Stanton K, Clark LA. Self-report indicators of negative valence constructs within the research domain criteria (RDoC): a critical review. J Affect Disord. 2016;216:58–69.
Ivleva EI, Clementz BA, Dutcher AM, Arnold SJM, Jeon-Slaughter H, Aslan S, et al. Brain structure biomarkers in the psychosis biotypes: findings from the bipolar-schizophrenia network for intermediate phenotypes. Biol Psychiatry. 2017;82(1):26–39.
Grisanzio KA, Goldstein-Piekarski AN, Wang MY, Rashed Ahmed AP, Samara Z, Williams LM. Transdiagnostic symptom clusters and associations with brain, behavior, and daily function in mood, anxiety, and trauma disorders. JAMA psychiatry. 2018;75(2):201–9.
Schneider M, Hathway P, Leuchs L, Samann PG, Czisch M, Spoormaker VI. Spontaneous pupil dilations during the resting state are associated with activation of the salience network. NeuroImage. 2016;139:189–201.
We would like to thank Stephanie Alam, Miriam El-Mahdi, Gertrud Ernst-Jansen, Carolin Haas, Karin Hofer, Elisabeth Kappelmann, Rebecca Meissner for their help with data collection, study management, the recruitment and screening of participants, Ines Eidner und Anna Hetzel for their assistance with MRI scanning, Sanja Ilic-Cocic and Boris Schmalz for help with informed consent consultations, and Sanja Ilic-Cocic also for helping with specific study assessments. We thank Benedikt Brücklmeier for supporting the implementation of the IST and developing analysis software for IST and mini-ECG analysis, and Pamela Hathway for help in implementing the Reward Anticipation Task, and Aaron Prosser and Yorick Peterse for help in implementing the Time Estimation Task. We would also like to thank Tanja Jovanovic and Seth Norrholm for their contribution to the airblast sartle EMG set-up and Jessica Keverne for language editing. Our special thanks go to all study participants for bringing up the time for participation in the BeCOME study.
The study is funded by the Max Planck Institute of Psychiatry. The funder had no role in the design of the study, the collection of data and in writing the manuscript. The funder will not have any role in the execution of the study, the collection, analyses, interpretation of the data, or decision to submit results.
Ethics approval and consent to participate
The study was approved by the ethics committee of the Ludwig Maximilian University, in Munich, Germany, under the reference number 350–14, and written informed consent was obtained from all participants after participants have read the information material. Protocol modifications (amendments) are reported to the ethics committee of the Ludwig Maximilian University prior to implementation. Biosamples are integrated into the Max Planck Biobank, which has been approved from the same ethics committee, under the reference number 338–15.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Brückl, T.M., Spoormaker, V.I., Sämann, P.G. et al. The biological classification of mental disorders (BeCOME) study: a protocol for an observational deep-phenotyping study for the identification of biological subtypes. BMC Psychiatry 20, 213 (2020). https://doi.org/10.1186/s12888-020-02541-z
- Research domain criteria (RDoC)
- Biology-based taxonomy