Skip to main content

The Binge Eating Genetics Initiative (BEGIN): study protocol



The Binge Eating Genetics Initiative (BEGIN) is a multipronged investigation examining the interplay of genomic, gut microbiota, and behavioral factors in bulimia nervosa and binge-eating disorder.


1000 individuals who meet current diagnostic criteria for bulimia nervosa or binge-eating disorder are being recruited to collect saliva samples for genotyping, fecal sampling for microbiota characterization, and recording of 30 days of passive data and behavioral phenotyping related to eating disorders using the app Recovery Record adapted for the Apple Watch.


BEGIN examines the interplay of genomic, gut microbiota, and behavioral factors to explore etiology and develop predictors of risk, course of illness, and response to treatment in bulimia nervosa and binge-eating disorder. We will optimize the richness and longitudinal structure of deep passive and active phenotypic data to lay the foundation for a personalized precision medicine approach enabling just-in-time interventions that will allow individuals to disrupt eating disorder behaviors in real time before they occur.

Trial registration

The identifier is NCT04162574. November 14, 2019, Retrospectively Registered.

Peer Review reports


Bulimia nervosa (BN: lifetime prevalence of 1.5% in women and 0.5% in men) and binge-eating disorder (BED: lifetime prevalence of 3.5% in women and 2% in men) are common debilitating eating disorders [1]. BN is marked by uncontrollable eating episodes coupled with compensatory behaviors, whereas BED includes similarly defined binge episodes only in the absence of regular compensatory behaviors. Both disorders are highly heritable (41–82% [2,3,4,5,6,7]), carry high psychiatric and somatic comorbidity, and have high medication and healthcare utilization, whether or not comorbid obesity is present [8,9,10,11]. Suicide risk is significantly elevated in both disorders [12].

The Binge Eating Genetics Initiative (BEGIN) is a multipronged research study that 1) examines the interplay of genomic, gut microbiota, and behavioral factors to explore etiology and develop predictors of risk, course of illness, and response to treatment in BN and BED; and 2) optimizes the richness and longitudinal structure of deep passive and active phenotypic data to lay the foundation for a personalized precision medicine approach enabling just-in-time interventions that will allow individuals to disrupt eating disorder behaviors in real time before they occur.


Despite their prevalence and the attendant personal and social costs, research into the genetic underpinnings of BN and BED is essentially absent. BEGIN represents the first contribution to a global effort to amass an adequate sample size to conduct a genome-wide association study of BN and BED in collaboration with the Eating Disorders Working Group of the Psychiatric Genomics Consortium (PGC-ED). The PGC-ED has rapidly advanced the study of the genomics of anorexia nervosa [13, 14] identifying eight significant loci and reporting a panel of genetic correlations suggesting that anorexia nervosa may have both psychiatric and metabolic etiological underpinnings. BEGIN will further the mission of the PGC-ED by launching a parallel investigation into BN and BED.

Intestinal microbiota

Inspired by reports of associations between enteric microbes, host metabolism, and host behavior [15,16,17] along with reported differences between gut microbiota composition from patients with anorexia nervosa and healthy individuals [18,19,20,21,22,23,24,25], we have incorporated the study of the intestinal microbiota into BEGIN. In addition to characterizing the biogeography of the human microbiome (cumulative genomes of the microbiota) of BEGIN participants, our analyses will identify associations between genes and microbial composition in BN/BED. The intention is to better understand the biological mechanisms of these illnesses in an effort to help identify potential drug targets and opportunities for novel interventions.

Deep phenotyping

We are capturing real-time longitudinal digital phenotypic data on individuals with BN/BED that reflect the true complexity of human behavior. Using Apple Watch and iPhone devices, we are collecting active data on binge-eating, purging, nutrition, mood, and cognitions with a widely-used cognitive-behavioral based eating disorder app Recovery Record [26] and passive sensor data via native applications collected over a 30-day period. We will combine active Recovery Record-based measures and passively collected, continuous, sensor-based measurements of autonomic nervous system (ANS) activity and actigraphy to characterize patterns of when and where individuals are more/less likely to binge and/or purge in their daily lives. Finally, across and within individuals, we will identify low-risk and high-risk passive data patterns that will facilitate the prediction of transitions to high risk states signaling impending binge or purge episodes (time-stamped by active app monitoring). This work has the potential to transform the standard of care for BN and BED by transcending current cognitive-behavioral therapy (CBT) approaches typically dependent on retrospective self-report and giving patients a tailored tool that will help them intervene when they need help the most.


Specific aims genomics and microbiota

In 1000 individuals with BN/BED, we will,

Aim 1: Contribute genomic data to the next genome-wide association study (GWAS) conducted by the Eating Disorders Working Group of the Psychiatric Genomics Consortium (PGC-ED) of BN/BED.

Aim 2: Comprehensively characterize the biogeography of the human microbiome using high-throughput sequencing of the microbial 16S rRNA gene and shallow shotgun sequencing.

Aim 3: Employ novel and develop new analytic methods to integrate GWAS, gut microbiota, and phenotypic data that will result in predictive algorithms that index risk, course of illness, severity, disordered eating episodes, and treatment response.

Specific aims digital longitudinal phenotyping

Aim 1: Conduct longitudinal deep phenotyping of 1000 individuals with BN/BED using Recovery Record and Apple Watch.

Aim 2: Predict the occurrence of binge eating and purging (vomiting) episodes in individuals with BN/BED using passive sensor data.

Aim 3: Test theoretically derived regulatory models of binge eating and purging behaviors as reflected in differences in temporal patterns.

Aim 4: Refine our capacity to predict binge and purge episodes by augmenting passive data with contextual factors collected by Recovery Record.


We are recruiting 1000 individuals with BN or BED.

Inclusion criteria

  1. 1)

    Currently meets Diagnostic and Statistical Manual for Mental Disorders -5th Edition (DSM-5 [27]) criteria for BN or BED (confirmed via validated questionnaire in screening instrument—see Measures)

  2. 2)

    Resident of US

  3. 3)

    All sexes

  4. 4)

    Age 18–45 years

  5. 5)

    Reads, speaks English

  6. 6)

    Existing iPhone user with iPhone 5 or later

  7. 7)

    Willing/able to wear Apple Watch for entire study period

  8. 8)

    Willing/able to use Recovery Record for the entire study period

  9. 9)

    Provides informed consent to have activity and self-reported Recovery Record data harvested

  10. 10)


Exclusion criteria

  1. 1)

    Currently pregnant or breastfeeding

  2. 2)

    Bariatric surgery due to the impact on eating patterns, including the following: (Roux-en-Y gastric bypass, laparoscopic adjustable gastric banding, sleeve gastrectomy, duodenal switch with biliopancreatic diversion, gastric balloon, AspireAssist)

  3. 3)

    Current use of hormone therapy

  4. 4)

    Inpatient treatment or hospitalization for eating disorders in the 2-weeks prior to study enrollment

  5. 5)

    Suicidality at screening

  6. 6)

    Antibiotic or probiotic use in the past 30 days (related to fecal sampling).


We are recruiting cases nationally from diverse geographical, socioeconomic, racial, and ethnic backgrounds via Recovery Record, social media and National Eating Disorders Association. Specifically, we launch tweets and Facebook posts that direct potential participants to the BEGIN url where they can take a preliminary screen. In addition, Recovery Record pushes notifications about BEGIN to users. Recruitment flow is detailed in Fig. 1.

Fig. 1
figure 1

BEGIN study recruitment and sampling flow


Informed consent is obtained digitally via the Recovery Record app. Participants complete an eating disorders diagnostic questionnaire. Those who screen case positive and meet all inclusion criteria are offered the opportunity to participate in the full study (with a second digital informed consent). All responses to questionnaires are encrypted and sent to a secure research server at the UNC Sheps Center for Health Services Research using secure transfer methodologies, who compile and house the data in servers specifically designed for Protected Health Information. Data are de-identified (Sheps Center maintains the key to match records). Study data from Recovery Record and the Apple Watch are maintained by Recovery Record and only includes passive and active sources necessary for analyses, minimizing exposure of protected health information. To ensure that a high level of security is maintained, data transfer from Recovery Record occurs with end-to-end encryption and authentication protocols. Records are only identified with a second study number that can be linked using the data from the Sheps Center. Eligible participants are mailed a package containing a description of the study, saliva collection kit, microbiome collection kit, and an Apple Watch. Saliva kits are returned directly to RUCDR Infinite Biologics where they are stored awaiting DNA extraction and genotyping. In Phase 1, participants returned microbiome kits to uBiome for sequencing; in Phase 2, kits are returned to the Carroll lab. Barcodes ensure accurate identification and coordination with phenotypic data. After enrollment and completion of the baseline survey, participants use the Apple Watches and Recovery Record for 30 days and complete midpoint and end-of-study surveys at 14 days and 30 days post-enrollment, respectively, to track progress of eating disorder pathology, including binge eating and purging behaviors.


Deep Phenotyping

Using the Recovery Record app and the Apple Watch over a 30-day period for each individual, we conduct active and passive data capture to fully characterize disordered eating behaviors, physical activity, nutrition, gastrointestinal distress, sleep, and heart rate. This generates exceptional data to enable deep characterization of the course of BN/BED. We expect that the likelihood of an event (i.e., binge/purge) will decrease over the course of 30-days and build this expectation into our statistical models. We further expect that although the likelihood of events will change over time, the dynamics of the events will not. These data can be broken down into four categories. First, self-report questionnaires are collected consisting of scales well established to relate to BN and BED (see Self-report questionnaires), measured prior to enrollment or three times across the study. Second, stratified sample intensive measurements consisting of daily mood and meal records are measured 6 times daily. Third, event contingent intensive measurements ask participants to log binge and purge episodes. Finally, continuous passive data collection captures real-time physiological and movement data. These different data will be integrated through multilevel modeling and systems continuous time modeling procedures [28, 29].

Active data collection

Self-report questionnaires

All BEGIN study participants are screened for eligibility and consented using the Recovery Record iPhone app, which is free for users to download and is HIPAA compliant ( All questionnaires are completed from within the Recovery Record app.

ED100K [30]

The ED100K questionnaire is a self-report, eating disorders assessment based on the Structured Clinical Interview for DSM-5, Eating Disorders Module, administered prior to enrollment. Items assess DSM-5 criteria for anorexia nervosa, BN, BED, and other specified feeding and eating disorders. The ED100K-v1 was found to be a valid measure of eating disorders and behaviors [30]. Positive predictive values indicating that among those who had a positive screening test, anorexia nervosa Criterion B, Criterion C, and binge eating ranged from 88 to 100%. Among women who had a negative screen, the probability of not having these criteria or behaviors ranged from 72 to 100%. The correlation between questionnaire and interview for lowest illness-related BMI was r = 0.91.

Eating disorders examination questionnaire (EDE-Q) [31]

The EDE-Q is a widely used, validated questionnaire capturing eating disorders pathology, including the frequency and severity of binge episodes. The EDE-Q is administered at baseline, midpoint, and endpoint of the 30-day period.

The Patient Health Questionnaire (PHQ-9) [32]

Is a 9-item, self-administered version of the PRIME-MD diagnostic instrument for common mental disorders. The nine items are based on the nine DSM-IV criteria for major depressive disorder and are scored as “0” (not at all) to “3” (nearly every day). The PHQ-9 has been found to be a reliable and valid measure of depression severity. The PHQ-9 is administered at baseline, midpoint, and endpoint of the 30-day period.

The Generalized Anxiety Disorder 7 (GAD-7) [33]

Is a 7-item, self-report questionnaire to screen for generalized anxiety disorder. Each symptom is scored on a 3-point scale: “not at all” (0), “several days” (1), or “more than half the days” (2). Items are then summed to create a symptom severity score. The GAD-7 is a reliable and valid measure of anxiety. The GAD-7 is administered at baseline, midpoint, and endpoint of the 30-day period.

ADHD self-report scale (ASRS) [34]

Is an 18-item questionnaire that assess symptoms associated with attention-deficit/hyperactivity disorder. Items are scored on a 5-point scale. The assessment has high internal consistency and validity [35]. The ASRS is administered at baseline.

Rome III [36]

To assess adult GI symptoms of the stomach and intestines, the relevant section (items 17–67) of the ROME III is administered at baseline.

Stratified sampled intensive measurements

Daily mood and meal records

These data are collected inside the Recovery Record iPhone app that primarily targets adherence to meal monitoring tasks. Participants are prompted with a push notification six times per day corresponding to meal and snack times to complete an evidence-based CBT-style question set (what was eaten, with whom, where, and what behaviors were used) in addition to optional symptom-focused questions including current emotional state, urges to engage in eating disorder behaviors, sleeping patterns, hunger levels, gastrointestinal problems, and intrusive thoughts.

Event contingent intensive measurements

Binge and purge records

Participants are instructed to launch the Recovery Record Apple Watch app if they have experienced a binge or purge episode (Fig. 2). Action buttons are used to quickly identify the relevant symptom and how long ago it occurred, with response options in five-minute increments ranging from “Right now” to “30 mins ago”. If an urge to engage in a behavior is identified, participants are additionally asked to rate the urge strength with response options: “Not at all”, “Slight”, “Moderate”, “Strong”, and “Overbearing”. Actively monitored mood, meal, binge and purge records and their respective timestamps are collected on the Recovery Record platform and shared with the research team via encrypted authenticated TLS. Ecological momentary assessment-based logging has shown moderate to strong concordance with retrospective self-report of binge eating and purging [37].

Fig. 2
figure 2

Recovery Record for Apple Watch screen examples. Image Action Buttons include icons created and owned by Recovery Record, Inc. (J. Tregarthen, author, CEO). Image Distractions features Relaxed Corgi GIF uploaded by GIPHY, 27 June 2016, These images were made available for the purpose of this research per our subcontract agreement with Recovery Record, Inc. (J. Tregarthen, Principal Investigator and author). The image titled Distractions features Relaxed Corgi GIF uploaded by GIPHY, 27 June. 2016, The GIF was accessed utilizing the Recovery Record, Inc. GIPHY account and made available under a license agreement between Recovery Record, Inc. and GIPHY

Continuous passive data collection

Apple watch

The number and timing of the steps (physical activity) as well as 5-min epoch heart rate are passively collected for each study participant using the Apple Watch and harvested by the Recovery Record app using Apple’s Application Program Interface (API). The Apple Watch activates the sensor approximately every 5 min to record heart rate based on 100 Hz using photo plethysmography. Built in signal processing algorithms are used to aggregate measurements to approximately 5-min intervals, a rate consistent with current methodological guidelines (e.g., Berntson [38]). To minimize data loss, these variables are uploaded to the Recovery Record server each time the Recovery Record app is opened on the iPhone while the Apple Watch is nearby, or at least once per day.

Biological sampling

Saliva sampling and genotyping

Saliva samples are collected with RUCDR Infinite Biologics saliva collection kits. GWAS profiling will be performed together with additional samples collected by the PGC-ED using the optimal platform at the time of genotyping, most likely a version of the Illumina Global Screening Array (GSA).

Fecal sampling and sequencing

In Phase 1, as recipients of a scientific in-kind grant from the now defunct company uBiome, we collected stool samples. uBiome comprehensively characterized the biogeography of the human microbiome using high-throughput sequencing of the microbial 16S rRNA gene and released all data to UNC for analysis. After their company dissolved in Oct 2019, all processing transferred to the Carroll Lab at UNC (I. Carroll, Director). In order to obtain high-resolution taxonomic and functional microbiome data, we will perform whole genome shotgun sequencing. Raw sequence data will be quality filtered and trimmed to remove bases with Phred quality scores less than 20. Downstream bioinformatics analysis will consist of: i) taxonomic composition; ii) functional composition; iii) alpha diversity (as measured by absolute numbers of sequence variants and the Shannon index of diversity) and (quantified by Bray Curtis and UniFrac metrics); and iv) computing descriptive statistics and identifying groups within the data, as well as performing statistical analyses between subgroups using additional metadata, where available [39]. Since the sequencing technology and bioinformatics tools are rapidly advancing, we will utilize the most suitable methods and tools available at the time of analysis.

Planned data analysis

Genomics and microbiota aims

We will combine BEGIN samples with other samples in the PGC-ED for meta-analysis. We will conduct cross-disorder analyses to identify loci that cut across diagnostic categories by leveraging existing high-quality results for anorexia nervosa, major depressive disorder, schizophrenia, bipolar disorder, and other psychiatric and metabolic phenotypes. We will use advanced methods [40] to compute SNP heritabilities and genetic correlations across psychiatric and metabolic traits. We will calculate metabolic and psychiatric trait & disorder polygenic scores (PGS) using PRSice, ( A leave-one-sample-out process will be carried out to calculate BN/BED PGSs. The calculated PGS will be the weighted numbers of risk alleles carried by each case and control. This aim will illuminate, from a fundamental perspective, the genetic architecture of BN/BED and its relation to other psychiatric disorders and metabolic conditions.

We will compare taxonomic composition and diversity of the gut microbiota for BEGIN participants, compare BN with BED, and both to a reference control panel. We will control for multiple covariates in all analyses (e.g., obesity).

We can now rapidly do GWAS on multiple phenotypes – e.g., GWAS for 22 K transcriptomic, 8 K proteomic, or 1 K metabolomic measures. (1) We will adapt and extend these methods to evaluate host genomic-microbiota interactions by conducting ~ 15 K GWAS for species-level microbial measures while controlling for multiple comparisons. (2) We will generate microbiome “modules”, clusters of species with high intra-group correlations and low inter-group correlations. We will then do a GWAS for these modules. (3) For all analyses, we will pay particular attention to the genomic regions highlighted in the prior literature (e.g., MHC, autoimmunity, gut barrier, inflammatory bowel disease). (4) We will utilize publicly available databases of summary statistics across a range of psychiatric, personality, metabolic, and physical activity phenotypes and employ both trait-specific polygenic scores (PGS) and multi-polygenic scores (MPS) to predict outcomes. We will use a novel MPS approach developed by collaborator Breen and colleagues [41], that exploits genetic correlations between the outcome trait and a multitude of traits by using the joint predictive power of multiple polygenic scores in one regression model. We will select relevant GWAS from a centralized repository of summary statistics to predict BN, BED, severity, treatment outcome. Using repeated cross-validation, we will train and validate the prediction models using elastic net regularized regression, which is a multiple regression model suited to deal with a large number of correlated predictors while preventing overfitting [42]. We will then add microbiota and phenotyping variables into the model to improve predictive accuracy.

Digital longitudinal phenotyping aims

Our dynamic systems approach capitalizes on a combination of the passive and active data collection to address all three of the longitudinal phenotyping aims. Each stable state can be thought of as having homeostatic properties that are reflected in associations between different levels of derivatives (i.e., change in value with respect to time). For example, the relationship between changes in heart rate from one moment to the next and values of heart rate at the previous moment characterize how heart rate fluctuates homeostatically about a “set point.” This set point represents the heart rate value to which the individual’s body returns when the person is at rest [43]. Not only does this association characterize the homeostatic heart rate value itself but also the rate of return to the set point when a person’s heart rate is perturbed (i.e., experiencing distress prior to a binge/purge episode, physical load creating during exercise, etc.). Higher order derivatives and accounting for more variables simultaneously allows for testing more complex homeostatic patterns (e.g., cycles), while including this concept of rate of return to set point (i.e., systemic stability).

Aim 2 will be tested by first depicting the dynamics that lead up to a binge or purge event in a multilevel model. Aim 3 will be addressed by depicting the dynamics once a binge or purge event has occurred. In this case, analyses will focus on the 2 h after binge/purge events (but not within an hour of a future event), again modeling changes in heart rate and steps as a function of current levels in heart rate and steps. Aim 4 will require depicting each instance in time in terms of risk for being in one of the temporal states associated with subsequent binge eating and/or purging. To do so, we will utilize the posterior probabilities from a latent mixture model where each pattern is differentiated by associations amongst different levels of change. Mixture modeling is a taxonomic approach where timepoints within and between individuals can be grouped together as a function of a model. In this case, the model will differentiate groups of data as a function of the dynamic properties.

To help ensure reproducibility, the sample will be split in half with each half used as confirmation on the other half generating competing models. Under large data circumstances such as these, rather than power, the primary concern is a combination of overfitting and gaining a proper gauge of an effect. Generating competing models allows each of the samples to function as confirmation of the other with the better fitting model on both samples providing the more generalizable solution.


As a multi-pronged investigation, BEGIN will have broad impact across various dimensions in the eating disorders field. First, in the biological domain, BEGIN will allow us to identify genetic and gut microbiota contributors to disorder risk and maintenance and identify genomic, enteric microbes, and behavioral predictors of outcome. Second, in the behavioral domain, BEGIN will allow us to build algorithms that predict behavioral events (e.g., impending binges or purges) to enable real-time intervention via wearable technology.

We intend this to be a transformative study in the field of eating disorders. Through deep longitudinal phenotyping via the Apple Watch, we designed BEGIN to rapidly accelerate progress toward personalized precision medicine for BN and BED. Advances in eating disorders treatment have been slow and incremental. In our Agency for Healthcare Quality and Research review of treatments for BED [44], we noted that the evidence base was challenged by small samples and single studies introducing small variations on core therapeutic approaches with little or no additive efficacy. Wearable sensors, such as the Apple Watch with the adapted Recovery Record app, offer us the opportunity to develop a transformative improvement in BN and BED treatment. BN and BED are model disorders with discrete and measurable pathognomonic unhealthy behaviors. By applying dynamical systems models to the passive and active data that we collect, we will bypass historical one-size-fits-all CBT interventions for BN and BED and immediately enter the era of personalized interventions for eating disorders. Not only will we be able to build models that predict binge and purge episodes within the acute phase of the illness, but personalized extensions of these models will allow us to identify and alert individuals to impending slips and relapses after recovery.

Although traditional CBT interventions that rely on in-session retrospective recall will never be entirely obsolete, we expect that the just-in-time approach afforded by our dynamical systems models will render ours a central feature in the future treatment of BN/BED. Results from BEGIN will set the stage for subsequent studies in which we will have achieved the ability to discriminate across types of events (e.g., exercise, meal, binge, purge) that will allow us to build in accurate push notifications when an individual’s passive and active data signal an impending binge or purge—truly tailoring treatment and delivering it in-the-moment. Moreover, Recovery Record already has a clinician interface, and we predict that we will be able to incorporate our models into provider interfaces such that clinicians will be able to view and interact with the alerts that emerge, thus supporting the provision of data-informed care.

Ultimately, we foresee that this study will advance both cognitive-behavioral approaches to understanding and treating eating disorders and dynamical systems theory of behavior change to incorporate both intensive longitudinal behavioral and physiological data. Although our focus is on eating disorders, we intend our models to be readily adaptable for other psychiatric (and somatic) conditions that have identifiable measurable indices in order to usher us more rapidly toward individualized interventions that attend to the psychology, the biology, and the dynamic environment of the individual.

Availability of data and materials

Our liberal data and analysis sharing principles will make genomic, microbiota, and phenotypic data and scripts widely available for access by other scientists to maximize utility of our investigation.

The datasets generated and/or analyzed during the current study will be available in the National Data Archive ( and on Open Science Framework ( DOI

DNA samples will be available from the NIMH Repository and Genomics Resource (



Attention deficit hyperactivity disorder


Autonomic nervous system


ADHD Self-Report Scale


Application programming interface


Binge-eating disorder


Binge Eating Genetics Initiative


Bulimia nervosa


Cognitive-behavioral therapy


Deoxyribonucleic acid


Diagnostic and Statistical Manual of Mental Disorders 4th Edition

DSM 5:

Diagnostic and Statistical Manual of Mental Disorders 5th Edition


Eating Disorders 100,000 Questionnaire


Eating Disorders Examination-Questionnaire


Generalized Anxiety Disorder-7


Global Screening Array


Genome-wide association study


Health Insurance Portability and Accountability Act


Multi-polygenic score


Eating Disorders Working Group of the Psychiatric Genomics Consortium


Polygenic score


Patient Health Questonnaire-9


Rutgers University Cell and DNA Repository


Single nucleotide polymorphism


University of North Carolina


  1. Hudson JI, Hiripi E, Pope HG Jr, Kessler RC. The prevalence and correlates of eating disorders in the National Comorbidity Survey Replication. Biol Psychiatry. 2007;61:348–58.

    Article  PubMed  Google Scholar 

  2. Hudson J, Lalonde J, Pindyck L, Bulik C, Crow S, McElroy S, et al. Familial aggregation of binge-eating disorder. Arch Gen Psychiatry. 2006;63:313–9.

    Article  PubMed  Google Scholar 

  3. Javaras KN, Laird NM, Reichborn-Kjennerud T, Bulik CM, Pope HG Jr, Hudson JI. Familiality and heritability of binge eating disorder: results of a case-control family study and a twin study. Int J Eat Disord. 2008;41(2):174–9.

    Article  PubMed  Google Scholar 

  4. Root TL, Thornton LM, Lindroos AK, Stunkard AJ, Lichtenstein P, Pedersen NL, et al. Shared and unique genetic and environmental influences on binge eating and night eating: a Swedish twin study. Eat Behav. 2010;11(2):92–8.

    Article  PubMed  Google Scholar 

  5. Reichborn-Kjennerud T, Bulik C, Tambs K, Harris J. Genetic and environmental influences on binge eating in the absence of compensatory behaviours: a population-based twin study. Int J Eat Disord. 2004;36:307–14.

    Article  PubMed  Google Scholar 

  6. Mitchell KS, Neale MC, Bulik CM, Aggen SH, Kendler KS, Mazzeo SE. Binge eating disorder: a symptom-level investigation of genetic and environmental influences on liability. Psychol Med. 2010;40(11):1899–906.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Bulik C, Sullivan P, Kendler K. Heritability of binge-eating and broadly defined bulimia nervosa. Biol Psychiatry. 1998;44(12):1210–8.

    Article  CAS  PubMed  Google Scholar 

  8. Welch E, Jangmo A, Thornton L, Herman B, Pawaskar M, Larsson H, et al. Treatment-seeking patients with binge-eating disorder in the Swedish national registers: clinical course and psychiatric comorbidity. BMC Psychiatry. 2016;16:163.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Thornton LM, Watson HJ, Jangmo A, Welch E, Wiklund C, von Hausswolff-Juhlin Y, et al. Binge-eating disorder in the Swedish national registers: somatic comorbidity. Int J Eat Disord. 2017;50(1):58–65.

    Article  PubMed  Google Scholar 

  10. Watson HJ, Jangmo A, Smith T, Thornton LM, von Hausswolff-Juhlin Y, Madhoo M, et al. A register-based case-control study of health care utilization and costs in binge-eating disorder. J Psychosom Res. 2018;108:47–53.

    Article  PubMed  Google Scholar 

  11. Watson HJ, Jangmo A, Munn-Chernoff MA, Thornton LM, Welch E, Wiklund C, et al. A register-based case-control study of prescription medication utilization in binge-eating disorder. Prim Care Companion CNS Disord. 2016;18(4).

  12. Pisetsky EM, Thornton LM, Lichtenstein P, Pedersen NL, Bulik CM. Suicide attempts in women with eating disorders. J Abnorm Psychol. 2013;122(4):1042–56.

    Article  PubMed  Google Scholar 

  13. Duncan L, Yilmaz Z, Gaspar H, Walters R, Goldstein J, Anttila V, et al. Significant locus and metabolic genetic correlations revealed in genome-wide association study of anorexia nervosa. Am J Psychiatry. 2017;174:850–8.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Watson HJ, Yilmaz Z, Thornton LM, Hübel C, Coleman JR, Gaspar HA, et al. Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa. Nat Genet. 2019;51:1207–14.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Blanton LV, Charbonneau MR, Salih T, Barratt MJ, Venkatesh S, Ilkaveya O, et al. Gut bacteria that prevent growth impairments transmitted by microbiota from malnourished children. Science. 2016;351(6275):aad3311.

    Article  PubMed  CAS  Google Scholar 

  16. Fouladi F, Brooks AE, Fodor AA, Carroll IM, Bulik-Sullivan EC, Tsilimigras MC, et al. The role of the gut microbiota in sustained weight loss following roux-en-Y gastric bypass surgery. Obesity Surg. 2019;29(4):1259–67.

    Article  Google Scholar 

  17. Sharon G, Cruz NJ, Kang D-W, Gandal MJ, Wang B, Kim Y-M, et al. Human gut microbiota from autism spectrum disorder promote behavioral symptoms in mice. Cell. 2019;177(6):1600–18 e17.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Armougom F, Henry M, Vialettes B, Raccah D, Raoult D. Monitoring bacterial community of human gut microbiota reveals an increase in lactobacillus in obese patients and methanogens in anorexic patients. PLoS One. 2009;4(9):e7125.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  19. Hanachi M, Manichanh C, Schoenenberger A, Pascal V, Levenez F, Cournède N, et al. Altered host-gut microbes symbiosis in severely malnourished anorexia nervosa (AN) patients undergoing enteral nutrition: an explicative factor of functional intestinal disorders? Clin Nutr. 2019;38(5):2304–10.

    Article  PubMed  Google Scholar 

  20. Kleiman SC, Watson HJ, Bulik-Sullivan EC, Huh EY, Tarantino LM, Bulik CM, et al. The intestinal microbiota in acute anorexia nervosa and during renourishment: relationship to depression, anxiety, and eating disorder psychopathology. Psychosom Med. 2015;77:969.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Kleiman SC, Glenny EM, Bulik-Sullivan EC, Huh EY, Tsilimigras MCB, Fodor AA, et al. Daily changes in composition and diversity of the intestinal microbiota in patients with anorexia nervosa: a series of three cases. Eur Eat Disord Rev. 2017;25(5):423–7.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Mack I, Cuntz U, Gramer C, Niedermaier S, Pohl C, Schwiertz A, et al. Weight gain in anorexia nervosa does not ameliorate the faecal microbiota, branched chain fatty acid profiles, and gastrointestinal complaints. Sci Rep. 2016;6:26752.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Morkl S, Lackner S, Muller W, Gorkiewicz G, Kashofer K, Oberascher A, et al. Gut microbiota and body composition in anorexia nervosa inpatients in comparison to athletes, overweight, obese, and normal weight controls. Int J Eat Disord. 2017;50(12):1421–31.

    Article  PubMed  Google Scholar 

  24. Morita C, Tsuji H, Hata T, Gondo M, Takakura S, Kawai K, et al. Gut dysbiosis in patients with anorexia nervosa. PLoS One. 2015;10(12):e0145274.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  25. Pfleiderer A, Lagier JC, Armougom F, Robert C, Vialettes B, Raoult D. Culturomics identified 11 new bacterial species from a single anorexia nervosa stool sample. Eur J Clin Microbiol Infect Dis. 2013;32(11):1471–81.

    Article  CAS  PubMed  Google Scholar 

  26. Tregarthen JP, Lock J, Darcy AM. Development of a smartphone application for eating disorder self-monitoring. Int J Eat Disord. 2015;48(7):972–82.

    Article  PubMed  Google Scholar 

  27. American PA. Diagnostic and statistical manual of mental disorders (5th ed.). Arlington: American Psychiatric Publishing; 2013.

    Google Scholar 

  28. Deboeck P, Preacher K. No need to be discrete: a method for continuous time mediation analysis. Struct Equ Modeling. 2016;23:61–75.

    Article  Google Scholar 

  29. Beltz AM, Wright AG, Sprague BN, Molenaar PC. Bridging the nomothetic and idiographic approaches to the analysis of clinical data. Assessment. 2016;23(4):447–58.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Thornton L, Munn-Chernoff M, Baker J, Juréus A, Parker R, Henders A, et al. The anorexia nervosa genetics initiative (ANGI): overview and methods. Contemp Clin Trials. 2018;74:61–9.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Fairburn CG, Beglin SJ. Assessment of eating disorders: interview or self-report questionnaire? Int J Eat Disord. 1994;16(4):363–70.

    CAS  PubMed  Google Scholar 

  32. Kroencke K, Spitzer R, Williams J. The PHQ-9: validity of a brief depression severity measure [electronic version]. J Gen Int Med. 2001;16(9):606–13.

    Article  Google Scholar 

  33. Lowe B, Decker O, Muller S, Brahler E, Schellberg D, Herzog W, et al. Validation and standardization of the generalized anxiety disorder screener (GAD-7) in the general population. Med Care. 2008;46(3):266–74.

    Article  PubMed  Google Scholar 

  34. Adler L, Kessler RC, Spencer T. Adult ADHD self-report scale-v1. 1 (ASRS-v1. 1) symptom checklist. New York: World Health Organization; 2003.

    Google Scholar 

  35. Adler LA, Spencer T, Faraone SV, Kessler RC, Howes MJ, Biederman J, et al. Validity of pilot adult ADHD self-report scale (ASRS) to rate adult ADHD symptoms. Ann Clin Psychiatry. 2006;18(3):145–8.

    Article  PubMed  Google Scholar 

  36. Drossman DA, Dumitrascu DL. Rome III: New standard for functional gastrointestinal disorders. J Gastrointestin Liver Dis. 2006;15(3):237.

    PubMed  Google Scholar 

  37. Wonderlich JA, Lavender JM, Wonderlich SA, Peterson CB, Crow SJ, Engel SG, et al. Examining convergence of retrospective and ecological momentary assessment measures of negative affect and eating disorder behaviors. Int J Eat Disord. 2015;48(3):305–11.

    Article  PubMed  Google Scholar 

  38. Berntson GG, Bigger JT Jr, Eckberg DL, Grossman P, Kaufmann PG, Malik M, et al. Heart rate variability: origins, methods, and interpretive caveats. Psychophysiol. 1997;34(6):623–48.

    Article  CAS  Google Scholar 

  39. Österlund T, Jonsson V, Kristiansson E. HirBin: high-resolution identification of differentially abundant functions in metagenomes. BMC Genomics. 2017;18(1):316.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  40. Bulik-Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J. Schizophrenia working Group of the Psychiatric Genomics C, et al. LD score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015;47:291–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Krapohl E, Patel H, Newhouse S, Curtis CJ, von Stumm S, Dale PS, et al. Multi-polygenic score approach to trait prediction. Mol Psychiatry. 2018;23(5):1368–74.

    Article  CAS  PubMed  Google Scholar 

  42. Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc Ser B Stat Methodol. 2005;67:301–20.

    Article  Google Scholar 

  43. Butner JE, Gagnon KT, Geuss MN, Lessard DA, Story TN. Utilizing topology to generate and test theories of change. Psychol Methods. 2015;20(1):1–25.

    Article  PubMed  Google Scholar 

  44. Brownley K, Berkman N, Peat C, Lohr K, Cullen K, Bulik C. Binge-eating disorder in adults: a systematic review and meta-analysis. Ann Int Med. 2016;165:409–20.

    Article  PubMed  Google Scholar 

Download references


The authors wish to thank Apple, Inc. for providing Apple Watches to the University of North Carolina at Chapel Hill.


Foundation of Hope, Raleigh North Carolina (Bulik, PI); National Eating Disorders Association (Bulik and Tregarthen, PIs); Brain and Behavior Research Foundation (BBRF: NARSAD Distinguished Investigator Grant; Bulik, PI); National Institute of Mental Health (NIMH: R01MH119084, Bulik/Butner, MPIs; U01 MH109528, Sullivan PI, Bulik Co-I), uBiome (services grant, Bulik, PI). No funding bodies were involved in the design of the study and collection, analysis, interpretation of data, or writing the manuscript. The NIMH and BBRF peer reviewed the study protocol. Open access funding provided by Karolinska Institute

Author information

Authors and Affiliations



Conception/design of the work: LMT, TS, JT, JEB, IC, BRB, PRDB, CMB. Data acquisition/analysis: REF, LMT, JT, JEB, IC, BRB, PRDB, CMB. Creation of software: JT. Drafted/revised the work: REF, LMT, JEB, CMB. Approved submitted version: REF, LMT, TS, HM, JT, JEB, IC, BRB, PRDB, CMB. Agreed both to be personally accountable for the author’s own contributions and to ensure that questions related to the accuracy or integrity of any part of the work, even ones in which the author was not personally involved, are appropriately investigated, resolved, and the resolution documented in the literature: REF, LMT, TS, HM, JT, JEB, IC, BRB, PRDB, CMB.

Corresponding author

Correspondence to Cynthia M. Bulik.

Ethics declarations

Ethics approval and consent to participate

BEGIN was approved by the University of North Carolina Biomedical Institutional Review Board (IRB) Protocol # 17–0242. All participants provided informed written online consent to participate. The electronic consent process was approved by the IRB.

Consent for publication

Not applicable.

Competing interests

C.M. Bulik reports: Shire (grant recipient, Scientific Advisory Board member); Idorsia (consultant); Pearson (author, royalty recipient). I. Carroll has previously served as consultants for Salix Pharmaceuticals. J. Tregarthen reports: Recovery Record (shareholder, employee).

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bulik, C.M., Butner, J.E., Tregarthen, J. et al. The Binge Eating Genetics Initiative (BEGIN): study protocol. BMC Psychiatry 20, 307 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: