Predicting inpatient violence using an extended version of the Brøset-Violence-Checklist: instrument development and clinical application
© Abderhalden et al; licensee BioMed Central Ltd. 2006
Received: 26 December 2005
Accepted: 25 April 2006
Published: 25 April 2006
Patient aggression is a common problem in acute psychiatric wards and calls for preventive measures. The timely use of preventive measures presupposes a preceded risk assessment. The Norwegian Brøset-Violence-Checklist (BVC) is one of the few instruments suited for short-time prediction of violence of psychiatric inpatients in routine care. Aims of our study were to improve the accuracy of the short-term prediction of violence in acute inpatient settings by combining the Brøset-Violence-Checklist (BVC) with an overall subjective clinical risk-assessment and to test the application of the combined measure in daily practice.
We conducted a prospective cohort study with two samples of newly admitted psychiatric patients for instrument development (219 patients) and clinical application (300 patients). Risk of physical attacks was assessed by combining the 6-item BVC and a 6-point score derived from a Visual Analog Scale. Incidents were registered with the Staff Observation of Aggression Scale-Revised SOAS-R. Test accuracy was described as the area under the receiver operating characteristic curve (AUCROC).
The AUCROC of the new VAS-complemented BVC-version (BVC-VAS) was 0.95 in and 0.89 in the derivation and validation study respectively.
The BVC-VAS is an easy to use and accurate instrument for systematic short-term prediction of violent attacks in acute psychiatric wards. The inclusion of the VAS-derived data did not change the accuracy of the original BVC.
Patient aggression is a common problem in acute psychiatric wards. Violent outbursts threaten the health, safety and well-being of other patients and staff. Psychiatric nurses are at particularly high risk of being victimized. However, psychiatric staff is not only a passive target of potential patient violence. Violence management is a key component of clinical practice, and psychiatric staff performs a wide range of interventions to modulate the context and the interaction with potentially violent patients. Preventive measures are of special importance. The timely use of preventive measures presupposes a preceded risk assessment. Therefore, accurate risk prediction to allow targeted interventions is of paramount importance .
Several attempts have been made to introduce accurate measures for risk prediction . Generally spoken fall into two categories: actuarial methods and prediction models derived from acute patient observation [2–4]. Actuarial models predict risk from the presence of statistically derived risk factors like age, gender, psychopathological state, diagnosis etc. Most studies using this method found that patients who had exhibited violent behavior in the past were substantially more likely to become aggressive during a new hospitalization than those with no history of aggressive behavior [5, 6]. The main criticisms advanced towards actuarial methods is a) that they discard the experience of the staff currently dealing with the patient, b) that they perform less well in non-forensic or acute settings [5, 7] and c) that they require the collection of data that may not be readily available in acutely admitted patients [1, 8].
Clinical prediction models based on acute patient observation use different approaches, considering factors as e.g. psychopathological states. One approach is based on overt patient behavior. A recently published method is the Brøset Violence Checklist (BVC), which has been validated in Norwegian and German [9–11]. The BVC assesses the presence of six observable patient behaviors namely whether the patient is confused, irritable, boisterous, verbally threatening, physically threatening, and attacking objects. The reported discriminatory ability is good with a correct prediction rate around 85% . Another clinical model emphasizes the staff's ability to judge the risk by integrating all available information into a formal subjective risk prediction statement. This subjective prediction is operationalized by likert-type scales or Visual Analogue Scales [12–16]. Investigators applying this approach found correct prediction rates of 75% . The limitation to either approach is a considerable residual risk of false positives.
Aim of the study
The aim of the present study was to ascertain whether combining both methods would yield improved risk prediction over either method alone. The study comprised two independent patient samples from different hospitals. The first patient sample served as a derivation dataset to identify the optimal algorithm for combining the BVC and the subjective prediction. The second patient sample served as the validation dataset, in which the prediction method was applied to clinical practice.
Two independent prospective cohort studies were conducted. The first served to develop the risk assessment instrument (derivation sample). The second patient sample tested the clinical application of the method (validation sample).
The study protocols were reviewed and approved by the research ethics boards of the Cantons Zurich (E-016/2001), Appenzell AR (10/01) and Berne (24.12.2001/IH/Hz/EW).
Setting and sample
Both studies were conducted in acute psychiatric wards in the German speaking part of Switzerland. All participating wards were closed admission wards providing comprehensive psychiatric service to the respective catchment areas. The first sample (derivation dataset) consisted of 219 consecutively admitted patients to six wards within three hospitals during a two-moth period. The number of beds in each ward ranged from 15 to 19. The second sample (validation dataset) consisted of 300 consecutively admitted patients to two wards during a six-month period. These two 12 bed wards were situated in two different hospitals in different cantons (one rural area, one urban area) to assure independence from the derivation dataset.
During instrument development psychiatric nurses responsible for the care of the patient provided an assessment during admission and twice daily (10 a.m. and 6 p.m.) at admission day and during the next three days or until discharge/transferral. Therefore, the maximum number of ratings per patients was 9 in the case of an admission time earlier than the regular rating at 11 a.m. Lower numbers of ratings resulted from missing items and when patients were discharged from the ward prior to the third day after hospitalization. Assessment forms contained the German research version of the BVC and a Visual Analogue Scale (VAS) of 10 cm length. Nurses were asked to indicate the presence or absence of the six behaviors constituting the BVC. In addition, nurses encoded their subjective perception of risk for a physical attack within the next 12 hours on the VAS. The endpoints of the VAS were marked as "no risk" and "very high risk". The data collection form was also used to gather information about any preventive measures taken since the last rating. No clues were provided about the interpretation of the BVC or the VAS. From these data, the final instrument (BVC-VAS) was developed as described in the statistical analysis section. The objective of this instrument to be developed was to integrate the findings from the BVC and the Visual Analogue Scale to a summary score. Crafting an instrument that would be compatible with routine use required graphic refinement of the BVC as well as a simple method to translate VAS-readings into scoring points. The latter was achieved by constructing a slide rule that resembled the VAS on the front side and provided the VAS score reading on the backside. The final instrument was pre-tested in a different ward before application in the validation study.
The new instrument (BVC-VAS) was integrated into clinical routine in two admission wards in two hospitals. To test the instrument during practical application, staff was aware about the interpretation of the obtained scores. Like in the derivation sample, nurses assessed the risk of newly admitted patients on the day of admission and the following three days twice daily.
The main outcome measure was the occurrence of physical attacks on persons during the next shift following assessment. The severity of the aggressive event was recorded using the Staff Observation of Aggression Scale Revised (SOAS-R) [17–19]. Test accuracy was described as the area under the receiver operating characteristic curve . A secondary outcome was the implementation of intense preventive measures such as seclusion or forced injection of psychotropic drugs. While this outcome may not be regarded as independent from the prediction, it allows the evaluation of false positive cases, i. e. to examine whether patients were unable to perpetrate violent attacks because of intense preventive measures. Thus, some of the false positive predictions may in fact be a consequence of effective prevention [13, 21].
The overall aim of the development of the BVC-VAS was to arrive at a simple number scoring system with presentation of risk as natural frequencies (e.g. 1 out of 10 patients with this score will attack). Such presentation of results is believed to provide a better framework to base actions than simple categorization as low or high risk. The statistical analysis consisted of two steps: First, an optimized prediction score was derived from the derivation dataset with the aim to provide four distinct risk strata: high, moderate risk, low risk and very low risk. Second, the application of the scoring system was tested under realistic conditions in a validation sample.
During derivation we employed independent logistic regression analyses with attack, aggression and coercive measures as the binary outcome variable. To account for possible non-linear relation between risk and individual BVC items, we performed additional analyses by entering each item as individual variable and by recoding numbers of BVC items into dummy variables. Second, we explored the relation between the VAS-distance measured in mm and the occurrence of physical attacks by independent logistic regression analyses. Within the constraints from the small dataset, these analyses did not suggest superiority of the single coding of symptoms over the simpler adding of symptoms. Next, several transformations of the raw VAS score (logarithmic, quadratic) were carried out, of which the logarithmic transformation yielded the highest discriminatory power. Because replacing the log-transformed VAS with the scoring points did not alter the predictive accuracy, we proceeded to adding the BVC and the VAS to a common summary score. We checked which combination of BVC scores and VAS scores would yield the best performing model, by testing different weights of the two scores. However, due to the small number of observed events, logistic regression analyses failed to ascertain with statistical significance whether non-balanced weighing would have yielded improved diagnostic performance over giving equal weights to the subjective assessment and the BVC. Therefore, we proceeded with equal weights. Thus, the final scale consisted of 12 score points, of which up to 6 were contributed from the BVC and up to 6 from a logarithmic transformation of the VAS. Finally, we calculated multilevel likelihood ratios for ranges of the revised BVC score, to be able to enumerate risk rather than expressing risk with ambiguous wordings. For practicability we chose four risk segments, corresponding to very low risk, low risk, moderate risk and high risk. In the validation dataset we elucidated the discriminatory performance of the total score and each subscore by independent analyses with the respective outcomes (attack/attack or intense preventive measures) as the outcome variables. To compare models, we used the area under the receiver operating characteristic curve (AUC-ROC). The AUC-ROC is determined from plotting sensitivity against 1-specificity for all possible cut-offs, in case of the combined BVC-AUC score for values ranging from 0 to 12. An area of 1 indicates a perfect prediction; an area of 0.5 is a chance result. Few clinical scores achieve AUCs ranging above 0.75, tests with an AUC of 0.95 are considered excellent . Analyses were carried out in SPSS version 10 (SPSS inc, Chicago, Illinois) for obtaining confidence intervals for area under the receiver operating characteristic curves and in SAS (version 8.2, SAS institute, Cary, North Carolina) for model development.
Age (mean± SD)
p = 0.002*
ICD-10 F1 (Alcohol and drug use disorders)
ICD-10 F2 (Schizophrenic or delusional disorder)
ICD-10 F3 (Affective disorder)
ICD-10 F4/6 (Neurotic, stress-related or somatoform disorder/personality disorder)
13.6 % %
p = 0.042**
ICD-10 (Other diagnoses)
Median length of stay (days)
Patients involved in ≥ 1 attack
10 (4.6 %)
27 (9.0 %)
Severity of attacks (SOAS-R) (mean; range)
12.7 (5 – 18)
13.4 (4 – 20)
Intense preventive measures
Patients with ≥ 1≥ intense preventive measure
28 (12.8 %)
41 (13.7 %)
Instrument development phase
Transformation of VAS-data into 6-point-scale
1 – 5
6 – 10
11 – 20
21 – 40
41 – 80
81 – 100
Interpretation of the extended version of the BVC, obtained from the derivation data-set
Odds ratio (95% CI) validation sample
0 – 3 Points
1.0 (reference group)
Very low risk (< 1 of 300 patients will attack a person)
4 – 6 Points
Low risk (about 1 out of 100 patients will attack a person)
7 – 9 Points
Moderate risk (about 1 out of 10 patients will attack a person)
High risk (about 1 out of 4 to 5 patients risk will attack a person)
Because we were interested in testing the performance of the instrument in routine application we provided recommendations in addition to the risk enumeration. The scoring form suggested discussing the risk within the nursing team for patients scoring between 7 and 9 (moderate risk) and to consider the implementation of preventive measures from a list provided with the instrument. A score of 10 or more (high risk) constituted the obligation to discuss the risk AND to plan and implement preventive measures from the same list of possible preventive measures (see appendix).
Summary of the areas under the receiver operating characteristic curves (95% Confidence interval)
Physical attack within next shift
Physical attack OR intense preventive measure within next shift
Ratings and accuracy of predictions
Outcome within the next shift
physical attack (n = 14)
physical attack (n = 37)
physical attack OR intense preventive measure ** (n = 121)
Prediction method and cut-off points
BVC >= 3
BVC-VAS >= 7
VAS >= 4
BVC >= 3
BVC-VAS >= 7
VAS >= 4
BVC >= 3
BVC-VAS >= 7
VAS >= 4
Sensitivity/Specificity in %
In the validation dataset, the Spearman's Rho correlation coefficient between the VAS and the BVC was r = 0.59, as compared to r = 0.50 in the derivation dataset.
The first aim of the study was to develop an extended version of the Brøset-Violence-Checklist that includes both the structured clinical assessment of observable patient behavior as well as the unaided subjective clinical assessment of psychiatric nurses on the patient's risk of perpetrating a violent attack. The second aim of the study was to test the instruments test accuracy and application in clinical practice. To this end, we conducted a prospective cohort study involving separate samples for instrument development (derivation sample) and clinical application (validation sample). The main findings of the study were that the visual analogue scale slightly improved the diagnostic accuracy in the derivation dataset (where no interpretation was provided), but that this effect was not retained in the validation dataset (where interpretation of the score was available). In the validation dataset the test accuracy of the VAS was significantly lower than in the derivation dataset. In contrast, the performance of the BVC was identical in both samples.
What are the clinical implications of these findings? The original BVC checklist proved to be remarkably stable in the independent dataset. Apparently, the BVC checklist combines the virtues of a structured clinical method by inquiring about specific patient behaviors. While it is still left to the discretion of the rater to decide, whether a specific behavior is actually present or not (e.g. being boisterous). Such subjective decisions may be more reliable than the subjective overall assessment provided in a Visual Analogue Scale. Moreover, we cannot rule out that providing the interpretation of the score affected the ratings. Of the two assessment methods, the BVC score is closer to resembling the practice of actuarial scores. The replication of almost identical test accuracy to the original Norwegian study in two independent samples underscores the possible generalizability of the instrument. Notwithstanding these encouraging findings, a relevant issue remains the limited positive predictive value in our settings with a low prevalence of physical attacks. This underscores the need for cautious interpretation of positive results and reporting of multilevel likelihood ratios. The satisfactory test accuracy (AUCROC = 0.90) of the combined instrument when using the composite endpoint emphasizes the applicability in daily routine. Our data do not support the presumption that the test accuracy improved to a relevant extent by including the subjective element of the visual analogue scale. We hesitate to recommend to solely using the VAS, for three reasons: First, in the derivation dataset nurses were unaware of the interpretation of the VAS rating and its clinical implication. A significantly lower test accuracy of the VAS was observed in the validation dataset, were scoring mattered – suggesting possible assessment biases. Second, a checklist of observable behaviour is not only helpful for less experienced staff, but also facilitates communication. Third, the VAS-results has to be regarded as product of the hidden process of clinical reasoning (black-box). However, the nurses' feedback on the user friendliness of the combined instrument as compared to our previous experience when using the BVC alone suggested an increased compliance and acceptance of the instrument. Therefore, we have opted for using the combined instrument in the ongoing randomized controlled trial evaluating the efficacy of systematic prediction on occurrence rates of violent attacks and intense coercive measures.
Several caveats of the study must be acknowledged. A purist approach to the validation study would have mandated employing exactly the same presentation and forms as used for the derivation set in the validation dataset. Instead we skipped this step and moved directly to the clinical application of a practicable and user-friendly form along with recommendations as to the consequences of the ratings to be considered. This design feature inhibits clearer delineation, whether the observed differences in the VAS performance were due to the different sample, differing professional experience amongst staff, the alteration of the design (scale versus ruler), the immediate feedback of the result on the score or the provided recommendations. A related problem is the lack of information on the factors considered by the nurses when rating the VAS. A second limitation is the small number of events that prevented the calculation of more elaborate statistical models accounting for other patient covariates such as diagnosis or demographic variables. We are currently addressing the first problem by means of a qualitative research project, in which nursing staff is interrogated about the thoughts and considerations leading to a specific subjective risk assessment. This project will reveal whether subjective risk assessment is actually incorporating actuarial data such as knowledge about prior patient behavior. Finally, providing an interpretation and suggestion for action with the score result partially violates the condition of independence between outcome and prediction. If only the occurrence of attacks is considered as an outcome event, cases of attacks prevented by interventions initiated as a consequence of the rating may inflate the false positive rate. In contrast, the composite outcome (attacks and interventions initiated following the rating) overestimates the true positive rate. It is reassuring that the area under the Receiver Operating Characteristic curve using either outcome definition differed only by a small margin (0.90 versus 0.86). It should also be noted that the performance of the VAS in the validation dataset was similar to that of earlier reports from other investigations .
In summary, we ascertained satisfactory performance of the BVC in an independent dataset where multilevel likelihood ratio based interpretations and action plans were provided. Adding a visual analogue scale for subjective risk assessment appeared to improve the compliance of the staff with systematic risk prediction but did not result in improved test accuracy in the validation dataset. The considerable difference in test performance for the visual analogue scale between the application within a research framework (derivation dataset) and the use in daily practice warrant further scrutiny. The combined instrument is currently been tested in a multi-center randomized controlled trial to assess the efficacy of systematic risk assessment. Until these data are available the recommendation for routine use cannot be extended from the BVC risk assessment to the combined BVC-VAS instrument. Finally, it should be born in mind that attacks are rare events. Even the use of the BVC-VAS may imply that about half of the attacks will not be properly predicted and that only about 1 in 10 of all patients classified as moderate or high risk would indeed have proceeded to commit an attack.
The BVC-VAS is an easy to use and accurate instrument for systematic short-term prediction of violent attacks in acute psychiatric wards. The inclusion of the VAS-derived data did not change the accuracy of the original BVC. Further research is needed on the factors considered by the nurses when rating the VAS and on the preventive efficacy of using the BVC-VAS.
No specific measures to prevent an attack
General conversation (directed to reduce aggression)
Walk outdoors 1:1 (directed to reduce aggression)
Walk outdoors in a group (directed to reduce aggression)
Reduction of demands (e.g. participation in activities)
Confrontation with ward rules
Discussion of risk with patient
Talk-down (to deescalate)
Transfer to intensive area within ward
1:1-observation for several hours
Increase of medication dosage
PRN-medication per os (psychotropic drugs)
Open isolation in the patients own room (time out)
Preventive seclusion (closed seclusion room)
Injection of psychotropic drugs (forced/voluntary)
Physical restraint (indicate nr. of points)
This study was supported by the European Violence in Psychiatry Research Group EViPRG and by the Swiss Academy of Medical Sciences. We also want to thank the staff of the participating wards for their patient and helpful collaboration in the collection of data.
- Allen J: Assessing and managing risk of violence in the mentally disordered. J Psychiatr Ment Health Nurs. 1997, 4: 369-378. 10.1046/j.1365-2850.1997.00068.x.View ArticlePubMed
- Rice ME, Harris GT, Quinsey VL: The Appraisal of Violence Risk. Curr Opin Psychiatry. 2002, 15: 589-593. 10.1097/00001504-200211000-00005.View Article
- Dolan M, Doyle M: Violence risk prediction. Clinical and actuarial measures and the role of the Psychopathy Checklist. Br J Psychiatry. 2000, 177: 303-311. 10.1192/bjp.177.4.303.View ArticlePubMed
- Doyle M, Dolan M: Violence risk assessment: combining actuarial and clinical information to structure clinical judgements for the formulation and management of risk. J Psychiatr Ment Health Nurs. 2002, 9: 649-657. 10.1046/j.1365-2850.2002.00535.x.View ArticlePubMed
- Steinert T: Prediction of inpatient violence. Acta Psychiatr Scand Suppl. 2002, 133-141. 10.1034/j.1600-0447.106.s412.29.x.
- Ruesch P, Miserez B, Hell D: Gibt es ein Täterprofil des aggressiven Psychiatriepatienten? [A risk profile of the aggressive psychiatric inpatient: can it be identified?]. Nervenarzt. 2003, 74: 259-265. 10.1007/s00115-003-1475-8.View ArticlePubMed
- Palmstierna T, Wistedt B: Risk factors for aggressive behaviour are of limited value in predicting the violent behaviour of acute involuntarily admitted patients. Acta Psychiatr Scand. 1990, 81 (2): 152-155.View ArticlePubMed
- Steadman HJ, Silver E, Monahan J, Appelbaum PS, Robbins PC, Mulvey EP, Grisso T, Roth LH, Banks S: A classification tree approach to the development of actuarial violence risk assessment tools. Law Hum Behav. 2000, 24: 83-100. 10.1023/A:1005478820425.View ArticlePubMed
- Abderhalden C, Needham I, Almvik R, Miserez B, Dassen T, Haug H, Fischer J: Predicting inpatient violence in acute psychiatric wards using the Brøset-Violence-Checklist: A multi-centre prospective cohort study. J Psychiatr Ment Health Nurs. 2004, 11: 422-427. 10.1111/j.1365-2850.2004.00733.x.View ArticlePubMed
- Almvik R, Woods P, Rasmussen K: The Broset Violence Checklist: Sensitivity, Specificity and Interrater Reliability. J Interpersonal Violence. 2000, 15: 1284-1296.View Article
- Almvik R, Woods P: Short-term risk prediction: the Broset Violence Checklist. J Psychiatr Ment Health Nurs. 2003, 10: 236-238.View ArticlePubMed
- McNiel D, Binder R: Correlates of accuracy in the assessment of psychiatric inpatients' risk of violence. Am J Psychiatry. 1995, 152: 901-906.View ArticlePubMed
- McNiel D: Clinical assessment of the risk of violence among psychiatric inpatients. Am J Psychiatry. 1991, 148: 1317-1321.View ArticlePubMed
- Nijman H, Merckelbach H, Evers C, Palmstierna T, a Campo J: Prediction of aggression on a locked psychiatric admissions ward. Acta Psychiatr Scand. 2002, 105: 390-395. 10.1034/j.1600-0447.2002.0o426.x.View ArticlePubMed
- Rabinowitz J, Garelik-Wyler R: Accuracy and confidence in clinical assessment of psychiatric inpatients risk of violence. Int J Law Psychiatry. 1999, 22: 99-106. 10.1016/S0160-2527(98)00032-6.View ArticlePubMed
- Haim R, Rabinowitz J, Lereya J, Fennig S: Predictions made by psychiatrists and psychiatric nurses of violence by patients. Psychiatr Serv. 2002, 53: 622-624. 10.1176/appi.ps.53.5.622.View ArticlePubMed
- Nijman HLI, Muris P, Merckelbach HLGJ, Palmstierna T, Wistedt B, Vos AM, van Rixtel A, Allertz WWF: The Staff Observation Aggression Scale – Revised (SOAS-R). Aggress Behav. 1999, 25: 197-209. 10.1002/(SICI)1098-2337(1999)25:3<197::AID-AB4>3.0.CO;2-C.View Article
- Nijman H, Palmstierna T: Measuring aggression with the staff observation aggression scale – revised. Acta Psychiatr Scand Suppl. 2002, 101-102. 10.1034/j.1600-0447.106.s412.21.x.
- Palmstierna T, Wistedt B: Staff observation aggression scale, SOAS: Presentation and evaluation. Acta Psychiatr Scand. 1987, 76: 657-663.View ArticlePubMed
- Swets JA: Measuring the accuracy of diagnostic systems. Science. 1988, 240: 1285-1293.View ArticlePubMed
- Werner PD, Rose TL, Yesavage JA: Reliability, accuracy, and decision-making strategy in clinical predictions of imminent dangerousness. J Consult Clin Psychol. 1983, 51: 815-825. 10.1037//0022-006X.51.6.815.View ArticlePubMed
- Szmukler G: Violence risk prediction in practice. Br J Psychiatry. 2001, 178: 84-85. 10.1192/bjp.178.1.84.View ArticlePubMed
- Bingley W: Assessing dangerousness: Protecting the interests of patients. Brit J Psychiatry. 1997, 170: 28-29.
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-244X/6/17/prepub