The severity of psychiatric disorders

The severity of psychiatric disorders

World Psychiatry. 2018 Oct; 17(3): 258–275.
Published online 2018 Sep 7. doi: 10.1002/wps.20569
PMCID: PMC6127765
PMID: 30192110

Abstract

The issue of the severity of psychiatric disorders has great clinical importance. For example, severity influences decisions about level of care, and affects decisions to seek government assistance due to psychiatric disability. Controversy exists as to the efficacy of antidepressants across the spectrum of depression severity, and whether patients with severe depression should be preferentially treated with medication rather than psychotherapy. Measures of severity are used to evaluate outcome in treatment studies and may be used as meaningful endpoints in clinical practice. But, what does it mean to say that someone has a severe illness? Does severity refer to the number of symptoms a patient is experiencing? To the intensity of the symptoms? To symptom frequency or persistence? To the impact of symptoms on functioning or on quality of life? To the likelihood of the illness resulting in permanent disability or death? Putting aside the issue of how severity should be operationalized, another consideration is whether severity should be conceptualized similarly for all illnesses or be disorder specific. In this paper, we examine how severity is characterized in research and contemporary psychiatric diagnostic systems, with a special focus on depression and personality disorders. Our review shows that the DSM‐5 has defined the severity of various disorders in different ways, and that researchers have adopted a myriad of ways of defining severity for both depression and personality disorders, although the severity of the former was predominantly defined according to scores on symptom rating scales, whereas the severity of the latter was often linked with impairments in functioning. Because the functional impact of symptom‐defined disorders depends on factors extrinsic to those disorders, such as self‐efficacy, resilience, coping ability, social support, cultural and social expectations, as well as the responsibilities related to one’s primary role function and the availability of others to assume those responsibilities, we argue that the severity of such disorders should be defined independently from functional impairment.

 

Keywords: Severity, psychiatric disorders, functional impairment, symptoms, depression, personality disorders, transdiagnostic models, HiTOP, DSM‐5, ICD‐10

The determination of illness severity has important clinical implications. Depending on the disorder, severity affects decisions to seek treatment, the type and intensity of treatment, and whether to continue or stop treatment. Severity also impacts expectations in the fulfillment of role function and disability status. Measures of severity are used to evaluate outcome in treatment studies and may be used as meaningful endpoints in clinical practice.

But, what does it mean to say that someone has a severe illness? Of the various dictionary definitions of “severe”, the one that is most relevant to the characterization of illness is “of great degree”. This definition, however, does not convey what is meant when an illness is considered “severe”. Does severity refer to the number of symptoms a patient is experiencing? To the intensity of the symptoms? To symptom frequency or persistence? To the impact of symptoms on functioning or quality of life? To the likelihood of the illness resulting in permanent disability or death?

Some of these questions about the meaning of severity can be further elaborated. For example, with regards to the prediction of mortality, does severity allude to imminent death, death in the near future, or death at any time in the future? Also, should the impact of intervention be considered? That is, is an illness severe only when death is likely if the illness is left untreated, or only if death is likely regardless of intervention?

Perhaps severity determinations should be independent of functional impact or prognosis and instead should be based on structural or morphological changes and damage to the diseased organ. To be sure, this is not relevant for many illnesses, but, when it can be measured, should this be the guiding principle for rating illness severity?

Putting aside the issue of how severity should be operationalized, another consideration is whether severity should be conceptualized similarly for all illnesses or be disorder specific. Should the severity of heart failure, rheumatoid arthritis, diabetes, an acute upper respiratory tract infection, and a headache be judged according to a common standard or metric, or should each disorder have its own respective guidelines for rating severity?

In this paper, we examine how severity is characterized in psychiatric research and contemporary psychiatric diagnostic systems. To illustrate some of the issues and controversies in determining the severity of psychiatric disorders, we focus on depression and personality disorders (PDs). The clinical significance of considering the severity of depression is reflected in official treatment guidelines wherein recommendations are based on illness severity1, 2. The importance of considering the severity of PDs is reflected by the ICD‐11 proposal to replace the specified criteria for different disorders by a single personality disorder category that is graded according to levels of severity3, 4.

Before discussing the issue of severity of psychiatric disorders, we present a brief overview of how severity has been conceptualized, assessed and measured for various physical illnesses, highlighting the variability of approaches.

SEVERITY OF PHYSICAL ILLNESSES

There is no consensus or uniform overriding principle in distinguishing between levels of severity of physical illnesses. In some cases, severity is defined by the degree of structural damage to the diseased organ. For example, the severity of rheumatoid arthritis has been defined according to radiographic evidence of joint damage5. The severity of diabetic retinopathy has been graded according to the degree of retinal damage assessed in a direct clinical eye exam6. In a related manner, physiological measures representing the impact of disease on the organ have been used to characterize the severity of some diseases. For example, left ventricular ejection fraction has been used as an index of the severity of cardiovascular disease7, 8, 9, 10. Forced expiratory volume has been used as index of severity of cystic fibrosis11. Aminotransferase and bilirubin levels have been used to assess the severity of hepatitis12.

Sometimes severity is defined by a disorder‐specific clinical examination. For example, not only have radiographic assessments been used to evaluate the severity of rheumatoid arthritis, but severity has additionally been defined according to a count of the number of swollen and painful joints13.

Illness severity has also been defined more broadly to encompass indices of the diseased organ as well as related and downstream effects. In a study of the prognostic implications of post‐cardiac arrest illness severity, severity scores were based on cardiopulmonary dysfunction and neurologic status14, 15. The severity of sickle cell disease has been based on the presence and frequency of complications such as renal failure, necrosis of hips and shoulders, and gallstones16. In studies of the severity of chronic obstructive pulmonary disease, the BODE index (B, body mass index; O, obstruction of airways as measured by forced expiratory volume in one second; D, dyspnea scale; E, exercise capacity as measured by a six‐minute walk test) includes and goes beyond a direct, specific, assessment of pulmonary damage and has been found to be a better predictor of mortality, hospitalization, quality of life, and depression than forced expiratory volume alone17. The Unified Parkinson’s Disease Rating Scale contains four subscales assessing mental state, activities of daily living, motor examination, and complications18, 19.

Moving further away from a direct or physiological assessment of the diseased organ, the New York Heart Association Functional Classification is a measure of cardiac disease severity based on limitations in physical activities and the presence of physical symptoms associated with varying degrees of activity20.

In contrast to disorder‐specific physical and physiological indicators of severity, there are composite measures of overall illness severity, such as the Acute Physiology and Chronic Health Evaluation (APACHE) scores and the Simplified Acute Physiology Score (SAPS), based on non‐specific clinical and biological indicators of health status such as body temperature, age, history of organ failure, electrolytes, and hematocrit21, 22. These illness severity measures have been used to predict mortality in heterogeneous and single disorder samples of acutely ill emergency department and hospitalized patients23, 24.

Finally, self‐report questionnaires have been developed to assess the severity of some physical illnesses. The severity of benign prostatic hypertrophy as assessed by the American Urological Association Symptom Index is based on the frequency of symptoms25. The Tinnitus Severity Index is based on the frequency of functional impairment or psychological symptoms due to tinnitus26. The Bowel Symptom Severity Scale assesses the frequency, distress and disability of symptoms associated with irritable bowel syndrome27. The severity of headaches as measured by the Headache Impact Questionnaire is a composite measure of headache frequency, the average pain intensity of headaches, and the impairment resulting from headaches28. The Liverpool Seizure Severity Scale assesses perceptions of seizure control and severity of ictal and postictal symptoms29.

Clark et al30 summarized the approach taken to develop self‐report measures of illness severity for six disease states studied in the Veterans Health Study. They defined illness severity in terms of patients’ perceptions of the magnitude of symptoms or complications of the illness that are associated with reductions in health‐related quality of life or health status. They distinguished disease severity from the impact of disease (e.g., impairment, life satisfaction, well‐being), because the impact of disease is often mediated by personal characteristics (e.g., resiliency, self‐efficacy) and social context.

SEVERITY OF PSYCHIATRIC DISORDERS AS DESCRIBED IN DSM‐5

In contrast to some physical illnesses, there are no specific or non‐specific biomarkers of psychiatric disorders that validly characterize the severity of the disorder. In the absence of such biological or structural indicators, researchers and clinicians are left to assess the epiphenomena of a psychiatric disorder to judge its severity.

Discussions of resource allocation in the public health sector often focus on patients with severe mental illness, though there is no consensus in how to define such an illness31, 32. The DSM‐533, like its immediate predecessors, defines severity for only some disorders. Table ​Table11 lists the DSM‐5 disorders with defined levels of severity.

Table 1

Characterization of disorder severity in DSM‐5

DSM‐5 disorder Features used to define severity
Major depressive disorder Number of symptoms, level of distress caused by intensity of symptoms, and impairment in social and occupational functioning
Mania, hypomania Same as major depressive disorder
Alcohol use disorder Number of criteria
Drug use disorder Number of criteria
Bulimia nervosa Frequency of compensatory behaviors per week
Anorexia nervosa Body mass index
Binge eating disorder Frequency of eating binges
Learning disorders Severity of deficit in learning skills and likelihood of learning the skills with or without intervention
Attention‐deficit/hyperactivity disorder Number of symptoms, severity of individual symptoms, or level of impairment caused by the symptoms
Intellectual disability Level of adaptive functioning
Autism spectrum disorder Degree of impairment in functioning due to deficits in verbal and nonverbal communication, inflexibility of behavior, difficulty coping with change, or restricted/repetitive behaviors
Stereotypic movement disorder The ease by which the symptoms can be suppressed and the need for intervention to prevent serious injury
Psychotic disorders Quantitative assessment on 5‐point scale of primary feature of the psychosis (delusions, hallucinations, disorganized speech, abnormal psychomotor behavior, and negative symptoms). Rating is based on symptom intensity or subjective distress due to symptom
Reactive attachment disorder Only the severe type is defined. Severe is defined as all criteria met at a high level
Disinhibited social engagement disorder Only the severe type is defined. Severe is defined as all criteria met at a high level
Somatic symptom disorder Number of criteria and somatic complaints
Psychological factors affecting other medical conditions Degree of impact on medical condition or medical risk
Hypersomnolence disorder Number of days per week with difficulty maintaining daytime alertness
Narcolepsy Frequency of cataplexy and responsiveness of cataplexy to medication, number of naps per day, degree of disturbance of nocturnal sleep
Obstructive sleep apnea/hypopnea Apnea/hypopnea index score
Nightmare disorder Frequency of nightmares per week
Sexual disorders Degree of distress related to symptoms
Premature ejaculation Time to ejaculation
Substance/medication‐induced sexual dysfunction Percentage of occasions of sexual activity that dysfunction occurs
Oppositional defiant disorder Number of settings in which the symptoms occur
Conduct disorder Number of conduct problems or the degree of harm caused to others
Neurocognitive disorders Degree of difficulty with instrumental activities of daily living

The DSM‐5 approach towards defining severity varies across disorders. The four severity levels of intellectual disability (mild, moderate, severe, profound) are the most elaborately defined, with three pages of descriptions of the adaptive functioning deficits characteristic of each level of severity. DSM‐5 notes that severity was defined according to adaptive functioning level rather than IQ scores because the former is a better determinant of the level of supports that are needed. Similarly, the level of deficits and functional impairment defining the severity of autism spectrum disorders is linked to the supports required. The severity of learning disorders refers to the difficulties in learning skills as well as the likelihood of learning those skills with or without intervention. For example, DSM‐5 defines severe impairment of a learning disorder as “severe difficulties learning skills, affecting several academic domains, so that the individual is unlikely to learn those skills without ongoing intensive individualized and specialized teaching for most of the school years”. For these disorders, then, the severity specifier is explicitly linked to suggested levels of intervention.

Depression and mania are classified as mild, moderate or severe according to the number of symptoms, the level of distress caused by the intensity of the symptoms, and the degree of impairment in social and occupational functioning. The severity of alcohol and drug use disorders is based on the number of criteria that are met (mild: 2 or 3 criteria; moderate: 4 or 5 criteria; severe: 6 or more criteria). The severity of attention‐deficit/hyperactivity disorder is based on the number of symptoms, severity of individual symptoms, or level of impairment caused by the symptoms. The severity of bulimia nervosa is operationalized according to the number of inappropriate compensatory behaviors per week (mild: 1‐3; moderate: 4‐7; severe: 8‐13; extreme: 14 or more), though the severity designation could be increased to reflect other symptoms or level of functional impairment. For anorexia nervosa, severity is defined according to body mass index, and for binge eating disorder it is defined by the number of binge eating episodes per week, though, similar to bulimia nervosa, the severity designation can be increased to reflect other symptoms or degree of functional impairment. Severity of sexual disorders is based on the level of distress regarding the symptoms, except for premature ejaculation, for which severity is based on the time to ejaculation. The severity of cataplexy is based, in part, on lack of responsiveness to medication.

This brief overview illustrates the variability in the approaches taken in the DSM‐5 towards defining degrees of severity, with some definitions emphasizing the number of criteria met, some others emphasizing the core feature of the disorder, some based on level of distress, and some focusing on response to intervention and prediction of course. In contrast to many physical illnesses, none of the definitions of severity refer to the likelihood of imminent or distal mortality, and most definitions do not refer to prognosis or future course. Rather, most definitions of severity in DSM‐5 refer to the number of symptoms or criteria of the disorder, the frequency of symptoms, and the level of impairment or distress.

SEVERITY OF DEPRESSION

We focus on the severity of depression because it has received the most extensive research. While the research has not been entirely consistent, the severity of depression has been associated with health‐related quality of life34, functional impairment35, 36, suicidality37, 38, 39, longitudinal course40, 41, 42, 43, and several biological variables44, 45, 46. Moreover, the severity of depression has been at the core of controversies regarding the efficacy of treatment and whether certain forms of treatment should be recommended as first line interventions. Almost all research on severity is based on scores on depression symptom scales, though most scales have been developed without consideration as to how to best conceptualize and assess the severity of depression.

Severity levels of depression in DSM‐5 and ICD‐10

Three elements are used to define the severity levels of depression in DSM‐5: the number of symptoms, the level of distress caused by the intensity of the symptoms, and the degree of impairment in social and occupational functioning. The severity categorization applies to all depressive disorders, not just major depressive disorder (MDD). Mild depression is specified when “few, if any, symptoms in excess of those required to make the diagnosis are present, the intensity of the symptoms is distressing but manageable, and the symptoms result in minor impairment in social or occupational functioning”. Severe depression is specified when “the number of symptoms is substantially in excess of that required to make the diagnosis, the intensity of the symptoms is seriously distressing and unmanageable, and the symptoms markedly interfere with social and occupational functioning”. The DSM‐5 does not explicitly define moderate depression other than to say that the number of symptoms, intensity of symptoms, and/or functional impairment are between mild and severe.

There are some problems with the DSM‐5 specification of severity levels. The same definition of the severity specifier is used for MDD and persistent depressive disorder. This is a problem, because persistent depressive disorder requires fewer symptoms than does MDD to meet the DSM‐5 diagnostic threshold. Thus, a patient with persistent depressive disorder who experiences the same number of symptoms as a patient with MDD, and with similar levels of functional impairment and distress, may be classified as more severe because the symptom count may be “substantially in excess” of the diagnostic threshold for persistent depressive disorder but not for MDD.

Another problem with the DSM‐5 severity specifier is that the definition of functional impairment is limited to social or occupational functioning. This is inconsistent with the wording of the impairment criterion for the diagnosis of MDD and persistent depressive disorder, which refers to impairment in social, occupational, or other important areas of functioning. Thus, individuals who maintain social contacts, are not expected to be employed, but are unable to function as students or full‐time parents, could be misclassified as less severe than they actually are.

While moderate severity is not specifically defined, the internal logic of the wording of the moderate severity description has a minor flaw. Mild depression requires low levels of symptoms, distress and functional impairment. Conversely, severe depression requires high levels of all three. Thus, moderate depression should be defined as lying between the mild and severe levels in symptoms, distress or functional impairment (not and/or as DSM‐5 defines it).

Finally, two other variables often considered important in discussions about depression severity – suicidality and need for hospitalization – are not considered in DSM‐5’s definition of severity.

What evidence supports the validity of the DSM‐5 approach towards defining severity in this manner? One study from a population‐based registry of twins who experienced a major depressive episode in the year prior to the interview found that the three aspects of the severity specifier – number of symptoms, severity of symptoms, and degree of functional impairment – were significantly, albeit only modestly, correlated47. The authors concluded that the DSM severity construct was multifaceted and heterogeneous.

A study of psychiatric outpatients with a mood disorder48, 84% of whom were in a major depressive episode, found that the number of DSM‐IV symptoms of MDD was weakly correlated with clinicians’ ratings on the Clinical Global Impression (CGI)49 and the Global Assessment of Functioning (GAF)50. Moreover, the severity ratings of some individual symptoms of depression were as highly correlated with CGI and GAF scores as was the total number of depressive symptoms. A small study of psychiatric inpatients with MDD found that the number of MDD criteria was weakly correlated with the Global Assessment Scale51. Kessler et al52 analyzed data from the National Comorbidity Study (NCS) and found that, compared to individuals who reported five or six MDD criteria during their worst episode of depression, individuals who reported seven to nine MDD criteria experienced more psychosocial impairment, more episodes of depression, and greater chronicity. Wakefield and Schmitz53, 54 examined the NCS database as well as another epidemiological survey and suggested that the number of depressive symptoms was less important than the type of depressive symptoms and other features of complicated depression in predicting future occurrence of a major depressive episode, seeking professional help for depression, a history of suicide attempt, and a history of psychiatric hospitalization. Thus, symptom count does not seem to be an adequate indicator of depression severity.

The ICD‐1055 designates three levels of severity – mild, moderate and severe – based on number of symptoms, severity of symptoms, functional impairment, level of distress and, indirectly, type of symptoms. In contrast to DSM‐5, there is no symmetry in the descriptions of the three levels of severity. Mild depression refers to the presence of two or three symptoms that are distressing though the patient is likely to be able to continue with most activities. Moderate depression requires four or more symptoms with the patient having great difficulty to continue with ordinary activities. Severe depression requires “several symptoms that are marked and distressing, typically loss of self‐esteem and ideas of worthlessness or guilt. Suicidal thoughts and acts are common and a number of ‘somatic’ symptoms are usually present”.

As with the definition of the DSM‐5 severity specifier, little research has been done on the ICD‐10 severity specifier, perhaps because the reliability of making the severity distinctions is poor56. Poor reliability is not surprising, due to the impreciseness of the severity level definitions57.

The severity definitions in the official diagnostic systems have not been used in treatment studies. Rather, in almost all those studies, severity is designated by a score on a symptom rating instrument – usually the Hamilton Depression Rating Scale (HAMD)58 or the Montgomery‐Åsberg Depression Rating Scale (MADRS)59. Thus, treatment studies generally do not consider other factors that have been used to characterize severity, such as level of functional impairment, degree of suicidality, or depressive subtype (i.e., presence of melancholic features or psychotic symptoms)60, 61.

 

Scales measuring the severity of depression

The severity of depression has been most frequently quantified on paper‐and‐pencil and clinician‐administered rating scales. There is variability amongst the instruments in the time frame covered (the two most common time frames being the past one or two weeks), rating guidelines (most scales use Likert‐type ratings based on symptom frequency, persistence or intensity), and item content.

Little research has examined which parameters provide the most valid indicator of depression severity. Is the severity of depression best conceptualized as the number of symptoms (i.e., present or absent), frequency of symptoms (e.g., every day vs. half the days vs. few days), persistence of symptoms (e.g., always present vs. often present vs. sometimes present), or intensity of symptoms (e.g., severe vs. moderate vs. mild)? Williams et al62, in standardizing the scoring of the HAMD, created a grid scoring format to incorporate information regarding symptom frequency/persistence and intensity in the ratings. The only study to examine whether it is important to consider both intensity and frequency constructs found that symptom intensity was a better indicator of severity than symptom frequency63. In developing the Patient‐Reported Outcomes Measurement Information System (PROMIS) depression scale, Pilkonis et al64 reviewed studies comparing alternative response options and concluded that frequency scaling outperformed intensity ratings, though these were not studies of depression ratings. Thus, the most valid rating format of depression severity scales is unsettled, and has been little studied.

Should the content of a severity scale be based on the diagnostic criteria for the disorder, include other symptoms of depression that are not components of the diagnostic criteria (e.g., low motivation), or include symptoms that are frequent in depressed patients but are defining features of other disorders (e.g., anxiety, irritability)? And by what standard should one judge whether one approach or scale is a more valid indicator of severity? Statistical approaches such as item response theory have been used to construct scales65, 66. While instruments derived from this approach may be psychometrically superior to measures based on the diagnostic criteria for MDD, such measures do not include symptoms that have long been considered to be core components of depression, such as appetite and sleep disturbances or suicidality. If a measure of severity is to be utilized for clinical purposes, and not just for administrative outcome measurement, it is important to include vegetative symptoms, as the presence of these symptoms affects medication selection67, and to assess suicidality because of safety concerns.

While there are differences amongst the scales in how they were constructed, their intended purpose, item coverage, and rating guidelines, the one commonality is that the overall severity of depression is represented by the sum of the ratings of the individual items. For all but a few scales, all items on the scale are rated similarly and contribute equally to the total score. A notable exception is the HAMD58, which includes some items rated 0 to 2, and some others rated 0 to 4. To be sure, measures differ in their emphasis on different content domains of depression68. Some measures have been criticized as being multidimensional, because a unidimensional construct of depression severity is better able to demonstrate treatment effects69. However, all scales, even multidimensional measures which yield subscale scores, as well as instruments that were initially intended to screen for depression rather than being used as indicators of severity, derive a total score that has been used to denote the severity of depression.

The score summation approach is based on some assumptions that have not been empirically supported. Adding up item scores to yield a total score as an indicator of overall depression severity assumes that all symptoms are equal indicators of the severity of depression. However, the different symptoms of depression are not similarly correlated with clinicians’ global ratings of severity48. From the psychometric perspective, the rating options of individual items should convey valid information across the entire spectrum of severity70. Thus, severely depressed patients should more frequently receive the highest rating of a symptom than a low or zero rating, whereas mildly depressed patients should more frequently receive ratings indicating mild severity than the highest rating of a symptom. Santor and Coyne70, using item response theory data analytic techniques, demonstrated that some of the items of the HAMD do not meet these assumptions.

In fact, scales based on item frequency ratings are unlikely to meet these assumptions and therefore may not be good measures of severity. For example, the items on the 9‐item Patient Health Questionnaire (PHQ‐9) are rated on a four‐point scale of symptom frequency during the past two weeks: (0=not at all, 1=several days; 2=more than half the days; 3=nearly every day)71. Patients with MDD would be expected to score a 3 for most of the symptoms that are present, because the definition of MDD requires symptom presence for at least two weeks. Because of the ceiling effect, a patient with MDD seen in primary care who continues to work would score similarly to a depressed patient who is hospitalized because of difficulties with self‐care. While there are several studies of the PHQ‐9 using an item response theory approach, these have been of heterogeneous non‐depressed psychiatric, medical or community samples72, 73, 74, 75, 76, 77, 78. We are unaware of any studies evaluating the performance of the PHQ‐9 items in a sample of depressed patients presenting for treatment. We would predict that, in such a sample, some – perhaps many – items of the PHQ‐9 would be highly skewed because of the aforementioned ceiling effect. No studies have examined the impact of different rating guidelines on the operating characteristics of items on a depression scale.

Implicit in the score summation approach is that low level ratings across many symptoms reflect equal severity to high ratings across a fewer number of symptoms. For example, someone who indicates that, in the past week, he/she has infrequently experienced low mood, insomnia, low self‐esteem, guilt, reduced concentration, fatigue, psychomotor slowing, insomnia, reduced appetite, reduced concentration, impaired decision making, and reduced interest in usual activities would be considered at the same level of severity as someone who reports daily depressed mood, guilt, feelings of inferiority, and suicidal thoughts, but denies all somatic and vegetative symptoms of depression. Likewise, when item ratings are based on symptom intensity, a mild intensity rating of many symptoms is considered the same as a severe intensity rating of a more limited number of symptoms.

The score summation approach, in which all items are weighted equally, is not grounded in a specific overriding conceptualization of severity. If illness severity is conceptualized in terms of mortality risk, then one would expect a measure of depression severity to weight more heavily item ratings of suicidal thoughts, hopelessness and psychomotor agitation than ratings of impaired concentration and fatigue. On the other hand, if illness severity is conceptualized in terms of functional impairment, then one might expect items assessing impaired concentration and fatigue to be weighted more heavily than items assessing appetite reduction or guilt. To be sure, some measures assess functional impairment along with symptomatology63, 71, 79, 80, 81. No symptom‐based measure, however, has been constructed by examining the association of individual items with indices of functional impairment and including on the scale only those items that are independently associated with impairment.

Few studies have examined the association between severity ratings of individual symptoms of depression and multiple external indicators of severity. Faravelli et al48 found marked differences among symptoms in their association with CGI and GAF ratings. Moreover, the symptoms with the highest correlations with CGI ratings – such as depressed mood, psychic retardation, impaired concentration, and anhedonia – tended to have the highest correlations with GAF scores.

Most discussions of the problems with depression scales have focused on their limitations as outcome measures69, 82, 83. However, different aspects of outcome measurement may be of interest, and these differences might result in different approaches towards scale construction. Some measures of the severity of depression have been specifically designed to be sensitive to treatment effects59, 84. Some measures are linked to the symptom criteria that are used to diagnose depression71, 79, 85, 86, whereas others assess a broad range of features that patients indicate are most important in measuring outcome80 or assess a range of diagnostic and associated symptoms of depression87. Descriptions of scale construction typically focus on the content of the measure and rarely discuss the reason for choosing the rating format. For example, in developing the Multidimensional Depression Assessment Scale, Cheung and Power68 reviewed the content of fifteen depression scales and how their scale would address a content gap. There was no discussion, however, of rating formats and why a symptom frequency format was chosen for their measure rather than a rating format assessing symptom intensity.

One of the commonly used clinician rated measures of severity, the MADRS, was designed to be particularly sensitive to change in treatment trials59. Items were selected if they were prevalent in the patients at the beginning of treatment (i.e., prevalence greater than 70%), showed the greatest change from baseline to week 4 of treatment, and change in scores from baseline to week 4 on the symptom showed the greatest correlation with change in total scores on the measure. While there is nothing inherently wrong with constructing a measure in this manner for this purpose, this should not be the basis for selecting items on a measure of depression severity, as the resulting scale can be biased towards the inclusion of items that are particularly sensitive to change for the medication(s) studied. The construction of the MADRS was based on response to mianserin, maprotiline, amitriptyline, and clomipramine – medications that are not commonly used today. Using the same approach to construct a measure today, when different medications are prescribed, might produce a scale that only partially overlaps with the items included on the MADRS. In the same vein, the HAMD, which was published more than 50 years ago, has been criticized for including items that are most responsive to the effects of sedating medications such as tricyclic antidepressants88.

So, while there are many rating scales of depression, and several studies examining them, questions remain as to how to judge if one measure is a more valid indicator of depression severity than another measure. Should it be based on psychometric analyses indicating unidimensionality? Would a “better” measure of severity be more highly correlated with indices of impairment? Be more highly correlated with current suicidal ideation? Be more highly predictive of future suicidal behavior? Be more highly predictive of future mortality in general? Be more highly predictive of future course? Be better able to distinguish depressed patients who do and do not require hospitalization? Demonstrate a larger effect size in a treatment study? Have greater discriminative ability between depression and anxiety, and thus be a “purer” measure of depression?

 

A problem with depression scales: uncertain validity of cutoffs to define severity groupings

Putting aside the question of how to best conceptualize severity and construct a scale, a problem with the existing literature on depression severity is the inconsistency in the cutoff scores on symptom scales used to demarcate levels of severity, particularly severe depression. The use of various cutoff scores to define severity groups makes it difficult to compare the studies on the treatment implications of severity.

DeRubeis et al89 conducted a mega‐analysis of four studies comparing cognitive‐behavioral therapy and medication, and defined severe depression as a cutoff of 20 or more on the 17‐item HAMD. Likewise, the recent mega‐analysis of placebo‐controlled trials of fluoxetine and venlafaxine used a cutoff of 20 to define severe depression90. Both of these studies cited the landmark study by Elkin et al91 to justify their definition of severe depression. However, Elkin et al did not cite empirical evidence for this cutoff and, in fact, did not refer to the patients scoring above 20 on the HAMD in absolute terms (i.e., having severe depression), but instead referred to these patients in relative terms (i.e., having more severe depression than the patients scoring 20 and below).

In Kirsch et al’s92 meta‐analysis of the impact of severity on antidepressant‐placebo differences, the authors noted that the mean baseline HAMD scores of the antidepressant efficacy trials were in the very severe range (i.e., > 23) based on the American Psychiatric Association (APA)’s Handbook of Psychiatric Measures93 for all but two of the 35 studies included in the analysis. In a prior analysis of antidepressant efficacy studies in the Food and Drug Administration (FDA) data base, Khan et al94 divided the studies into three groups based on pre‐treatment HAMD scores (<24, 25‐27, >28) without indicating the basis for using these cutoff scores to define the groups. Fournier et al95 used the thresholds recommended in the APA’s Handbook of Psychiatric Measures93 to define grades of severity on the HAMD (mild to moderate: <18; severe: 19 to 22; very severe: >23). In contrast to these studies, and the APA guidelines, most pharmacotherapy studies have used a cutoff of 25 on the 17‐item HAMD to define severe depression96, 97, 98, 99, 100, 101 and this cutoff has been recommended by several experts102, 103, 104. Thus, severe depression has not been consistently defined.

Fundamental to studies on the treatment implications of severity levels is the validity of the cutoffs on the HAMD to define the severity categories. In none of the discussion sections of the meta‐analyses and pooled analyses of the reports on severity and treatment outcome were questions raised about the cutoffs used to define the grades of severity. The APA’s Handbook of Psychiatric Rating Scales93 cited only two small studies in support of the cutoff scores to identify severity subtypes, and neither study provided support for the APA guidelines. One was a study examining the validity of deriving a HAMD equivalent score on the Schedule for Affective Disorders and Schizophrenia105. This study did not attempt to determine the cutoff scores on the HAMD indicating grades of severity. The second study examined the association between HAMD scores and global ratings of severity in 59 depressed inpatients106. The authors did not derive (or recommend) cutoff scores corresponding to severity levels. Thus, it is unclear why a cutoff of 19 was recommended in the APA Handbook to identify severe depression. The UK National Institute for Health and Clinical Excellence (NICE) guidelines recommended a cutoff of 23 to identify severe depression on the HAMD, though no research was cited to support this recommendation107.

Because of the limited amount of empirical research establishing cutoff scores for bands of severity on the HAMD, and the significance accorded to severity by treatment guidelines, our clinical research group also examined this issue in 627 psychiatric outpatients with MDD who were rated on the CGI108. The cutoff score on the HAMD that maximized the sum of sensitivity and specificity was 17 for the comparison of mild vs. moderate depression and 24 for the comparison of moderate vs. severe depression. Based on a review of the available evidence, as well as the recommendations that a cutoff of 7 be used to define remission, we recommended the following severity ranges for the 17‐item HAMD: no depression (0‐7); mild depression (8‐16); moderate depression (17‐23); and severe depression (>24).

Each of the above studies derived cutoff scores based on clinicians’ global judgments of severity. A limitation of these studies is that it is not known on what basis the global judgments of severity were made. Were some symptoms of depression considered better indicators of severity than other symptoms? For example, are symptoms characteristic of melancholic or endogenous depression given greater weight in clinicians’ CGI ratings? Are clinicians’ global ratings disproportionately influenced by degree of suicidality? Do clinicians consider psychosocial impairment in making their CGI ratings? We are unaware of any studies that have attempted to derive severity ranges on the HAMD, or any other depression scale for that matter, based on degree of impairment or level of suicidality.

 

Another problem with depression symptom scales: different scales classify patients into different severity groups

In clinical practice, self‐report questionnaires are preferable to clinician‐rated scales because they take less time to administer. If self‐report scales are to be used to classify patients into severity categories, and if treatment recommendations are to be based, in part, on severity classification, then it is important for different scales to classify individuals similarly. However, because the content of measures differ, it would not be surprising if there were significant differences between measures.

Cameron et al109 compared the PHQ‐9 and the Hospital Anxiety and Depression Scale (HADS) severity classifications in a sample of primary care patients referred by their general practitioners in the UK to a mental health worker110. No information was provided regarding the patients’ psychiatric diagnoses. They found that the PHQ‐9 overclassified severity compared to the HADS, with twice as many patients classified in the severe range. Other studies comparing the PHQ‐9 and the HADS in medical patients found similar results111, 112. However, these studies lack an external validator and it is therefore unclear if the PHQ‐9 overclassifies, or the HADS underclassifies, severity. A second study by Cameron et al107 included the second edition of the Beck Depression Inventory (BDI‐II)113 along with the PHQ‐9 and HADS, and also assessed the patients with the HAMD. The participants were primary care patients who had been diagnosed by their general practitioner with depression. Both the PHQ‐9 and BDI‐II overclassified severity compared to the HAMD, whereas the HADS underclassified severity.

We are aware of only one study that compared self‐report scales in a sample of psychiatric outpatients with MDD114. Our clinical research group compared severity classification on three measures that assess the DSM‐IV/DSM‐5 symptom criteria for MDD: the Clinically Useful Depression Outcome Scale (CUDOS)79, the Quick Inventory of Depressive Symptomatology (QIDS)85, and the PHQ‐971. The patients were also rated on the 17‐item HAMD. In a study of depressed outpatients, we found that the correlations between the HAMD and all three self‐report scale scores were nearly identical, and the average correlation among the three self‐report scales was .73. However, the scales significantly differed in their distribution of patients into severity categories. Approximately one‐third of the patients scored in the mild range on the HAMD and CUDOS, whereas approximately 10% of the patients were mildly depressed according to the PHQ‐9 and QIDS. On the CUDOS and HAMD, moderate depression was the most frequent severity category, whereas on the PHQ‐9 and QIDS the majority of the patients were classified as severe. The majority of the patients in the moderate range on the HAMD were in the severe range on the PHQ‐9 and QIDS. Significantly fewer patients were classified as severely depressed on the CUDOS compared to the PHQ‐9 and QIDS.

With the three self‐report measures being highly correlated with each other, and equally correlated with the HAMD, what, then, might account for the marked differences between scales of similar content in the distribution of patients into severity groups?

The cutoffs on the three scales to define the severity groups were derived in different ways, and this was likely responsible for the differences between the scales in severity classification. For example, Kroenke et al71 indicated that the cutoff scores on the PHQ‐9 were chosen for the pragmatic reason of making them easier for clinicians to recall. They also noted that alternative cutoffs did not increase the association between increasing PHQ‐9 severity and indices of construct validity. When selecting the cutoff scores to define the severity ranges on the PHQ‐9, the developers of this questionnaire did not consider the potential impact of the broadness by which severity ranges were defined and how this might impact on treatment recommendations of official treatment guidelines.

Kroenke et al71 indicated that, when severity groupings based on different cutoffs are equally associated with external variables, then the cutoffs can be chosen based on their ease of recall. We disagree with this reasoning. For all scales measuring the severity of depressive symptoms, the thresholds distinguishing patients with mild, moderate and severe depression do not represent well‐demarcated lines separating the severity subtypes. As with other areas of psychopathology, the severity of depression better corresponds to a dimensional than a categorical model of classification115. Thus, alternative cutoffs to categorize severity groupings are likely to also be valid when the groupings are compared on an external variable such as psychosocial functioning. However, one should not be cavalier about the choice of cutoffs, because they impact on the relative broadness of each of the severity categories.

If clinicians are to follow official treatment guidelines’ recommendations and base initial treatment selection on the severity of depression, then it is important to have a consistent method of determining depression severity. The marked disparity between standardized self‐administered scales in the classification of depressed outpatients into severity groups indicates that there is a problem with the use of such instruments to classify depression severity. If official treatment guideline recommendations were followed, then use of measures such as the QIDS and PHQ‐9, which broadly define the severe category, would result in greater reliance on medication in preference to psychotherapy as the first line treatment option for MDD. Caution is thus warranted in the use of these scales to guide treatment selection until the thresholds to define severity ranges have been better established empirically.

 

The importance of severity of depression in treatment: official guideline recommendations

Notwithstanding the aforementioned problems with conceptualizing the severity of depression, and defining the cutoffs on scales for severity levels, depression severity is an important consideration in treatment decision‐making. The severity of depression has influenced treatment recommendations in official guidelines. The third edition of the APA’s guidelines for the treatment of MDD recommend both psychotherapy and pharmacotherapy as monotherapies for depression of mild and moderate severity, and pharmacotherapy (with or without psychotherapy) for severe depression1. The NICE updated guidelines for the treatment and management of depression discourage the use of antidepressant medication as the initial treatment option for mild depression, and recommend medication together with empirically supported psychotherapy for moderate and severe depression2. As reported by van der Lem et al116, treatment guidelines in the Netherlands also recommend pharmacotherapy as the first treatment option for severely depressed patients, and either pharmacotherapy or psychotherapy for mildly and moderately depressed patients. While the recommendations in these guidelines are not entirely consistent, they are unanimous in recommending medication as the treatment of choice for severe depression.

The treatment significance of severity has been studied in several different ways. There are controlled studies, effectiveness studies, pooled analyses, and meta‐analyses examining the impact of severity on particular treatments117, 118, 119, 120, 121, 122, comparing treatments across a range of severity99, 123, 124, 125, 126, 127, comparing medication and placebo across a range of severity128, 129, comparing psychotherapy and control groups across a range of severity130, 131, comparing treatments amongst severely depressed patients96, 101, 102, 132, and examining whether severity predicts short‐term outcome42, 133, 134, 135, treatment resistance136, longer‐term outcome40, 137, 138, 139, and relapse38.

 

Severity of depression and pharmacotherapy

In the past decade, questions have been raised whether selective serotonin reuptake inhibitors (SSRIs) and other new generation antidepressants are effective in non‐severe depression. Khan et al94 analyzed 45 clinical trials in the FDA database and found that in studies with a mean baseline 17‐item HAMD score of 24 or less there was little evidence that antidepressant medication was superior to placebo, whereas in studies with a mean baseline HAMD score of 28 or greater there was clear evidence that medication was superior to placebo. Kirsch et al92 similarly examined the FDA database, and they also examined the efficacy of antidepressants as a function of mean baseline HAMD score in the trial. Their results largely replicated the findings of Khan et al94 that drug‐placebo differences were largest in the studies with the highest baseline severity (i.e., HAMD >28). Kirsch et al92 found that antidepressants were significantly more effective than placebo in the less severe cohorts, but they considered the difference in response to be modest and clinically insignificant.

In contrast to the analyses of the FDA database by Kirsch et al92 and Khan et al94, Fournier et al95 pooled individual patient data from six published studies. Kirsch et al and Khan et al used aggregated mean scores for an entire study as the unit of analysis. That is, they compared studies with different mean severity scores at baseline. The problem with this approach is that a group of patients with a mean score in the severe range will also include some patients in the mild and moderate severity ranges. Likewise, a group of patients with a mean score in the mild or moderate severity range will include some patients scoring in the severe range. Pooling individual patient data avoids the problem of severity group misclassification at the individual patient level. Fournier et al95 replicated the finding that drug‐placebo differences were clinically significant only for severely depressed patients, and found only a small effect size for mildly and moderately depressed patients.

More recently, other pooled analyses of patient level data (rather than aggregated data from a trial) have been conducted. Using pharmaceutical company data bases, these analyses included all studies of a product, thereby avoiding the bias inherent in examining only published studies140. The results of three large, pooled analyses of published and unpublished studies, which included between 4,000 and 10,000 subjects each, indicated that antidepressants are effective across a range of severity90, 129, 141. These analyses, and the controversy that has been stirred regarding the efficacy of antidepressants, highlights the impact that considerations of severity might have on clinical practice.

 

Severity of depression and medication or psychotherapy as first line treatment

A second important severity related treatment question is whether the severity of depression should be used as the basis for recommending medication or psychotherapy as first line treatment. More specifically, the question is whether patients with severe depression should preferentially be treated with medication. A related question is whether psychotherapy is beneficial for severely depressed patients.

Symptom severity as a moderator of treatment response has been the subject of ongoing debate since the publication of the results from the US National Institute of Mental Health Treatment of Depression Collaborative Research Program (TDCRP), suggesting that psychotherapy was not as effective as medication in the acute treatment of severe depression91, 142. The first meta‐analysis of studies directly comparing psychotherapy and pharmacological interventions included 30 published studies of more than 3,000 patients143. A meta‐regression analysis examining whether effect sizes were associated with mean baseline scores on the HAMD or BDI found no evidence that baseline severity was associated with differential treatment outcome. A comparison of effect sizes in studies with baseline HAMD scores below 20 vs. 20 and above also found no differences.

A meta‐analysis of 132 controlled psychotherapy studies of more than 10,000 patients found that greater mean baseline symptom severity did not predict poorer response130. More recently, Weitz et al144 pooled individual patient data from 16 studies comparing antidepressants and cognitive behavior therapy. They defined the severe group according to the APA (HAMD ≥19) and NICE (HAMD >23) recommendations. Increased severity was associated with significantly lower remission rates (but not response rates) in both the medication and psychotherapy treatment conditions. Severity was not associated with differential treatment outcome, thus confirming the results of a prior pooled analysis based on a smaller number of studies89. In a follow‐up study, the authors conducted a pooled analysis focused on the five studies that used placebo as the control condition131. The results were consistent with the larger pooled analysis: baseline symptom severity was not associated with change in symptom severity scores from baseline to endpoint between the cognitive behavior therapy and pill placebo groups.

The results of these more recent meta‐analyses, based on severity classification according to symptom rating scales, are thus not consistent with official treatment guidelines which recommend medication as the first line treatment for severe depression.

SEVERITY OF PERSONALITY DISORDERS

Severity is clearly of import to PDs, though the current diagnostic systems do not include any formal severity ratings. PD patients identified as “severe” are more likely to exhibit high comorbidity with other psychiatric diagnoses, particularly mood, anxiety, substance use145, and other PDs146. So‐called “severe” cases are often in treatment for protracted periods of time147, 148, 149, exhibit higher rates of hospitalization and suicide attempts150, and self‐injure with greater frequency151. They are likely to be incarcerated, unable to hold down a job, and have failed relationships152. It is generally agreed that they may present a public health burden, and therefore should be identified early and get treated often3, 4, 153.

Nonetheless, the question remains: what is meant by “severe” PD? Severity has been assessed by counting the number of comorbid PD diagnoses overall, with higher comorbidity indicating higher severity152, 154, 155, 156. However, this may better reflect the severity of overall personality pathology rather than the severity of a particular PD. More severe cases of personality pathology may further be identified by case complexity and specific comorbidity patterns. The main section of DSM‐5 (i.e., Section II) identifies PDs as occurring in one of three clusters. Tyrer and Johnson157 proposed that individuals with comorbid PDs from more than one cluster are more severe than those with comorbid PDs from the same cluster. The authors further identify antisocial PD as the most severe PD based on risk to others. Therefore, the most severe cases must be diagnosed with antisocial PD as well as PDs from other clusters. Using this model, severity of PD was associated with conduct disorder, criminal behavior, homelessness, institutionalization, unemployment, and delinquent behavior in childhood.

Severity of a specific PD may be measured by counting the number of criteria met. For example, cases of borderline PD for which nine criteria are endorsed would be viewed as more severe than patients endorsing only five criteria147. However, results from our clinical research group did not support this hypothesis, finding no differences in comorbidity or psychosocial functioning based on criteria count for patients diagnosed with borderline PD158. Alternatively, severity can be defined by the frequency of symptoms. For instance, patients with borderline PD who engage in self‐injury multiple times daily would be more severe than those reporting only monthly self‐injury151.

Specific PDs have even been identified as more or less severe than others. Kernberg and Caligor159 organized PDs into a hierarchy ranging from “more severe” (e.g., borderline PD) to less severe (e.g., obsessive compulsive PD, dependent PD). There has also been a strong push for conceptualizing PDs using constellations of pathology personality traits. From this perspective, a “severe” PD symptom or trait may be defined as one that is statistically extreme, or existing in only a very small proportion of the population160.

Treatment research of “severe” personality disorders primarily emphasizes symptom characteristics (frequency, persistence, intensity) and functional impairment (social/occupational, or outcomes such as imprisonment)161, 162, 163. Maden and Tyrer162 identify a category of “dangerous and severe” PD, which is characterized by having a high risk of causing unrecoverable harm to others. Confusingly, the first criterion for having a “dangerous and severe” PD is already being diagnosed with a “severe disorder of personality” which remains undefined itself. The authors do not clarify what severity means at the criterion level, although it appears this definition is legal in origin, and refers primarily to psychopathy and not to PDs as they are traditionally defined.

 

Severity of personality disorders and functioning

Although severity has been defined in various ways in the PD literature, a general consensus appears to have emerged that PD severity is inherently linked with level of maladaptive functioning164, 165, 166, 167, 168, 169. It is widely acknowledged that extreme trait or symptom variation is insufficient to diagnose PDs or to dictate diagnostic severity. Rather, the emphasis lies in having extreme personality traits in the presence of impairment associated with those traits. Unlike physical illnesses, or even depression, which are more focused on symptom presentation, personality diagnoses are intertwined with adaptive functioning. Like depression, PDs by definition must result in “distress or impairment” to be diagnosed33. In contrast to depression, however, the symptom criteria for diagnosing PDs include both affective/cognitive/emotional and functional components. For example, impoverished occupational and financial functioning is included in symptom criteria for antisocial PD, and failure to engage in social and leisure activities is part of the criteria for obsessive‐compulsive PD.

The interrelationship between functional impairment and personality leads many to conclude that PD severity is a combination of extreme personality disturbance and maladaptive functioning associated with that disturbance165, 169. In fact, functioning is so fundamental to determining PD presence and severity that some authors argue that assessing extreme traits/symptoms is unnecessary170, 171, 172, 173. Thus, one need not demonstrate symptom severity if sufficient impairment is judged to be present. However, the dysfunction must be determined as due to the presence of the personality features, even if they are not extreme. For example, using the multiaxial DSM‐IV, Livesley174 proposed defining PD as present diagnostically on Axis I, and coding personality traits separately on Axis II. Widiger and Trull169 proposed a similar model, only using the GAF score on Axis V as a stand in for severity.

Taken together, these models converge on defining severity as a generalized, adaptive failure of an intrapsychic system required to fulfill daily life tasks166. Although specific areas of impairment differ, there is convergence on impairment in three broad areas: identity formation, self‐control (or direction), and interpersonal relationships164. However, some research indicates that pathological personality traits and functioning are so closely intertwined that they may not represent distinct domains175.

 

Severity of personality disorders as described in DSM‐5 and ICD‐10

There is no clear mention of severity with respect to PDs in the main section II of DSM‐533. However, the overall description of PDs includes severity indicators common to other disorders. For example, PDs are specifically noted to be inflexible, maladaptive, pervasive, and associated with “clinically significant” functional impairment or subjective distress. Functional impairment is an indicator of severity in many physical and psychiatric disorders; pervasiveness is a severity indicator for depression; and subjective distress is identified as indicating a “severe case” for disorders of mood and sexual function. As it stands, there is no official method for indicating PD severity in DSM‐5.

Section III (Emerging Measures and Models) of DSM‐5 includes an alternative model for diagnosing PDs. Diagnosis is defined via a combination of severity levels of dysfunction and elevated personality traits, and severity is determined principally by dysfunction associated with elevated traits33. This model does not designate a measure for overall severity, but “moderate or greater impairment” is required for diagnosis. Impairment is operationalized as falling into one of five levels, with the extreme end indicative of severe personality pathology. The Level of Personality Functioning Scale (LPFS) is proposed to rate impairments in functioning, and therefore also PD severity. Ratings are made for self (identity and self‐direction) and interpersonal (empathy and intimacy) functioning. Levels include: 0 (little or no impairment), 1 (some impairment), 2 (moderate impairment), 3 (severe impairment), 4 (extreme impairment). Individuals with extreme impairment are described as having an impoverished, unclear identity and self‐direction with maladaptive self‐concept, and completely lacking capacity to engage interpersonally.

Interestingly, DSM‐5 Section III also includes discussion of an additional measure of personality traits, the Personality Inventory for DSM‐5176. The items are clearly trait content related; however, the measure provides an overall summed score identified as measuring “overall personality dysfunction”. The identification of extreme traits as indicative of dysfunction is curious, but not inconsistent with the significant overlap between functioning and PD traits/symptoms found elsewhere in the literature175. Nonetheless, this suggests that extreme traits are at least indicative of extreme dysfunction, which is the primary index of severity in this model.

Similar to the DSM‐5, the ICD‐10 does not make mention of severity in PD classifications. However, several papers have been published on changes proposed for ICD‐11, which are substantial. Most notably, the primary classification of PDs will change to one based on severity of personality disturbance. Description of PD traits or features is optional but will not be required for diagnosis3, 4.

Consistent with the larger literature, the proposed changes to the ICD‐11 conceptualize severity primarily as dysfunction, or the personality‐related problems experienced by the individual. Again, five levels of severity are proposed, though they vary slightly from those in the DSM. Summed together, severity levels are dictated first by pervasiveness of the impairment (across situations or limited), and secondarily by the number of problematic personality traits (multiple or single). At the highest level of severity, risk to self or others is also assessed. Thus, the most severe cases are identified by functioning above all else. Symptoms/traits and risk of harm are secondary, but also considered. Unlike the DSM‐5 alternative model proposal, dysfunction in self and identity is not included in severity ratings3, 4. At the time of this writing, the ICD‐11 has not yet been published, and therefore these definitions should be considered provisional. Nonetheless, the emphasis on functioning via severity ratings has been criticized for insufficient research establishing its reliability and validity177.

 

Measures of personality severity

As early as 1996, Tyrer and Johnson157 developed a five‐point scale assessing disorder severity similar to that in the ICD‐11 proposal. Ratings were made based on information derived from a trait personality measure, the Personality Assessment Schedule (PAS)153. Thus, severity was weighted more towards extremity on traits than on functioning. The PAS has also been used to classify individuals into the four PD categories proposed by Tyrer and Johnson157: no PD, personality difficulty, simple PD, complex PD. PAS severity designations are primarily based on the frequency of DSM‐IV and ICD‐10 categories, and have been used in studies predicting treatment outcomes, albeit with mixed findings178. The General Assessment of Personality Disorder179 has been used as an index of severity in multiple studies, and provides two main scales of severity – self‐pathology and interpersonal problems – both of which reflect functional impairment as defined by the DSM‐5164, 180, 181. Similarly, the Severity Indices of Personality Problems173 defines severity as a combination of impoverished self and interpersonal functioning.

Relatively few measures of severity exist for individual PDs, and these largely focus on borderline PD. For example, the Borderline Personality Disorder Severity Index (BPDSI)151, 182 is a semi‐structured clinical interview that operationalizes severity primarily by frequency of borderline PD symptom behaviors over the preceding three months. Frequency of symptoms is rated from 0 (never) to 10 (daily). Severity is averaged across these scores, yielding severity scores for individual borderline PD criteria as well as the diagnosis overall. Thus, the BPDSI largely measures severity as a function of symptom frequency, though many of the items also ask about behaviors that have implied functional consequences (e.g., going out instead of working).

Consistent with the severity of personality pathology often being linked with impairments in functioning, PD treatment outcome research has often focused on the degree to which various treatment approaches (e.g., dialectical behavioral therapy, mentalization‐based treatments, transference‐focused psychotherapy) improve day‐to‐day functioning and reduce specific, concrete maladaptive behaviors147, 183, 184. For instance, in the extensive borderline PD treatment literature, change in personality pathology is often assessed using measures such as the Zanarini Rating Scale for Borderline Personality Disorder185 and the Barratt Impulsiveness Scale186. However, reduction in suicide attempts, self‐harm behavior, and reliance on psychiatric emergency treatment services are often primary treatment outcome measures, as are improvements in maintaining meaningful relationships and improving workplace functioning147, 183, 184, 187, 188.

Although the PD treatment literature has focused primarily on the treatment of borderline PD, other PDs also have received some attention, with functional impairment being identified as central to treatment outcomes. For instance, transference‐focused psychotherapy has demonstrated some benefit for patients with comorbid narcissistic and borderline PD, and this treatment approach emphasizes interpersonal functioning in personal and workplace relationships when assessing outcome189. Treatment research on antisocial PD has focused on subsequent substance use and arrests190. Thus, across the treatment of various PDs, treatment outcome and a reduction in “severity” is understood not just as symptom reduction, but also reduction in specific deleterious behaviors (e.g., self‐harm) and the promotion of interpersonal functioning and specific prosocial behaviors (e.g., maintaining employment).

TRANSDIAGNOSTIC MODELS AND SEVERITY: THE EMERGENCE OF PSYCHOPATHOLOGY SPECTRA

Many of the questions asked above as to how to compare the validity of depression scales in measuring severity also apply to determining if different diagnoses confer differential levels of severity. The likelihood of meeting criteria for different diagnoses confers standing on underlying genetic liabilities191, 192. This is important to consider given that individuals who meet criteria for one diagnosis are very likely to meet criteria for multiple other diagnoses193, such that various diagnoses may be thought to be manifestations of underlying spectra (e.g., antisocial PD, narcissistic PD and substance use all reflect an underlying externalizing spectrum).

Research examining the relations amongst various internalizing diagnoses characterized by subjective distress and fear suggests that it may be “easier” for individuals to meet criteria for diagnoses such as MDD than for more “severe” disorders such as generalized anxiety and panic disorders194. Put differently, meeting criteria for generalized anxiety or panic disorder reflect higher standing on the internalizing dimension than would simply meeting criteria for MDD. Interestingly, Krueger and Finger194 also found that high standing on the internalizing dimension was linked robustly to lifetime number of inpatient hospitalizations and past month psychosocial functioning.

Other more recent research has also linked “severity” on the internalizing spectrum to key outcomes. For instance, Eaton et al195 found that the likelihood of meeting criteria for various depressive disorders, anxiety disorders, and bipolar disorders can be represented by an underlying continuum. Individuals with high scores on this dimension, who would be characterized as having more “severe” levels of internalizing psychopathology, would thus be likely to meet criteria for many diagnoses and to report a broad range of symptoms (e.g., depressed mood, worry, concentration difficulties, irritability) characterizing the various DSM diagnoses defining this dimension.

Eaton et al195 presented evidence indicating that scores on the internalizing spectrum predicted outcomes such as the future occurrence of internalizing symptoms (e.g., depressed mood, worry), suicide attempts, angina/chest pain, and ulcers. Moreover, standing on this underlying dimensionally‐based internalizing spectrum predicted these outcomes much more strongly than did DSM‐based conceptualizations of various internalizing disorders (e.g., MDD, generalized anxiety disorder), thereby providing evidence for the utility of this approach in capturing severity as it relates to important outcomes such as suicidality and physical health concerns195.

In regard to other forms of psychopathology, Krueger et al196 presented evidence indicating that symptoms and behaviors defining personality and substance use disorders can be captured by an underlying externalizing dimension. Other research also supports the presence of this underlying latent externalizing dimension, which explains why antisocial behaviors (e.g., various unlawful behaviors) and traits (e.g., impulsivity, callousness) and substance use issues are likely to co‐occur191, 197. Carragher et al197 presented findings suggesting that meeting criteria for some disorders (e.g., cocaine dependence) confers higher standing and severity on this underlying externalizing dimension than other “less severe” disorders (e.g., nicotine and alcohol dependence). Similarly, overlap in disorders such as schizophrenia and schizotypal PD appears to be reflected by a thought disorder spectrum191, 198. Standing on this spectrum has been linked to functional impairment and illness course198.

Going forward, it will continue to be important for future research to further explicate how level of severity (i.e., how likely an individual is to meet criteria for different disorders and to meet criteria for “difficult” disorders such as cocaine dependence in the case of the externalizing spectrum) captured by broad internalizing, externalizing, and thought disorder dimensions predicts illness course and other key outcomes related to morbidity and mortality. These dimensions account for diagnostic comorbidity amongst various disorders and have been shown to predict various outcomes more strongly than diagnostic status on various DSM disorders, suggesting important merits to this approach191, 195. In this regard, the Hierarchical Taxonomy of Psychopathology (HiTOP) has emerged as a dimensionally‐based alternative to the DSM‐5191, 199. Thus, it will be important to determine the degree to which this framework adequately captures psychopathology “severity”, however severity is defined, and is useful for researchers and practitioners.

CONCLUSIONS

The issue of severity has great clinical importance. Severity influences decisions about level of care and affects decisions to seek government assistance due to psychiatric disability. In outpatient settings, the importance of severity is reflected in the controversy about the efficacy of antidepressants across the spectrum of depression severity, and whether patients with severe depression should be preferentially treated with medication rather than psychotherapy.

We began this paper with a series of questions as to how the severity of psychopathology should be conceptualized. Some authors have suggested that the core indicator of the severity of mental illness is functional disability200. The DSM‐5 has defined the severity of different disorders in different ways. Our review of the literature for depression and PDs demonstrated that researchers have adopted a myriad of ways of defining severity. The severity of depression has predominantly been defined according to scores on symptom rating scales. To be sure, there is some variability in how items are rated (i.e., symptom intensity vs. symptom frequency vs. symptom persistence), as well as some variability in the range of symptoms assessed by different measures of depression. Irrespective of the precise manner by which symptom severity is determined, most of the literature on the severity of depression is based on the parameters of symptoms. By contrast, the core of personality pathology is intertwined with its impact on functioning. Distinguishing extreme variants of personality traits from functioning has been challenging, therefore functional impairment has been fundamental to conceptualizing the severity of PDs.

Because the functional impact of symptom‐defined disorders such as MDD depends on factors unrelated to the disorder such as self‐efficacy, resilience, coping ability, social support, cultural and social expectations, as well as the responsibilities related to one’s primary role function and the availability of others to assume those responsibilities, we would argue that the severity of such disorders should be defined independently from functional impairment. To those who would disagree, consider the following scenario: two individuals have an upper respiratory tract infection. They have the same elevation in body temperature, sneeze and cough with the same frequency, have the same level of mucus production and nasal discharge, and the same viral load. And the symptoms last for the same number of days. In short, they have the same intensity, frequency, and persistence of symptoms. Yet one person misses work for a week and the other does not miss work. Does the person who missed work have a more severe upper respiratory tract infection?

A distinction could be made between defining severity at the level of a disorder vs. overall global illness severity. As stated, at the level of disorder, severity should be determined by the factors that are intrinsic to the disorder. Thus, the severity of depression should be determined by the intensity, frequency, and/or persistence of the depressive symptoms. And the same is true for other disorders such as generalized anxiety disorder, post‐traumatic stress disorder, mania/hypomania, and tic disorder. The severity of panic disorder should be based on the intensity and frequency of panic attacks. The severity of premature ejaculation should be based on time to ejaculation, the severity of hypoactive sexual desire based on the intensity (or lack thereof) of desire, the severity of binge eating disorder on the frequency and intensity of binges, etc.. The episodic nature of some psychiatric disorders and symptoms presents some measurement challenges. There may be day‐to‐day variability in symptom intensity as well as symptom persistence through the course of the day. Symptom frequency varies by disorder. Too little research has compared the validity of symptom intensity, frequency, and persistence assessments.

Severity, however, can be considered from another perspective: at the level of overall illness. A patient with depression, borderline PD, some anxiety disorders, substance use disorder and an eating disorder has a severe illness. It would likely be difficult to parse the levels of functional impairment to the separate disorders. The severity of the symptoms of depression may not be high, but the patient is nonetheless severely ill. How to take into account comorbidity when determining the severity of individual disorders is not clear. A global rating of overall illness severity was included in DSM‐III through DSM‐IV, but dropped from DSM‐5. The global rating of illness severity can be considered to be akin to the composite measures of physical illness severity, described in the introduction, that have been used to predict mortality in emergency room and hospitalized patients. The problem with the GAF was that it was a single rating that required consideration of multiple constructs, including symptom frequency, type of symptom, level of impairment, suicidality, ability to care for oneself, and psychosis. Because of its complexity, there were problems with the reliability of its ratings201. Perhaps the dimensionally based measures of psychopathology articulated in HiTOP will yield clinically meaningful and useful approaches towards characterizing overall severity.

In the future, research on severity needs to be clear as to what correlates of a measure are expected. We noted above that too little research has compared the validity of symptom intensity, frequency, and persistence assessments. The question is how to evaluate validity. Should severity be a predictor of outcome? Should it help match patients to appropriate treatments or appropriate levels of care? Should it predict mortality? Should it reflect underlying pathophysiology? Should it confer genetic risk? Should it be used to guide the allocation of finite resources at either the insurance company or governmental funding agency level?

There are a wealth of papers in the psychiatric, medical and epidemiological literatures that refer to depression severity in the title and examine the correlates of a measure of depressive symptoms. But how to best measure severity has largely not been the subject of study. Numerous scales have been developed that purport to measure the severity of depression. When the authors of these scales discuss the reason behind developing their measure, the explanation usually focuses on item content and rarely discusses the reason for choosing a particular rating approach. Perhaps it does not make a meaningful difference how items are scaled. Perhaps the exact content of a scale does not make a meaningful difference either. Perhaps simplicity and clinical utility should trump any minor incremental validity that one measure shows over another.

However, some research suggests otherwise. The ability to detect differences between medication and placebo may be related to the content of the measure used202. Scales differ in severity classification111, 112, 114, and treatment guidelines suggest that severity be used to select among treatment alternatives1, 2. Thus, severity has real world implications in both the research and clinical communities. It is our hope that this paper stimulates more consideration and research into the issue of how to best conceptualize and measure the severity of psychiatric disorders.

References

1. American Psychiatric Association . Practice guideline for the treatment of patients with major depressive disorder, 3rd ed. Washington: American Psychiatric Association, 2010. []
2. National Collaborating Centre for Mental Health . Depression: the treatment and management of depression in adults. London: National Institute for Health and Clinical Excellence, 2009. []
3. Tyrer P, Crawford M, Mulder R et al. Reclassifying personality disorders. Lancet 2011;377:1814‐5. [PubMed] []
4. Tyrer P, Crawford M, Mulder R et al. The rationale for the reclassification of personality disorder in the 11th revision of the International Classification of Diseases (ICD‐11). Personal Ment Health 2011;5:246‐59. []
5. Minichiello E, Semerano L, Boissier MC. Time trends in the incidence, prevalence, and severity of rheumatoid arthritis: a systematic literature review. Joint Bone Spine 2016;83:625‐30. [PubMed] []
6. Hirai FE, Tielsch JM, Klein BE et al. Relationship between retinopathy severity, visual impairment and depression in persons with long‐term type 1 diabetes. Ophthalmic Epidemiol 2012;19:196‐203. [PMC free article] [PubMed] []
7. Greco A, Steca P, Pozzi R et al. Predicting depression from illness severity in cardiovascular disease patients: self‐efficacy beliefs, illness perception, and perceived social support as mediators. Int J Behav Med 2014;21:221‐9. [PubMed] []
8. Steca P, Greco A, Monzani D et al. How does illness severity influence depression, health satisfaction and life satisfaction in patients with cardiovascular disease? The mediating role of illness perception and self‐efficacy beliefs. Psychol Health 2013;28:765‐83. [PubMed] []
9. Pelletier R, Lavoie KL, Bacon SL et al. Depression and disease severity in patients with premature acute coronary syndrome. Am J Med 2014;127:87‐93. [PubMed] []
10. Carels RA. The association between disease severity, functional status, depression and daily quality of life in congestive heart failure patients. Qual Life Res 2004;13:63‐72. [PubMed] []
11. Snell C, Fernandes S, Bujoreanu IS et al. Depression, illness severity, and healthcare utilization in cystic fibrosis. Pediatr Pulmonol 2014;49:1177‐81. [PubMed] []
12. Deterding K, Gruner N, Buggisch P et al. Symptoms of anxiety and depression are frequent in patients with acute hepatitis C and are not associated with disease severity. Eur J Gastroenterol Hepatol 2016;28:187‐92. [PubMed] []
13. Euesden J, Matcham F, Hotopf M et al. The relationship between mental health, disease severity, and genetic risk for depression in early rheumatoid arthritis. Psychosom Med 2017;79:638‐45. [PMC free article] [PubMed] []
14. Reynolds JC, Rittenberger JC, Toma C et al. Risk‐adjusted outcome prediction with initial post‐cardiac arrest illness severity: implications for cardiac arrest survivors being considered for early invasive strategy. Resuscitation 2014;85:1232‐9. [PubMed] []
15. Coppler PJ, Elmer J, Calderon L et al. Validation of the Pittsburgh Cardiac Arrest Category illness severity score. Resuscitation 2015;89:86‐92. [PMC free article] [PubMed] []
16. Schaeffer JJ, Gil KM, Burchinal M et al. Depression, disease severity, and sickle cell disease. J Behav Med 1999;22:115‐26. [PubMed] []
17. Kim KU, Park HK, Jung HY et al. Association of depression with disease severity in patients with chronic obstructive pulmonary disease. Lung 2014;192:243‐9. [PubMed] []
18. van Dijk JP, Havlikova E, Rosenberger J et al. Influence of disease severity on fatigue in patients with Parkinson’s disease is mainly mediated by symptoms of depression. Eur Neurol 2013;70:201‐9. [PubMed] []
19. Goetz CG, Tilley BC, Shaftman SR et al. Movement Disorder Society‐sponsored revision of the Unified Parkinson’s Disease Rating Scale (MDS‐UPDRS): scale presentation and clinimetric testing results. Mov Disord 2008;23:2129‐70. [PubMed] []
20. Bennett JA, Riegel B, Bittner V et al. Validity and reliability of the NYHA classes for measuring research outcomes in patients with cardiac disease. Heart Lung 2002;31:262‐70. [PubMed] []
21. Knaus WA, Zimmerman JE, Wagner DP et al. APACHE‐acute physiology and chronic health evaluation: a physiologically based classification system. Crit Care Med 1981;9:591‐7. [PubMed] []
22. Le Gall JR, Lemeshow S, Saulnier F. A new Simplified Acute Physiology Score (SAPS II) based on a European/North American multicenter study. JAMA 1993;270:2957‐63. [PubMed] []
23. de Groot B, de Deckere ER, Flameling R et al. Performance of illness severity scores to guide disposition of emergency department patients with severe sepsis or septic shock. Eur J Emerg Med 2012;19:316‐22. [PubMed] []
24. Schneider AG, Lipcsey M, Bailey M et al. Simple translational equations to compare illness severity scores in intensive care trials. J Crit Care 2013;28:e881‐8. [PubMed] []
25. Barry MJ, Fowler FJ Jr, O’Leary MP et al. The American Urological Association symptom index for benign prostatic hyperplasia. J Urol 2017;197:S189‐97. [PubMed] []
26. Folmer RL, Shi YB. SSRI use by tinnitus patients: interactions between depression and tinnitus severity. Ear Nose Throat J 2004;83:107‐8. [PubMed] []
27. Gros DF, Antony MM, McCabe RE et al. Frequency and severity of the symptoms of irritable bowel syndrome across the anxiety disorders and depression. J Anxiety Disord 2009;23:290‐6. [PubMed] []
28. Stewart WF, Lipton RB, Simon D et al. Validity of an illness severity measure for headache in a population sample of migraine sufferers. Pain 1999;79:291‐301. [PubMed] []
29. Baker GA, Smith DF, Jacoby A et al. Liverpool Seizure Severity Scale revisited. Seizure 1998;7:201‐5. [PubMed] []
30. Clark JA, Spiro A, Miller DR et al. Patient‐based measures of illness severity in the Veterans Health Study. J Ambul Care Manage 2005;28:274‐85. [PubMed] []
31. Slade M, Powell R, Strathdee G. Current approaches to identifying the severely mentally ill. Soc Psychiatry Psychiatr Epidemiol 1997;32:177‐84. [PubMed] []
32. Ruggeri M, Leese M, Thornicroft G et al. Definition and prevalence of severe and persistent mental illness. Br J Psychiatry 2000;177:149‐55. [PubMed] []
33. American Psychiatric Association . Diagnostic and statistical manual of mental disorders, 5th ed. Arlington: American Psychiatric Association, 2013. []
34. Fattori A, Neri L, Bellomo A et al. Depression severity and concentration difficulties are independently associated with HRQOL in patients with unipolar depressive disorders. Qual Life Res 2017;26:2459‐69. [PubMed] []
35. Goethe JW, Fischer EH, Wright JS. Severity as a key construct in depression. J Nerv Ment Dis 1993;181:718‐24. [PubMed] []
36. Luty SE, Joyce PR, Mulder RT et al. Social adjustment in depression: the impact of depression severity, personality, and clinic versus community sampling. J Affect Disord 2002;70:143‐54. [PubMed] []
37. Bradvik L, Mattisson C, Bogren M et al. Long‐term suicide risk of depression in the Lundby cohort 1947‐1997 – severity and gender. Acta Psychiatr Scand 2008;117:185‐91. [PubMed] []
38. Kessing LV. Severity of depressive episodes according to ICD‐10: prediction of risk of relapse and suicide. Br J Psychiatry 2004;184:153‐6. [PubMed] []
39. Wang YY, Jiang NZ, Cheung EF et al. Role of depression severity and impulsivity in the relationship between hopelessness and suicidal ideation in patients with major depressive disorder. J Affect Disord 2015;183:83‐9. [PubMed] []
40. Katon W, Unutzer J, Russo J. Major depression: the importance of clinical characteristics and treatment response to prognosis. Depress Anxiety 2010;27:19‐26. [PubMed] []
41. Keller MB, Lavori PW, Mueller TI et al. Time to recovery, chronicity, and levels of psychopathology in major depression. A 5‐year prospective follow‐up of 431 subjects. Arch Gen Psychiatry 1992;49:809‐16. [PubMed] []
42. Meyers BS, Sirey JA, Bruce M et al. Predictors of early recovery from major depression among persons admitted to community‐based clinics: an observational study. Arch Gen Psychiatry 2002;59:729‐35. [PubMed] []
43. Melartin T, Rytsala H, Leskela U et al. Severity and comorbidity predict episode duration and recurrence of DSM‐IV major depressive disorder. J Clin Psychiatry 2004;65:810‐9. [PubMed] []
44. Berent D, Zboralski K, Orzechowska A et al. Thyroid hormones association with depression severity and clinical outcome in patients with major depressive disorder. Mol Biol Rep 2014;41:2419‐25. [PMC free article] [PubMed] []
45. de Diego‐Adelino J, Pires P, Gomez‐Anson B et al. Microstructural white‐matter abnormalities associated with treatment resistance, severity and duration of illness in major depression. Psychol Med 2014;44:1171‐82. [PubMed] []
46. Zimmerman M, Coryell W, Pfohl B. The validity of the dexamethasone suppression test as a marker for endogenous depression. Arch Gen Psychiatry 1986;43:347‐55. [PubMed] []
47. Lux V, Aggen SH, Kendler KS. The DSM‐IV definition of severity of major depression: inter‐relationship and validity. Psychol Med 2010;40:1691‐701. [PMC free article] [PubMed] []
48. Faravelli C, Servi P, Arends J et al. Number of symptoms, quantification, and qualification of depression. Compr Psychiatry 1996;37:307‐15. [PubMed] []
49. Guy W. ECDEU assessment manual for psychopharmacology. Rockville: National Institute of Mental Health, 1976. []
50. American Psychiatric Association . Diagnostic and statistical manual of mental disorders, 4th ed. Washington: American Psychiatric Association, 1994. []
51. Kitamura T, Nakagawa Y, Machizawa S. Grading depression severity by symptom scores: is it a valid method for subclassifying depressive disorders? Compr Psychiatry 1993;34:280‐3. [PubMed] []
52. Kessler RC, Zhao S, Blazer DG et al. Prevalence, correlates, and course of minor depression and major depression in the National Comorbidity Survey. J Affect Disord 1997;45:19‐30. [PubMed] []
53. Wakefield JC, Schmitz MF. Severity of complicated versus uncomplicated subthreshold depression: new evidence on the “monotonicity thesis” from the National Comorbidity Survey. J Affect Disord 2017;212:101‐9. [PubMed] []
54. Wakefield JC, Schmitz MF. Symptom quality versus quantity in judging prognosis: using NESARC predictive validators to locate uncomplicated major depression on the number‐of‐symptoms severity continuum. J Affect Disord 2017;208:325‐9. [PubMed] []
55. World Health Organization . International classification of diseases and related health problems, 10th revision. Geneva: World Health Organization, 2016. []
56. Hiller W, Dichtl G, Hecht H et al. Evaluating the new ICD‐10 categories of depressive episode and recurrent depressive disorder. J Affect Disord 1994;31:49‐60. [PubMed] []
57. Montgomery S. Are the ICD‐10 or DSM‐5 diagnostic systems able to define those who will benefit from treatment for depression? CNS Spectr 2016;21:283‐8. [PubMed] []
58. Hamilton M. A rating scale for depression. J Neurol Neurosurg Psychiatry 1960;23:56‐62. [PMC free article] [PubMed] []
59. Montgomery SA, Åsberg M. A new depression scale designed to be sensitive to change. Br J Psychiatry 1979;134:382‐9. [PubMed] []
60. Nemeroff CB. The burden of severe depression: a review of diagnostic challenges and treatment alternatives. J Psychiatr Res 2007;41:189‐206. [PubMed] []
61. Sonawalla SB, Fava M. Severe depression: is there a best approach? CNS Drugs 2001;15:765‐76. [PubMed] []
62. Williams JB, Kobak KA, Bech P et al. The GRID‐HAMD: standardization of the Hamilton Depression Rating Scale. Int Clin Psychopharmacol 2008;23:120‐9. [PubMed] []
63. Parker G, Hadzi‐Pavlovic D, Sengoz A et al. A brief self‐report depression measure assessing mood state and social impairment. J Affect Disord 1994;30:133‐42. [PubMed] []
64. Pilkonis PA, Choi SW, Reise SP et al. Item banks for measuring emotional distress from the Patient‐Reported Outcomes Measurement Information System (PROMIS): depression, anxiety, and anger. Assessment 2011;18:263‐83. [PMC free article] [PubMed] []
65. Balsamo M, Giampaglia G, Saggino A. Building a new Rasch‐based self‐report inventory of depression. Neuropsychiatr Dis Treat 2014;10:153‐65. [PMC free article] [PubMed] []
66. Vaccarino AL, Evans KR, Kalali AH et al. The Depression Inventory Development Workgroup: a collaborative, empirically driven initiative to develop a new assessment tool for major depressive disorder. Innov Clin Neurosci 2016;13:20‐31. [PMC free article] [PubMed] []
67. Zimmerman M, Posternak M, Friedman M et al. Which factors influence psychiatrists’ selection of an antidepressant? Am J Psychiatry 2004;161:1285‐9. [PubMed] []
68. Cheung HN, Power MJ. The development of a new multidimensional depression assessment scale: preliminary results. Clin Psychol Psychother 2012;19:170‐8. [PubMed] []
69. Licht RW, Qvitzau S, Allerup P et al. Validation of the Bech‐Rafaelsen Melancholia Scale and the Hamilton Depression Scale in patients with major depression; is the total score a valid measure of illness severity? Acta Psychiatr Scand 2005;111:144‐9. [PubMed] []
70. Santor D, Coyne J. Examining symptom expression as a function of symptom severity: item performance on the Hamilton Rating Scale for Depression. Psychol Assess 2001;13:127‐39. [PubMed] []
71. Kroenke K, Spitzer R, Williams J. The PHQ‐9. Validity of a brief depression severity measure. J Gen Int Med 2001;16:606‐13. [PMC free article] [PubMed] []
72. Downey L, Hayduk LA, Curtis JR et al. Measuring depression‐severity in critically ill patients’ families with the Patient Health Questionnaire (PHQ): tests for unidimensionality and longitudinal measurement invariance, with implications for CONSORT. J Pain Sympt Manage 2016;51:938‐46. [PMC free article] [PubMed] []
73. Fischer HF, Tritt K, Klapp BF et al. How to compare scores from different depression scales: equating the Patient Health Questionnaire (PHQ) and the ICD‐10‐Symptom Rating (ISR) using item response theory. Int J Methods Psychiatr Res 2011;20:203‐14. [PubMed] []
74. Adler M, Hetta J, Isacsson G et al. An item response theory evaluation of three depression assessment instruments in a clinical sample. BMC Med Res Methodol 2012;12:84. [PMC free article] [PubMed] []
75. Barthel D, Barkmann C, Ehrhardt S et al. Screening for depression in pregnant women from Cote d’Ivoire and Ghana: psychometric properties of the Patient Health Questionnaire‐9. J Affect Disord 2015;187:232‐40. [PubMed] []
76. Pedersen SS, Mathiasen K, Christensen KB et al. Psychometric analysis of the Patient Health Questionnaire in Danish patients with an implantable cardioverter defibrillator (The DEFIB‐WOMEN study). J Psychosom Res 2016;90:105‐12. [PubMed] []
77. Umegaki Y, Todo N. Psychometric properties of the Japanese CES‐D, SDS, and PHQ‐9 depression scales in university students. Psychol Assess 2017;29:354‐9. [PubMed] []
78. Zhong Q, Gelaye B, Fann JR et al. Cross‐cultural validity of the Spanish version of PHQ‐9 among pregnant Peruvian women: a Rasch item response theory analysis. J Affect Disord 2014;158:148‐53. [PMC free article] [PubMed] []
79. Zimmerman M, Chelminski I, McGlinchey JB et al. A clinically useful depression outcome scale. Compr Psychiatry 2008;49:131‐40. [PubMed] []
80. Zimmerman M, Galione J, Attiullah N et al. Depressed patients perspectives of two measures of outcome: the Quick Inventory of Depressive Symptomatology (QIDS) and the Remission from Depression Questionnaire (RDQ). Ann Clin Psychiatry 2011;23:208‐12. [PubMed] []
81. Bentley KH, Gallagher MW, Carl JR et al. Development and validation of the Overall Depression Severity and Impairment Scale. Psychol Assess 2014;26:815‐30. [PubMed] []
82. Bagby RM, Ryder AG, Schuller DR et al. The Hamilton Depression Rating Scale: has the gold standard become a lead weight? Am J Psychiatry 2004;161:2163‐77. [PubMed] []
83. Zimmerman M, Posternak M, Chelminski I. Is it time to replace the Hamilton Depression Rating Scale as the primary outcome measure in treatment studies of depression? J Clin Psychopharmacol 2005;25:105‐10. [PubMed] []
84. Tiplady B. A self‐rating scale for depression designed to be sensitive to change. Neuropharmacology 1980;19:1211‐2. [PubMed] []
85. Rush A, Trivedi M, Ibrahim H et al. The 16‐item Quick Inventory of Depressive Symptomatology (QIDS), clinician rating (QIDS‐C), and self‐report (QIDS‐SR): a psychometric evaluation in patients with chronic major depression. Biol Psychiatry 2003;54:573‐83. [PubMed] []
86. Olsen LR, Jensen DV, Noerholm V et al. The internal and external validity of the Major Depression Inventory in measuring severity of depressive states. Psychol Med 2003;33:351‐6. [PubMed] []
87. Rush AJ, Gullion CM, Basco MR et al. The Inventory of Depressive Symptomatology (IDS). Psychol Med 1996;26:477‐86. [PubMed] []
88. Moller HJ. Methodological aspects in the assessment of severity of depression by the Hamilton Depression Scale. Eur Arch Psychiatry Clin Neurosci 2001;251(Suppl. 2):13‐20. [PubMed] []
89. DeRubeis RJ, Gelfand LA, Tang TZ et al. Medications versus cognitive behavior therapy for severely depressed outpatients: mega‐analysis of four randomized comparisons. Am J Psychiatry 1999;156:1007‐13. [PubMed] []
90. Gibbons R, Hur K, Brown C et al. Benefits from antidepressants: synthesis of 6‐week patient‐level outcomes from double‐blind placebo‐controlled randomized trials of fluoxetine and venlafaxine. Arch Gen Psychiatry 2012;69:572‐9. [PMC free article] [PubMed] []
91. Elkin I, Gibbons R, Shea M et al. Initial severity and differential treatment outcome in the National Institute of Mental Health Treatment of Depression Collaborative Research Program. J Consult Clin Psychol 1995;63:841‐7. [PubMed] []
92. Kirsch I, Deacon BJ, Huedo‐Medina TB et al. Initial severity and antidepressant benefits: a meta‐analysis of data submitted to the Food and Drug Administration. PLoS Med 2008;5:260‐8. [PMC free article] [PubMed] []
93. Rush AJ, First MB, Blacker D. Handbook of psychiatric measures, 2nd ed. Washington: American Psychiatric Publishing, 2008. []
94. Khan A, Leventhal RM, Khan SR et al. Severity of depression and response to antidepressants and placebo: an analysis of the Food and Drug Administration database. J Clin Psychopharmacol 2002;22:40‐5. [PubMed] []
95. Fournier JC, DeRubeis RJ, Hollon SD et al. Antidepressant drug effects and depression severity: a patient‐level meta‐analysis. JAMA 2010;303:47‐53. [PMC free article] [PubMed] []
96. Dunner D, Lipschitz A, Pitts C et al. Efficacy and tolerability of controlled‐release paroxetine in the treatment of severe depression: post hoc analysis of pooled data from a subset of subjects in four double‐blind clinical trials. Clin Ther 2005;27:1901‐11. [PubMed] []
97. Kasper S. Efficacy of antidepressants in the treatment of severe depression: the place of mirtazapine. J Clin Psychopharmacol 1997;17(Suppl. 1):19S‐28S. [PubMed] []
98. Montgomery S, Ferguson J, Schwartz G. The antidepressant efficacy of reboxetine in patients with severe depression. J Clin Psychopharmacol 2003;23:45‐50. [PubMed] []
99. Schmitt AB, Bauer M, Volz HP et al. Differential effects of venlafaxine in the treatment of major depressive disorder according to baseline severity. Eur Arch Psychiatry Clin Neurosci 2009;259:329‐39. [PubMed] []
100. Shelton RC, Prakash A, Mallinckrodt CH et al. Patterns of depressive symptom response in duloxetine‐treated outpatients with mild, moderate or more severe depression. Int J Clin Pract 2007;61:1337‐48. [PubMed] []
101. Versiani M, Moreno R, Ramakers‐van Moorsel C et al. Comparison of the effects of mirtazapine and fluoxetine in severely depressed patients. CNS Drugs 2005;19:137‐46. [PubMed] []
102. Hirschfeld R. Efficacy of SSRIs and newer antidepressants in severe depression: comparison with TCAs. J Clin Psychiatry 1999;60:326‐35. [PubMed] []
103. Montgomery S, Lecrubier Y. Is severe depression a separate indication? Eur Neuropsychopharmacol 1999;9:259‐64. [PubMed] []
104. Schatzberg AF. Antidepressant effectiveness in severe depression and melancholia. J Clin Psychiatry 1999;60(Suppl. 4):14‐21. [PubMed] []
105. Endicott J, Cohen J, Nee J et al. Hamilton depression rating scale. Arch Gen Psychiatry 1981;38:98‐103. [PubMed] []
106. Kearns NP, Cruickshank CA, McGuigan KJ et al. A comparison of depression rating scales. Br J Psychiatry 1982;141:45‐9. [PubMed] []
107. Cameron IM, Cardy A, Crawford JR et al. Measuring depression severity in general practice: discriminatory performance of the PHQ‐9, HADS‐D, and BDI‐II. Br J Gen Pract 2011;61:e419‐26. [PMC free article] [PubMed] []
108. Zimmerman M, Martinez JH, Young D et al. Severity classification on the Hamilton Depression Rating Scale. J Affect Disord 2013;150:384‐8. [PubMed] []
109. Zigmond AS, Snaith RP. The Hospital Anxiety and Depression Scale. Acta Psychiatr Scand 1983;67:361‐70. [PubMed] []
110. Cameron IM, Crawford JR, Lawton K et al. Psychometric comparison of PHQ‐9 and HADS for measuring depression severity in primary care. Br J Gen Pract 2008;58:32‐6. [PMC free article] [PubMed] []
111. Hansson M, Chotai J, Nordstom A et al. Comparison of two self‐rating scales to detect depression: HADS and PHQ‐9. Br J Gen Pract 2009;59:e283‐8. [PMC free article] [PubMed] []
112. Reddy P, Philpot B, Ford D et al. Identification of depression in diabetes: the efficacy of PHQ‐9 and HADS‐D. Br J Gen Pract 2010;60:e239‐45. [PMC free article] [PubMed] []
113. Beck A, Steer R, Brown G. The Beck Depression Inventory, 2nd ed. San Antonio: The Psychological Corporation, 1996. []
114. Zimmerman M, Martinez J, Friedman M et al. How can we use depression severity to guide treatment selection when measures of depression categorize patients differently? J Clin Psychiatry 2012;73:1287‐91. [PubMed] []
115. Ruscio J, Zimmerman M, McGlinchey JB et al. Diagnosing major depressive disorder XI: a taxometric investigation of the structure underlying DSM‐IV symptoms. J Nerv Ment Dis 2007;195:10‐9. [PubMed] []
116. van der Lem R, van der Wee NJ, van Veen T et al. The generalizability of antidepressant efficacy trials to routine psychiatric out‐patient practice. Psychol Med 2011;41:1353‐63. [PubMed] []
117. Bielski RJ, Friedel RO. Prediction of tricyclic antidepressant response: a critical review. Arch Gen Psychiatry 1976;33:1479‐89. [PubMed] []
118. Grammer GG, Kuhle AR, Clark CC et al. Severity of depression predicts remission rates using transcranial magnetic stimulation. Front Psychiatry 2015;6:114. [PMC free article] [PubMed] []
119. Jones NP, Siegle GJ, Thase ME. Effects of rumination and initial severity on remission to cognitive therapy for depression. Cogn Ther Res 2008;32:591‐604. [PMC free article] [PubMed] []
120. Lisanby SH, Husain MM, Rosenquist PB et al. Daily left prefrontal repetitive transcranial magnetic stimulation in the acute treatment of major depression: clinical predictors of outcome in a multisite, randomized controlled clinical trial. Neuropsychopharmacology 2009;34:522‐34. [PubMed] []
121. Bower P, Kontopantelis E, Sutton A et al. Influence of initial severity of depression on effectiveness of low intensity interventions: meta‐analysis of individual patient data. BMJ 2013;346:f540. [PMC free article] [PubMed] []
122. Sugawara Y, Higuchi H, Yoshida K et al. Response rate obtained using milnacipran depending on the severity of depression in the treatment of major depressive patients. Clin Neuropharmacol 2006;29:6‐9. [PubMed] []
123. Friedman ES, Davis LL, Zisook S et al. Baseline depression severity as a predictor of single and combination antidepressant treatment outcome: results from the CO‐MED trial. Eur Neuropsychopharmacol 2012;22:183‐99. [PMC free article] [PubMed] []
124. Kennedy S, Andersen H, Lam R. Efficacy of escitalopram in the treatment of major depressive disorder compared with conventional selective serotonin reuptake inhibitors and venlafaxine SR: a meta‐analysis. J Psychiatry Neurosci 2006;31:122‐31. [PMC free article] [PubMed] []
125. Angst J, Amrein R, Stabl M. Moclobemide and tricyclic antidepressants in severe depression: meta‐analysis and prospective studies. J Clin Psychopharmacol 1995;15:16S‐23S. [PubMed] []
126. Wiles NJ, Mulligan J, Peters TJ et al. Severity of depression and response to antidepressants: GENPOD randomised controlled trial. Br J Psychiatry 2011;200:130‐6. [PubMed] []
127. Kilts CD, Wade AG, Andersen HF et al. Baseline severity of depression predicts antidepressant drug response relative to escitalopram. Exp Opin Pharmacother 2009;10:927‐36. [PubMed] []
128. Khan A, Sambunaris A, Edwards J et al. Vilazodone in the treatment of major depressive disorder: efficacy across symptoms and severity of depression. Int Clin Psychopharmacol 2014;29:86‐92. [PMC free article] [PubMed] []
129. Mosca D, Zhang M, Prieto R et al. Efficacy of desvenlafaxine compared with placebo in major depressive disorder patients by age group and severity of depression at baseline. J Clin Psychopharmacol 2017;37:182‐92. [PubMed] []
130. Driessen E, Cuijpers P, Hollon SD et al. Does pretreatment severity moderate the efficacy of psychological treatment of adult outpatient depression? A meta‐analysis. J Consult Clin Psychol 2010;78:668‐80. [PubMed] []
131. Furukawa TA, Weitz ES, Tanaka S et al. Initial severity of depression and efficacy of cognitive‐behavioural therapy: individual‐participant data meta‐analysis of pill‐placebo‐controlled trials. Br J Psychiatry 2017;210:190‐6. [PMC free article] [PubMed] []
132. Guelfi JD, Ansseau M, Timmerman L et al. Mirtazapine versus venlafaxine in hospitalized severely depressed patients with melancholic features. J Clin Psychopharmacol 2001;21:425‐31. [PubMed] []
133. Henkel V, Seemuller F, Obermeier M et al. Relationship between baseline severity of depression and antidepressant treatment outcome. Pharmacopsychiatry 2011;44:27‐32. [PubMed] []
134. Hirschfeld RM, Russell JM, Delgado PL et al. Predictors of response to acute treatment of chronic and double depression with sertraline or imipramine. J Clin Psychiatry 1998;59:669‐75. [PubMed] []
135. Madhoo M, Levine SZ. Initial severity effects on residual symptoms in response and remission: a STAR*D study during and after failed citalopram treatment. J Clin Psychopharmacol 2015;35:450‐3. [PubMed] []
136. Souery D, Oswald P, Massat I et al. Clinical factors associated with treatment resistance in major depressive disorder: results from a European multicenter study. J Clin Psychiatry 2007;68:1062‐70. [PubMed] []
137. Brown C, Schulberg HC, Prigerson HG. Factors associated with symptomatic improvement and recovery from major depression in primary care patients. Gen Hosp Psychiatry 2000;22:242‐50. [PubMed] []
138. Enns MW, Cox BJ. Psychosocial and clinical predictors of symptom persistence vs. remission in major depressive disorder. Can J Psychiatry 2005;50:769‐77. [PubMed] []
139. Sargeant JK, Bruce ML, Florio LP et al. Factors associated with 1‐year outcome of major depression in the community. Arch Gen Psychiatry 1990;47:519‐26. [PubMed] []
140. Turner EH, Matthews AM, Linardatos E et al. Selective publication of antidepressant trials and its influence on apparent efficacy. N Engl J Med 2008;358:252‐60. [PubMed] []
141. Rabinowitz J, Werbeloff N, Mandel FS et al. Initial depression severity and response to antidepressants vs. placebo: patient‐level data analysis from 34 randomised controlled trials. Br J Psychiatry 2016;209:427‐8. [PubMed] []
142. Elkin I, Shea M, Watkins J et al. NIMH treatment of depression collaborative research program: general effectiveness of treatments. Arch Gen Psychiatry 1989;46:971‐82. [PubMed] []
143. Cuijpers P, van Straten A, van Oppen P et al. Are psychological and pharmacologic interventions equally effective in the treatment of adult depressive disorders? A meta‐analysis of comparative studies. J Clin Psychiatry 2008;69:1675‐85. [PubMed] []
144. Weitz ES, Hollon SD, Twisk J et al. Baseline depression severity as moderator of depression outcomes between cognitive behavioral therapy vs. pharmacotherapy: an individual patient data meta‐analysis. JAMA Psychiatry 2015;72:1102‐9. [PubMed] []
145. Links PS, Eynan R. The relationship between personality disorders and Axis I psychopathology: deconstructing comorbidity. Annu Rev Clin Psychol 2013;9:529‐54. [PubMed] []
146. Lenzenweger MF, Lane MC, Loranger AW et al. DSM‐IV personality disorders in the National Comorbidity Survey Replication. Biol Psychiatry 2007;62:553‐64. [PMC free article] [PubMed] []
147. Bateman AW, Fonagy P. The effectiveness of partial hospitalization in the treatment of borderline personality disorder – a randomised controlled trial. Am J Psychiatry 1999;156:1563‐9. [PubMed] []
148. National Collaborating Centre for Mental Health . Borderline personality disorder: recognition and management. London: Department of Health, 2009. []
149. Blum N, St John D, Pfohl B et al. Systems Training for Emotional Predictability and Problem Solving (STEPPS) for outpatients with borderline personality disorder: a randomized controlled trial and 1‐year follow‐up. Am J Psychiatry 2008;165:468‐78. [PMC free article] [PubMed] []
150. Zanarini MC, Yong L, Frankenburg FR et al. Severity of childhood sexual abuse and its relationship to severity of borderline psychopathology and psychosocial impairment among borderline inpatients. J Nerv Ment Dis 2002;190:381‐7. [PubMed] []
151. Giesen‐Bloo JH, Wachters LM, Schouten E et al. The borderline personality disorder severity index‐IV: psychometric evaluation and dimensional structure. Pers Ind Diff 2010;49:136‐41. []
152. Yang M, Coid J, Tyrer P. Personality pathology recorded by severity: national survey. Br J Psychiatry 2010;197:193‐9. [PubMed] []
153. Tyrer P, Alexander MS, Cicchetti D et al. Reliability of a schedule for rating personality disorders. Br J Psychiatry 1979;135:168‐74. [PubMed] []
154. Dolan B, Evans C, Norton K. Multiple axis‐II diagnoses of personality disorder. Br J Psychiatry 1995;166:107‐12. [PubMed] []
155. Oldham JM, Skodol AE, Kellman HD et al. Diagnosis of DSM‐III‐R personality disorders by two structured interviews: patterns of comorbidity. Am J Psychiatry 1992;149:213‐20. [PubMed] []
156. Zimmerman M, Galione JN, Chelminski I et al. Does the diagnosis of multiple Axis II disorders have clinical significance? Ann Clin Psychiatry 2012;24:195‐201. [PubMed] []
157. Tyrer P, Johnson T. Establishing the severity of personality disorder. Am J Psychiatry 1996;153:1593‐7. [PubMed] []
158. Asnaani A, Chelminski I, Young D et al. Heterogeneity of borderline personality disorder: do the number of criteria met make a difference? J Pers Disord 2007;21:615‐25. [PubMed] []
159. Kernberg OF, Caligor E. A psychoanalytic theory of personality disorders In: Clarkin JF, editor; , Lenzenweger MF, editor. (eds). Major theories of personality disorder. New York: Guilford, 2005:114‐56. []
160. Paris J. Dimensional diagnosis and the DSM‐5. J Clin Psychiatry 2005;72:1340. [PubMed] []
161. Moran P. Dangerous severe personality disorder – bad tidings from the UK. Int J Soc Psychiatry 2001;48:6‐10. [PubMed] []
162. Maden A, Tyrer P. Dangerous and severe personality disorders: a new personality concept from the United Kingdom. J Pers Disord 2003;17:489‐96. [PubMed] []
163. Tyrer P, Cooper S, Rutter D et al. The assessment of dangerous and severe personality disorder: lessons from a randomised controlled trial linked to qualitative analysis. Forensic Psychol Psychiatry 2009;20:132‐46. []
164. Berghuis H, Kamphuis JH, Verheul R. Specific personality traits and general personality dysfunction as predictors of the presence and severity of personality disorders in a clinical sample. J Pers Assess 2014;96:410‐6. [PubMed] []
165. Crawford MJ, Koldobsky N, Mulder R et al. Classifying personality disorder according to severity. J Pers Disord 2011;25:321‐30. [PubMed] []
166. Livesley WJ. Practical management of personality disorders. New York: Guilford, 2003. []
167. Widiger TA, Costa PT, McCrae RR. A proposal for Axis II: diagnosing personality disorders using the five factor model In: Costa PT, editor; , Widiger TA, editor. (eds). Personality disorders and the five factor model of personality. Washington: American Psychological Association, 2002:431‐52. []
168. Widiger TA, Mullins‐Sweatt SN. Five‐factor model of personality disorder: a proposal for DSM‐V. Annu Rev Clin Psychol 2009;5:197‐220. [PubMed] []
169. Widiger TA, Trull TJ. Plate tectonics in the classification of personality disorder: shifting to a dimensional model. Am Psychol 2007;62:71‐83. [PubMed] []
170. Livesley W, Schroeder M, Jackson D et al. Categorical distinctions in the study of personality disorder: implications for classification. J Abnorm Psychol 1994;103:6‐17. [PubMed] []
171. Parker G, Hadzi‐Pavlovic D, Both L et al. Measuring disordered personality functioning: to love and to work reprised. Acta Psychiatr Scand 2004;110:230‐9. [PubMed] []
172. Trull TJ. Dimensional models of personality disorder: coverage and cutoffs. J Pers Disord 2005;19:262‐82. [PubMed] []
173. Verheul R, Andrea H, Berghout CC et al. Severity Indices of Personality Problems (SIPP‐118): development, factor structure, reliability, and validity. Psychol Assess 2008;20:23‐34. [PubMed] []
174. Livesley WJ. Suggestions for a framework for an empirically based classification of personality disorder. Can J Psychiatry 1998;43:137‐47. [PubMed] []
175. Ro E, Clark LA. Interrelations between psychosocial functioning and adaptive‐ and maladaptive‐range personality traits. J Abnorm Psychol 2013;122:822‐35. [PubMed] []
176. Krueger RF, Derringer J, Markon KE et al. Initial construction of a maladaptive personality trait model and inventory for DSM‐5. Psychol Med 2012;42:1879‐90. [PMC free article] [PubMed] []
177. Gunderson J, Zanarini MC. Commentary: deceptively simple – or radical shift? Pers Ment Health 2011;5:260‐2. []
178. Kelly BD, Nur UA, Tyrer P et al. Impact of severity of personality disorder on the outcome of depression. Eur Psychiatry 2009;24:322‐6. [PubMed] []
179. Livesley WJ. General Assessment of Personality Disorder (GAPD). Vancouver: University of British Columbia, 2006. []
180. Berghuis H, Kamphuis JH, Verheul R. Core features of personality disorder: differentiating general personality dysfunctioning from personality traits. J Pers Disord 2012;26:704‐16. [PubMed] []
181. Berghuis H, Kamphuis JH, Verheul R et al. The General Assessment of Personality Disorder (GAPD) as an instrument for assessing the core features of personality disorders. Clin Psychol Psychother 2013;20:544‐57. [PubMed] []
182. Arntz A, van den Hoorn M, Cornelis J et al. Reliability and validity of the borderline personality disorder severity index. J Pers Disord 2003;17:45‐59. [PubMed] []
183. Clarkin JF, Levy KN, Lenzenweger MF et al. Evaluating three treatments for borderline personality disorder: a multiwave study. Am J Psychiatry 2007;164:922‐8. [PubMed] []
184. Linehan MM, Comtois KA, Murray AM et al. Two‐year randomized controlled trial and follow‐up of dialectical behavior therapy vs. therapy by experts for suicidal behaviors and borderline personality disorder. Arch Gen Psychiatry 2006;63:757‐66. [PubMed] []
185. Zanarini MC, Vujanovic AA, Parachini EA et al. Zanarini Rating Scale for Borderline Personality Disorder (ZAN‐BPD): a continuous measure of DSM‐IV borderline psychopathology. J Pers Disord 2003;17:233‐42. [PubMed] []
186. Patton JH, Stanford MS, Barratt ES. Factor structure of the Barratt Impulsiveness Scale. J Clin Psychol 1995;51:768‐74. [PubMed] []
187. Doering S, Horz S, Rentrop M et al. Transference‐focused psychotherapy v. treatment by community psychotherapists for borderline personality disorder: randomised controlled trial. Br J Psychiatry 2010;196:389‐95. [PubMed] []
188. McMain SF, Links PS, Gnam WH et al. A randomized trial of dialectical behavior therapy versus general psychiatric management for borderline personality disorder. Am J Psychiatry 2009;166:1365‐74. [PubMed] []
189. Diamond D, Yeomans FE, Stern B et al. Transference focused psychotherapy for patients with comorbid narcissistic and borderline personality disorder. Psychoanal Inq 2013;33:527‐51. []
190. Messina NP, Wish ED, Hoffman JA et al. Antisocial personality disorder and treatment outcomes. Am J Drug Alcohol Abuse 2002;28:197‐212. [PubMed] []
191. Kotov R, Krueger RF, Watson D et al. The Hierarchical Taxonomy of Psychopathology (HiTOP): a dimensional alternative to traditional nosologies. J Abnorm Psychol 2017;126:454‐77. [PubMed] []
192. Stanton K, Rozek DC, Stasik‐O’Brien SM et al. A transdiagnostic approach to examining the incremental predictive power of emotion regulation and basic personality dimensions. J Abnorm Psychol 2016;125:960‐75. [PubMed] []
193. Zimmerman M. A review of 20 years of research on overdiagnosis and underdiagnosis in the Rhode Island Methods to Improve Diagnostic Assessment and Services (MIDAS) project. Can J Psychiatry 2016;61:71‐9. [PMC free article] [PubMed] []
194. Krueger RF, Finger MS. Using item response theory to understand comorbidity among anxiety and unipolar mood disorders. Psychol Assess 2001;13:140‐51. [PubMed] []
195. Eaton NR, Krueger RF, Markon KE et al. The structure and predictive validity of the internalizing disorders. J Abnorm Psychol 2013;122:86‐92. [PMC free article] [PubMed] []
196. Krueger RF, Markon KE, Patrick CJ et al. Externalizing psychopathology in adulthood: a dimensional‐spectrum conceptualization and its implications for DSM‐V. J Abnorm Psychol 2005;114:537‐50. [PMC free article] [PubMed] []
197. Carragher N, Krueger RF, Eaton NR et al. ADHD and the externalizing spectrum: direct comparison of categorical, continuous, and hybrid models of liability in a nationally representative sample. Soc Psychiatry Psychiatr Epidemiol 2014;49:1307‐17. [PMC free article] [PubMed] []
198. Kotov R, Chang SW, Fochtmann LJ et al. Schizophrenia in the internalizing‐externalizing framework: a third dimension? Schizophr Bull 2011;37:1168‐78. [PMC free article] [PubMed] []
199. Kotov R, Krueger RF, Watson D. A paradigm shift in psychiatric classification: the Hierarchical Taxonomy Of Psychopathology (HiTOP). World Psychiatry 2018;17:24‐5. [PMC free article] [PubMed] []
200. Gaebel W, Zaske H, Baumann AE. The relationship between mental illness severity and stigma. Acta Psychiat Scand 2006;113(Suppl. 429):41‐5. [PubMed] []
201. Grootenboer EM, Giltay EJ, van der Lem R et al. Reliability and validity of the Global Assessment of Functioning Scale in clinical outpatients with depressive disorders. J Eval Clin Pract 2012;18:502‐7. [PubMed] []
202. Bech P, Boyer P, Germain JM et al. HAM‐D17 and HAM‐D6 sensitivity to change in relation to desvenlafaxine dose and baseline depression severity in major depressive disorder. Pharmacopsychiatry 2010;43:271‐6. [PubMed] []

Articles from World Psychiatry are provided here courtesy of The World Psychiatric Association

Comments are closed.