Scholarly article on topic 'Multidimensional endotyping in patients with severe asthma reveals inflammatory heterogeneity in matrix metalloproteinases and chitinase 3–like protein 1'

Multidimensional endotyping in patients with severe asthma reveals inflammatory heterogeneity in matrix metalloproteinases and chitinase 3–like protein 1 Academic research paper on "Clinical medicine"

CC BY
0
0
Share paper
OECD Field of science
Keywords
{Asthma / cytokines / eosinophils / neutrophils / phenotype / endotype / heterogeneity / "matrix metalloproteinase" / "chitinase 3–like protein 1" / "topological data analysis"}

Abstract of research paper on Clinical medicine, author of scientific article — Timothy S.C. Hinks, Tom Brown, Laurie C.K. Lau, Hitasha Rupani, Clair Barber, et al.

Background Disease heterogeneity in patients with severe asthma and its relationship to inflammatory mechanisms remain poorly understood. Objective We aimed to identify and replicate clinicopathologic endotypes based on analysis of blood and sputum parameters in asthmatic patients. Methods One hundred ninety-four asthmatic patients and 21 control subjects recruited from 2 separate centers underwent detailed clinical assessment, sputum induction, and phlebotomy. One hundred three clinical, physiologic, and inflammatory parameters were analyzed by using topological data analysis and Bayesian network analysis. Results Severe asthma was associated with anxiety and depression, obesity, sinonasal symptoms, decreased quality of life, and inflammatory changes, including increased sputum chitinase 3–like protein 1 (YKL-40) and matrix metalloproteinase (MMP) 1, 3, 8, and 12 levels. Topological data analysis identified 6 clinicopathobiologic clusters replicated in both geographic cohorts: young, mild paucigranulocytic; older, sinonasal disease; obese, high MMP levels; steroid resistant TH2 mediated, eosinophilic; mixed granulocytic with severe obstruction; and neutrophilic, low periostin levels, severe obstruction. Sputum IL-5 levels were increased in patients with severe particularly eosinophilic forms, whereas IL-13 was suppressed and IL-17 levels did not differ between clusters. Bayesian network analysis separated clinical features from intricately connected inflammatory pathways. YKL-40 levels strongly correlated with neutrophilic asthma and levels of myeloperoxidase, IL-8, IL-6, and IL-6 soluble receptor. MMP1, MMP3, MMP8, and MMP12 levels were associated with severe asthma and were correlated positively with sputum IL-5 levels but negatively with IL-13 levels. Conclusion In 2 distinct cohorts we have identified and replicated 6 clinicopathobiologic clusters based on blood and induced sputum measures. Our data underline a disconnect between clinical features and underlying inflammation, suggest IL-5 production is relatively steroid insensitive, and highlight the expression of YKL-40 in patients with neutrophilic inflammation and the expression of MMPs in patients with severe asthma.

Academic research paper on topic "Multidimensional endotyping in patients with severe asthma reveals inflammatory heterogeneity in matrix metalloproteinases and chitinase 3–like protein 1"

Multidimensional endotyping in patients with severe asthma reveals inflammatory heterogeneity in matrix metalloproteinases and chitinase 3-like protein 1

Timothy S. C. Hinks, MD,a b Tom Brown, MD,c Laurie C. K. Lau, PhD,a b Hitasha Rupani, MD,a b Clair Barber, BSc,b

Scott Elliott, BSc,c Jon A. Ward, BSc,a,b Junya Ono, BSc,d Shoichiro Ohta, MD,e Kenji Izuhara, MD,f

Ratko Djukanovic, MD,a,b Ramesh J. Kurukulaaratchy, MD,g Anoop Chauhan, MD,c and Peter H. Howarth, MDa,b

Southampton and Portsmouth, United Kingdom, and Kanagawa and Saga, Japan

Background: Disease heterogeneity in patients with severe asthma and its relationship to inflammatory mechanisms remain poorly understood.

Objective: We aimed to identify and replicate clinicopathologic endotypes based on analysis of blood and sputum parameters in asthmatic patients.

Methods: One hundred ninety-four asthmatic patients and 21 control subjects recruited from 2 separate centers underwent detailed clinical assessment, sputum induction, and phlebotomy. One hundred three clinical, physiologic, and inflammatory parameters were analyzed by using topological data analysis and Bayesian network analysis.

From aClinical and Experimental Sciences, University of Southampton Faculty of Medicine, Sir Henry Wellcome Laboratories, Southampton University Hospital; bthe NIHR Southampton Respiratory Biomedical Research Unit, Southampton University Hospital; cPortsmouth Hospitals NHS Trust; dShino-Test Corporation, Kanagawa; ethe Department of Laboratory Medicine and fthe Department of Biomolecular Sciences, Saga Medical School; and gthe Department of Respiratory Medicine, Southampton General Hospital.

Supported by the Medical Research Council (G0800649). T.S.C.H. was supported by a Wellcome Trust Clinical Research Fellowship (088365/z/09/z) and by the Academy of Medical Sciences. Infrastructure support was funded by the National Institute for Health Research (NIHR) Southampton Respiratory Biomedical Research Unit. We acknowledge the support of the NIHR through the Primary Care Research Network and through an Academic Clinical Fellowship (to T.S.C.H.).

Disclosure of potential conflict of interest: T. S. C. Hinks has received research support from Wellcome Trust. T. Brown has received speakers' fees from Chiesi and Novartis and has received travel support from Chiesi. S. Elliott and A. Chauhan have received research support from the Medical Research Council (MRC) UK. K. Izuhara has received research support from Shino-Test and has received consultancy fees from Chugai Pharmaceutical Co Ltd and AQUA Therapetics. R. Djukanovic has received research support through a personal Clinical Training Fellowship from the Wellcome Trust and the IMI-funded EU project UBIOPRED and an MRC-funded project on COPD: COPD-MAP, has received consultancy fees from Teva Pharmaceuticals, has received lecture fees from Novartis, has received travel support from Boehringer Ingelheim, and owns stock in Synairgen. R. Kurukulaaratchy has received research support from the MRC (G0800649). P. H. Howarth has received research support from the MRC UK (Wessex Severe Asthma Cohort) and National Institute of Health Research UK (Respiratory Biomedical Research Unit) and is on the advisory boards for Novartis, Roche, Johnson & Johnson, and Aventis. The rest of the authors declare that they have no relevant conflicts of interest.

Received for publication July 11, 2015; revised October 6,2015; accepted for publication November 20, 2015.

Corresponding author: Timothy S. C. Hinks, MD, Clinical and Experimental Sciences, University of Southampton Faculty of Medicine, Sir Henry Wellcome Laboratories, Southampton University Hospital, Southampton SO16 6YD, United Kingdom. E-mail: t.hinks@soton.ac.uk. 0091-6749

© 2015 The Authors. Published by Elsevier Inc. on behalf of the American Academy of Allergy, Asthma & Immunology. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

http://dx.doi.org/10.1016/jjaci.2015.11.020

Results: Severe asthma was associated with anxiety and depression, obesity, sinonasal symptoms, decreased quality of life, and inflammatory changes, including increased sputum chitinase 3-like protein 1 (YKL-40) and matrix metalloproteinase (MMP) 1, 3, 8, and 12 levels. Topological data analysis identified 6 clinicopathobiologic clusters replicated in both geographic cohorts: young, mild paucigranulocytic; older, sinonasal disease; obese, high MMP levels; steroid resistant TH2 mediated, eosinophilic; mixed granulocytic with severe obstruction; and neutrophilic, low periostin levels, severe obstruction. Sputum IL-5 levels were increased in patients with severe particularly eosinophilic forms, whereas IL-13 was suppressed and IL-17 levels did not differ between clusters. Bayesian network analysis separated clinical features from intricately connected inflammatory pathways. YKL-40 levels strongly correlated with neutrophilic asthma and levels of myeloperoxidase, IL-8, IL-6, and IL-6 soluble receptor. MMP1, MMP3, MMP8, and MMP12 levels were associated with severe asthma and were correlated positively with sputum IL-5 levels but negatively with IL-13 levels.

Conclusion: In 2 distinct cohorts we have identified and replicated 6 clinicopathobiologic clusters based on blood and induced sputum measures. Our data underline a disconnect between clinical features and underlying inflammation, suggest IL-5 production is relatively steroid insensitive, and highlight the expression of YKL-40 in patients with neutrophilic inflammation and the expression of MMPs in patients with severe asthma. (J Allergy Clin Immunol 2016;hh:hh-hh.)

Key words: Asthma, cytokines, eosinophils, neutrophils, phenotype, endotype, heterogeneity, matrix metalloproteinase, chitinase 3-like protein 1, topological data analysis

Asthma is a chronic inflammatory disorder of the airways characterized by variable airflow obstruction and airway remodeling and mediated by a variety of inflammatory mediators and cells, including mast cells, T cells, eosinophils, and neutrophils.1 There is now recognition of considerable disease heterogeneity within the spectrum of clinical asthma, the precise nature of which remains to be defined, and this currently constitutes a significant barrier to research.2 It is postulated that distinct subgroups of asthma exist, which have been termed endotypes, meaning ''a subtype of a condition defined by distinct pathophysiological mechanisms.''3 A better understanding of such endotypes and their relationship to distinct underlying disease mechanisms should enable identification of novel therapeutic

2 HINKSETAL j allergy clin Immunol

■■■ 2016

Abbreviations used

ACQ Asthma Control Questionnaire

ECP Eosinophil cationic protein

Feno Fraction of exhaled nitric oxide

FGF Fibroblast growth factor

GINA Global Initiative for Asthma

HAD Hospital Anxiety and Depression

ICS Inhaled corticosteroid

K-S Kolmogorov-Smirnov

MMP Matrix metalloproteinase

TDA Topological data analysis

YKL-40 Chitinase 3-like protein 1

targets and facilitate the aim of stratified medicine (ie, the efficient targeting of specific therapies to subgroups of subjects likely to benefit most).

To date, several groups have reported cluster analyses of patient cohorts to investigate possible disease endotypes.4-9 However, these are often limited by a lack of robust statistical validation in replication cohorts or have generated clusters the identity of which is dominated by predominantly clinical parameters, such as pulmonary physiology or participants' demographics, without providing significant insight into the underlying pathophysiology. Furthermore, techniques like principle component analysis tend to accentuate separation between clusters, which might in reality represent groupings within a continuum of disease rather than clear-cut entities.9 Recently, we have piloted a new analytic approach to such large clinical data sets by using network analyses that allow truly multidimensional analysis of clusters and provide visual representations of the data that reveal continuities within data sets.10

The aim of this study was to identify and independently replicate distinct multidimensional clinicopathobiologic clusters of severe asthma from the participants in the Wessex Severe Asthma cohort who had induced sputum and peripheral blood biomarker measures, as well as detailed clinical characterization. We aimed to cluster participants using only parameters that could be available to a clinician in tertiary care with access to sputum induction facilities and then to investigate the disease mechanisms of airway inflammation in each of these clusters by using more advanced immunologic assays.

METHODS

Southampton participants (derivation cohort)

The derivation cohort comprised 213 adult participants (18-70 years) enrolled for clinical phenotyping in the Wessex Severe Asthma Cohort, at the NIHR Southampton Respiratory Biomedical Research Unit. Five were excluded because of alternative diagnoses of bronchiectasis (n = 3), interstitial lung disease (n = 1), and gastroesophageal reflux without asthma. One hundred forty-five participants underwent successful sputum induction, with the emphasis on severe asthma (n = 121) and inclusion of 8 healthy nonatopic participants, 9 patients with mild asthma receiving b2-agonists alone, and 7 patients with moderate asthma receiving inhaled corticosteroids (ICSs). Thirty-eight of the 121 patients with severe asthma with persistent symptoms despite high-dose ICSs and other therapy were also receiving daily oral corticosteroids (Table I and see the Methods section and Fig E1 in this article's Online Repository at www.jacionline.org).

Portsmouth participants (validation cohort)

The validation cohort comprised 108 adult participants (18-70 years) enrolled by a separate study team from outpatient clinics at Queen Alexandra Hospital, Portsmouth. Seventy-one participants underwent successful sputum induction : 13 healthy nonatopic participants, 1 patient with mild asthma, 6 patients with moderate asthma, and 50 patients with severe asthma with persistent symptoms despite high-dose ICSs (n = 32) and oral corticosteroids (n = 18, Table II and see Fig E1).

Study procedures

Participants were assessed based on history; examination; questionnaires, including the Asthma Control Questionnaire (ACQ),11 Asthma Quality of Life Questionnaire,12 Hospital Anxiety and Depression (HAD) Scale,13 Sino-Nasal Outcome Test 20, and Short-Form 36 Health Survey ; skin prick tests with common aeroallergens; spirometry with albuterol reversibility; exhaled nitric oxide measurements; the University of Pennsylvania smell identification test16; and serum IgE and urinary cotinine measurements. Sputum samples were obtained by means of hypertonic saline induction and processed as previously described.17 Fifty-five different inflammatory mediators were measured in serum and sputum by using ELISAs or cytokine bead arrays (see the Methods section in this article's Online Repository).

The study was approved by the Southampton and South West Hampshire Research Ethics Committee A (09/H0502/37). All participants provided informed consent.

Statistical analysis

Data were analyzed initially by using topological data analysis (TDA) to define multidimensional clusters in the derivation and validation cohorts separately. Standard statistical methods were then applied to define the features of these clusters. In a separate analysis to define relationships between these parameters, Bayesian network analysis was then applied to all the pathobiologic and clinical features on the highest quality data from both cohorts combined.

Data are expressed as medians with interquartile ranges, unless stated otherwise. Data were logarithmically transformed if they were not normally distributed. For all analyses, 2-tailed P values of less than .05 were considered significant. Data were compared between the healthy and control groups by using Mann-Whitney U or Student t tests and between each asthma severity group and control subjects by using the Kruskal-Wallis test or ANOVA, depending on data distribution. For the latter, an overall 5% significance level was adjusted for multiple comparisons by using the Bonferroni method. Correlations were tested with the Spearman r statistic. Kolmogorov-Smirnov (K-S) tests identified significant differences between distributions within a single cluster. Data were analyzed with Prism 6.0 (GraphPad Software, San Diego, Calif) and SPSS 21.0 (IBM, Armonk, NY) software.

Network analyses (TDA and Bayesian network analysis) were performed, as previously described.10 Networks were generated from all participants with the most complete data (n = 145 for the derivation data set and n = 70 for the validation data set) after missing data (6.1% of data set) were imputed by using the mean of 5 multiple imputations. Subsequent analyses of sputum parameters used only data from the highest quality sputum samples (n = 118 for the derivation data set and n = 55 for the validation data set) and without imputation. Terms used to generate the networks are described in Tables E1 and E2 in this article's Online Repository at www.jacionline.org.

To identify multidimensional features within the data sets, which might not be apparent by using traditional methods, we used TDA. This is particularly suited to complex biological data sets, representing a high-dimensional data set as a structured 3-dimensional network. Each node comprises participants similar to each other in multiple dimensions. Edges connect nodes that contain shared data points. Statistical tests can then be performed on groups or features that emerge from the inherent structure of the data set. This technique provides

TABLE I. Demographics of clusters in the derivation cohort

Healthy

control

Parameters subjects A B C D E F G H

No.* 8 30 7 13 4 13 17 37 19

Demographics

Sex (male/female), 3 (38)/5 (72) 9 (30)/21 (70) 4 (57)/3 (43) 3 (23)/10 (77) 2 (50)/2 (50) 6 (46)/7 (54) 9 (53)/8 (47) 13 (35)/24 (65) 9 (47)/10 (53;

no. (%)

Age (y), median 33.5 (21-53) 38 (22-65) 60 (39-67) 44 (21-57) 54.5 (23-61) 45 (26-62) 57 (29-68) 51 (23-69) 58 (43-71)

(range)

Pulmonary function

FEV1 (% predicted, 89 (84-98) 91 (83-105) 58 (53-64) 80 (72-94) 99 (63-115) 64 (58-83) 40 (31-64) 74 (55-84) 44 (35-59)

pre-BD)|

FEV1 reversibility (%) 0 (0.0-1.8) 6.8 (4.1-9.3) 16 (5.7-23) 5.8 (2.6-11) 4.7 (0.68-11) 8.7 (2.7-20) 12 (8.2-24) 11 (3.3-20) 14 (3.3-25)

FEV1 (% predicted, 92 (84-98) 96 (87-112) 67 (63-76) 85 (70-104) 112 (71-122) 75 (64-87) 52 (30-77) 82 (66-88) 54 (44-66)

post-BD)

Exhaled nitric oxide 13 (11-17) 26 (12-52) 25 (19-45) 11 (9.5-19) 20 (16-35) 33 (11-73) 22 (17-46) 19 (10-29) 17 (10-30)

(ppb, at 50 L/s)

Clinical

Atopy (positive skin 4 (50)/4 (50) 22 (73)/8 (27) 4 (57)/3 (43) 5 (38)/8 (62) 3 (75)/1 (25) 11 (82)/2 (18) 12 (71)/5 (29) 22 (59)/15 (41) 13 (68)/6 (32)

test response,

yes/no), no. (%)

No. of allergens 1 (0-23) 3 (0-5) 3 (0-4) 0 (0-4) 2 (1-4) 2 (1-5) 3 (0-6) 2 (0-4) 2 (0-3)

eliciting positive

skin test responses

Peripheral eosinophil 0.1 (0.1-0.3) 0.2 (0.1-0.4) 0.2 (0.1-0.4) 0.3 (0.1-0.6) 0.2 (0.1-1.3) 0.6 (0.4-0.7) 0.4 (0.1-0.7) 0.2 (0.1-0.2) 0.2 (0.1-0.3)

count (109/L)

Total IgE (IU/mL) 230 (79-280) 120 (30-260) 88 (23-210) 34 (11-110) 130 (37-190) 68 (13-812) 380 (110-1400) 92 (12-260) 130 (28-290)

Body mass index 23.5 31.3 28.0 36.4 (32.4-41.7) 34.6 25.9 (23.3-29.0) 28 (25.8-37.2) 25.5 30.9

(kg/m2) (22.4-25.6) (26.7-35.7) (27.6-35.5) (25.6-37.9) (24.3-29.8) (28.6-36.5)

Smoking status

Never, no. (%) 5(72) 15 (50) 5(71) 4(31) 4 (100) 7(54) 10 (59) 17 (46) 10 (53)

Former, no. 3 (38 [3.3]) 12 (40 [14]) 2 (29 [29]) 7 (54 [16]) 0 (0) 6 (46 [3]) 6 (35 [22]) 14 (38 [16]) 5 (26 [13])

(% [mean

pack years])

Current, no. 0 (0) 3 (10 [13]) 0 (0) 2 (15 [23]) 0 (0) 0 (0) 1 (5.9 [6.5]) 6 (16 [28]) 4 (21 [35])

(% [mean

pack years])

Cluster

Duration of NA 19 (5-31) 30 (15-49) 21 (6-32) 12 (9.3-20) 29 (21-42) 34 (25-47) 29 (18-44) 43 (23-47)

asthma (y)

ACQ7 score NA 1.6 (0.9-2.7) 2.3 (1.7-4.1) 2.7 (2.1-3.7) 2.1 (0.43-2.7) 2.9 (1.7-4.0) 3.1 (2.3-3.9) 3.3 (2.4-3.9) 3.3 (2.5-4.2)

Treatment

Inhaled steroid dose 0 1240 (0-2160) 2400 1440 1640 1600 2000 1640 1600

(equivalent mg (1600-2400) (1220-2080) (400-1860) (800-1840) (1760-2000) (1280-2080) (920-2300)

of BDP)

Maintenance oral 0 (0)/8 (0) 5 (17)/25 (83) 0 (0)/7 (100) 5 (38)/8 (62) 2 (50)/2 (50) 6 (46)/7 (54) 5 (29)/12 (71) 12 (32)/25 (68) 3 (16)/16 (84)

corticosteroids

(yes/no), no. (%)

Inflammatory subtype, no. (%)

Neutrophilic 0 (0) 0 (0) 3 (43) 3 (23) 0 (0) 0 (0) 6(35) 8 (22) 13 (68)

Eosinophilic 1 (13) 6 (20) 3(43) 3 (23) 2(50) 0(0) 4 (24) 10 (27) 1 (5)

Mixed granulocytic 0 (0) 0(0) 1 (14) 1 (8) 0 (0) 9 (69) 4 (24) 3 (8) 2(10)

Paucigranulocytic 7 (87) 24 (80) 0(0) 6 (46) 2(50) 4(31) 3 (18) 16 (43) 3 (16)

Sputum cell differential (%)

Macrophages 70 (58-85) 70 (57-76) 17 (7.7-28) 44 (28-59) 64 (26-68) 41 (25-51) 13 (5.3-48) 36 (26-55) 25 (9.7-29)

Neutrophils 12 (7.7-30) 19 (13-28) 66 (57-84) 52 (40-61) 21 (22-27) 30 (22-41) 65 (47-91) 53 (34-64) 71 (64-88)

Eosinophils 0.75 (0.60-1.3) 1.3 (0.0-2.6) 5.3 (0.3-13) 0.25 (0.0-1.3) 2 (0.13-17) 14 (1.8-43) 2.8 (1.2-8.1) 1 (0.3-6.0) 0.75 (0.06-1.38)

Lymphocytes 0.0 (0.0-0.0) 0.0 0.0 (0.0-0.0) 0.0 (0.0-0.25) 0.0 (0.0-0.06) 0.0 (0.0-0.25) 0 (0.0-0.31) 0.15 (0.0-0.30) 0.0 (0.0-0.19)

(0.0-0.38)

Epithelial 2.4 (1.5-11) 8.0 (2.5-12) 1.3 (0.9-7.0) 4.0 (1.0-6.0) 9.1 (0.38-14) 3.4 (2.2-4.9) 1.3 (0.25-2.4) 2.8 (1.5-9.8) 1.2 (0.31-3.7)

The inflammatory subtype is based on sputum differentials by using the following cut points: neutrophilic, greater than 61%; eosinophilic, greater than 3%. Percentages shown are derived from those subjects with valid data.

ACQ, Asthma Control Questionnaire11; BD, bronchodilator; BDP, beclomethasone dipropionate; CT, computed tomography; FVC, forced vital capacity; G1NA, Global Initiative for Asthma; NA, not available; PEFR, peak expiratory flow rate.

*Because some subjects were outliers, not all are assigned to clusters A through H. fValues are medians with interquartile ranges, unless stated otherwise.

4 HINKSETAL j allergy clin Immunol

■■■ 2016

TABLE II. Demographics of clusters in the validation cohort

Healthy Cluster control

Parameters subjects a b c e f h i

No.* 13 4 9 7 5 19 9 5 Demographics

Sex (male/female), 5 (8)/8 (62) 3 (75)/1 (25) 7 (78)/2 (22) 3 (43)/4 (57) 1 (20)/4 (80) 12 (63)/7 (37) 2 (22)/7 (72) 3 (60)/2 (40) no. (%)

Age (y), median (range) 34 (18-53) 34 (23-51) 61 (29-79) 44 (30-62) 61 (45-71) 51 (29-79) 57 (30-73) 45 (41-50) Pulmonary function

FEV1 (% predicted, 104 (96-108) 103 (95-109) 57 (52-62) 73 (68-78) 50 (49-52) 60 (51-78) 48 (44-69) 75 (75-85) pre-BD)f

FEV1 reversibility (%) 0(0.0-0.0) 2.1 (-0.3-4.7) 8.4(1.2-17) 11(5.3-13) 5.8(2.6-17) 13.3(5.7-15) 7.5(6.2-14) 9.3(5.0-9.4)

FEV1 (% predicted, 104 (96-108) 105 (97-111) 61 (53-73) 78 (73-86) 53 (51-62) 70 (58-83) 53 (50-73) 81 (80-87)

post BD)

Exhaled nitric oxide 14 (11-18) 39 (28-72) 32 (17-68) 26 (17-53) 72 (17-98) 32 (18-64) 27 (16-52) 28 (14-51)

(ppb, at 50 L/s) Clinical

Atopy (positive skin 4 (31)/9 (69) 3 (75)/1 (25) 8 (89)/1 (11) 6 (83)/1 (17) 4 (80)/1 (20) 13 (68)/6 (32) 5 (56)/44 (44) 4 (80)/1 (20) test response, yes/no), no. (%)

No. of allergen eliciting 0 (0-0) 3.5 (2-6) 3 (1-4) 2 (2-4) 2 (1-3) 3 (0-4.5) 1 (0-4) 5 (5-5)

positive skin test

responses

Peripheral eosinophil 0.1 (0.1-0.1) 0.1 (0.1-0.2) 0.5 (0.3-0.5) 0.3 (0.1-0.4) 0.2 (0.1-0.3) 0.5 (0.2-0.9) 0.3 (0.0-0.4) 0.5 (0.5-0.6)

count (109/L)

Total IgE (IU/mL) 21 (8.8-52) 77 (34-130) 116 (69-136) 145 (79-1500) 149 (100-860) 130 (54-170) 73 (32-540) 266 (140-400)

Body mass index (kg/m2) 24.3 (21.9-28.4) 29.4 (26.1-31.5) 26.4 (25.9-29.0) 30.9 (27.8-33.5) 27.3 (26.4-28.4) 29.1 (26.4-32.1) 26.2 (41.0-29.1) 32.1 (27.2-34.2)

Smoking status

Never, no. (%) 11 (85) 1 (25) 2 (22) 4(57) 2 (40) 10 (53) 6(67) 2 (60)

Former, no. 2 (15 [2.5]) 3 (75 [6]) 7 (78 [25]) 3 (43 [19]) 2 (40% [25]) 8 (42 [20]) 3 (33 [17]) 3 (40% [16])

(% [mean pack years])

Current, no. 0 (0) 0 (0) 0(0) 0 (0) 1 (20 [32]) 1 (5.3 [32]) 0 (0) 0 (0)

(% [mean pack years])

Duration of asthma (y) NA 16 (12-22) 26 (14-38) 30 (19-47) 13 (5-16) 33 (18-46) 22 (6-41) 41 (21-41)

ACQ7 score NA 0.76 (0.43-1.2) 3.4 (2.9-4.0) 2.3 (1.9-3.2) 3.6 (3.0-4.1) 2.9 (2.4-3.6) 3.4 (2.7-4.0) 3.1 (3.0-3.3)

Treatment

Inhaled steroid dose 0 3280 (2280-3940) 1600 (1600-2000) 2880 (1840-4440) 2000 (2000-2880) 1600 (1600-2000) 2240 (1270-2850) 2000 (2000-2000)

(equivalent mg of BDP)

Maintenance oral 0 (0)/13 (100) 0 (0)/4 (100) 4 (44)/5 (56) 2 (29)/5 (71) 2 (40)/3 (60) 6 (32)/13 (68) 3 (33)/6 (66) 3 (60)/2 (40)

corticosteroids

(yes/no), no. (%)

Inflammatory subtype, no. (%)

Neutrophilic 0 (0) 0 (0) 1 (11) 2 (29) 0 (0) 6(32) 3 (33) 0 (0)

Eosinophilic 0(0) 0(0) 5(56) 2 (29) 3(60) 9 (47) 2 (22) 2 (40)

Mixed granulocytic 0 (0) 0 (0) 1 (11) 0 (0) 0 (0) 1 (5) 3 (33) 0 (0)

Paucigranulocytic 13 (100) 4 (100) 2 (22) 3(43) 2 (40) 3 (16) 1 (11) 3(60)

Sputum cell differential (%)

Macrophages 82 (69-89) 70 (63-78) 26 (19-54) 34 (25-39) 52 (12-67) 30 (22-51) 24 (17-31) 84 (75-89)

Neutrophils 18 (11-33) 26 (17-34) 50 (42-50) 45 (39-53) 18 (9.9-23) 50 (32-70) 67 (51-76) 16 (3.8-17)

Eosinophils 0.0 (0.0-0.2) 0.0 (0.0-0.13) 11 (3.8-25) 0.69 (0.38-30) 42 (5.4-79) 4.9 (2.0-13) 7.4 (1.8-19) 3.3 (0.29-6.7)

Lymphocytes 0.2 (0.0-0.63) 0.1 (0.05-0.68) 0.94 (0.5-1.2) 0.69 (0.25-1.5) 0.25 (0.0-0.53) 0.63 (0.5-1.3) 1.4 (1.0-1.6) 0.38 (0.19-0.50)

Epithelial 0.1 (0.0-0.3) 1.0 (1.0-1.5) 0.5 (0.22-1.0) 0.69 (0.16-0.94) 0.69 (0.0-2.1) 1.8 (0.38-2.0) 0.5 (0.25-0.76) 0.19 (0.10-1.6)

The inflammatory subtype is based on sputum differentials by using the following cut points: neutrophilic, greater than 61%; eosinophilic, greater than 3%. Percentages given are derived from those subjects with valid data.

ACQ, Asthma Control Questionnaire11; BD, bronchodilator; BDP, beclomethasone dipropionate; CT, computed tomography; FVC, forced vital capacity; G1NA, Global Initiative for

Asthma; NA, not available; PEFR, peak expiratory flow rate.

*Because some subjects were outliers, not all are assigned to clusters a through i.

"Values are medians with interquartile ranges, unless stated otherwise.

a geometric representation of the data,18,19 is independent of prior hypotheses, and detects multidimensional features within the data that become apparent on visualization. As a consequence, topological networks capture interesting structure, even in very small data sets.

TDA was performed, as previously described,10,19 by using Ayasdi Core 1.59 (Ayasdi, Menlo Park, Calif), constructing networks with the 29 parameters listed in Table E1. Variance-normalized Euclidean distance was

used as a distance metric with 2 filter functions: principal and secondary metric singular value decomposition. Resolution was set at 30 and gain at 3 (derivation) or 4 (validation) and selected to provide network structures that permitted identification of subgroups. K-S tests identified parameters that differentiated each cluster from the rest of the structure. Comparisons between multiple clusters used 1-way ANOVA, with post hoc tests with the Bonferroni correction.

Bayesian network analysis

Interconnectivity between clinical and pathobiologic parameters was explored by using Bayesian network analysis (Genie 2.0; Decision Systems Laboratory, University of Pittsburgh, Pittsburgh, Pa). Data were discretized to describe nonlinear correlations into 2 (binary variables) or 4 or 5 (continuous variables) bins. Seventy-four parameters were included in analyses (see Table E2) on the 173 participants (including 17 healthy control subjects) from both cohorts with the highest quality sputum data and without imputation. The strengths of associations found to be significant in this analysis were analyzed by using Spearman correlations.

RESULTS

First, we investigated which of the 103 clinical, physiologic, and pathobiologic parameters measured were associated with severe asthma (Global Initiative for Asthma [GINA] step 4 and 5). Features that differed significantly in K-S tests between patients with severe asthma and healthy subjects in both the derivation data set (n = 145 participants) and the validation data set (n = 70) are presented in Table III. The presence of severe asthma was associated with symptoms of anxiety and depression or nasal dysfunction, decreased quality-of-life scores, obesity, obstructive spirometry, and increased reversibility. Pathobiologic parameters associated with a diagnosis of severe asthma were neutrophilic sputum; an increase in peripheral blood neutrophil counts; serum and sputum chitinase 3-like protein 1 (YKL-40) levels; sputum matrix metalloproteinase (MMP) 1, MMP3, MMP8, and MMP12 levels (P < .0001 each, Fig 1); vascular endothelial growth factor, IL-5, IL-6, IL-8, and IL-6 soluble receptor levels; and a decrease in sputum macrophage counts and levels of tissue inhibitor of metalloproteinases 1, fibroblast growth factor, IL-1 receptor antagonist, and IL-2.

TDA to identify clusters

Next, we applied TDA to the Southampton cohort (derivation) data sets to identify multidimensional clinicopathobiologic clusters. The network was generated by using only 29 clinical, physiologic, and cellular parameters (see Table E1) with the potential to be available to a tertiary care clinician. Subsequent cluster analyses were then performed on data available from all 103 parameters. Eight clusters of asthmatic patients (A-H) were identified, as described in Tables I and IV and Fig 2. Of these, 6 clusters (A-C, E, F, and H) were subsequently replicated when the same analysis was applied to the geographically distinct Portsmouth (validation) cohort (Tables II and IV and see Figs E2-E4 in this article's Online Repository at www.jacionline. org), which also identified a small additional cluster (cluster i) not present in the Southampton cohort. Healthy control subjects formed distinct clusters in both analyses.

Of the 6 clusters replicated in both data sets, cluster A (young, mild, paucigranulocytic) comprises participants with predominantly paucigranulocytic sputum, few symptoms (the lowest ACQ7 scores, 0.8-1.6), and low serum periostin levels who are young (lowest median ages, 34-38 years) and more likely to be at GINA treatment step 2 (low-dose maintenance ICS).

Subjects in cluster B (older, sinonasal disease) have the highest median age, more symptoms of anxiety and depression (highest median HAD score, 12-27), more nasal symptoms (highest Sino-Nasal Outcome Test 20 score), and high levels of serum periostin and sputum MMP3.

Subjects in cluster C (obese, high MMP levels) have the highest body mass index (30.9-36.4 kg/m2); increased sputum MMP1, MMP2, and MMP8 concentrations; and low serum periostin levels.

Subjects in cluster E (steroid-resistant TH2-mediated, eosino-philic) have high serum periostin levels, sputum eosinophilia, sputum IL-5 levels, and fraction of exhaled nitric oxide (FENO) levels despite high-dose ICSs (1600-2000 mg of beclomethasone dipropionate) or oral corticosteroids (40% to 46% of participants).

Subjects in cluster F (mixed granulocytic inflammation with severe obstruction) have both sputum eosinophilia and neutrophilia with lower prebronchodilator FEV1 values and FEV1/forced vital capacity ratios associated with higher sputum periostin and eosinophil cationic protein (ECP) levels and high HAD scores.

Subjects in cluster H (neutrophilic disease with severe obstruction and low periostin levels) have high sputum neutrophil counts with fixed airflow obstruction (low prebronchodilator and postbronchodilator FEV1) associated with very high symptom scores (median ACQ7, 3.3-3.4) and low serum periostin levels.

Of the clusters that were not replicated in both data sets, both clusters D and i were small (n = 4 and 5, respectively) and therefore might represent model overfitting. Lastly, cluster G shared many features with cluster H, comprising a second large cluster of participants with blood and sputum neutrophilia, high symptom scores, and low serum periostin levels. When clusters G and H were compared directly, cluster G had higher prebroncho-dilator and postbronchodilator FEV1, higher sputum macrophage counts, and higher serum periostin levels and were less neutro-philic, with lower sputum neutrophil counts and sputum myeloperoxidase (MPO), MMP8, and MMP9 levels (data not shown). Thus clusters G and H could be considered to represent milder and more severe subgroups, respectively, of neutrophilic asthma with low periostin levels.

Features of specific interest were compared across these TDA clusters. Serum periostin levels were significantly lower in clusters C and H in both the training and validation cohorts (see Fig E4). Although sputum IL-5 concentrations were significantly increased in severe clusters B through H, sputum IL-13 concentrations were significantly decreased in most of the severe clusters B, C, F, and H (see Fig E4). By contrast, no significant differences were observed in sputum IL-17 concentrations between healthy subjects and subjects of any cluster (data not shown).

A qualitative comparison of these clusters and clusters we have previously identified by using similar methodology in a small and distinct cohort, the IL-17 cohort,10 is presented in Table E3 and Fig E5 in this article's Online Repository at www.jacionline. org. Clusters A, E, F, and H showed clear similarities to analogous clusters in the IL-17 cohort, although clusters B and C did not.

Bayesian network analysis of combined data sets

Next, to investigate the interactions between the diverse clinical, physiologic, and pathobiologic parameters in the data sets, we applied Bayesian network analysis to 74 nonredundant parameters in data from 173 participants from both cohorts with the highest quality sputum data (Fig 3). This Bayesian network provides a graphic representation of the probabilistic dependencies among the parameters and arises from the data by using machine learning inferred from the joint probability distributions

TABLE III. Clinical and pathologic features found to be associated with patients with severe asthma compared with healthy subjects

Derivation data set

K-S tests

Feature Healthy subjects Patients with severe asthma K-S score P value

No.* 8 121

Increased in asthmatic patients compared with healthy subjects

Reversibility (%) 0.0 (0.0-1.8) 10 (2.9-21) 0.590 .01

BMIt 23.5 (22.4-25.6) 31.2 (26.7-37.3) 0.566 .02

HAD score 4.0 (1.8-7.8) 15 (10-22) 0.597 .01

HAD Depression score Nasal dysfunction^ 1 (1-3) 0.35 (0.09-0.39) 9 (6-11) 0.48 (0.42-0.72) 0.680 0.558 .01 .02

SN0T-20 12 (1.5-27) 35 (24-48) 0.507 .04

Serum YKL-40 (ng/mL)t 17 (13-22) 83 (55-140) 0.787 <.001

Blood neutrophil count (109/L)" 3.9 (3.2-4.5) 6.1 (4.3-8.4) 0.603 .008

Sputum neutrophil count (%) 12 (7.7-30) 51 (28-68) 0.571 .01

Sputum MMP3§ Sputum MMP12§ 3.7 X 7.0 X 1024 (3.7 X 10241025 (6.3 x 1026- 3.9 X ■1.4 X 1024) 1024) 2.6 X 1.0 X 1022 (1.3 X 10~2 1022 (5.9 X 10~3 5.9 X 1.6 X 1022) 1022) 0.931 0.774 <.001 .002

Sputum MMP8§ Sputum MMP1§ 1.6 X 0.12 (0.049-0.21) 1024 (3.3 x 1025- 3.6 X 1024) 1.1 X 9.6 (2.2-27) 1022 (6.2 X 1023 2.1 X 1022) 0.628 0.627 .02 .02

Sputum VEGFt 230 (220-280) 700 (470-1100) 0.814 .001

Sputum IL-6 soluble receptor (pg/mL)" 41 (8.0-99) 260 (140-430) 0.607 .03

Sputum IL-6 (pg/mL)" 0.0 (0.0-0.0) 50 (17-120) 0.873 <.001

Sputum IL-5 (pg/mL)" 0.0 (0.0-0.0) 0.76 (0.18-6.4) 0.627 .02

Sputum IL-8 (pg/mL)" 190 (100-420) 3300 (1000-8200) 0.676 .01

Sputum YKL-40 (ng/mL)" 3.2 (2.5-8.7) 65 (20-150) 0.647 .02

Decreased in asthmatic patients compared with healthy subjects

FEV1/FVC ratio 80 (77-83) 66 (54-72) 0.752 <.001

FEVt (% predicted, pre-BD) 89 (84-98) 68 (49-84) 0.560 .008

FEV1 (% predicted, post-BD) 92 (84-98) 78 (58-90) 0.608 .02

AQLQ score 7 (7-7) 3.7 (3.0-4.8) 0.936 <.001

SF-36 89 (71-91) 42 (27-61) 0.776 <.001

Sputum macrophage count (%) 70 (58-85) 36 (23-56) 0.646 .001

Sputum TIMP-1 (ng/mL)" 1.7 X 105 (7.2 X 104-2.5 X 106) 1.2 X 104 (5.2 X 103-3.8 X 104) 0.725 .005

Sputum IL-2 (pg/mL)" 1.8 (0.27-2.7) 0.0 (0.0-0.0) 0.627 .02

Sputum IL-1RA (pg/mL)" 2.6 X 104 (2.4 X 104-2.8 X 104) 2.7 X 103 (0.0-1.2 X 104) 0.941 <.001

Sputum FGF (pg/mL) 53 (43-62) 0.0 (0.0-0.60) 0.941 <.001

These features were found to differ significantly between healthy subjects and patients with severe asthma (British Thoracic Society steps 4 and 5) in both training and validation data sets.

AQLQ, Juniper Asthma Quality of Life Questionnaire; BD, bronchodilator; BDP, beclomethasone dipropionate equivalent; BMI, body mass index; FGF, fibroblast growth factor; FVC, forced vital capacity; GINA, Global Initiative for Asthma; IL-1RA, IL-1 receptor antagonist; MPO, myeloperoxidase; SF-36, Short-Form 36 Health Survey; SNOT-20, Sino-Nasal Outcome Test 20; TIMP-1, tissue inhibitor of metalloproteinases 1; VEGF, vascular endothelial growth factor. *Values are medians with interquartile ranges, unless stated otherwise. " Statistical tests were performed on transformed data.

J''Nasal dysfunction'' is a composite average score on a scale of 0 to 1 derived from SN0T-20 scores, hyposmia, and rhinosinusitis. §MMPs are expressed as a ratio to tissue inhibitor of metalloproteinases values; statistical tests were performed on transformed data.

of the data. In the figure the breadth of each line represents the strength of the interaction (Euclidean distance). Forty-one of the parameters were included in the model by the analysis, whereas 33 parameters without strong interactions were excluded from the model, including sex, Feno values, reversibility, peripheral blood counts, and serum periostin levels (see Table E2). Within the network, strong associations were observed between clinical parameters, and separately, strong associations were observed between pathobiologic parameters. However, a prominent feature of the network is a lack of associations between pathobiologic parameters and clinical parameters, with the exceptions of fibroblast growth factor, which is strongly

negatively correlated with GINA treatment step, and atopic status, which is positively associated with sputum IL-2 levels.

Sputum YKL-40 levels are highly connected within the network, particularly with levels of sputum MPO (Spearman rs = 0.884, P < .0001), IL-8 (rs = 0.837, P < .0001), and sputum IL-6 soluble receptor (rs = 0.758, P < .0001; Fig 4, A, B, and E). Sputum YKL-40 levels correlated moderately with sputum neutrophil counts (rs = 0.484, P < .0001; Fig 4, G) and were increased more in patients with neutrophilic versus eosinophilic disease (see Fig E6, A, in this article's Online Repository at www.jacionline.org). Sputum YKL-40 levels negatively correlated with lung function, particularly postbronchodilator

j allergy clin immunol volume ■■■, number ■■ HINKS ETAL 7

TABLE III. (Continued)

Validation data set

K-S tests

Healthy subjects Patients with severe asthma K-S score P value

0.0 (0.0-0.0) 8.8 (3.3-14) 0.841 <.001

24.3 (21.9-28.4) 29.0 (26.0-32.2) 0.452 .03

3.0 (0.0-5.0) 16 (9.0-25) 0.637 <.001

2 (0-2) 8.5 (5.3-12) 0.734 <.001

0.0 (0.0-0.43) 0.39 (0.13-0.53) 0.708 <.001

0.0 (0.0-3.0) 37 (20-53) 0.739 <.001

27 (19-37) 110 (67-160) 0.739 <.001

2.9 (2.7-4.0) 5.6 (4.4-8.5) 0.545 .004

18 (11-33) 45 (29-67) 0.732 <.001

1.6 X 1024 (9.9 X 10 5-2.3 X 1024) 3.2 X 1022 (1.5 X 1022-5.9 X 10~2) 0.902 <.001

3.7 X 10"5 (1.2 X 1025-5.5 X 1025) 1.0 X 1022 (5.1 X 1023-1.9 X 1022) 0.878 <.001

0.04 (0.030-0.077) 21 (5.4-28) 0.854 <.001

5.7 X 10"5 (3.6 X 1025-1.5 X 1024) 1.2 X 1022 (6.7 X 1023-2.0 X 1022) 0.732 <.001

580 (520-710) 1000 (620-1300) 0.519 .02

130 (100-215) 480 (260-840) 0.623 .002

10 (2.3-19) 55 (15-170) 0.567 .007

0.0 (0.0-0.080) 3.1 (0.72-9.8) 0.714 <.001

620 (380-880) 4000 (2100-7100) 0.756 <.001

21 (14-31) 150 (48-270) 0.738 <.001

85 (84-88) 66 (58-74) 0.785 <.001

100 (96-108) 60 (49-75) 0.918 <.001

100 (96-110) 68 (52-81) 0.857 <.001

7.0 (7.0-7.0) 4.1 (3.1-4.9) 0.959 <.001

89 (85-92) 46 (33-65) 0.841 <.001

82 (69-89) 30 (19-52) 0.837 <.001

4.7 X 105 (2.3 X 105-1.5 X 106) 1.7 X 104 (6.3 X 103-3.8 X 104) 0.860 <.001

0.95 (0.0-3.1) 0.0 (0.0-0.0) 0.612 .003

2.6 X 104 (2.2 X 104-3.5 X 104) 1.4 X 104 (1.1 X 104-1.8 X 104) .. - --------■x . 0.745 <.001

48 (44-57) 1.2 (2.5 X 10~3-1.2) 0.860 <.001

FEV1 (rs = -0.270, P = .0004, data not shown). Sputum and serum YKL-40 values are only moderately correlated (rs = 0.434, P < .0001; Fig 4, H), which has implications for their utility as a serum biomarker.

Levels of sputum MPO and sputum elastase, markers of neutrophilic airways inflammation, are highly connected within the network (Fig 3 and see Fig E7 in this article's Online Repository at www.jacionline.org). MMP12 levels are also highly connected, being positively associated with MMP1, MMP3, MMP8, and MMP13 levels (Fig 3 and see Fig E8 in this article's Online Repository at www.jacionline.org). Because MMP8 can induce the decoy receptor IL-13 receptor a2,20 a negative

association between MMP concentrations and free IL-13 might be expected. Indeed, concentrations of these MMPs correlate negatively with sputum IL-13 levels (rs = —0.371 to —0.452, see Fig E9 in this article's Online Repository at www. jacionline.org) but positively with sputum IL-5 levels (rs = 0.481 to 0.559, P <.0001; seeFigE10 in this article's Online Repository at www.jacionline.org). MMP/tissue inhibitor of metalloproteinases 1 ratios are also associated with body mass index, particularly MMP1, MMP3, and MMP12 (rs = 0.373, 0.303, and 0.311, respectively; P < .0001 each; data not shown).

Although sputum IL-5 levels are strongly associated with sputum eosinophilia (rs = 0.572, P < .0001) and sputum ECP

FIG 1. Protease/antiprotease balance in asthmatic patients. MMP/tissue inhibitor of metalloproteinases (TIMP-1) ratios in sputum in asthmatic patients compared with healthy subjects for MMP1 (A), MMP3 (B), MMP8 (C), and MMP12 (D) are shown. Horizontal lines show medians. Statistical comparisons were done with Student ttests on log-transformed data.

levels (rs = 0.604, P < .0001, see Figs E6 and E11 in this article's Online Repository at www.jacionline.org), sputum IL-13 levels are not highly correlated with sputum IL-5 levels (P = .6) and, conversely, are suppressed in both patients with neutrophilic and those with eosinophilic asthma (see Fig E6, D), suggesting that IL-13 production, which is strongly associated with GINA treatment step, is more steroid responsive than IL-5. These differences between associations of sputum ECP, IL-5, and IL-13 levels are further explored in Figs E11-E15 in this article's Online Repository at www.jacionline.org, which reveal that IL-5 is associated with a wider range of inflammatory markers than IL-13, including makers of neutrophilic inflammation (IL-8 and YKL-40). Thus disparities between IL-5 and IL-13 can have additional causes: given the association of IL-5 with ECP, which can be produced by both neutrophils and eosinophils,21 and that the eosinophil activation product major basic protein can increase eosinophil IL-8 production,22 the biology is complex, and IL-5 seems to have a broader effect than IL-13.

Fig E16 in this article's Online Repository at www.jacionline. org presents 12 key sputum parameters stratified according to GINA treatment group. For each of these parameters, no significant differences were observed between distributions for GINA step 4 compared with step 5 (receiving an additional median 10 mg/d oral prednisone), which would argue against our main observations being attributable wholly to therapeutic corticosteroids.

DISCUSSION

We have previously demonstrated the potential utility of topological and Bayesian analytic techniques to analyze high-dimensional flow cytometric data from a bronchoscopy study in a small asthma cohort in which we identified distinct multidimensional clinicopathologic clusters.10 Here we apply the same analytic approach to 2 much larger severe asthma cohorts, clustering patients by using only real-world assays already accessible to clinicians in tertiary referral centers for severe asthma. This provides endotypes that both relate to the underlying biology and are clinically meaningful.

In addition to the large study size and the statistically unbiased approach, a major strength of this study is the use of 2 geographically distinct cohorts. This provides true external validation of the derivation cohort model in contrast to studies that simply use a random split of a single data set, which provides only a weak and inefficient form of internal validation.23 The focus of our analyses is on differences within the spectrum of asthma. Although the number of healthy control subjects included is only modest, this has little effect on the identity of these endotypes, which are defined by comparison with all other subjects in the study. Although a larger sample size would enable a more detailed description of the clusters, the sensitivity of TDA to detect structure in small data sets and the external validation provide statistical confidence in the features described. A further strength of this study is the breadth of additional data on serum and airway inflammatory mediators also available for

TABLE IV. Definitions of clusters in the derivation and validation data sets

Derivation data set Validation data set

Features K-S tests Features K-S tests

Cluster of cluster K-S score P value Comments Cluster of cluster K-S score P value Comments

A Young, mild, a

paucigranulocytic

Lower serum 0.365 .003 More likely GINA Lower serum 0.758 .03 More likely

periostin level step 2 (P < .001) periostin GINA step 2

level (P = .001)

Predominantly Predominantly

paucigranulocytic paucigranu-

sputum (P < .001) locytic

sputum

(P = .04)

Lowest median ACQ7 score (0.8)

Youngest median age

(34 y)

Lowest median ACQ7 score (1.6)

Youngest median

age (38 y)

B Older, sinonasal b

disease

Higher serum 0.746 .001 Oldest median Higher serum 0.709 <.001

periostin level age (60 y) periostin level

Higher sputum 0.610 .03 Highest median Higher sputum 0.562 .04

MMP3 level HAD score (18) MMP3 level

Joint oldest median age

(61 y) Highest median HAD score (27)

Higher SNOT-20 score 0.533 .04 Higher SNOT-20 score 0.803 <.001

C Obese, high MMP level c

Higher sputum MMP1 level 0.498 .02 Highest BMI (36.4) Higher sputum MMP levels 0.735 .006

Higher sputum MMP8 level 0.481 .03

Higher sputum MMP2 level 0.474 .03

Lower serum periostin level 0.802 <.001 Lower serum periostin level 0.746 .002

This group was not replicated in the validation set

(30.9) after group i

Higher serum 0.780 .02

periostin level

Higher HAD Depression

E Steroid resistant TH2 mediated, eosinophilic e

Higher serum 0.811 <.001 More likely Higher serum 0.862 .002 Predominantly

periostin level eosinophilic sputum class (P = .03) periostin level (60%) eosinophilic sputum class

Higher 0.524 .008 Higher 0.981 .05

eosinophilia eosinophilia

Higher sputum 0.503 .01 Highest median Feno Higher sputum 1.00 .04 Highest median

IL-5 level value (33 ppb) IL-5 level Feno value (72 ppb)

TABLE IV. (Continued)

Derivation data set

Cluster

Features of cluster

K-S tests

Validation data set

K-S score P value

Comments

Cluster

Features of cluster

K-S tests

K-S score P value

Comments

Youngest median age of onset

(13 y)

F Mixed granulocytic f

inflammation with

severe obstruction

Higher serum 0.781 <.001 Higher serum 0.569 <.001

periostin level periostin level

Higher sputum 0.503 .005 Higher sputum 0.413 .04

ECP level ECP level

Higher sputum 0.465 .003 Higher sputum 0.373 .04

neutrophil neutrophil

count count

Higher sputum 0.391 .0197 Higher sputum 0.438 .01

eosinophil eosinophil

count count

Higher HAD 0.367 .03 Higher HAD 0.379 .04

Depression Depression

score score

Youngest median age of onset (4 y)

Lower FEV1 0.495 .001 Lower FEV1 0.379 .04

(% predicted, (% predicted,

pre-BD) pre-BD)

Lower FEV1/ 0.479 .002 Lower FEV1/ 0.463 .005

FVC ratio FVC ratio

Lower sputum 0.409 .01 Lower sputum 0.438 .01

macrophage macrophage

counts counts

G This group was not replicated in the validation set

Higher 0.304 .05

neutrophilia

Higher sputum 0.300 .05

osteopontin

Higher blood 0.292 .02

neutrophil

Higher ACQ7 0.285 .02

Lower serum

periostin level Lower sputum MMP9 level Lower sputum

a2M level Lower sputum

FGF level Lower SF-36 score

0.426 <.001 0.378 .006 0.350 .01 0.303 .05 0.344 .003

Lower AQLQ 0.289 .02

H Neutrophilic disease

with severe obstruction, low periostin level

TABLE IV. (Continued)

Derivation data set Validation data set

Features K-S tests Features K-S tests

Cluster of cluster K-S score P value Comments Cluster of cluster K-S score P value Comments

Higher sputum neutrophil count 0.667 <.001 More likely neutrophilic sputum class (P < .0001) Higher sputum neutrophil count 0.639 .003 More likely neutrophilic sputum class (P = .005)

Lower FEV! 0.569 <.001 Lower FEV! 0.490 .04

(% predicted, (% predicted,

pre-BD) pre-BD)

Lower FEV1 0.562 <.001 Very low FEV1 Lower FEV1 0.519 .03 Lowest FEV1

(% predicted, (44% of predicted (% predicted, (48% of

post-BD) value, pre-BD) post-BD) predicted value, pre-BD)

Lower periostin 0.495 <.001 Highest median Lower periostin 0.639 .003 High median

level ACQ score (3.3) level ACQ score (3.4)

i This group was

not replicated

in the validation

Higher SNOT-20 0.677 .03

Higher sputum 0.631 .05

macrophage

counts

Lower AQLQ 0.677 .03

emotional

Lower periostin 0.646 .04

Features of clusters were identified in the training and validation sets by using TDA. Features are listed that differ significantly in the cluster when compared with all other subjects in the same cohort.

a2M, a2-Macroglobulin; AQLQ, Juniper Asthma Quality of Life Questionnaire; BD, bronchodilator; BDP, beclomethasone dipropionate equivalent; BMI, body mass index; FGF, fibroblast growth factor; FVC, forced vital capacity; GINA, Global Initiative for Asthma; MPO, myeloperoxidase; SNOT-20, Sino-Nasal Outcome Test 20.

analysis, providing new insights into the roles of YKL-40 and MMPs in patients with severe asthma.

Several previous studies have investigated the existence of endotypes by using predominantly clinical parameters, which are straightforward to apply.3,7,24 Other studies have incorporated the additional dimension of sputum cell differential.5,6,8 Haldar et al6 observed 2 clusters characterized by discordance between symptoms and eosinophilic inflammation. Likewise, Wu et al5 included peripheral blood counts and bronchoalveolar lavage cell differentials when conducting a machine learning approach to analyze Severe Asthma Research Programme (SARP) data, identifying 6 asthma clusters. However, their analysis did not include induced sputum, and the composition of the clusters was again influenced mostly by clinical parameters because the selected markers of eosinophilic or neutrophilic inflammation provided only modest discriminating power in the model (INFOGAIN value, 0.12-0.16). Our analyses included sputum eosinophil and neutrophil counts; sputum subclass determination; measurement of Feno, periostin, high-sensitivity C-reactive protein, and serum IgE levels; and a restricted number of clinical parameters found to be nonredundant and have strong discriminatory power in our previous study.10 We identified 6 endotypes that could be replicated robustly in the validation

cohort. The 2 dimensions of neutrophilic and eosinophilic inflammation are strongly influential in defining the shape of the data set and the composition of these clusters. As is clear from the Bayesian analysis, airway neutrophilia and eosinophilia represent 2 distinct inflammatory networks that likely contribute separately to asthma symptoms, again underlining the importance of considering these 2 specific dimensions when phenotyping an individual patient.

It is interesting that we identified more than 1 eosinophilic or neutrophilic cluster. Although airway eosinophilia and high periostin levels were common to clusters E and F, cluster F had the additional component of neutrophilic inflammation, leading to higher symptom scores and a mixed granulocytic subtype.25 Sputum ECP levels were significantly increased in group F (Table IV). ECP is a basic protein released during eosinophil degranulation and is highly correlated in our Bayesian analysis with both neutrophilic (sputum myeloperoxidase) and eosinophilic (sputum eosinophils and IL-5) inflammation (Fig 3 and see Fig E11). In the derivation cohort an additional neutrophilic cluster (ie, cluster G) was identified but not replicated, likely because of the small sample size, perhaps representing a milder spectrum of neutrophilic asthma with low periostin levels.

[D] High periostin High depression scores

H The most neutrophilic disease, severe airflow obstruction. High ACQ score, Low periostin.

C High sputum MMPs Low periostin Highest BMI

B Sinonasal disease Oldest Anxiety and depression High periostin ,

High MMP3

E Eosinophilic disease High periostin. High FeNO Early onset. High sputum IL5

A Paucigranulocytic Young \

Few symptoms (lowesV ACQ score)

More likely GINA step 2 Low periostin

F High periostin High sputum eosinophils and neutrophils Severe airflow obstruction Depression. High sputum ECP. Nasal polyposis. Aspirin sensitive disease.

Health

[G] Predominantly neutrophilic disease Severe airflow obstruction Low periostin

FIG 2. Multidimensional clinicopathological clusters in asthmatic patients in the derivation data set (Southampton cohort). A topological network generated by using 22 clinical and pathological features together identifies 1 healthy (in blue) and 8 distinct clinicopathobiologic asthma clusters (A-H). The network is colored according to ACQ7 scores, with the most symptomatic subjects in red. The TDA used 145 subjects with the most complete data: metric, variance-normalized Euclidean; lenses, principal and secondary singular value decomposition (resolution, 30; gain, 3.0/3.0X, equalized) and presence/absence of asthma; node size, proportional to the number of subjects in the node. Color bars: red, highest ACQ7 score; blue, healthy participants. Features in boldface were replicated in the validation data set. GINA, Global Initiative for Asthma.

Similarities highlighted between clusters A, E, F, and H and analogous clusters we previously described in a separate cohort10 suggest our analytic approach produces consistent findings and that these clusters are likely related to fundamental differences in underlying disease mechanisms. The previous study aimed to investigate the full spectrum of clinical asthma. Thus 2 clusters of moderate disease in the IL-17 cohort were not replicated in the current study, which enrolled very few patients with moderate asthma, focusing instead on severe asthma, which is typical of our difficult asthma clinics.

How might our clusters relate to possible treatments? Our description of group E is consistent with descriptions by Haldar et al,6 Newby et al,8 Wu et al,5 Fingleton et al,9 and Hinks et al10 of a cluster of highly atopic, early-onset, eosinophilic asthma with high Feno values. We have shown this group to be characterized by high serum periostin and sputum IL-5 levels (see Fig E4 and Hinks et al10) despite high-dose corticosteroids, suggesting a

likely response to agents, such as mepolizumab, targeting the IL-5 pathway. That the type 2 cytokines IL-5 and IL-13 are not associated in the Bayesian network and were not correlated in sputum suggests IL-13 is more susceptible to steroid therapy.10 Conversely, like others,9 we found mean serum IgE levels to be increased in all clusters, suggesting anti-IgE therapies might benefit several phenotypes. As before,10 we found no evidence of a significant dysregulation of airway IL-17 in any subgroup, implying this cytokine might not be a promising target.

In contrast to recent developments of therapies targeting TH2-mediated eosinophilic inflammation, there has been little progress in therapeutics for neutrophilic disease. Our Bayesian network highlights the prominent role of YKL-40 in neutrophilic inflammation. YKL-40 is a chitinase-like protein expressed by differentiated macrophages and is believed to regulate the magnitude of tissue injury and fibroproliferative repair in neutrophil granules.26 YKL-40 polymorphisms are associated with asthma,

FIG 3. Bayesian belief network showing the strongest interactions between pathobiologic parameters across a range of clinical severities of asthma or health. Nodes without strong interactions are excluded. Line thickness represents the strength of the interaction (Euclidean distance). Line colors: green, positive associations; red, negative associations; black, nonlinear associations. AQLQ, Juniper Asthma Quality of Life Questionnaire; BD, bronchodilator; FGF, fibroblast growth factor; GCSF, Granulocyte-colony stimulating factor; GINA, Global Initiative for Asthma; MPO, myeloperoxidase; SPT, skin prick test; VEGF, vascular endothelial growth factor.

bronchial hyperresponsiveness, and reduced lung function.27,28 Serum and sputum YKL-40 levels are increased in patients with severe asthma and correlate with disease severity, airway obstruction, and basement membrane thickness.29,30 We observed a very strong correlation between sputum YKL-40 and sputum IL-8 levels, in addition to several other markers of neutrophilic inflammation, which is consistent with ex vivo observations that YKL-40 induces IL-8 from bronchial epithelia and stimulates smooth muscle proliferation.31 In addition, airway YKL-40 level increases in human allergen challenge and murine models implicate YKL-40 in airway eosinophilia and IL-5 production.26 In our data a moderate association (rs = 0.477, P < .0001, data not shown) was seen with sputum IL-5 levels. Although genetic and in vitro data are suggestive, it remains to be determined whether YKL-40 plays a causative role in asthma or is simply a marker of extracellular tissue remodeling.

MMPs have also been implicated in tissue remodeling in asthmatic patients by human genetic studies,32,33 murine emphysema models,34-36 and the findings of increased bronchoalveolar lavage MMP3 and MMP9 levels in patients with status asthmati-cus37 and increased sputum MMP12 levels in asthmatic smokers.38 Our study, the largest to date, confirms and extends these findings with robust evidence of an increase in levels of

specific MMPs in patients with severe asthma. These include MMP1, MMP3, and MMP8, the 3 secreted type 3 collagenases able to degrade collagen at neutral pH. We show that these are strongly associated with neutrophilic inflammation, dysregulated in obese asthmatic patients, and correlated positively with sputum IL-5 levels but negatively with sputum IL-13 levels, perhaps because MMP8 cleaves the IL-13 decoy receptor (IL-13 receptor a2).20

In conclusion, we have identified and replicated 6 clinicopa-thobiologic clusters using assays and sputum induction available in clinical practice. Our data underline a disconnect between clinical features and underlying inflammation, suggest IL-5 production is relatively steroid insensitive, and highlight the roles of YKL-40 in neutrophilic inflammation and specific MMPs in severe asthma.

We thank Emma Ray, Lorraine Hewitt, Lisa Aitken, Sumita Kerley, and Kerry Gove and other nursing support from the NIHR Southampton Respiratory Biomedical Research Unit and the NIHR Wellcome Trust Southampton Clinical Research Facility. We thank Dr Borislav Dimitrov, Dr Gunnar Carlsson, and Devi Ramanan for statistical advice and Dr Paul Elkington for discussions and critical review of the manuscript. We extend our gratitude to all the volunteers who gave of their time and enthusiasm to make this research possible.

A Sputum MPO

100000 rs=0 884

Q= toooo

£ looo =J

Q. »• (ß

P<0.0001

Sputum YKL-40

E Sputum IL-6SR

rs=0.758 P<0.0001

(Y 10000

_! 1000 -

_J 100-

<0 100

В Sputum IL-8

100000- , rs=0.837

С Sputum VEGF

f- 1000 3

"S 100

(Л 10

P<0.0001 jÂjfc*"

■Aï?*

pg/ml ц_мооо О

Ш looo >

£ 100

rs=0.661 P<0.0001

Sputum YKL-40

F Sputum IL-6

fs=0.4 P<0.0001

pg/ml , дод 10000, 's-и.чоч

1000 100 10

Sputum YKL-40

G Sputum neutrophils

rs=0.484 P<0.0001

*• * «Ж

x: 100-, Q.

О 00-

Ф "J с

-g го

Q. . (О

D Sputum elastase

E 3 "S

q. СЯ

fs=0.697

P<0.0001 •

Sputum YKL-40 H Serum and sputum YKL40

.л- '

% 'V * * -jfi- •

1Й!1 rs=0.434

Р<0.0001. •

1000 n g/ml

Sputum YKL-40

Sputum YKL-40

Sputum YKL-40

Sputum YKL-40

1000 ng/ml

FIG 4. Inflammatory mediators associated with sputum YKL-40 levels. Spearman correlations between levels of sputum YKL-40 and sputum myeloperoxidase (MPO; A), IL-8 (B), vascular endothelial growth factor (VEGF; C), elastase (D), IL-6 soluble receptor (IL-6SR; E), IL-6 (F), neutrophils (G), and serum YKL-40 (H) are shown.

Key messages

• We have identified and replicated 6 clinicopathobiologic asthma endotypes.

• MMP1, MMP3, MMP8, and MMP12 levels are increased in patients with severe asthma and associated with sputum IL-5 levels.

• YKL-40 is strongly implicated in neutrophilic asthma.

REFERENCES

1. Global Initiative for Asthma (GINA) Global strategy for asthma management and prevention 2015. Available at: http://www.ginasthma.org/. Accessed January 8, 2016.

2. Anderson GP. Endotyping asthma: new insights into key pathogenic mechanisms in a complex, heterogeneous disease. Lancet 2008;372:1107-19.

3. Lotvall J, Akdis CA, Bacharier LB, Bjermer L, Casale TB, Custovic A, et al. Asthma endotypes: a new approach to classification of disease entities within the asthma syndrome. J Allergy Clin Immunol 2011;127:355-60.

4. Moore WC, Bleecker ER, Curran-Everett D, Erzurum SC, Ameredes BT, Bacharier L, et al. Characterization of the severe asthma phenotype by the National Heart, Lung, and Blood Institute's Severe Asthma Research Program. J Allergy Clin Immunol 2007;119:405-13.

5. Wu W, Bleecker E, Moore W, Busse WW, Castro M, Chung KF, et al. Unsupervised phenotyping of Severe Asthma Research Program participants using expanded lung data. J Allergy Clin Immunol 2014;133:1280-8.

6. Haldar P, Pavord ID, Shaw DE, Berry MA, Thomas M, Brightling CE, et al. Cluster analysis and clinical asthma phenotypes. Am J Respir Crit Care Med 2008;178:218-24.

7. Boudier A, Curjuric I, Basagana X, Hazgui H, Anto JM, Bousquet J, et al. Ten-year follow-up of cluster-based asthma phenotypes in adults. A pooled analysis of three cohorts. Am J Respir Crit Care Med 2013;188:550-60.

8. Newby C, Heaney LG, Menzies-Gow A, Niven RM, Mansur A, Bucknall C, et al. Statistical cluster analysis of the British Thoracic Society Severe refractory Asthma Registry: clinical outcomes and phenotype stability. PLoS One 2014;9: e102987.

9. Fingleton J, Travers J, Williams M, Charles T, Bowles D, Strik R, et al. Treatment responsiveness of phenotypes of symptomatic airways obstruction in adults. J Allergy Clin Immunol 2015;136:601-9.

10. Hinks TS, Zhou X, Staples KJ, Dimitrov BD, Manta A, Petrossian T, et al. Innate and adaptive T cells in asthmatic patients: relationship to severity and disease mechanisms. J Allergy Clin Immunol 2015;136:323-33.

11. Juniper EF, O'Byrne PM, Ferrie PJ, King DR, Roberts JN. Measuring asthma control. Clinic questionnaire or daily diary? Am J Respir Crit Care Med 2000;162: 1330-4.

12. Juniper EF, Guyatt GH, Epstein RS, Ferrie PJ, Jaeschke R, Hiller TK. Evaluation of impairment of health related quality of life in asthma: development of a questionnaire for use in clinical trials. Thorax 1992;47:76-83.

13. Zigmond AS, Snaith RP. The hospital anxiety and depression scale. Acta Psychiatr Scand 1983;67:361-70.

14. Piccirillo JF, Merritt MG Jr, Richards ML. Psychometric and clinimetric validity of the 20-Item Sino-Nasal Outcome Test (SN0T-20). Otolaryngol Head Neck Surg 2002;126:41-7.

15. Ware JE Jr, Sherbourne CD. The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Med Care 1992;30:473-83.

16. Doty RL, Shaman P, Kimmelman CP, Dann MS. University of Pennsylvania Smell Identification Test: a rapid quantitative olfactory function test for the clinic. Laryngoscope 1984;94:176-8.

17. Bafadhel M, McCormick M, Saha S, McKenna S, Shelley M, Hargadon B, et al. Profiling of sputum inflammatory mediators in asthma and chronic obstructive pulmonary disease. Respiration 2012;83:36-44.

18. Carlsson G. Topology and data. Bull Am Math Soc 2009;46:255-308.

19. Lum PY, Singh G, Lehman A, Ishkanov T, Vejdemo-Johansson M, Alagappan M, et al. Extracting insights from the shape of complex data using topology. Sci Rep 2013;3:1236.

20. Chen W, Tabata Y, Gibson AM, Daines MO, Warrier MR, Wills-Karp M, et al. Matrix metalloproteinase 8 contributes to solubilization of IL-13 receptor alpha2 in vivo. J Allergy Clin Immunol 2008;122:625-32.

21. Monteseirin J, Vega A, Chacon P, Camacho MJ, El Bekay R, Asturias JA, et al. Neutrophils as a novel source of eosinophil cationic protein in IgE-mediated processes. J Immunol 2007;179:2634-41.

22. Kita H, Abu-Ghazaleh RI, Sur S, Gleich GJ. Eosinophil major basic protein induces degranulation and IL-8 production by human eosinophils. J Immunol 1995;154:4749-58.

23. Moons KG, Altman DG, Reitsma JB, Ioannidis JP, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med 2015;162:W1-73.

24. Moore WC, Meyers DA, Wenzel SE, Teague WG, Li H, Li X, et al. Identification of asthma phenotypes using cluster analysis in the Severe Asthma Research Program. Am J Respir Crit Care Med 2010;181:315-23.

25. Simpson JL, Scott R, Boyle MJ, Gibson PG. Inflammatory subtypes in asthma: assessment and identification using induced sputum. Respirology 2006;11:54-61.

26. Kang MJ, Yoon CM, Nam M, Kim DH, Choi JM, Lee CG, et al. Role of chitinase 3-like-1 in IL-18-induced pulmonary type-1, -2 and -17 inflammation, alveolar destruction and airway fibrosis in the murine lung. Am J Respir Cell Mol Biol 2015;53:863-71.

27. Ober C, Tan Z, Sun Y, Possick JD, Pan L, Nicolae R, et al. Effect of variation in CHI3L1 on serum YKL-40 level, risk of asthma, and lung function. N Engl J Med 2008;358:1682-91.

28. Gomez JL, Crisafi GM, Holm CT, Meyers DA, Hawkins GA, Bleecker ER, et al. Genetic variation in chitinase 3-like 1 (CHI3L1) contributes to asthma severity and airway expression of YKL-40. J Allergy Clin Immunol 2015;136:51-8.e10.

29. Chupp GL, Lee CG, Jarjour N, Shim YM, Holm CT, He S, et al. A chitinase-like protein in the lung and circulation of patients with severe asthma. N Engl J Med 2007;357:2016-27.

30. Otsuka K, Matsumoto H, Niimi A, Muro S, Ito I, Takeda T, et al. Sputum YKL-40 levels and pathophysiology of asthma and chronic obstructive pulmonary disease. Respiration 2012;83:507-19.

31. Tang H, Sun Y, Shi Z, Huang H, Fang Z, Chen J, et al. YKL-40 induces IL-8 expression from bronchial epithelium via MAPK (JNK and ERK) and NF-kappaB pathways, causing bronchial smooth muscle proliferation and migration. J Immunol 2013;190:438-46.

32. Mukhopadhyay S, Sypek J, Tavendale R, Gartner U, Winter J, Li W, et al. Matrix metalloproteinase-12 is a therapeutic target for asthma in children and young adults. J Allergy Clin Immunol 2010;126:70-6.e16.

33. Hunninghake GM, Cho MH, Tesfaigzi Y, Soto-Quiros ME, Avila L, Lasky-Su J, et al. MMP12, lung function, and COPD in high-risk populations. N Engl J Med 2009;361:2599-608.

34. D'Armiento J, Dalal SS, Okada Y, Berg RA, Chada K. Collagenase expression in the lungs of transgenic mice causes pulmonary emphysema. Cell 1992;71: 955-61.

35. Shapiro SD. Animal models for chronic obstructive pulmonary disease: age of klotho and marlboro mice. Am J Respir Cell Mol Biol 2000;22:4-7.

36. Hautamaki RD, Kobayashi DK, Senior RM, Shapiro SD. Requirement for macrophage elastase for cigarette smoke-induced emphysema in mice. Science 1997;277:2002-4.

37. Lemjabbar H, Gosset P, Lamblin C, Tillie I, Hartmann D, Wallaert B, et al. Contribution of 92 kDa gelatinase/type IV collagenase in bronchial inflammation during status asthmaticus. Am J Respir Crit Care Med 1999;159: 1298-307.

38. Chaudhuri R, McSharry C, Brady J, Donnelly I, Grierson C, McGuinness S, et al. Sputum matrix metalloproteinase-12 in patients with chronic obstructive pulmonary disease and asthma: relationship to disease severity. J Allergy Clin Immunol 2012;129:655-63.e8.