- Wang, Hao;
- Shimizu, Chisato;
- Bainto, Emelia;
- Hamilton, Shea;
- Jackson, Heather;
- Estrada-Rivadeneyra, Diego;
- Kaforou, Myrsini;
- Levin, Michael;
- Pancheri, Joan;
- Dummer, Kirsten;
- Tremoulet, Adriana;
- Burns, Jane
BACKGROUND: Although Kawasaki disease is commonly regarded as a single disease entity, variability in clinical manifestations and disease outcome has been recognised. We aimed to use a data-driven approach to identify clinical subgroups. METHODS: We analysed clinical data from patients with Kawasaki disease diagnosed at Rady Childrens Hospital (San Diego, CA, USA) between Jan 1, 2002, and June 30, 2022. Patients were grouped by hierarchical clustering on principal components with k-means parcellation based on 14 variables, including age at onset, ten laboratory test results, day of illness at the first intravenous immunoglobulin infusion, and normalised echocardiographic measures of coronary artery diameters at diagnosis. We also analysed the seasonality and Kawasaki disease incidence from 2002 to 2019 by subgroup. To explore the biological underpinnings of identified subgroups, we did differential abundance analysis on proteomic data of 6481 proteins from 32 patients with Kawasaki disease and 24 healthy children, using linear regression models that controlled for age and sex. FINDINGS: Among 1016 patients with complete data in the final analysis, four subgroups were identified with distinct clinical features: (1) hepatobiliary involvement with elevated alanine transaminase, gamma-glutamyl transferase, and total bilirubin levels, lowest coronary artery aneurysm but highest intravenous immunoglobulin resistance rates (n=157); (2) highest band neutrophil count and Kawasaki disease shock rate (n=231); (3) cervical lymphadenopathy with high markers of inflammation (erythrocyte sedimentation rate, C-reactive protein, white blood cell, and platelet counts) and lowest age-adjusted haemoglobin Z scores (n=315); and (4) young age at onset with highest coronary artery aneurysm but lowest intravenous immunoglobulin resistance rates (n=313). The subgroups had distinct seasonal and incidence trajectories. In addition, the subgroups shared 211 differential abundance proteins while many proteins were unique to a subgroup. INTERPRETATION: Our data-driven analysis provides insight into the heterogeneity of Kawasaki disease, and supports the existence of distinct subgroups with important implications for clinical management and research design and interpretation. FUNDING: US National Institutes of Health and the Irving and Francine Suknow Foundation.