Much research in autism spectrum disorders (ASD) has focused on the development of efficacious interventions to address the core deficits of ASD. However, the heterogeneous nature of ASD complicates the development of such interventions. With great heterogeneity in the expression of ASD’s core deficits, it is unlikely that there is a one size fits all intervention. It is important for researchers to understand for whom an intervention works. Advancements in data analytics, in particular machine learning, provide new methods to identify subgroups among a given population, and can potentially help to identify for whom intervention works best. Of particular interest are minimally verbal individuals. A targeted social communication intervention known as JASPER (Joint Attention, Symbolic Play, Engagement, and Regulation) has shown promise for improving language outcomes among minimally verbal children with ASD and may provide the context to examine the question of for whom an intervention benefits. This study aims to develop a model predicting expressive language gains among minimally verbal, preschool aged children with ASD that received a targeted social communication intervention.
Classification and regression tree (CART) analysis was used to explore the relationship between child characteristics and gains in expressive language. Secondary data analysis was conducted on a sample of 99 minimally verbal, preschool age children with ASD, collected from participants across five previous intervention studies. Expressive language gains (outcome) were calculated using expressive language age equivalents from the Mullen Scales for Early Learning. Predictors for the analyses were taken from child demographics and behavioral assessments completed prior to intervention. The initial list of predictors included race, gender, ASD severity, visual reception age equivalent, fine motor age equivalent, joint attention gestures, requesting gestures, and play skills. Using expressive language age equivalent change scores, 47% (n = 47) of the sample were identified as “super responders,” children that exceeded expressive language gains typically expected through maturation. To predict responder status, all initial predictors were used to generate conditional inference forest, from which the most important variables would be chosen for the final model. Conditional inference results identified three variables to be fitted into the final model; play diversity, requesting gestures, and fine motor age equivalent. A final conditional inference tree was created, with play diversity being the only significant predictor of responder status. Participants with an entry play diversity score above 23 predicted super response while scores of 23 or below predicted slow response. The overall model accuracy was 67%, with a specificity of 55% and sensitivity of 78%. As a comparison, stepwise logistic regression was run, and play diversity was again the only significant predictor of responder status (χ2 (1) = 10.686, p = .001). Receiver operating characteristic curves were generated to compare model performance, and comparison of area under the curves for the two models showed no statistical difference (p = .82).
Overall accuracy of the conditional inference tree was moderate, and performed similarly to the more traditional logistic regression analysis. However, the conditional inference tree provides a cutoff point that may provide clinical utility over the regression results. Both models identify play diversity as in important predictor of expressive language gains from JASPER, which is a play based social communication intervention. Additionally, our model appears to be more sensitive to identifying slow responders. The role of play diversity and expressive language gains in JASPER is discussed.