Search

Article
Peer Reviewed

Asthma clustering methods: a literature-informed application to the children’s health study data

UCLA Previously Published Works (2022)

Objective

The heterogeneity of asthma has inspired widespread application of statistical clustering algorithms to a variety of datasets for identification of potentially clinically meaningful phenotypes. There has not been a standardized data analysis approach for asthma clustering, which can affect reproducibility and clinical translation of results. Our objective was to identify common and effective data analysis practices in the asthma clustering literature and apply them to data from a Southern California population-based cohort of schoolchildren with asthma.

Methods

As of January 1, 2020, we reviewed key statistical elements of 77 asthma clustering studies. Guided by the literature, we used 12 input variables and three clustering methods (hierarchical clustering, k-medoids, and latent class analysis) to identify clusters in 598 schoolchildren with asthma from the Southern California Children's Health Study (CHS).

Results

Clusters of children identified by latent class analysis were characterized by exhaled nitric oxide, FEV₁/FVC, FEV₁ percent predicted, asthma control and allergy score; and were predictive of control at two year follow up. Clusters from the other two methods were less clinically remarkable, primarily differentiated by sex and race/ethnicity and less predictive of asthma control over time.

Conclusion

Upon review of the asthma phenotyping literature, common approaches of data clustering emerged. When applying these elements to the Children's Health Study data, latent class analysis clusters-represented by exhaled nitric oxide and spirometry measures-had clinical relevance over time.

Cover page: Asthma clustering methods: a literature-informed application to the children’s health study data

Article
Peer Reviewed

Constrained Mixed-Effect Models with Ensemble Learning for Prediction of Nitrogen Oxides Concentrations at High Spatiotemporal Resolution

UC Irvine Previously Published Works (2017)

Spatiotemporal models to estimate ambient exposures at high spatiotemporal resolutions are crucial in large-scale air pollution epidemiological studies that follow participants over extended periods. Previous models typically rely on central-site monitoring data and/or covered short periods, limiting their applications to long-term cohort studies. Here we developed a spatiotemporal model that can reliably predict nitrogen oxide concentrations with a high spatiotemporal resolution over a long time span (>20 years). Leveraging the spatially extensive highly clustered exposure data from short-term measurement campaigns across 1-2 years and long-term central site monitoring in 1992-2013, we developed an integrated mixed-effect model with uncertainty estimates. Our statistical model incorporated nonlinear and spatial effects to reduce bias. Identified important predictors included temporal basis predictors, traffic indicators, population density, and subcounty-level mean pollutant concentrations. Substantial spatial autocorrelation (11-13%) was observed between neighboring communities. Ensemble learning and constrained optimization were used to enhance reliability of estimation over a large metropolitan area and a long period. The ensemble predictions of biweekly concentrations resulted in an R² of 0.85 (RMSE: 4.7 ppb) for NO₂ and 0.86 (RMSE: 13.4 ppb) for NO_x. Ensemble learning and constrained optimization generated stable time series, which notably improved the results compared with those from initial mixed-effects models.

Cover page: Constrained Mixed-Effect Models with Ensemble Learning for Prediction of Nitrogen Oxides Concentrations at High Spatiotemporal Resolution

Article
Peer Reviewed

A Longitudinal Cohort Study of Body Mass Index and Childhood Exposure to Secondhand Tobacco Smoke and Air Pollution: The Southern California Children’s Health Study

UCLA Previously Published Works (2015)

Background

Childhood body mass index (BMI) and obesity prevalence have been associated with exposure to secondhand smoke (SHS), maternal smoking during pregnancy, and vehicular air pollution. There has been little previous study of joint BMI effects of air pollution and tobacco smoke exposure.

Methods

Information on exposure to SHS and maternal smoking during pregnancy was collected on 3,318 participants at enrollment into the Southern California Children's Health Study. At study entry at average age of 10 years, residential near-roadway pollution exposure (NRP) was estimated based on a line source dispersion model accounting for traffic volume, proximity, and meteorology. Lifetime exposure to tobacco smoke was assessed by parent questionnaire. Associations with subsequent BMI growth trajectory based on annual measurements and attained BMI at 18 years of age were assessed using a multilevel modeling strategy.

Results

Maternal smoking during pregnancy was associated with estimated BMI growth over 8-year follow-up (0.72 kg/m2 higher; 95% CI: 0.14, 1.31) and attained BMI (1.14 kg/m2 higher; 95% CI: 0.66, 1.62). SHS exposure before enrollment was positively associated with BMI growth (0.81 kg/m2 higher; 95% CI: 0.36, 1.27) and attained BMI (1.23 kg/m2 higher; 95% CI: 0.86, 1.61). Growth and attained BMI increased with more smokers in the home. Compared with children without a history of SHS and NRP below the median, attained BMI was 0.80 kg/m2 higher (95% CI: 0.27, 1.32) with exposure to high NRP without SHS; 0.85 kg/m2 higher (95% CI: 0.43, 1.28) with low NRP and a history of SHS; and 2.15 kg/m2 higher (95% CI: 1.52, 2.77) with high NRP and a history of SHS (interaction p-value 0.007). These results suggest a synergistic effect.

Conclusions

Our findings strengthen emerging evidence that exposure to tobacco smoke and NRP contribute to development of childhood obesity and suggest that combined exposures may have synergistic effects.

Cover page: A Longitudinal Cohort Study of Body Mass Index and Childhood Exposure to Secondhand Tobacco Smoke and Air Pollution: The Southern California Children’s Health Study

Article
Peer Reviewed

Applying Multivariate Segmentation Methods to Human Activity Recognition From Wearable Sensors’ Data

UCLA Previously Published Works (2019)

Background

Time-resolved quantification of physical activity can contribute to both personalized medicine and epidemiological research studies, for example, managing and identifying triggers of asthma exacerbations. A growing number of reportedly accurate machine learning algorithms for human activity recognition (HAR) have been developed using data from wearable devices (eg, smartwatch and smartphone). However, many HAR algorithms depend on fixed-size sampling windows that may poorly adapt to real-world conditions in which activity bouts are of unequal duration. A small sliding window can produce noisy predictions under stable conditions, whereas a large sliding window may miss brief bursts of intense activity.

Objective

We aimed to create an HAR framework adapted to variable duration activity bouts by (1) detecting the change points of activity bouts in a multivariate time series and (2) predicting activity for each homogeneous window defined by these change points.

Methods

We applied standard fixed-width sliding windows (4-6 different sizes) or greedy Gaussian segmentation (GGS) to identify break points in filtered triaxial accelerometer and gyroscope data. After standard feature engineering, we applied an Xgboost model to predict physical activity within each window and then converted windowed predictions to instantaneous predictions to facilitate comparison across segmentation methods. We applied these methods in 2 datasets: the human activity recognition using smartphones (HARuS) dataset where a total of 30 adults performed activities of approximately equal duration (approximately 20 seconds each) while wearing a waist-worn smartphone, and the Biomedical REAl-Time Health Evaluation for Pediatric Asthma (BREATHE) dataset where a total of 14 children performed 6 activities for approximately 10 min each while wearing a smartwatch. To mimic a real-world scenario, we generated artificial unequal activity bout durations in the BREATHE data by randomly subdividing each activity bout into 10 segments and randomly concatenating the 60 activity bouts. Each dataset was divided into ~90% training and ~10% holdout testing.

Results

In the HARuS data, GGS produced the least noisy predictions of 6 physical activities and had the second highest accuracy rate of 91.06% (the highest accuracy rate was 91.79% for the sliding window of size 0.8 second). In the BREATHE data, GGS again produced the least noisy predictions and had the highest accuracy rate of 79.4% of predictions for 6 physical activities.

Conclusions

In a scenario with variable duration activity bouts, GGS multivariate segmentation produced smart-sized windows with more stable predictions and a higher accuracy rate than traditional fixed-size sliding window approaches. Overall, accuracy was good in both datasets but, as expected, it was slightly lower in the more real-world study using wrist-worn smartwatches in children (BREATHE) than in the more tightly controlled study using waist-worn smartphones in adults (HARuS). We implemented GGS in an offline setting, but it could be adapted for real-time prediction with streaming data.

Article
Peer Reviewed

Air pollution exposure is associated with the gut microbiome as revealed by shotgun metagenomic sequencing

UC San Diego Previously Published Works (2020)

Animal work indicates exposure to air pollutants may alter the composition of the gut microbiota. This study examined relationships between air pollutants and the gut microbiome in young adults residing in Southern California. Our results demonstrate significant associations between exposure to air pollutants and the composition of the gut microbiome using whole-genome sequencing. Higher exposure to 24-hour O₃ was associated with lower Shannon diversity index, higher Bacteroides caecimuris, and multiple gene pathways, including L-ornithine de novo biosynthesis as well as pantothenate and coenzyme A biosynthesis I. Among other pollutants, higher NO₂ exposure was associated with fewer taxa, including higher Firmicutes. The percent variation in gut bacterial composition that was explained by air pollution exposure was up to 11.2% for O₃ concentrations, which is large compared to the effect size for many other covariates reported in healthy populations. This study provides the first evidence of significant associations between exposure to air pollutants and the compositional and functional profile of the human gut microbiome. These results identify O₃ as an important pollutant that may alter the human gut microbiome.

Cover page: Air pollution exposure is associated with the gut microbiome as revealed by shotgun metagenomic sequencing

Article
Peer Reviewed

Outdoor Air Pollution and New-Onset Airway Disease. An Official American Thoracic Society Workshop Report

UC San Francisco Previously Published Works (2020)

Although it is well accepted that air pollution exposure exacerbates preexisting airway disease, it has not been firmly established that long-term pollution exposure increases the risk of new-onset asthma or chronic obstruction pulmonary disease (COPD). This Workshop brought together experts on mechanistic, epidemiological, and clinical aspects of airway disease to review current knowledge regarding whether air pollution is a causal factor in the development of asthma and/or COPD. Speakers presented recent evidence in their respective areas of expertise related to air pollution and new airway disease incidence, followed by interactive discussions. A writing committee summarized their collective findings. The Epidemiology Group found that long-term exposure to air pollution, especially metrics of traffic-related air pollution such as nitrogen dioxide and black carbon, is associated with onset of childhood asthma. However, the evidence for a causal role in adult-onset asthma or COPD remains insufficient. The Mechanistic Group concluded that air pollution exposure can cause airway remodeling, which can lead to asthma or COPD, as well as asthma-like phenotypes that worsen with long-term exposure to air pollution, especially fine particulate matter and ozone. The Clinical Group concluded that air pollution is a plausible contributor to the onset of both asthma and COPD. Available evidence indicates that long-term exposure to air pollution is a cause of childhood asthma, but the evidence for a similar determination for adult asthma or COPD remains insufficient. Further research is needed to elucidate the exact biological mechanism underlying incident childhood asthma, and the specific air pollutant that causes it.

Cover page: Outdoor Air Pollution and New-Onset Airway Disease. An Official American Thoracic Society Workshop Report

Article
Peer Reviewed

A meta-analysis of genome-wide association studies for serum total IgE in diverse study populations

UC San Francisco Previously Published Works (2013)

Background

IgE is both a marker and mediator of allergic inflammation. Despite reported differences in serum total IgE levels by race-ethnicity, African American and Latino subjects have not been well represented in genetic studies of total IgE.

Objective

We sought to identify the genetic predictors of serum total IgE levels.

Methods

We used genome-wide association data from 4292 subjects (2469 African Americans, 1564 European Americans, and 259 Latinos) in the EVE Asthma Genetics Consortium. Tests for association were performed within each cohort by race-ethnic group (ie, African American, Latino, and European American) and asthma status. The resulting P values were meta-analyzed, accounting for sample size and direction of effect. Top single nucleotide polymorphism associations from the meta-analysis were reassessed in 6 additional cohorts comprising 5767 subjects.

Results

We identified 10 unique regions in which the combined association statistic was associated with total serum IgE levels (P<5.0×10(-6)) and the minor allele frequency was 5% or greater in 2 or more population groups. Variant rs9469220, corresponding to HLA-DQB1, was the single nucleotide polymorphism most significantly associated with serum total IgE levels when assessed in both the replication cohorts and the discovery and replication sets combined (P=.007 and 2.45×10(-7), respectively). In addition, findings from earlier genome-wide association studies were also validated in the current meta-analysis.

Conclusion

This meta-analysis independently identified a variant near HLA-DQB1 as a predictor of total serum IgE levels in multiple race-ethnic groups. This study also extends and confirms the findings of earlier genome-wide association analyses in African American and Latino subjects.

Cover page: A meta-analysis of genome-wide association studies for serum total IgE in diverse study populations

Article
Peer Reviewed

Air Pollution and Lung Function in Minority Youth with Asthma in the GALA II (Genes–Environments and Admixture in Latino Americans) and SAGE II (Study of African Americans, Asthma, Genes, and Environments) Studies

UC San Francisco Previously Published Works (2016)

Rationale

Adverse effects of exposures to ambient air pollution on lung function are well documented, but evidence in racial/ethnic minority children is lacking.

Objectives

To assess the relationship between air pollution and lung function in minority children with asthma and possible modification by global genetic ancestry.

Methods

The study population consisted of 1,449 Latino and 519 African American children with asthma from five different geographical regions in the mainland United States and Puerto Rico. We examined five pollutants (particulate matter ≤10 μm and ≤2.5 μm in diameter, ozone, nitrogen dioxide, and sulfur dioxide), derived from participant residential history and ambient air monitoring data, and assessed over several time windows. We fit generalized additive models for associations between pollutant exposures and lung function parameters and tested for interaction terms between exposures and genetic ancestry.

Measurements and main results

A 5 μg/m(3) increase in average lifetime particulate matter less than or equal to 2.5 μm in diameter exposure was associated with a 7.7% decrease in FEV1 (95% confidence interval = -11.8 to -3.5%) in the overall study population. Global genetic ancestry did not appear to significantly modify these associations, but percent African ancestry was a significant predictor of lung function.

Conclusions

Early-life particulate exposures were associated with reduced lung function in Latino and African American children with asthma. This is the first study to report an association between exposure to particulates and reduced lung function in minority children in which racial/ethnic status was measured by ancestry-informative markers.

Cover page: Air Pollution and Lung Function in Minority Youth with Asthma in the GALA II (Genes–Environments and Admixture in Latino Americans) and SAGE II (Study of African Americans, Asthma, Genes, and Environments) Studies

Article
Peer Reviewed

Genetic ancestry influences asthma susceptibility and lung function among Latinos

UC San Francisco Previously Published Works (2015)

Background

Childhood asthma prevalence and morbidity varies among Latinos in the United States, with Puerto Ricans having the highest and Mexicans the lowest.

Objective

To determine whether genetic ancestry is associated with the odds of asthma among Latinos, and secondarily whether genetic ancestry is associated with lung function among Latino children.

Methods

We analyzed 5493 Latinos with and without asthma from 3 independent studies. For each participant, we estimated the proportion of African, European, and Native American ancestry using genome-wide data. We tested whether genetic ancestry was associated with the presence of asthma and lung function among subjects with and without asthma. Odds ratios (OR) and effect sizes were assessed for every 20% increase in each ancestry.

Results

Native American ancestry was associated with lower odds of asthma (OR = 0.72, 95% CI: 0.66-0.78, P = 8.0 × 10(-15)), while African ancestry was associated with higher odds of asthma (OR = 1.40, 95% CI: 1.14-1.72, P = .001). These associations were robust to adjustment for covariates related to early life exposures, air pollution, and socioeconomic status. Among children with asthma, African ancestry was associated with lower lung function, including both pre- and post-bronchodilator measures of FEV1 (-77 ± 19 mL; P = 5.8 × 10(-5) and -83 ± 19 mL; P = 1.1 x 10(-5), respectively) and forced vital capacity (-100 ± 21 mL; P = 2.7 × 10(-6) and -107 ± 22 mL; P = 1.0 x 10(-6), respectively).

Conclusion

Differences in the proportions of genetic ancestry can partially explain disparities in asthma susceptibility and lung function among Latinos.

Cover page: Genetic ancestry influences asthma susceptibility and lung function among Latinos

Article
Peer Reviewed

Lung Function in African American Children with Asthma Is Associated with Novel Regulatory Variants of the KIT Ligand KITLG/SCF and Gene-By-Air-Pollution Interaction

UC San Francisco Previously Published Works (2020)

Baseline lung function, quantified as forced expiratory volume in the first second of exhalation (FEV₁), is a standard diagnostic criterion used by clinicians to identify and classify lung diseases. Using whole-genome sequencing data from the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine project, we identified a novel genetic association with FEV₁ on chromosome 12 in 867 African American children with asthma (P = 1.26 × 10^-8, β = 0.302). Conditional analysis within 1 Mb of the tag signal (rs73429450) yielded one major and two other weaker independent signals within this peak. We explored statistical and functional evidence for all variants in linkage disequilibrium with the three independent signals and yielded nine variants as the most likely candidates responsible for the association with FEV₁ Hi-C data and expression QTL analysis demonstrated that these variants physically interacted with KITLG (KIT ligand, also known as SCF), and their minor alleles were associated with increased expression of the KITLG gene in nasal epithelial cells. Gene-by-air-pollution interaction analysis found that the candidate variant rs58475486 interacted with past-year ambient sulfur dioxide exposure (P = 0.003, β = 0.32). This study identified a novel protective genetic association with FEV₁, possibly mediated through KITLG, in African American children with asthma. This is the first study that has identified a genetic association between lung function and KITLG, which has established a role in orchestrating allergic inflammation in asthma.

Cover page: Lung Function in African American Children with Asthma Is Associated with Novel Regulatory Variants of the KIT Ligand KITLG/SCF and Gene-By-Air-Pollution Interaction