- Main
A refined spirometry dataset for comparing segmented (piecewise) linear models to that of GAMLSS.
Published Web Location
https://doi.org/10.1016/j.dib.2024.111062Abstract
Generalized Additive Models for Location, Scale, and Shape (GAMLSS) are widely used for developing spirometric reference equations but are often complex, requiring additional spline tables. This study explores the potential of Segmented (piecewise) Linear Regression as an alternative, comparing its predictive accuracy to GAMLSS and examining the agreement between the two methods. Spirometry data from nearly 16,600 patients, deemed Grade A and B acceptable from the NHANES 2007-2012 dataset, was analyzed. The dataset includes both nominal and scalar variables. Reference equations for forced expiratory volume in 1 s (FEV1), forced vital capacity (FVC), and the ratio (FEV1/FVC) were generated using GAMLSS (FEV1, FVC, FEV1/FVC), Segmented Linear Regression (FEV1, FVC) and multiple linear regression (FEV1/FVC). K-fold cross-validation was employed to compare prediction accuracy, using root-mean-square error (RMSE) and correlation coefficients. Agreement in classifying spirometric patterns (i.e. airway obstruction, restrictive spirometry pattern, mixed obstructive and restrictive disorder) was evaluated with the kappa statistic. This study uniquely compares the models by incorporating the lower limit of normal (LLN) using fitted z-scores of -1.645 or -1.96. The dataset is publicly available in SPSS (.sav) and .csv formats through the Mendeley Data repository.
Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.
Main Content
Enter the password to open this PDF file:
-
-
-
-
-
-
-
-
-
-
-
-
-
-