Search

Scholarly Works (36 results)

Sort By:

Show:

Thesis
Peer Reviewed

Beyond Standard Assumptions - Semiparametric Models, A Dyadic Item Response Theory Model, and Cluster-Endogenous Random Intercept Models

Sim, Nicholas
Advisor(s): Rabe-Hesketh, Sophia

UC Berkeley Electronic Theses and Dissertations (2019)

In most statistical analyses, quantitative education researchers often make simplifying assumptions regarding the manner in which their data was generated in order to answer some of these questions. These assumptions can help to reduce the complexity of the problem, and allow the researcher to describe their data using a simpler, and often times more interpretable, statistical model. However, making some of these assumptions when they are not true can lead to biased estimates and misleading answers. While the standard sets of assumptions associated with commonly-used statistical models are usually sufficient in a wide range of contexts, it will always be beneficial for education researchers to understand what they are, when they are reasonable, and how to modify them if necessary.

This dissertation focuses on three of the most common models used in quantitative education research (viz. parametric models like Linear Models (LMs), Item Response Theory (IRT) models, and Random-Intercept Models (RIMs)), discusses the standard sets of assumptions that accompany these models, and then describes related models with less stringent sets of assumptions. In each of the following three chapters, we either explicitly unpack existing models that are useful but are currently still uncommon in the field of education research, or propose novel models and/or estimation strategies for these models.

We begin in Chapter 1 with a common parametric model known as the Gaussian LM, and use it as a scaffold to better understand semiparametric models and their estimation. We begin by reviewing how the coefficients of the Gaussian LM are usually estimated using Maximum Likelihood (ML) or Least-Squares (LS). We then introduce the notion of an $m$-estimator as well as that of a Regular Asymptotically Linear estimator, and show how they relate to the ML estimator. In particular, we introduce the notion of influence functions/curves and discuss their geometry together with concepts such as Hilbert spaces and tangent spaces. We then demonstrate, concretely, how to derive the so-called efficient influence function under the Gaussian LM, and show that it is precisely the influence function of the ML and (Ordinary) LS estimators. This shows that the ML estimator (at least under the Gaussian LM) is efficient. Using the foundation built, we move on from the Gaussian LM by relaxing both the assumption that the residuals are normally distributed, as well as the assumption that they have a constant variance, and define this as the Heteroskedastic Linear Model. Unlike the Gaussian LM, this is a semiparametric model. Where possible, we make use of intuition and analogous results from the parametric setting to help describe the workflow for obtaining an efficient estimator for the coefficients of the Heteroskedastic Linear Model. In particular, we derive the nuisance tangent space for this semiparametric model, and use it to obtain the efficient influence function for our model. We then show how to use the efficient influence function to obtain an efficient estimator (which happens to be the Weighted LS estimator) from the (Ordinary) LS estimator via a one-step approach as well as an estimating equations approach. We then conclude by directing readers to more advanced material, including references on more modern approaches to estimating more general semiparametric models such as Targeted Maximum Likelihood Estimation.

In Chapter 2, we focus on a class of measurement models known as Item Response Theory models which are useful for measuring latent traits of a subject based on the subject's response to items. We relax the condition that the responses are only a result of the individual's latent trait (and possibly an external rater), and propose a dyadic Item Response Theory (dIRT) model for measuring interactions of pairs of individuals when the responses to items represent the actions (or behaviors, perceptions, etc.) of each individual (actor) made within the context of a dyad formed with another individual (partner). Examples of its use in education include the assessment of collaborative problem solving among students, or the evaluation of intra-departmental dynamics among teachers. The dIRT model generalizes both Item Response Theory models for measurement and the Social Relations Model for dyadic data. Here, the responses of an actor when paired with a partner are modeled as a function of not only the actor's inclination to act and the partner's tendency to elicit that action, but also the unique relationship of the pair, represented by two directional, possibly correlated, interaction latent variables. We discuss generalizations such as accommodating triads or larger groups, but focus on demonstrating the key idea in the dyadic case. We show that estimation may be performed using Markov-chain Monte Carlo implemented in \texttt{Stan}, making it straightforward to extend the dIRT model in various ways. Specifically, we show how the basic dIRT model can be extended to accommodate latent regressions, random effects, distal outcomes. We perform a simulation study that demonstrates that our estimation approach performs well. In the absence of educational data of this form, we demonstrate the usefulness of our proposed approach using speed-dating data instead, and find new evidence of pairwise interactions between participants, describing a mutual attraction that is inadequately characterized by individual properties alone.

Finally, in Chapter 3, we consider the often implicit assumption made when estimating the coefficients of structural Random Intercept Models (RIMs) that covariates at all levels do not co-vary with the random intercepts. A violation of this assumption (called cluster-level endogeneity) leads to inconsistent estimates when using standard estimation procedures. For two-level RIMs with such endogeneity, Hausman and Taylor (HT) devised a consistent multi-step instrumental variable estimator using only internal instruments. We, instead, approach this problem by explicitly modeling the endogeneity using a Structural Equation Model (SEM). In this chapter, we compare, through simulation, the HT and SEM estimators, and evaluate their asymptotic and finite sample properties. We show that the SEM approach is also flexible enough to deal with different exchangeability assumptions for the covariates (e.g., whether the correlations between pairs of all units in a cluster are the same) and investigate how these exchangeability assumptions affect finite sample properties of the HT estimator. For the simulations, we propose a new procedure for generating cluster- and unit-level covariates and random intercepts with a fully flexible covariance structure. We also compare our approach to another common approach known as Multilevel Matching using data from the High School and Beyond survey.

Cover page: Beyond Standard Assumptions - Semiparametric Models, A Dyadic Item Response Theory Model, and Cluster-Endogenous Random Intercept Models

Article
Peer Reviewed

Latent Variable Modelling: A Survey*

UC Berkeley Previously Published Works (2007)

Latent variable modelling has gradually become an integral part of mainstream statistics and is currently used for a multitude of applications in different subject areas. Examples of 'traditional' latent variable models include latent class models, item-response models, common factor models, structural equation models, mixed or random effects models and covariate measurement error models. Although latent variables have widely different interpretations in different settings, the models have a very similar mathematical structure. This has been the impetus for the formulation of general modelling frameworks which accommodate a wide range of models. Recent developments include multilevel structural equation models with both continuous and discrete latent variables, multiprocess models and nonlinear latent variable models. © 2007 Board of the Foundation of the Scandinavian Journal of Statistics.

Creative Commons 'BY-NC-ND' version 4.0 license

Article
Peer Reviewed

Classical latent variable models for medical research

UC Berkeley Previously Published Works (2008)

Latent variable models are commonly used in medical statistics, although often not referred to under this name. In this paper we describe classical latent variable models such as factor analysis, item response theory, latent class models and structural equation models. Their usefulness in medical research is demonstrated using real data. Examples include measurement of forced expiratory flow, measurement of physical disability, diagnosis of myocardial infarction and modelling the determinants of clients' satisfaction with counsellors' interviews.

Article
Peer Reviewed

Handling initial conditions and endogenous covariates in dynamic/transition models for binary data with unobserved heterogeneity

UC Berkeley Previously Published Works (2014)

Distinguishing between longitudinal dependence due to the effects of previous responses on subsequent responses and dependence due to unobserved heterogeneity is important in many disciplinesFor example, wheezing is an inflammatory reaction that may 'remodel' a child's airway structure and thereby affect the probability of future wheezing (state dependence)Alternatively, children could vary in their susceptibilities because of unobserved covariates such as genes (unobserved heterogeneity)For binary responses, distinguishing between state dependence and unobserved heterogeneity is typically accomplished by using dynamic/transition models that include both a lagged response and a random interceptNaive maximum likelihood estimators can be severely inconsistent because of two kinds of endogeneity problem: lack of independence of the initial response and the random intercept (the initial conditions problem) and lack of independence of the covariates and the random intercept (the endogenous covariates problem)We clarify and unify previous work on handling these problems in the disconnected literatures of statistics and econometrics, suggest improved methods, investigate the asymptotic performance of competing methods and provide practical recommendationsThe recommended methods are applied to longitudinal data on children's wheezing, where we investigate the extent of state dependence and unobserved heterogeneity and whether there is an effect of maternal smoking© 2013 Royal Statistical Society.

Article
Peer Reviewed

Prediction in multilevel generalized linear models

UC Berkeley Previously Published Works (2009)

We discuss prediction of random effects and of expected responses in multilevel generalized linear models. Prediction of random effects is useful for instance in small area estimation and disease mapping, effectiveness studies and model diagnostics. Prediction of expected responses is useful for planning, model interpretation and diagnostics. For prediction of random effects, we concentrate on empirical Bayes prediction and discuss three different kinds of standard errors; the posterior standard deviation and the marginal prediction error standard deviation (comparative standard errors) and the marginal sampling standard deviation (diagnostic standard error). Analytical expressions are available only for linear models and are provided in an appendix. For other multilevel generalized linear models we present approximations and suggest using parametric bootstrapping to obtain standard errors. We also discuss prediction of expectations of responses or probabilities for a new unit in a hypothetical cluster, or in a new (randomly sampled) cluster or in an existing cluster. The methods are implemented in gllamm and illustrated by applying them to survey data on reading proficiency of children nested in schools. Simulations are used to assess the performance of various predictions and associated standard errors for logistic random-intercept models under a range of conditions. © 2009 Royal Statistical Society.

Article
Peer Reviewed

The Role of Conditional Likelihoods in Latent Variable Modeling

UC Berkeley Previously Published Works (2022)

In psychometrics, the canonical use of conditional likelihoods is for the Rasch model in measurement. Whilst not disputing the utility of conditional likelihoods in measurement, we examine a broader class of problems in psychometrics that can be addressed via conditional likelihoods. Specifically, we consider cluster-level endogeneity where the standard assumption that observed explanatory variables are independent from latent variables is violated. Here, "cluster" refers to the entity characterized by latent variables or random effects, such as individuals in measurement models or schools in multilevel models and "unit" refers to the elementary entity such as an item in measurement. Cluster-level endogeneity problems can arise in a number of settings, including unobserved confounding of causal effects, measurement error, retrospective sampling, informative cluster sizes, missing data, and heteroskedasticity. Severely inconsistent estimation can result if these challenges are ignored.

Cover page: The Role of Conditional Likelihoods in Latent Variable Modeling

Thesis
Peer Reviewed

Models for Understanding Student Thinking using Data from Complex Computerized Science Tasks

LaMar, Michelle
Advisor(s): Rabe-Hesketh, Sophia

UC Berkeley Electronic Theses and Dissertations (2014)

The Next Generation Science Standards (NGSS Lead States, 2013) define performance targets which will require assessment tasks that can integrate discipline knowledge and cross-cutting ideas with the practices of science. Complex computerized tasks will likely play a large role in assessing these standards, but many questions remain about how best to make use of such tasks within a psychometric framework (National Research Council, 2014). This dissertation explores the use of a more extensive cognitive modeling approach, driven by the extra information contained in action data collected while students interact with complex computerized tasks. Three separate papers are included. In Chapter 2, a mixture IRT model is presented that simultaneously classifies student understanding of a task while measuring student ability within their class. The model is based on differentially scoring the subtask action data from a complex performance. Simulation studies show that both class membership and class-specific ability can be reasonably estimated given sufficient numbers of items and response alternatives. The model is then applied to empirical data from a food-web task, providing some evidence of feasibility and validity. Chapter 3 explores the potential of using a more complex cognitive model for assessment purposes. Borrowing from the cognitive science domain, student decisions within a strategic task are modeled with a Markov decision process. Psychometric properties of the model are explored and simulation studies report on parameter recovery within the context of a simple strategy game. In Chapter 4 the Markov decision process (MDP) measurement model is then applied to an educational game to explore the practical benefits and difficulties of using such a model with real world data. Estimates from the MDP model are found to correlate more strongly with posttest results than a partial-credit IRT model based on outcome data alone.

Cover page: Models for Understanding Student Thinking using Data from Complex Computerized Science Tasks

Thesis
Peer Reviewed

Estimation of Complex Generalized Linear Mixed Models for Measurement and Growth

Jeon, Minjeong
Advisor(s): Rabe-Hesketh, Sophia

UC Berkeley Electronic Theses and Dissertations (2012)

Maximum likelihood (ML) estimation of generalized linear mixed models (GLMMs) is technically challenging because of the intractable likelihoods that involve high dimensional integrations over random effects. The problem is magnified when the random effects have a crossed design and thus the data cannot be reduced to small independent clusters. A variety of methods have been developed for approximating the intractable likelihood functions, but there seems no method yet that is both computationally efficient and accurate in a wide range of situations. In this dissertation, I consider new estimation methods and applications of complex GLMMs for measurement and growth. The dissertation consists of three papers,

1) Variational maximization-maximization (MM) algorithm,

2) Monte Carlo local likelihood (MCLL) estimation,

and 3) Autoregressive item response theory (IRT) growth model for longitudinal item analysis.

In the first and second papers, I develop two ML methods for estimating GLMMs with crossed random effects. The variational MM algorithm is a modified expectation-maximization (EM) algorithm where a variational density is introduced in the expectation (E) step to approximate the true posterior density of the random effects given the data. The E-step is replaced by another maximization step that minimizes the Kullback-Leibler (KL) divergence between the posterior and the variational density, or equivalently, maximizes the lower bound of the log-likelihood with respect to the variational distribution. The MCLL algorithm uses the posterior samples of model parameters obtained from Markov chain Monte Carlo (MCMC) for likelihood inference. The posterior density is estimated by local likelihood density estimation and the likelihood function is approximated up to a constant by the local likelihood density estimate of the posterior divided by the prior. The performance of these new algorithms is evaluated using simulation and empirical studies and compared with other ML and Bayesian estimators. In the third paper, a new autoregressive IRT growth model is proposed to take into account serial correlations among responses to the same items over time. The consequences of ignoring serial dependence and the initial conditions problem are investigated using simulations. The new model is applied to longitudinal data of Korean students' self-esteem.

Cover page: Estimation of Complex Generalized Linear Mixed Models for Measurement and Growth

Article
Peer Reviewed

Avoiding biased versions of Wooldridge’s simple solution to the initial conditions problem

UC Berkeley Previously Published Works (2013)

Wooldridge (2005) provided a simple and elegant solution to the initial conditions problem for dynamic nonlinear unobserved-effects models. His original auxiliary model includes the time-varying explanatory variables at each period. Unfortunately, a popular constrained version that includes within-means of the explanatory variables can be severely biased. We show that there are several ways to avoid this problem. © 2013 Elsevier B.V.

Article
Peer Reviewed

Maximum likelihood estimation of endogenous switching and sample selection models for binary, ordinal, and count variables

UC Berkeley Previously Published Works (2006)

Studying behavior in economics, sociology, and statistics often involves fitting models in which the response variable depends on a dummy variable- also known as a regime-switch variable- or in which the response variable is observed only if a particular selection condition is met. In either case, standard regression techniques deliver inconsistent estimators if unobserved factors that affect the re- sponse are correlated with unobserved factors that affect the switching or selection variable. Consistent estimators can be obtained by maximum likelihood estimation of a joint model of the outcome and switching or selection variable. This article describes a “wrapper” program, ssm, that calls gllamm (Rabe-Hesketh, Skrondal, and Pickles, GLLAMM Manual [University of California – Berkeley, Division of Bio- statistics, Working Paper Series, Paper No. 160]) to fit such models. The wrapper accepts data in a simple structure, has a straightforward syntax, and reports out- put that is easily interpretable. One important feature of ssm is that the log likelihood can be evaluated using adaptive quadrature (Rabe-Hesketh, Skrondal, and Pickles, Stata Journal 2: 1–21; Journal of Econometrics 128: 301–323). Copyright 2006 by StataCorp LP.