Search

Scholarly Works (13 results)

Sort By:

Show:

Thesis
Peer Reviewed

Anytime Approximate Inference in Graphical Models

Lou, Qi
Advisor(s): Ihler, Alexander

UC Irvine Electronic Theses and Dissertations (2018)

Graphical models are a powerful framework for modeling interactions within complex systems. Reasoning over graphical models typically involves answering inference queries, such as computing the most likely configuration (maximum a posteriori or MAP) or evaluating the marginals or normalizing constant of a distribution (the partition function); a task called marginal MAP generalizes these two by maximizing over a subset of variables while marginalizing over the rest.

Exact computation of these queries is known to be intractable in general, leading to the development of many approximate schemes, the major categories of which are variational methods, search algorithms, and Monte Carlo sampling. Within these, anytime techniques that provide some guarantees on the correct value, and can be improved with more computational effort, are valued for quickly providing users with confidence intervals or certificates of accuracy and allow users to decide the desired balance of quality, time and memory.

In this dissertation, we develop a series of approximate inference algorithms for the partition function and marginal MAP with anytime properties by leveraging ideas and techniques from the three inference paradigms, and integrating them to provide hybrid solutions that inherit the strengths of all three approaches. We propose anytime anyspace best-first search algorithms that provide deterministic bounds on the partition function and marginal MAP. These best-first search schemes take advantage of both AND/OR tree search and optimized variational heuristics. We then extend this approach to give anytime probabilistic confidence bounds via a dynamic importance sampling algorithm, which interleaves importance sampling (using proposal distributions extracted from the variational bound) with our best-first search algorithm to refine the proposal. We also propose a framework for interleaving sampling with the optimization of the initial variational bound, which can automatically balance its computational effort between the two schemes. Overall, we show that our hybrid algorithms perform significantly better than existing methods, giving flexible approaches with excellent anytime confidence bounds.

Thesis
Peer Reviewed

Approximate Inference in Graphical Models

Forouzan, Sholeh
Advisor(s): Ihler, Alexander

UC Irvine Electronic Theses and Dissertations (2015)

Graphical models have become a central paradigm for knowledge representation and rea- soning over models with large numbers of variables. Any useful application of these models involves inference, or reasoning about the state of the underlying variables and quantify- ing the models’ uncertainty about any assignment to them. Unfortunately, exact inference in graphical models is fundamentally intractable, which has led to significant interest in approximate inference algorithms.

In this thesis we address several aspects of approximate inference that affect its quality. First, combining the ideas from variational inference and message passing on graphical models, we study how the regions over which the approximation is formed can be selected more effectively using a content-based scoring function that computes a local measure of the improvement to the upper bound to log partition function. We then extend this framework to use the available memory more efficiently, and show that this leads to better approximations. We propose different memory allocation strategies and empirically show how they can improve the quality of the approximation to the upper bound. Finally, we address the optimization algorithms used in approximate inference tasks. Focusing on maximum a posteriori (MAP) inference and linear programming (LP), we show how the Alternating Direction Method of Multipliers (ADMM) technique can provide an elegant algorithm for finding the saddle point of the augmented Lagrangian of the approximation, and present an ADMM-based algorithm to solve the primal form of the MAP-LP whose closed form updates are based on a linear approximation technique.

Cover page: Approximate Inference in Graphical Models

Thesis
Peer Reviewed

Reasoning and Decisions in Probabilistic Graphical Models - A Unified Framework

Liu, Qiang
Advisor(s): Ihler, Alexander

UC Irvine Electronic Theses and Dissertations (2014)

Probabilistic graphical models such as Markov random fields, Bayesian networks and decision networks (a.k.a. influence diagrams) provide powerful frameworks for representing and exploiting dependence structures in complex systems. However, making predictions or decisions using graphical models involve challenging computational problems of optimization and/or estimation in high dimensional spaces. These include combinatorial optimization tasks such as maximum a posteriori (MAP), which finds the most likely configuration, or marginalization tasks that calculate the normalization constants or marginal probabilities. Even more challenging tasks require a hybrid of both: marginal MAP tasks find the optimal MAP prediction while marginalizing over missing information or latent variables, while decision-making problems search for optimal policies over decisions in single- or multi-agent systems, in order to maximize expected utility in uncertain environments.

All these problems are generally NP-hard, creating a need for efficient approximations. The last two decades have witnessed significant progress on traditional optimization and marginalization problems, especially via the development of variational message passing algorithms. However, there has been less progress on the more challenging marginal MAP and decision-making problems.

This thesis presents a unified variational representation for all these problems. Based on our framework, we derive a class of efficient algorithms that combines the advantages of several existing algorithms, resulting in improved performance on traditional marginalization and optimization tasks. More importantly, our framework allows us to easily extend most or all existing variational algorithms to hybrid inference and decision-making tasks, and significantly improves our ability to solve these difficult problems. In particular, we propose a spectrum of efficient belief propagation style algorithms with "message passing" forms, which are simple, fast and amenable to parallel or distributed computation. We also propose a set of convergent algorithms based on proximal point methods, which have the nice form of transforming the hybrid inference problem into a sequence of standard marginalization problems. We show that our algorithms significantly outperform existing approaches in terms of both empirical performance and theoretical properties.

Cover page: Reasoning and Decisions in Probabilistic Graphical Models - A Unified Framework

Thesis
Peer Reviewed

Learning and Inference in Latent Variable Graphical Models

Ping, Wei
Advisor(s): Ihler, Alexander

UC Irvine Electronic Theses and Dissertations (2016)

Probabilistic graphical models such as Markov random fields provide a powerful framework and tools for machine learning, especially for structured output learning. Latent variables naturally exist in many applications of these models; they may arise from partially labeled data, or be introduced to enrich model flexibility. However, the presence of latent variables presents challenges for learning and inference.

For example, the standard approach of using maximum a posteriori (MAP) prediction is complicated by the fact that, in latent variable models (LVMs), we typically want to first marginalize out the latent variables, leading to an inference task called marginal MAP. Unfortunately, marginal MAP prediction can be NP-hard even on relatively simple models such as trees, and few methods have been developed in the literature. This thesis presents a class of variational bounds for marginal MAP that generalizes the popular dual-decomposition method for MAP inference, and enables an efficient block coordinate descent algorithm to solve the corresponding optimization. Similarly, when learning LVMs for structured prediction, it is critically important to maintain the effect of uncertainty over latent variables by marginalization. We propose the marginal structured SVM, which uses marginal MAP inference to properly handle that uncertainty inside the framework of max-margin learning.

We then turn our attention to an important subclass of latent variable models, restricted Boltzmann machines (RBMs). RBMs are two-layer latent variable models that are widely used to capture complex distributions of observed data, including as building block for deep probabilistic models. One practical problem in RBMs is model selection: we need to determine the hidden (latent) layer size before performing learning. We propose an infinite RBM model and apply the Frank-Wolfe algorithm to solve the resulting learning problem. The resulting algorithm can be interpreted as inserting a hidden variable into a RBM model at each iteration, to easily and efficiently perform model selection during learning. We also study the role of approximate inference in RBMs and conditional RBMs. In particular, there is a common assumption that belief propagation methods do not work well on RBM-based models, especially for learning. In contrast, we demonstrate that for conditional models and structured prediction, learning RBM-based models with belief propagation and its variants can provide much better results than the state-of-the-art contrastive divergence methods.

Cover page: Learning and Inference in Latent Variable Graphical Models

Thesis
Peer Reviewed

Probabilistic Models for Brain Image Collection, Classication, and Functional Connectivity.

Keator, David Bryant
Advisor(s): Ihler, Alexander

UC Irvine Electronic Theses and Dissertations (2015)

The use of functional neuroimaging to evaluate brain disorders has become pervasive in the scientific community. The technique provides researchers with a means to evaluate dynamic in-vivo brain function. Over the last thirty years of using neuroimaging techniques to evaluate brain disorders, there is evidence suggesting some illnesses are characterized by differences in regional brain function whereas others by differences in regional connectivity. Disorders with gross anatomical and functional changes such as Alzheimer's disease and traumatic brain injury are often visually discernible in brain scans and differences quantifiable using typical mass univariate analysis techniques. Conversely, disorders with subtle functional changes (e.g. depression) or subtle changes in how the brain communicates (e.g. schizophrenia) are less amiable to existing analysis techniques. Detecting these subtle differences in molecular imaging data, often plagued by noisy measurements from the imaging system, further impedes our ability to gain valuable insights into brain disorders. In this dissertation we use a variety of tools from machine learning and probabilistic modeling to develop new models for decreasing noise in data captured from our imaging systems, improve feature extraction for detecting differences in regional brain function, and evaluate group-based functional connectivity models and their performance in settings with small sample sizes. Each of these models are presented separately with experiments designed to show improvements over existing methodologies and measures of accuracy in both disease classification and recovering gold-standard functional relationships in the brain.

Cover page: Probabilistic Models for Brain Image Collection, Classication, and Functional Connectivity.

Creative Commons 'BY' version 4.0 license

Thesis
Peer Reviewed

Exploiting Factor and State Space Symmetry for Inference in Graphical Models

Gallo, Nicholas
Advisor(s): Ihler, Alexander T

UC Irvine Electronic Theses and Dissertations (2020)

Probabilistic graphical models provide a powerful framework for representing and reasoning about complex systems composed of many small overlapping subsystems. Key problems in graphical models can be formulated as probabilistic inference queries, such as computing the most likely configuration or the marginal probability of a random variable. Since these problems are intractable in general, a large class of approximate inference algorithms that exploit the factorization structure of graphical models have been developed.

In addition to the classic factorization structure, many graphical models also possess symmetric structure where groups of objects in the model are indistinguishable. Two types of symmetry arise regularly in practice: first, a model with state space symmetry contains factors with groups of states that have identical values; second, a model with factor symmetry contains groups of factors that have identical factor tables.

Although efficient inference algorithms for models with perfect symmetry have been well developed, most real problems do not contain perfect symmetry. Often, for example, a model with a symmetric substructure is perturbed by asymmetric evidence factors. Consequently, there is a great need for methods that accurately approximate the desired inference quantities using symmetric inference terms. While a few methods to address this problem exist, many are heuristic or address only certain aspects of the problem.

The goal of this thesis is to develop more intelligent ways to perform inference in graphical models with approximate symmetry. Our central strategy is to force groups of parameters in a variational inference relaxation to be symmetric. These symmetry groups are iteratively broken in a controlled manner to capture problem asymmetries more accurately; this procedure gives rise to a flexible class of coarse-to-fine inference algorithms. Furthermore, by using intelligently structured parameter symmetries, we are able to construct symmetric high-order inference terms which are often necessary to obtain high accuracy inference estimates. We develop algorithms both for models with state space symmetry and for models with factor symmetry, highlighting the deep similarities between these two classes of problems which has not been widely appreciated before.

Cover page: Exploiting Factor and State Space Symmetry for Inference in Graphical Models

Thesis
Peer Reviewed

Variational Methods for Optimal Experimental Design

Kennamer, Noble William
Advisor(s): Ihler, Alexander

UC Irvine Electronic Theses and Dissertations (2022)

In this work we study variational methods for Bayesian optimal experimental design (BOED). Experimentation is a cornerstone of science and is central to any major engineering effort. Often experiments require the use of substantial resources, from expensive equipment to limited researcher time; in addition, experiments can be dangerous or may be required to be completed in a given period of time. For these reasons, we prefer to conduct our experiments as efficiently as possible, acquiring as much information as we can given the resources available to us. Optimal experimental design (OED) is a sub-field of statistics focused on developing methods for accomplishing this goal. The OED problem is formulated by defining a utility function over designs and optimizing this function over the set of all feasible designs. We focus on the \emph{Expected Information Gain} (EIG), a widely used utility function with sound theoretical support. However, in practice the EIG is intractable to compute, and approximation strategies are required. We investigate the use of variational methods for this purpose and show substantial improvement over competing approximation techniques. A specific form of OED common in the field of machine learning (ML) is \emph{active learning} (AL). In the active learning framework, we would like to obtain a labeled dataset in order to train a supervised model. However, for all the reasons stated, labeling data points can be costly and again we should make efficient use of our labeling resources. We present a novel application of active learning to optimize spectroscopic follow up for large scale astronomical surveys. Finally, much of this work requires learning functions over sets which we know must satisfy certain properties (e.g., permutation invariance). We conclude the thesis by presenting a novel neural network architecture for predicting the astronomical class of individual objects in the same exposure using a neural architecture specifically designed to accommodate known inductive biases present in the data.

Cover page: Variational Methods for Optimal Experimental Design

Thesis
Peer Reviewed

Design of Cross-Layer MAC and Routing Protocols for Autonomous UAV Networks

UC Irvine Electronic Theses and Dissertations (2022)

Autonomous networks of unmanned aerial vehicles (UAVs) have many civilian and military applications. These networks experience a wide variety of network configurations and communication constraints (including node density, speed, and trajectory), resulting in a highly dynamic and unpredictable network topology. In addition, these networks support diverseand time-varying applications that can include different traffic types and priorities, data generation rates, session lengths, and reliability and latency tolerance.

In this dissertation, we develop distributed, cross-layer medium access control (MAC) and routing protocols to provide robust and reliable communication in autonomous and decentralized UAV networks, in which the network topology and traffic conditions change frequently and the future node trajectories are not known.

First, we present a mathematical framework to compute the link lifetime for a realistic node mobility model, followed by the design of a novel, distributed time division multiple access (TDMA) scheme for directional communication in multihop networks. This scheme includes a low-complexity, rank-based scheduling mechanism, which effectively adapts to the changes in the network and quality of service (QoS) demands in real-time with significantly reduced overhead and delay, and improves both channel utilization and fairness in channel access allocation.

In the subsequent chapters, we focus on routing protocols, which discover and select high-quality routes, and switch to alternate routes in response to changes in the available communication resources, observed traffic patterns, and performance demands to make the best use of the network resources.

Traditional topology-based routing schemes are slow to adapt to changes in topology and traffic, and typically select a route without considering the effect of intra-flow interference on the selected route. To address these issues, we present an adaptive, cross-layer, mobility and congestion-aware proactive routing protocol for decentralized UAV networks. Our protocol includes a novel, multi-step and multi-metric, inter- and intra-flow interference-aware route selection mechanism, which selects a stable, longer-lasting and less congested route. It uses a preemptive route switching mechanism to prevent potential packet drops due to congestion and topology changes, and a periodic queue management mechanism to prioritize transmitting packets with a lower survivability score, and discard packets that are likely to expire before reaching their destination.

Proactive routing protocols can incur large control and computational overhead, and may be vulnerable to the security threats. In contrast, reactive routing protocols incur much lower control and computation overhead, but the resulting, on-demand route discovery introduces large routing overhead and delay in settings with frequent topology changes and link breaks, such as UAV networks. We address these issues via a novel, hybrid mobility- and congestion-aware reactive routing protocol, which discovers routes on demand and preemptively switches to another high-quality route within the region around the selected route. This significantly reduces the number of route discoveries and overhead from route control and computation. Despite having limited network topology information, our proposed routing scheme providessuperior flow throughput performance.

We show via network simulation results that our proposed MAC and routing protocols significantly outperform existing schemes across a variety of different network and traffic settings.

Cover page: Design of Cross-Layer MAC and Routing Protocols for Autonomous UAV Networks

Article
Peer Reviewed

Gibbs Sampling for (Coupled) Infinite Mixture Models in the Stick Breaking Representation

UC Irvine Previously Published Works (2012)

Nonparametric Bayesian approaches to clustering, information retrieval, language modeling and object recognition have recently shown great promise as a new paradigm for unsupervised data analysis. Most contributions have focused on the Dirichlet process mixture models or extensions thereof for which efficient Gibbs samplers exist. In this paper we explore Gibbs samplers for infinite complexity mixture models in the stick breaking representation. The advantage of this representation is improved modeling flexibility. For instance, one can design the prior distribution over cluster sizes or couple multiple infinite mixture models (e.g. over time) at the level of their parameters (i.e. the dependent Dirichlet process model). However, Gibbs samplers for infinite mixture models (as recently introduced in the statistics literature) seem to mix poorly over cluster labels. Among others issues, this can have the adverse effect that labels for the same cluster in coupled mixture models are mixed up. We introduce additional moves in these samplers to improve mixing over cluster labels and to bring clusters into correspondence. An application to modeling of storm trajectories is used to illustrate these ideas.

Cover page: Gibbs Sampling for (Coupled) Infinite Mixture Models in the Stick Breaking Representation

Article
Peer Reviewed

Feed-forward hierarchical model of the ventral visual stream applied to functional brain image classification

ICTS Publications (2014)