Skip to main content
eScholarship
Open Access Publications from the University of California

UC San Diego

UC San Diego Electronic Theses and Dissertations bannerUC San Diego

Learning on the Space of Probability Measures

Abstract

This thesis studies computationally feasible machine learning methods, based on optimal transport and neural network theory, applied to measure-valued data. We first analyze linearized optimal transport (LOT), which essentially embeds measure-valued data into an L2 space, where out-of-the-box machine learning techniques are available. We analyze the situations when LOT provides an isometric embedding with respect to the Wasserstein-2 distance and provide necessary bounds when we can achieve a pre-specified linear separation level in the LOT embedding space. Second, we produce a computationally feasible algorithm to recover low-dimensional structures in measure-valued data by using the LOT embedding along with dimensionality reduction techniques. Using computational methods for solving optimal transport problems such as the Sinkhorn algorithm or linear programming, we provide approximation guarantees in terms of the sampling rates. Third, we study structured approximations of measures in Wasserstein space by a scaled Voronoi partition generated from a full rank lattice. We show that these structured approximations match rates of optimal quantizers and empirical measure approximation in most instances. We then extend these results for noncompactly supported measures that decay fast enough. Finally, we study methods for comparing probability measures by analyzing a neural network two-sample test. In particular, we perform time-analysis on a related neural tangent kernel (NTK) two-sample test and extend the analysis to the neural network two-sample test with a small-time training regime. We also show the amount of time needed before the two-sample test detects a deviation in the case the probability measures considered are different versus when they are the same.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View