Search

Scholarly Works (3 results)

Sort By:

Article
Peer Reviewed

Universal Approximation Depth and Errors of Narrow Belief Networks with Discrete Units

Montúfar, Guido F

UCLA Previously Published Works (2014)

We generalize recent theoretical work on the minimal number of layers of narrow deep belief networks that can approximate any probability distribution on the states of their visible units arbitrarily well. We relax the setting of binary units (Sutskever & Hinton, 2008 ; Le Roux & Bengio, 2008 , 2010 ; Montúfar & Ay, 2011 ) to units with arbitrary finite state spaces and the vanishing approximation error to an arbitrary approximation error tolerance. For example, we show that a q-ary deep belief network with L > or = 2 + (q[m-delta]-1 / (q-1)) layers of width n < or = + log(q) (m) + 1 for some [Formula : see text] can approximate any probability distribution on {0, 1, ... , q-1}n without exceeding a Kullback-Leibler divergence of delta. Our analysis covers discrete restricted Boltzmann machines and naive Bayes models as special cases.

Thesis
Peer Reviewed

Predicting In-Hospital Mortality from Intensive Care Admissions Records with Recurrent Neural Networks

Wilks, Asa
Advisor(s): Montufar, Guido F

UCLA Electronic Theses and Dissertations (2021)

This study explores the implications of different modeling choices when predicting mortalityduring intensive care visits using recurrent neural networks. Using the MIMIC-III database, models were trained and tested with varying memory cells, architectures, and other hyper- parameters. Performance gains from incorporating information from unstructured clinical notes was tested as well. The study finds that a range of relatively shallow networks with varying memory cells and architectures can perform well and produce similar results, all of which outperform traditional mortality risk scores such as SAPS II. Adding information from clinical notes boosts model performance even with a simple natural language processing algorithm. Although methodological differences make direct comparisons complicated, the most accurate model presented here achieves an AUROC score of 0.943 which represents a slight improvement over similar prior work.

Cover page: Predicting In-Hospital Mortality from Intensive Care Admissions Records with Recurrent Neural Networks

Thesis
Peer Reviewed

Inductive Biases in Multi-Stage Machine Learning Problems and Applications

UCLA Electronic Theses and Dissertations (2025)

This thesis explores the role of inductive biases in multi-stage machine learning problems. Modern machine learning often involves multiple steps of preprocessing, training, and adaptation and models may be deployed to make many decisions over time. These complex pipelines can obscure the impact of specific biases in the final model's performance. In chapter 2, we investigate the role of batch active learning in graph-based semi-supervised learning. Through theoretical motivation and empirical validation, we demonstrate improved accuracy and efficiency. In chapters 3 and 4, we investigate the role of stratification in non-negative matrix factorization and tensor factorization. We develop efficient multiplicative-update algorithms and demonstrate their effectiveness on synthetic and real-world datasets. In chapter 5, we investigate the role of topological message-passing in relational structures. We propose a unifying framework for topological message-passing networks and demonstrate its effectiveness in mitigating oversquashing. This framework unifies many topological deep learning (TDL) methods under a common axiomatic framework, allowing for consistent theoretical analysis and greater understanding of the algebraic and topological tools employed in TDL. In chapter 6, we investigate the role of zero-shot context generalization in reinforcement learning. We propose a novel method for zero-shot context generalization and demonstrate its effectiveness in improving model performance. This provides a straight-forward extension of many off-policy reinforcement learning methods, which improves generalization to unseen contexts. Through these investigations, we provide a comprehensive theoretical and empirical analysis of the aforementioned inductive biases in multi-stage machine learning problems. Our findings highlight the critical role of these biases in enhancing model performance and their broad applicability across diverse domains.

Cover page: Inductive Biases in Multi-Stage Machine Learning Problems and Applications