Ulutan, Oytun; Riggan, Benjamin S; Nasrabadi, Nasser M; Manjunath, BS; Ulutan, Oytun; Riggan, Benjamin S; Nasrabadi, Nasser M; Manjunath, BS

doi:10.1109/wacv.2018.00132

Download PDF

An Order Preserving Bilinear Model for Person Detection in Multi-Modal Data

2018

Published Web Location

https://doi.org/10.1109/wacv.2018.00132

Abstract

We propose a new order preserving bilinear framework that exploits low-resolution video for person detection in a multi-modal setting using deep neural networks. In this setting cameras are strategically placed such that less robust sensors, e.g. geophones that monitor seismic activity, are located within the field of views (FOVs) of cameras. The primary challenge is being able to leverage sufficient information from videos where there are less than 40 pixels on targets, while also taking advantage of less discriminative information from other modalities, e.g. seismic. Unlike state-of-the-art methods, our bilinear framework retains spatio-temporal order when computing the vector outer products between pairs of features. Despite the high dimensionality of these outer products, we demonstrate that our order preserving bilinear framework yields better performance than recent orderless bilinear models and alternative fusion methods.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content

For improved accessibility of PDF content, download the file to your device.

UC Santa Barbara

An Order Preserving Bilinear Model for Person Detection in Multi-Modal Data

Published Web Location