Lawrence Berkeley National Laboratory
Statistical Projections for Multi-dimensional Visual Data Exploration and Analysis
- Author(s): Bethel, EW
- Stone, D
- Nguyen, H
- et al.
Published Web Locationhttps://doi.org/10.1109/LDAV.2016.7874338
When working with large, multidimensional and multivariate data, science users are frequently interested in understanding variation in data, as opposed to the actual data values. Our work focuses on exploring how a simple statistical metric, the Coefficient of Variation (or Cv), can be used in several different ways to facilitate understanding variation in data. As a statistical measure, it offers a key advantage over more widely accepted measures like standard deviation, namely to its ability to capture local variation properties. As a multidimensional projection operator, Cv is an effective way of reducing data size while preserving the key variational signal. Visualizations produced from Cv that target conveying variation in data are highly informative, especially compared to those produced with more widely known methods. We demonstrate these ideas within the context of a two-part application case study focusing on understanding long-term trends in the the changes in precipitation and winds in large-scale climate model ensemble output.