Skip to main content
Open Access Publications from the University of California

Workflows and extensions to the Kepler scientific workflow system to support environmental sensor data access and analysis

  • Author(s): Barseghian, Derik
  • Altintas, Ilkay
  • Jones, Matthew B
  • Crawl, Daniel
  • Potter, Nathan
  • Gallagher, James
  • Cornillon, Peter
  • Schildhauer, Mark
  • Borer, Elizabeth T
  • Seabloom, Eric W
  • Hosseini, Parviez R
  • et al.

Environmental sensor networks are now commonly being deployed within environmental observatories and as components of smaller-scale ecological and environmental experiments. Effectively using data from these sensor networks presents technical challenges that are difficult for scientists to overcome, severely limiting the adoption of automated sensing technologies in environmental science. The Realtime Environment for Analytical Processing (REAP) is an NSF-funded project to address the technical challenges related to accessing and using heterogeneous sensor data from within the Kepler scientific workflow system. Using distinct use cases in terrestrial ecology and oceanography as motivating examples, we describe workflows and extensions to Kepler to stream and analyze data from observatory networks and archives. We focus on the use of two newly integrated data sources in Kepler: DataTurbine and OPeNDAP. Integrated access to both near real-time data streams and data archives from within Kepler facilitates both simple data exploration and sophisticated analysis and modeling with these data sources.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content
Current View