TempEst 2 Development Data: Observed Stream Temperature, Covariates, Performance Data, and Analysis Notebooks


Authors:
Owners: Daniel Philippus
Type: Resource
Storage: The size of this resource is 1.7 GB
Created: Sep 16, 2024 at 7:46 p.m.
Last updated: Feb 04, 2025 at 3:44 p.m.
Published date: Feb 04, 2025 at 3:44 p.m.
DOI: 10.4211/hs.a8b243957f7946e388d10ab206990675
Citation: See how to cite this resource
Sharing Status: Published
Views: 529
Downloads: 35
+1 Votes: Be the first one to 
 this.
Comments: No comments (yet)

Abstract

This resource contains code and data related to the development of the TempEst 2 stream temperature remote sensing model (manuscript in review with Journal of Hydrology). The code includes the model implementation (model.R), some utility functions (valfn.R), data retrieval scripts for Google Earth Engine (eeretrieval.py and datapts.py), and a reproducible validation notebook (validation.Rmd), along with the knitted PDF of the latter (validation.pdf). The main data include stream temperature daily mean/max observations retrieved from the USGS NWIS as well as remotely-sensed and gridded observations retrieved using Google Earth Engine from NLDAS, ESA WorldCover, MODIS, ERA5, and EPA Ecoregions (using eeretrieval.py). These are contained in three files. AllData.csv includes all observations for mean temperature. ExtData.csv ("extended data") adds maximum temperature, at the expense of fewer total observations being included. Ecoregions.csv is not central to the analysis, but includes EPA Level I ecoregion classifications for convenience.

Model performance tests can be reproduced using validation.Rmd. To run validation.Rmd in full, there must be a Data directory with subdirectories Density and TSLen, a Figures directory, at least one of the main data files (AllData.csv, ExtData.csv) or equivalent, and Ecoregions.csv. A knitted version of the Notebook is included in this resource. The error map plots also use an EPA Level I Ecoregions (https://gaftp.epa.gov/EPADataCommons/ORD/Ecoregions/cec_na/na_cec_eco_l1.zip) shapefile, which is assumed to be in an Ecoregions subdirectory of the *parent* directory. This dependency can be removed by replacing the `plot.eco` function with ordinary ggplot plotting.

The two rda (RData) files contain different versions of a pre-trained model. model.rda contains a regular, pre-trained model function that can be used directly to generate predictions. krigs.rda contains a list of the actual fitted kriging models, which can be used for investigating model components (see demo.pdf).

These data and related items of information have not been formally disseminated by NOAA, and do not represent any agency determination, view, or policy.

Coverage

Spatial

Coordinate System/Geographic Projection:
WGS 84 EPSG:4326
Coordinate Units:
Decimal degrees
Place/Area Name:
Contiguous United States
North Latitude
49.0000°
East Longitude
-67.3000°
South Latitude
25.1400°
West Longitude
-124.0600°

Temporal

Start Date: 01/02/2001
End Date: 12/31/2022
Leaflet Map data © OpenStreetMap contributors

Content

    No files to display.

Credits

Funding Agencies

This resource was created using funding from the following sources:
Agency Name Award Title Award Number
National Oceanic and Atmospheric Administration Cooperative Institute for Research to Operations in Hydrology NA22NWS4320003

How to Cite

Philippus, D., C. R. Corona, K. Schneider, A. Rust, T. S. Hogue (2025). TempEst 2 Development Data: Observed Stream Temperature, Covariates, Performance Data, and Analysis Notebooks, HydroShare, https://doi.org/10.4211/hs.a8b243957f7946e388d10ab206990675

This resource is shared under the Creative Commons Attribution CC BY.

http://creativecommons.org/licenses/by/4.0/
CC-BY

Comments

There are currently no comments

New Comment

required