Tandem EVolutionary Algorithm (TEVA) of Hanley et al (2020)


Authors:
Owners: Kristen L UnderwoodAbner Bogan
Type: Resource
Storage: The size of this resource is 10.8 MB
Created: Dec 20, 2024 at 1:46 p.m.
Last updated: Jan 14, 2025 at 5:39 p.m.
Citation: See how to cite this resource
Sharing Status: Public
Views: 351
Downloads: 376
+1 Votes: 1 other +1 this
Comments: No comments (yet)

Abstract

Underwood et al. (2023) have recently introduced the tandem evolutionary algorithm (TEVA) of Hanley et al. (2020) to the water resources and ecology domains, and applied it to identify features (catchment-scale attributes) and feature interactions important in determining patterns in Dissolved Organic Carbon across the continental US. TEVA has particular advantages for feature selection in large, multivariate observational data sets of complex systems like riverscapes or ecosystems, and has been shown to outperform logistic regression or Random Forest for identifying feature interactions and equifinality (Hanley et al., 2020; Anderson et al., 2020). TEVA finds interactions between multiple variables that may result from either additive processes or feature interactions, and not only extracts features significantly associated with a given outcome class(es), but also identifies the specific value ranges associated with those features (Underwood et al., 2023; Hanley, et al., 2020). This algorithm is also robust to issues of mixed data types (continuous, categorical), missing data, censored data, skewed distributions, and unbalanced target classes or clusters (Hanley et al., 2020).

When presented with n observations of p features across a study domain and a target of one or more classes or outcomes, the algorithm identifies and archives two types of clauses below a given fitness threshold. In the first pass, TEVA identifies Conjunctive Clauses (CCs) - a combination of variables that may or may not be correlated and somehow interact to produce an outcome. For example, an Extreme Flood may result from steep slopes + shallow soils + intense rainfall. A second pass of TEVA identifies Disjunctive Clauses (DCs) - a sequence of CCs that are linked with a logical “OR” statement. For example, an Extreme Flood may results from (steep slopes + shallow soils + intense rainfall) OR (high antecedent soil moisture + rainfall) OR (thick snow pack + high temperatures). DCs are multi-order, while the CCs comprising a DC can themselves range from first-order to multi-order (Underwood et al., 2023).

In this workshop, we illustrate the functionality of TEVA using a dataset of 91 observations from forested catchments across the CONUS of 54 catchment attributes inferred to have importance to DOC dynamics. Combinations of these catchment attributes were identified in CCs and DCs with high probability to be linked to an outcome class of High or Low mean DOC concentration. Target classes were assigned using Jenks natural breaks for 91 catchments with sufficient (≥3) observations of DOC in stream water to calculate a mean value. Originally, computation of TEVA was performed in the MATLAB programming language; the codebase has now been transferred to the open-source coding language Python, and is accessed through CUAHSI JupyterHub.

Content

    No files to display.

Credits

Funding Agencies

This resource was created using funding from the following sources:
Agency Name Award Title Award Number
U.S. National Science Foundation Collaborative Research: Network Cluster: Using Big Data approaches to assess ecohydrological resilience across scales EAR 2012123

Contributors

People or Organizations that contributed technically, materially, financially, or provided general support for the creation of the resource's content but are not considered authors.

Name Organization Address Phone Author Identifiers
Abner Bogan CUAHSI NY, US
Clara Cogswell CUAHSI MA, US
Ali Dadkhah University of Vermont VT, US
Ryan van der Heijden University of Vermont VT, US
Cailin Gramling University of Vermont VT, US
Shaurya Swami University of Vermont VT, US

How to Cite

Underwood, K. L., D. M. Rizzo, J. P. Hanley (2025). Tandem EVolutionary Algorithm (TEVA) of Hanley et al (2020), HydroShare, http://www.hydroshare.org/resource/6c3b9d9735994976a12cf5b7620f9ace

This resource is shared under the Creative Commons Attribution CC BY.

http://creativecommons.org/licenses/by/4.0/
CC-BY

Comments

There are currently no comments

New Comment

required