Tanu Malik
Recent Activity
ABSTRACT:
This HydroShare resource provides the Jupyter Notebooks created for the study "An Approach for Creating Immutable and Interoperable End-to-End Hydrological Modeling Computational Workflows" led by researcher Young-Don Choi submitted to the 2021 EarthCube Annual meeting, Notebook Sessions.
To find out the instructions on how to run Jupyter Notebooks, please refer to the README file provided in this resource.
For the sake of completeness, the abstract for the study submitted to the EarthCube session is mentioned below:
"Reproducibility is a fundamental requirement to advance science. Creating reproducible hydrological models that include all required data, software, and workflows, however, is often burdensome and requires significant work. Computational hydrology is a rapidly advancing field with fast-evolving technologies to support increasingly complex computational hydrologic modeling. The growing model complexity in terms of variety of software and cyberinfrastructure capabilities makes achieving computational reproducibility extremely challenging. Through recent reproducibility research, there have been efforts to integrate three components: 1) (meta)data, 2) computational environments, and 3) workflows. However, each component is still separate, and researchers must interoperate between these three components. These separations make verifying end-to-end reproducibility challenging. Sciunit was developed to assist scientists, who are not programming experts, with encapsulating these three components into a container to enable reproducibility in an immutable form. However, there were still limitations to support interoperable computational environments and apply end-to-end solutions, which are an ultimate goal of reproducible hydrological modeling. Therefore, the objective of this research is to advance the existing Sciunit capabilities to not only support immutable, but also interoperable computational environments and apply an end-to-end modeling workflow using the Regional Hydro-Ecologic Simulation System (RHESSys) hydrologic model as an example. First, we create an end-to-end workflow for RHESSys using pyRHESSys on the CyberGIS-Jupyter for Water platform. Second, we encapsulate the aforementioned three components and create configurations that include lists of encapsulated dependencies using Sciunit. Third, we create two HydroShare resources, one for immutable reproducibility evaluation using Sciunit and the other for interoperable reproducibility evaluation using library configurations created by Sciunit. Finally, we evaluate the reproducibility of Sciunit in MyBinder, which is a different computational environment, using these two resources. This research presents a detailed example of a user-centric case study demonstrating the application of an open and interoperable containerization approach from a hydrologic modeler’s perspective."
ABSTRACT:
This HydroShare resource is an example to demonstrate the vPICO presentations in EGU General Assembly 2021 (https://meetingorganizer.copernicus.org/EGU21/session/40092#vPICO_presentations).
- Session: EOS5.3 session - The evolving open-science landscape in geosciences: open data, software, publications, and community initiatives
- Title: An Approach for Open and Reproducible Hydrological Modeling using Sciunit and HydroShare
Using this notebook, you can test how to create an immutable and interoperable Sciunit Container for open and reproducible hydrological modeling.
You can start using "NB_01_An_Approach_for_Open_and_Reproducible_Hydrological_Modeling_using_Sciunit_and_HydroShare.ipynb" notebook in "CyberGIS-Jupyter for water" after clicking "Open with...". in Right-Above.
ABSTRACT:
This is an example of Geoscience Use Case 4: Height Above the Nearest Drainage (HAND) of "Improving Reproducibility of Geoscience Models with Sciunit" in the Geological Society of America publication. In this resource, there are two notebooks: 1) HANDWorkFlow.ipynb and 2) HAND_Sciunit.ipynb.
Using these two notebooks, we demonstrate the capabilities of Sciunit to encapsulate the HAND TauDEM workflow and create a Sciunit Container, and evaluate differences in HAND due to changing the contributing area threshold used to map the drainage network. During computation of the drainage network, a minimum contributing area threshold is used to identify the channel beginning. With a lower threshold value, the density of the resulting drainage network increases. Scientists running this experiment might be interested in finding out how the threshold makes a difference in the execution and result of the HAND model.
The first notebook demonstrates the general procedure to calculate HAND (Height above the Nearest Drainage) using TauDEM (https://hydrology.usu.edu/taudem/taudem5/).
Then using the second notebook we demonstrate how to create a Sciunit container for HAND Workflow and compare two Sciunit containers (5000 vs 50000 thresholds) using `diff` command.
ABSTRACT:
This is shared.
Contact
(Log in to send email) |
All | 0 |
Collection | 0 |
Resource | 0 |
App Connector | 0 |
ABSTRACT:
This is shared.
Created: Dec. 21, 2020, 8:45 a.m.
Authors: Tarboton, David
ABSTRACT:
This is an example of Geoscience Use Case 4: Height Above the Nearest Drainage (HAND) of "Improving Reproducibility of Geoscience Models with Sciunit" in the Geological Society of America publication. In this resource, there are two notebooks: 1) HANDWorkFlow.ipynb and 2) HAND_Sciunit.ipynb.
Using these two notebooks, we demonstrate the capabilities of Sciunit to encapsulate the HAND TauDEM workflow and create a Sciunit Container, and evaluate differences in HAND due to changing the contributing area threshold used to map the drainage network. During computation of the drainage network, a minimum contributing area threshold is used to identify the channel beginning. With a lower threshold value, the density of the resulting drainage network increases. Scientists running this experiment might be interested in finding out how the threshold makes a difference in the execution and result of the HAND model.
The first notebook demonstrates the general procedure to calculate HAND (Height above the Nearest Drainage) using TauDEM (https://hydrology.usu.edu/taudem/taudem5/).
Then using the second notebook we demonstrate how to create a Sciunit container for HAND Workflow and compare two Sciunit containers (5000 vs 50000 thresholds) using `diff` command.
Created: April 7, 2021, 4:54 a.m.
Authors: Choi, Young-Don
ABSTRACT:
This HydroShare resource is an example to demonstrate the vPICO presentations in EGU General Assembly 2021 (https://meetingorganizer.copernicus.org/EGU21/session/40092#vPICO_presentations).
- Session: EOS5.3 session - The evolving open-science landscape in geosciences: open data, software, publications, and community initiatives
- Title: An Approach for Open and Reproducible Hydrological Modeling using Sciunit and HydroShare
Using this notebook, you can test how to create an immutable and interoperable Sciunit Container for open and reproducible hydrological modeling.
You can start using "NB_01_An_Approach_for_Open_and_Reproducible_Hydrological_Modeling_using_Sciunit_and_HydroShare.ipynb" notebook in "CyberGIS-Jupyter for water" after clicking "Open with...". in Right-Above.
Created: April 29, 2021, 5:10 p.m.
Authors: Choi, Young-Don · Goodall, Jonathan · Maghami, Iman · Ahmad, Raza · Malik, Tanu · Band, Lawrence · Li, Zhiyu/Drew · Wang, Shaowen · Tarboton, David
ABSTRACT:
This HydroShare resource provides the Jupyter Notebooks created for the study "An Approach for Creating Immutable and Interoperable End-to-End Hydrological Modeling Computational Workflows" led by researcher Young-Don Choi submitted to the 2021 EarthCube Annual meeting, Notebook Sessions.
To find out the instructions on how to run Jupyter Notebooks, please refer to the README file provided in this resource.
For the sake of completeness, the abstract for the study submitted to the EarthCube session is mentioned below:
"Reproducibility is a fundamental requirement to advance science. Creating reproducible hydrological models that include all required data, software, and workflows, however, is often burdensome and requires significant work. Computational hydrology is a rapidly advancing field with fast-evolving technologies to support increasingly complex computational hydrologic modeling. The growing model complexity in terms of variety of software and cyberinfrastructure capabilities makes achieving computational reproducibility extremely challenging. Through recent reproducibility research, there have been efforts to integrate three components: 1) (meta)data, 2) computational environments, and 3) workflows. However, each component is still separate, and researchers must interoperate between these three components. These separations make verifying end-to-end reproducibility challenging. Sciunit was developed to assist scientists, who are not programming experts, with encapsulating these three components into a container to enable reproducibility in an immutable form. However, there were still limitations to support interoperable computational environments and apply end-to-end solutions, which are an ultimate goal of reproducible hydrological modeling. Therefore, the objective of this research is to advance the existing Sciunit capabilities to not only support immutable, but also interoperable computational environments and apply an end-to-end modeling workflow using the Regional Hydro-Ecologic Simulation System (RHESSys) hydrologic model as an example. First, we create an end-to-end workflow for RHESSys using pyRHESSys on the CyberGIS-Jupyter for Water platform. Second, we encapsulate the aforementioned three components and create configurations that include lists of encapsulated dependencies using Sciunit. Third, we create two HydroShare resources, one for immutable reproducibility evaluation using Sciunit and the other for interoperable reproducibility evaluation using library configurations created by Sciunit. Finally, we evaluate the reproducibility of Sciunit in MyBinder, which is a different computational environment, using these two resources. This research presents a detailed example of a user-centric case study demonstrating the application of an open and interoperable containerization approach from a hydrologic modeler’s perspective."