Workflow Type: Common Workflow Language
Work-in-progress

This repository contains a Common Workflow Language (CWL) and Ro-Crate metadata definition for DTC-T1 workflow 6101, which is designed for providing tsunami impact forecasting following a tsunamigenic earthquake event, based on a probabilistic approach. The workflow integrates real-time earthquake data, runs HPC simulations, and generates tsunami hazard maps.

The main CWL implementation is found in WF6101.cwl, together with ST610106 and ST610109 describing complex steps. The file mini-ST610106.cwl is refereed to a specific realization of the ST610106, where different alternative options are included.

To see a preview of the RO-Crate metadata the ro-crate-preview.html file can be opeens in the browser.

Overview

The DTC-T1 Digital Twin component operates a single workflow (WF6101) that runs 11 main steps. DTC-T1 is also referred to as PTF, Probabilistic Tsunami Forecasting.

The workflow is initialized by a potentially tsunamigenic earthquake event. Based on the real-time event parameters together with long-term information retrieved from a regional-scale tsunami hazard model, an ensemble of earthquake scenarios compatible with the occurring event is built, where each scenario is weighted by the probability of being a good representation of the actual event. The tsunami impact computed by numerical modeling and the scenario probabilities are aggregated into exceedance probabilities for given tsunami intensities at given coastal points to calculate the hazard curves and subsequently the hazard maps.

The workflow is designed to operate in (near-)real-time and to be employed at Tsunami Warning Centres for operational tsunami early warning and forecasting, as well as for rapid post-event assessment.

Workflow structure

The workflow consists of multiple steps (ST), datasets (DT), and software services (SS). Below is a simplified breakdown:

  • The Scenario Player (ST610101) acts like a data archive, storing observations (synthetic or measured) (DT6101) recorded by external data providers (i.e. seismic data, labeled as DT6103, sea level data, labeled as DT6104, and GNSS data, labeled as DT6105) and sending them to specific data listeners.

  • The Listener EQ (ST610102) evaluates the earthquake information that initialise the workflow execution, while Listeners SL (ST610103) and GNSS (ST610104) elaborate observational sea level and ground deformation data for further comparison with simulated data.

  • The Ensemble Manager (ST610105) combines the information from the Listener EQ with the long-term information from a regional hazard model (DT6102) and defines a list of scenarios (DT6106) compatible with the triggering event, with their associated probabilities (DT6107).

  • The ensemble of earthquake scenarios is provided as input to the Scenario Modelling (ST610106), which makes use of pre-defined topo-bathymetric grids (DT6109) and computes tsunami intensities (DT6110) and ground deformation (DT6111). In the modeling step different alternative and/or combined models can be used in each single workflow realization. More specifically, different source and tsunami models can be combined through the following software: Tsunami-HySEA (SS6107), SeisSol (SS6106), BingClaw (SS6109), Shaltop (SS6110), Landslide-HySEA (SS6108), Source-to-wave filter (SS6112), and the Inundation AI module (SS6111). User settings will dictate which model combinations are invoked. As an alternative, precomputed scenarios can be simply retrieved (SS6119).

  • The Misfit Evaluator (ST610107) combines simulation outputs with eventual SL and GNSS data to evaluate the degree of confidence between the simulated scenarios and observations. According to the computed misfit, the initial scenario ensemble and related probabilities can be updated.

  • The Hazard Aggregator (ST610108) aggregates scenario probabilities and impact metrics to calculate hazard curves at multiple forecast points (DT6112).

  • The AL (Alert level) and Visualization step (ST610109) makes the final processing of the results, converting hazard into Alert Levels and producing probabilistic visual maps (DT6113). It is worth noting that only the Tsunami Service Providers (TSP) are allowed in producing and issuing the AL for operational tsunami early warning and forecasting.

  • In case of landslide-triggered tsunamis, input in the form of shake maps generated for the occurring event through the Earthquake Modeling (ST610111) is forwarded to the Landslide Scenario Manager (ST610110), which in turn feeds landslide scenarios (DT6108) to the Scenario Modelling.

Workflow diagram

Here is a visual overview of the described workflow, and the corresponding CWL graph automatically generated from the metadata information stored in the CWL workflow file.

wf_diag

cwl_wf

The mini-ST610106 for operationalization

We focused on one realization of the Scenario Modeling step (hereinafter called mini-ST610106), where the T-HySEA code (SS6107) is used to model both the earthquake source and the following tsunami, and only tsunami intensities (DT6110) are saved. In this experiment, the singularity image of the T-HySEA code created with the eFlows4HPC image service creation is used. Moreover, PyCOMPSs functionalities have been implemented in order to execute the mini-ST610106 as a COMPSs job on the Booster partition of the Leonardo cluster at the HPC CINECA infrastructure.

cwl_mini

To enable execution, an executable CWL file containing only the information required to run the process has been produced. In this case, the processing step is implemented by the bash script run_gpu_compss.sh that uses PyCOMPSs to submit a job to the Leonardo cluster queue through the Slurm scheduler. The executable CWL file does not explicitly specify input and output data because the underlying processing step manages these details internally. An input configuration file input.yaml provides the necessary global input data.

How to execute the mini-ST610106

The mini-workflow depicted is executed using the following command:

cwltool --cachedir cache --preserve-entire-environment mini-ST610106.cwl input.yaml

In this command, the --cachedir flag is employed to save all files generated during execution for debugging purposes, while the --preserve-entire-environment flag ensures that the current environment variables are transferred into the custom execution environment established by cwltool. Once the command is launched, cwltool creates an isolated execution environment where it runs the bash script, which, in turn, submits the PyCOMPSs job to the computational node via Slurm, with the standard output and error captured in the cwl.out and cwl.err files, respectively.

Click and drag the diagram to pan, double click or use the controls to zoom.

Inputs

ID Name Description Type
DT6102 n/a NEAMTHM18
  • Directory
DT6103 n/a EIDA seismic data archive
  • Directory
DT6104 n/a SLSMF sea level data
  • Directory
DT6105 n/a GNSS displacements
  • Directory
DT6109 n/a topo-bathymetric grids
  • Directory

Steps

ID Name Description
ST610101 n/a SS6101
ST610102 n/a SS6102
ST610103 n/a SS6103
ST610104 n/a SS6104
ST610105 n/a SS6105
ST610106 n/a n/a
ST610107 n/a SS6113
ST610108 n/a SS6114
ST610109 n/a n/a
ST610110 n/a SS6117
ST610111 n/a SS6118

Outputs

ID Name Description Type
DT6101 n/a Scenario Library
  • Directory
DT6106 n/a list of earthquake scenarios
  • Directory
DT6107 n/a list of scenario probabilities
  • Directory
DT6108 n/a list of landslide scenarios
  • Directory
DT6110 n/a Tsunami intensities
  • Directory
DT6111 n/a Ground deformation
  • Directory
DT6112 n/a Tsunami hazard curves
  • Directory
DT6113 n/a Hazard visual products
  • Directory

Version History

main @ b8047c3 (latest) Created 30th Jun 2025 at 10:42 by Pablo Orviz

Edit README.md - AL


Frozen main b8047c3

main @ c324ab2 Created 21st Feb 2025 at 13:26 by Raül Sirvent

update preview


Frozen main c324ab2

main @ 3923678 (earliest) Created 20th Dec 2024 at 14:54 by Raül Sirvent

add ro-crate-preview


Frozen main 3923678
help Creators and Submitter
Creators
Not specified
Additional credit

Marco Salvi

Submitter
Activity

Views: 1018   Downloads: 304

Created: 20th Dec 2024 at 14:54

Last updated: 30th Jun 2025 at 10:47

help Attributions

None

Total size: 3.1 MB
Powered by
(v.1.17.0-main)
Copyright © 2008 - 2025 The University of Manchester and HITS gGmbH