[DTC-E1] WF7101: Data Input Workflow
Version 1

Workflow Type: Common Workflow Language
Work-in-progress

Overview

This repository contains the Common Workflow Language (CWL) and RO-Crate metadata definition for WF7101, the Data Input workflow of DTC-E1, developed under the DT-GEO project.

The Data Input workflow (WF7101) serves as the entry point for the DTC-E1 pipeline. It collects, validates, and prepares essential datasets for downstream workflows, including Seismogenic Source Models (WF7102), Computational Hazard (WF7103), and Computational Risk (WF7104).

It integrates multiple data sources (DT7101–DT7103) representing crustal faults, subduction zones, and active fault datasets, and ensures standardized outputs ready for subsequent modeling workflows.

Workflow Structure

The workflow is composed of steps (ST) and datasets (DT), following DT-GEO’s standardized schema. The general data flow is:

  1. Workflow Inputs

    • DT7101 – Active fault data repository.
    • DT7102 – Regional crustal fault dataset.
    • DT7103 – Subduction zone dataset.
  2. Preprocessing Steps

    • ST710101–ST710105: Validate, harmonize, and convert data into uniform formats compatible with downstream workflows.
  3. Workflow Outputs

    • SUBDUCTION_ZONES – Standardized subduction zone model dataset.
    • CRUSTAL_FAULT_MODEL – Cleaned and validated crustal fault data.
    • ACTIVE_FAULT_DATASET – Finalized active fault model used by WF7102.

Inputs

ID Name Description Type
DT7101 n/a European instrumental earthquake catalogue (EMEC).
  • File
DT7102 n/a European PreInstrumental Earthquake Catalogue (EPICA)
  • File
DT7103 n/a European Fault-Source Model.
  • File

Steps

ID Name Description
ST710101 n/a Harmonization of Earthquake Catalogs.
ST710103 n/a Statistical Analysis of the Earthquake Catalog.
ST710105 n/a Update Crustal Faults Model and Subduction Zones.
ST710102 n/a Data Cleaning and Feature Selection.
ST710104 n/a Harmonize Active Fault Dataset.

Outputs

ID Name Description Type
CRUSTAL_FAULT_MODEL n/a Output dataset for ST710105.
  • File
SUBDUCTION_ZONES n/a Output dataset for ST710105.
  • File
ACTIVE_FAULT_DATASET n/a Output dataset of ST710104.
  • File

Version History

Version 1 (earliest) Created 14th Oct 2025 at 13:31 by Johannes Kemper

Initial commit


Open master b81fc15
help Creators and Submitter
Creators
  • Laurentiu Danciu
  • Nicolas Schmid
  • Johannes Kemper
Submitter
Activity

Views: 39   Downloads: 1

Created: 14th Oct 2025 at 13:31

help Attributions

None

Total size: 45.4 KB
Powered by
(v.1.17.0-main)
Copyright © 2008 - 2025 The University of Manchester and HITS gGmbH