21 items tagged with 'workflow'.

Creating workflows with Common Workflow Language

(Show All)

This BioExcel best practice guide outlines the development process for writing a workflow using the Common Workflow Language (CWL), from creating and selecting tools like BioBB, through early experimentation, reuse and testing, to optimization and ensuring reproducibility before publication in workflow repositories.

Creators: Stian Soiland-Reyes, Douglas Lowe, Robin Long

Submitter: Stian Soiland-Reyes

External Link

Created: 5th Jul 2021 at 12:37

Methods Included: Standardizing Computational Reuse and Portability with the Common Workflow Language

Common Workflow Language (CWL) community

(Show All)

Abstract (Expand)

A widely used standard for portable multilingual data analysis pipelines would enable considerable benefits to scholarly publication reuse, research/industry collaboration, regulatory cost control, and …

Authors: Michael R. Crusoe, Sanne Abeln, Alexandru Iosup, Peter Amstutz, John Chilton, Nebojša Tijanić, Hervé Ménager, Stian Soiland-Reyes, Carole Goble

Date Published: 14th May 2021

Publication Type: Preprint

Citation: arXiv 2105.07028 [cs.DC]

Created: 5th Jul 2021 at 12:58, Last updated: 16th Jan 2023 at 13:34

FAIR Computational Workflows - Keynote GCB2021

FAIR Computational Workflows

Keynote at German Conference on Bioinformatics 2021 https://gcb2021.de/ FAIR Computational Workflows Computational workflows capture precise descriptions of the steps and data dependencies needed to carry out computational data pipelines, analysis and simulations in many areas of Science, including the Life Sciences. The use of computational workflows to manage these multi-step computational processes has accelerated in the past few years driven by the need for scalable data processing, the ...

Creator: Carole Goble

Submitter: Carole Goble

External Link

Created: 1st Dec 2021 at 20:49, Last updated: 1st Dec 2021 at 21:24

Human–AI Ledger (HAIL): A Structured Workflow for Traceable, Reproducible Human–AI Collaboration

BioFAIR

Stable

The Human–AI Ledger (HAIL) defines a structured, repeatable workflow for human–AI collaboration. Through standardized checkpoints and a session ledger, HAIL documents ethical, creative, and procedural context across both human and AI contributions. While AI-generated outputs are inherently non-deterministic, HAIL supports process reproducibility by providing a consistent framework for recording collaboration, facilitating auditability, transparency, and ethical accountability in co-creative AI ...

Type: Unrecognized workflow type

Creators: Evan P. Troendle, BioFAIR Fellowship Programme

Submitter: Evan P. Troendle

Created: 25th Feb 2026 at 12:55

Galaxy Workflow for CNV Detection with CNVkit and Conversion to Beacon JSON Using cnv-vcf2json

Galaxy Training Network

Stable

This Galaxy workflow streamlines comprehensive copy number variation (CNV) analysis by integrating CNVkit’s robust detection capabilities with an efficient conversion step using cnv-vcf2json to format results into Beacon JSON. Designed for computational biologists and bioinformaticians, the workflow standardizes CNV identification and output formatting to enhance interoperability with Beacon networks. It is specifically optimized for use with mapped BAM files from the EGAD00001008392 synthetic ...

Type: Galaxy

Creators: Khaled Jum'ah, Krzysztof Poterlowicz

Submitter: Khaled Jum'ah

DOI: 10.48546/workflowhub.workflow.1314.1

Created: 3rd Mar 2025 at 09:03, Last updated: 3rd Mar 2025 at 09:53

Somatic-ShortV-nf

Sydney Informatics Hub, Australian BioCommons

(Show All)

Work-in-progress

This is a Nextflow implementaion of the GATK Somatic Short Variant Calling workflow. This workflow can be used to discover somatic short variants (SNVs and indels) from tumour and matched normal BAM files following GATK's Best Practices Workflow. The workflowis currently optimised to run efficiently and at scale on the National Compute Infrastructure, Gadi.

Type: Nextflow

Creators: Nandan Deshpande, Tracy Chew, Cali Willet, Georgina Samaha

Submitter: Georgina Samaha

DOI: 10.48546/workflowhub.workflow.691.1

Created: 20th Dec 2023 at 01:12, Last updated: 20th Dec 2023 at 01:16

VVV2_align_PE

ANSES-Ploufragan

Deprecated

PAIRED-END workflow. Align reads on fasta reference/assembly using bwa mem, get a consensus, variants, mutation explanations.

IMPORTANT:

For "bcftools call" consensus step, the --ploidy file is in "Données partagées" (Shared Data) and must be imported in your history to use the worflow by providing this file (tells bcftools to consider haploid variant calling).
SELECT THE MOST ADAPTED VADR MODEL for annotation (see vadr parameters).

Type: Galaxy

Creator: Fabrice Touzain

Submitter: Fabrice Touzain

Created: 28th Jun 2023 at 10:52, Last updated: 19th Jun 2025 at 11:23

CWL-based (single-sample) workflow for germline variant calling

Biodata Analysis Group

Stable

A CWL-based pipeline for calling small germline variants, namely SNPs and small INDELs, by processing data from Whole-genome Sequencing (WGS) or Targeted Sequencing (e.g., Whole-exome sequencing; WES) experiments.

On the respective GitHub folder are available:

The CWL wrappers and subworkflows for the workflow
A pre-configured YAML template, based on validation analysis of publicly available HTS data

Briefly, the workflow performs the following steps:

Quality control of Illumina reads ...

Type: Common Workflow Language

Creators: Konstantinos Kyritsis, Nikolaos Pechlivanis, Fotis Psomopoulos

Submitter: Konstantinos Kyritsis

DOI: 10.48546/workflowhub.workflow.527.1

Created: 5th Jul 2023 at 10:48

CWL-based (multi-sample) workflow for germline variant calling

Biodata Analysis Group

Stable

On the respective GitHub folder are available:

The CWL wrappers and subworkflows for the workflow
A pre-configured YAML template, based on validation analysis of publicly available HTS data

Briefly, the workflow performs the following steps:

Quality control of Illumina reads ...

Type: Common Workflow Language

Creators: Konstantinos Kyritsis, Nikolaos Pechlivanis, Fotis Psomopoulos

Submitter: Konstantinos Kyritsis

DOI: 10.48546/workflowhub.workflow.526.1

Created: 5th Jul 2023 at 10:44

CWL-based ChIP-Seq workflow

Biodata Analysis Group

Stable

A CWL-based pipeline for processing ChIP-Seq data (FASTQ format) and performing:

Peak calling
Consensus peak count table generation
Detection of super-enhancer regions
Differential binding analysis

On the respective GitHub folder are available:

The CWL wrappers for the workflow
A pre-configured YAML template, based on validation analysis of publicly available HTS data
Tables of metadata (EZH2_metadata_CLL.csv and H3K27me3_metadata_CLL.csv), based on the same validation ...

Type: Common Workflow Language

Creators: Konstantinos Kyritsis, Nikolaos Pechlivanis, Fotis Psomopoulos

Submitter: Konstantinos Kyritsis

DOI: 10.48546/workflowhub.workflow.525.1

Created: 5th Jul 2023 at 10:39

CWL-based RNA-Seq workflow

Biodata Analysis Group

Stable

A CWL-based pipeline for processing RNA-Seq data (FASTQ format) and performing differential gene/transcript expression analysis.

On the respective GitHub folder are available:

The CWL wrappers for the workflow
A pre-configured YAML template, based on validation analysis of publicly available HTS data
A table of metadata (mrna_cll_subsets_phenotypes.csv), based on the same validation analysis, to serve as an input example for the design of comparisons during differential expression ...

Type: Common Workflow Language

Creators: Konstantinos Kyritsis, Nikolaos Pechlivanis, Fotis Psomopoulos

Submitter: Konstantinos Kyritsis

DOI: 10.48546/workflowhub.workflow.524.1

Created: 5th Jul 2023 at 09:44, Last updated: 5th Jul 2023 at 10:15

Metabolome Annotation Workflow (MAW)

Metabolomics-Reproducibility

(Show All)

Stable

This repository hosts Metabolome Annotation Workflow (MAW). The workflow takes MS2 .mzML format data files as an input in R. It performs spectral database dereplication using R Package Spectra and compound database dereplication using SIRIUS OR MetFrag . Final candidate selection is done in Python using RDKit and PubChemPy.

Type: Common Workflow Language

Creators: Mahnoor Zulfiqar, Michael R. Crusoe, Luiz Gadelha, Christoph Steinbeck, Maria Sorokina, Kristian Peters

Submitter: Mahnoor Zulfiqar

DOI: 10.48546/workflowhub.workflow.510.2

Created: 19th Jun 2023 at 21:09, Last updated: 1st Aug 2023 at 15:21

EJP-RD WP13 case-study CAKUT momix analysis

EJPRD WP13 case-studies workflows

(Show All)

Stable

Joint multi-omics dimensionality reduction approaches for CAKUT data using peptidome and proteome data

Brief description In (Cantini et al. 2020), Cantini et al. evaluated 9 representative joint dimensionality reduction (jDR) methods for multi-omics integration and analysis and . The methods are Regularized Generalized Canonical Correlation Analysis (RGCCA), Multiple co-inertia analysis (MCIA), Multi-Omics Factor Analysis (MOFA), Multi-Study Factor Analysis (MSFA), iCluster, Integrative NMF ...

Type: Snakemake

Creators: Ozan Ozisik, Juma Bayjan, Cenna Doornbos, Friederike Ehrhart, Matthias Haimel, Laura Rodriguez-Navas, José Mª Fernández, Eleni Mina, Daniël Wijnbergen

Submitter: Juma Bayjan

Download

Created: 23rd Jun 2021 at 11:42, Last updated: 27th Oct 2022 at 17:39

EJP-RD WP13 case-study: CAKUT proteome, peptidome and miRNome data analysis using WikiPathways

EJPRD WP13 case-studies workflows

(Show All)

Stable

In this analysis, we created an extended pathway, using the WikiPathways repository (Version 20210110) and the three -omics datasets. For this, each of the three -omics datasets was first analyzed to identify differentially expressed elements, and pathways associated with the significant miRNA-protein links were detected. A miRNA-protein link is deemed significant, and may possibly be implying causality, if both a miRNA and its target are significantly differentially expressed.

The peptidome and ...

Type: Snakemake

Creators: Woosub Shin, Friederike Ehrhart, Juma Bayjan, Cenna Doornbos, Ozan Ozisik

Submitter: Juma Bayjan

Created: 20th Apr 2022 at 17:59, Last updated: 27th Oct 2022 at 16:16

MGnify - amplicon analysis pipeline

MGnify

Stable

MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over the past 2 years, MGnify (formerly EBI Metagenomics) has more than doubled the number of publicly available analysed datasets held within the resource. Recently, an updated approach to data analysis has been unveiled (version 5.0), replacing the previous single pipeline ...

Type: Common Workflow Language

Creator: Alex L Mitchell, Alexandre Almeida, Martin Beracochea, Miguel Boland, Josephine Burgin, Guy Cochrane, Michael R Crusoe, Varsha Kale, Simon C Potter, Lorna J Richardson, Ekaterina Sakharova, Maxim Scheremetjew, Anton Korobeynikov, Alex Shlemov, Olga Kunyavskaya, Alla Lapidus, Robert D Finn

Submitter: Martin Beracochea

Created: 7th Jun 2022 at 09:28

MGnify - assembly analysis pipeline

MGnify, HoloFood at MGnify

Stable

Type: Common Workflow Language

Submitter: Martin Beracochea

Created: 7th Jun 2022 at 08:41, Last updated: 7th Jun 2022 at 09:04

V-pipe (main multi-virus version)

V-Pipe

Stable

...

Type: Snakemake

Creators: Ivan Topolsky, Kim Philipp Jablonski

Submitter: Ivan Topolsky

DOI: 10.48546/workflowhub.workflow.301.5

Created: 30th Mar 2022 at 17:50, Last updated: 10th Jun 2024 at 19:38

RNASeq-DE @ NCI-Gadi

Sydney Informatics Hub

Stable

RNASeq-DE @ NCI-Gadi processes RNA sequencing data (single, paired and/or multiplexed) for differential expression (raw FASTQ to counts). This pipeline consists of multiple stages and is designed for the National Computational Infrastructure's (NCI) Gadi supercompter, leveraging multiple nodes to run each stage in parallel.

Infrastructure_deployment_metadata: Gadi (NCI)

Type: Shell Script

Creators: Tracy Chew, Rosemarie Sadsad, Cali Willet

Submitter: Tracy Chew

DOI: 10.48546/workflowhub.workflow.152.1

Download

Created: 19th Aug 2021 at 00:24, Last updated: 23rd Aug 2022 at 07:09

GATK4 Fastq to joint-called cohort VCF with Cromwell on local cluster (no job scheduler)

Australian BioCommons, Pawsey Supercomputing Research Centre

Work-in-progress

Local Cromwell implementation of GATK4 germline variant calling pipeline

See the GATK website for more information on this toolset

Assumptions

Using hg38 human reference genome build
Running 'locally' i.e. not using HPC/SLURM scheduling, or containers. This repo was specifically tested on Pawsey Nimbus 16 CPU, 64GB RAM virtual machine, primarily running in the /data volume storage partition.
Starting from short-read Illumina paired-end fastq ...

Type: Workflow Description Language

Creators: None

Submitter: Sarah Beecroft

Download

Created: 17th Aug 2021 at 05:47

microPIPE: a pipeline for high-quality bacterial genome construction using ONT and Illumina sequencing

QCIF Bioinformatics

Stable

microPIPE was developed to automate high-quality complete bacterial genome assembly using Oxford Nanopore Sequencing in combination with Illumina sequencing.

To build microPIPE we evaluated the performance of several tools at each step of bacterial genome assembly, including basecalling, assembly, and polishing. Results at each step were validated using the high-quality ST131 Escherichia coli strain EC958 (GenBank: HG941718.1). After appraisal of each step, we selected the best combination of ...

Type: Nextflow

Creators: Valentine Murigneux, Leah W Roberts, Brian M Forde, Minh-Duy Phan, Nguyen Thi Khanh Nhu, Adam D Irwin, Patrick N A Harris, David L Paterson, Mark A Schembri, David M Whiley, Scott A Beatson

Submitter: Valentine Murigneux

DOI: 10.48546/workflowhub.workflow.140.1

Download

Created: 9th Aug 2021 at 01:17, Last updated: 6th Sep 2021 at 23:50

Common Workflow Language Engines

BioExcel Best Practice Guides

(Show All)

This BioExcel best practice guide discusses the workflow engines available for the Common Workflow Language (CWL).

Creators: Robin Long, Douglas Lowe, Stian Soiland-Reyes

Submitter: Stian Soiland-Reyes

External Link

Created: 5th Jul 2021 at 12:40