Workflows

What is a Workflow?

23 Workflows visible to you, out of a total of 23

Swedish Earth Biogenome Project Genome Assembly Workflow

NBIS, ERGA Assembly

Work-in-progress

Swedish Earth Biogenome Project - Genome Assembly Workflow

The primary genome assembly workflow for the Earth Biogenome Project at NBIS.

Workflow overview

General aim:

flowchart LR 
hifi[/ HiFi reads /] --> data_inspection 
ont[/ ONT reads /] --> data_inspection 
hic[/ Hi-C reads /] --> data_inspection 
data_inspection[[ Data inspection ]] --> preprocessing 
preprocessing[[ Preprocessing ]] --> assemble 
assemble[[ Assemble ]] --> validation 
validation[[ Assembly
...

Type: Nextflow

Creators: Mahesh Binzer-Panchal, Martin Pippel

Submitter: Mahesh Binzer-Panchal

Created: 23rd Aug 2024 at 14:16

HiC scaffolding pipeline

ERGA Assembly, Biodiversity Genomics Europe (general)

Stable

HiC scaffolding pipeline

Snakemake pipeline for scaffolding of a genome using HiC reads using yahs.

Prerequisites

This pipeine has been tested using Snakemake v7.32.4 and requires conda for installation of required tools. To run the pipline use the command:

snakemake --use-conda --cores N

where N is number of cores to use. There are provided a set of configuration and running scripts for exectution on a slurm queueing system. After configuring the cluster.json file run:

./run_cluster ...

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.796.2

Created: 16th Mar 2024 at 09:01

Purge retained haplotypes using Purge-Dups

ERGA Assembly, Biodiversity Genomics Europe (general)

Purge dups

This snakemake pipeline is designed to be run using as input a contig-level genome and pacbio reads. This pipeline has been tested with snakemake v7.32.4. Raw long-read sequencing files and the input contig genome assembly must be given in the config.yaml file. To execute the workflow run:

snakemake --use-conda --cores N

Or configure the cluster.json and run using the ./run_cluster command

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.506.2

Created: 16th Jun 2023 at 14:56, Last updated: 16th Mar 2024 at 07:49

HiC contact map generation

ERGA Assembly, Biodiversity Genomics Europe (general)

Stable

HiC contact map generation

Snakemake pipeline for the generation of .pretext and .mcool files for visualisation of HiC contact maps with the softwares PretextView and HiGlass, respectively.

Prerequisites

This pipeine has been tested using Snakemake v7.32.4 and requires conda for installation of required tools. To run the pipline use the command:

snakemake --use-conda

There are provided a set of configuration and running scripts for exectution on a slurm queueing system. After configuring ...

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.795.2

Created: 14th Mar 2024 at 09:50, Last updated: 14th Mar 2024 at 09:52

ERGA ONT+Illumina Assembly+QC NextDenovo+HyPo v2403 (WF2)

ERGA Assembly

Work-in-progress

The workflow takes raw ONT reads and trimmed Illumina WGS paired reads collections, the ONT raw stats table (calculated from WF1) and the estimated genome size (calculated from WF1) to run NextDenovo and subsequently polish the assembly with HyPo. It produces collapsed assemblies (unpolished and polished) and runs all the QC analyses (gfastats, BUSCO, and Merqury).

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

Created: 11th Mar 2024 at 14:45

ERGA ONT+Illumina Assembly+QC Flye+HyPo v2403 (WF2)

ERGA Assembly

Stable

The workflow takes raw ONT reads and trimmed Illumina WGS paired reads collections, and the estimated genome size and Max depth (both calculated from WF1) to run Flye and subsequently polish the assembly with HyPo. It produces collapsed assemblies (unpolished and polished) and runs all the QC analyses (gfastats, BUSCO, and Merqury).

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

Created: 11th Mar 2024 at 12:41

CLAWS (CNAG's long-read assembly workflow in Snakemake)

ERGA Assembly

Stable

CLAWS (CNAG's Long-read Assembly Workflow in Snakemake)

Snakemake Pipeline used for de novo genome assembly @CNAG. It has been developed for Snakemake v6.0.5.

It accepts Oxford Nanopore Technologies (ONT) reads, PacBio HFi reads, illumina paired-end data, illumina 10X data and Hi-C reads. It does the preprocessing of the reads, assembly, polishing, purge_dups, scaffodling and different evaluation steps. By default it will preprocess the reads, run Flye + Hypo + purge_dups + yahs and evaluate ...

Type: Snakemake

Creators: Jessica Gomez-Garrido, Fernando Cruz (CNAG), Francisco Camara (CNAG), Tyler Alioto (CNAG)

Submitter: Jessica Gomez-Garrido

DOI: 10.48546/workflowhub.workflow.567.2

Created: 12th Sep 2023 at 14:23, Last updated: 2nd Feb 2024 at 12:24

ERGA HiC Collapsed Scaffolding+QC YaHS v2311 (WF4)

ERGA Assembly

Work-in-progress

The workflow takes trimmed HiC forward and reverse reads, and one assembly (e.g.: Hap1 or Pri or Collapsed) to produce a scaffolded assembly using YaHS. It also runs all the QC analyses (gfastats, BUSCO, and Merqury).

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

Created: 9th Jan 2024 at 11:00

ERGA ONT+Illumina Collapsed Purge+QC v2311 (WF3)

ERGA Assembly

Work-in-progress

The workflow takes a trimmed Illumina WGS paired-end reads collection, Collapsed contigs, and the values for transition parameter and max coverage depth (calculated from WF1) to run Purge_Dups. It produces purged Collapsed contigs assemblies, and runs all the QC analysis (gfastats, BUSCO, and Merqury).

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

Created: 9th Jan 2024 at 10:40, Last updated: 9th Jan 2024 at 10:44

ERGA Profiling Illumina v2311 (WF1)

ERGA Assembly

Stable

The workflow takes a trimmed Illumina paired-end reads collection, runs Meryl to create a K-mer database, Genomescope2 to estimate genome properties and Smudgeplot to estimate ploidy. The main results are K-mer ddatabase and genome profiling plots, tables, and values useful for downstream analysis. Default K-mer length and ploidy for Genomescope are 21 and 2, respectively.

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

Created: 8th Jan 2024 at 15:55, Last updated: 8th Jan 2024 at 15:57

Workflows

Filters ×

Swedish Earth Biogenome Project - Genome Assembly Workflow

Workflow overview

HiC scaffolding pipeline

Prerequisites

Purge dups

HiC contact map generation

Prerequisites

CLAWS (CNAG's Long-read Assembly Workflow in Snakemake)

Filters