Workflows

What is a Workflow?
22 Workflows visible to you, out of a total of 22
Stable

The workflow takes trimmed HiC forward and reverse reads, and Pri/Alt assemblies to produce a scaffolded primary assembliy (and alternate contigs) using YaHS. It also runs all the QC analyses (gfastats, BUSCO, and Merqury).

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

Stable

The workflow takes a trimmed HiFi reads collection, Pri/Alt contigs, and the values for transition parameter and max coverage depth (calculated from WF1) to run Purge_Dups. It produces purged Pri and Alt contigs assemblies, and runs all the QC analysis (gfastats, BUSCO, and Merqury).

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1163.1

Stable

The workflow takes a trimmed HiFi reads collection, and max coverage depth (calculated from WF1) to run Hifiasm in HiFi solo mode. It produces a Pri/Alt assembly, and runs all the QC analysis (gfastats, BUSCO, and Merqury).

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1162.1

Stable

The workflow takes a trimmed HiFi reads collection, runs Meryl to create a K-mer database, Genomescope2 to estimate genome properties and Smudgeplot to estimate ploidy. The main results are K-mer database and genome profiling plots, tables, and values useful for downstream analysis. Default K-mer length and ploidy for Genomescope are 21 and 2, respectively.

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.603.1

Stable

The workflow takes a HiFi reads collection, runs FastQC and SeqKit, filters with Cutadapt, and creates a MultiQC report. The main outputs are a collection of filtred reads, a report with raw and filtered reads stats, and a table with raw reads stats.

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.602.1

Stable

The workflow takes a paired-reads collection (like illumina WGS or HiC), runs FastQC and SeqKit, trims with Fastp, and creates a MultiQC report. The main outputs are a paired collection of trimmed reads, a report with raw and trimmed reads stats, and a table with raw reads stats.

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.601.1

Work-in-progress

Swedish Earth Biogenome Project - Genome Assembly Workflow

The primary genome assembly workflow for the Earth Biogenome Project at NBIS.

Workflow overview

General aim:

flowchart LR 
hifi[/ HiFi reads /] --> data_inspection 
ont[/ ONT reads /] --> data_inspection 
hic[/ Hi-C reads /] --> data_inspection 
data_inspection[[ Data inspection ]] --> preprocessing 
preprocessing[[ Preprocessing ]] --> assemble 
assemble[[ Assemble ]] --> validation 
validation[[ Assembly
...

Type: Nextflow

Creators: Mahesh Binzer-Panchal, Martin Pippel

Submitter: Mahesh Binzer-Panchal

Stable

Assembly Evaluation for ERGA-BGE Reports

One Assmebly, HiFi WGS reads + HiC reads

The workflow requires the following:

  • Species Taxonomy ID number
  • NCBI Genome assembly accession code
  • BUSCO Lineage
  • WGS accurate reads accession code
  • NCBI HiC reads accession code

The workflow will get the data and process it to generate genome profiling (genomescope, smudgeplot -optional-), assembly stats (gfastats), merqury stats (QV, completeness), BUSCO, snailplot, contamination blobplot, and HiC ...

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1104.1

Stable

Assembly Evaluation for ERGA-BGE Reports

One Assmebly, Illumina WGS reads + HiC reads

The workflow requires the following:

  • Species Taxonomy ID number
  • NCBI Genome assembly accession code
  • BUSCO Lineage
  • WGS accurate reads accession code
  • NCBI HiC reads accession code

The workflow will get the data and process it to generate genome profiling (genomescope, smudgeplot -optional-), assembly stats (gfastats), merqury stats (QV, completeness), BUSCO, snailplot, contamination blobplot, and ...

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1103.2

Stable

HiC scaffolding pipeline

Snakemake pipeline for scaffolding of a genome using HiC reads using yahs.

Prerequisites

This pipeine has been tested using Snakemake v7.32.4 and requires conda for installation of required tools. To run the pipline use the command:

snakemake --use-conda --cores N

where N is number of cores to use. There are provided a set of configuration and running scripts for exectution on a slurm queueing system. After configuring the cluster.json file run:

./run_cluster ...

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.796.2

Powered by
(v.1.16.0-main)
Copyright © 2008 - 2024 The University of Manchester and HITS gGmbH