Workflows

What is a Workflow?

26 Workflows visible to you, out of a total of 26

Assembly with Flye

Galaxy Australia, Australian BioCommons

Assembly with Flye; can run alone or as part of a combined workflow for large genome assembly.

What it does: Assembles long reads with the tool Flye
Inputs: long reads (may be raw, or filtered, and/or corrected); fastq.gz format
Outputs: Flye assembly fasta; Fasta stats on assembly.fasta; Assembly graph image from Bandage; Bar chart of contig sizes; Quast reports of genome assembly
Tools used: Flye, Fasta statistics, Bandage, Bar chart, Quast
Input parameters: None required, but recommend ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.225.1

Download

Created: 8th Nov 2021 at 05:07, Last updated: 9th Nov 2021 at 01:11

Trim and filter reads - fastp

Galaxy Australia, Australian BioCommons

Trim and filter reads; can run alone or as part of a combined workflow for large genome assembly.

What it does: Trims and filters raw sequence reads according to specified settings.
Inputs: Long reads (format fastq); Short reads R1 and R2 (format fastq)
Outputs: Trimmed and filtered reads: fastp_filtered_long_reads.fastq.gz (But note: no trimming or filtering is on by default), fastp_filtered_R1.fastq.gz, fastp_filtered_R2.fastq.gz
Reports: fastp report on long reads, html; fastp report ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.224.1

Download

Created: 8th Nov 2021 at 04:56, Last updated: 9th Nov 2021 at 01:11

kmer counting - meryl

Galaxy Australia, Australian BioCommons

Kmer counting step, can run alone or as part of a combined workflow for large genome assembly.

What it does: Estimates genome size and heterozygosity based on counts of kmers
Inputs: One set of short reads: e.g. R1.fq.gz
Outputs: GenomeScope graphs
Tools used: Meryl, GenomeScope
Input parameters: None required
Workflow steps: The tool meryl counts kmers in the input reads (k=21), then converts this into a histogram. GenomeScope: runs a model on the histogram; reports estimates. k-mer ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.223.1

Download

Created: 8th Nov 2021 at 04:47, Last updated: 9th Nov 2021 at 01:10

Data QC

Galaxy Australia, Australian BioCommons

Data QC step, can run alone or as part of a combined workflow for large genome assembly.

What it does: Reports statistics from sequencing reads.
Inputs: long reads (fastq.gz format), short reads (R1 and R2) (fastq.gz format).
Outputs: For long reads: a nanoplot report (the HTML report summarizes all the information). For short reads: a MultiQC report.
Tools used: Nanoplot, FastQC, MultiQC.
Input parameters: None required.
Workflow steps: Long reads are analysed by Nanoplot; Short reads ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.222.1

Download

Created: 8th Nov 2021 at 04:34, Last updated: 9th Nov 2021 at 01:09

Racon polish with Illumina reads, x2

Galaxy Australia, Australian BioCommons

Assembly polishing subworkflow: Racon polishing with short reads

Inputs: short reads and assembly (usually pre-polished with other tools first, e.g. Racon + long reads; Medaka)

Workflow steps:

minimap2: short reads (R1 only) are mapped to the assembly => overlaps.paf. Minimap2 setting is for short reads.
overlaps + short reads + assembly => Racon => polished assembly 1
using polished assembly 1 as input; repeat minimap2 + racon => polished assembly 2
Racon short-read polished ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.228.1

Download

Created: 8th Nov 2021 at 05:50, Last updated: 9th Nov 2021 at 01:09

Assembly polishing

Galaxy Australia, Australian BioCommons

Assembly polishing; can run alone or as part of a combined workflow for large genome assembly.

What it does: Polishes (corrects) an assembly, using long reads (with the tools Racon and Medaka) and short reads (with the tool Racon). (Note: medaka is only for nanopore reads, not PacBio reads).
Inputs: assembly to be polished: assembly.fasta; long reads - the same set used in the assembly (e.g. may be raw or filtered) fastq.gz format; short reads, R1 only, in fastq.gz format
Outputs: ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.226.1

Download

Created: 8th Nov 2021 at 05:32, Last updated: 9th Nov 2021 at 01:08

Workflows

Filters ×

Filters