Workflows

What is a Workflow?
31 Workflows visible to you, out of a total of 31

Note: Deprecated as of May 2025. The mRNA preprocessing previously performed by this workflow is now built into the Fgenesh annotation workflow (881) Version 4. This workflow is no longer needed in the TSI annotation pipeline. Please use workflow 881 Version 4 directly with TransDecoder CDS output from workflow 879 (Extract transcripts).


This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.880.2

Fgenesh Annotation - TSI Workflow Description

Overview

One of a series of workflows to annotate a genome, tagged TSI-annotation. Based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

Workflow Sequence

Run in this order:

  • Repeat masking
  • RNAseq QC and read trimming
  • Find transcripts
  • Combine transcripts
  • Extract transcripts
  • Fgenesh annotation (this workflow)

Inputs Required

Files uploaded by the user:

  • assembled_genome.fasta — the ...

Type: Galaxy

Creator: Luke Silver

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.881.8

Assembly polishing; can run alone or as part of a combined workflow for large genome assembly.

  • What it does: Polishes (corrects) an assembly, using long reads (with the tools Racon and Medaka) and short reads (with the tool Racon). (Note: medaka is only for nanopore reads, not PacBio reads).
  • Inputs: assembly to be polished: assembly.fasta; long reads - the same set used in the assembly (e.g. may be raw or filtered) fastq.gz format; short reads, R1 only, in fastq.gz format
  • Outputs: ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.226.2

Stable

Post-genome assembly quality control workflow using Quast, BUSCO, Meryl, Merqury and Fasta Statistics, with updates November 2024.

Workflow inputs: reads as fastqsanger.gz (not fastq.gz), and primary assembly.fasta. (To change reads format: click on the pencil icon next to the file in the Galaxy history, then "Datatypes", then set "New type" as fastqsanger.gz). Note: the reads should be those that were used for the assembly (i.e., the filtered/cleaned reads), not the raw reads.

What it does: ...

Type: Galaxy

Creators: Kate Farquharson, Gareth Price, Simon Tang, Anna Syme, Gareth Price, Anna Syme

Submitters: Johan Gustafsson, Anna Syme

DOI: 10.48546/workflowhub.workflow.403.11

Stable

Taxonomy Assignment with QIIME2

This workflow performs taxonomic assignment for an input OTU/ASV (Operating Taxonomic Unit/Amplicon Sequence Variant) Feature Table and set of amplicon Representative Sequences using QIIME2. The workflow creates and trains a QIIME2 classifier for an input reference taxonomic database (e.g. HOMD, SILVA, PR2, UNITE) and uses the trained classifier to assign taxonomy ...

Genome assembly workflow for nanopore reads, for TSI

Input:

  • Nanopore reads (can be in format: fastq, fastq.gz, fastqsanger, or fastqsanger.gz)

Optional settings to specify when the workflow is run:

  • [1] how many input files to split the original input into (to speed up the workflow). default = 0. example: set to 2000 to split a 60 GB read file into 2000 files of ~ 30 MB.
  • [2] filtering: min average read quality score. default = 10
  • [3] filtering: min read length. default = 200
  • [4] ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.1114.1

Scaffolding using HiC data with YAHS

This workflow has been created from a Vertebrate Genomes Project (VGP) scaffolding workflow.

Some minor changes have been made to better fit with TSI project data:

  • optional inputs of SAK info ...

Type: Galaxy

Creators: VGP Project, VGP, Galaxy

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.1054.1

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

  • Repeat masking
  • RNAseq QC and read trimming
  • Find transcripts
  • Combine transcripts
  • Extract transcripts
  • Convert formats
  • Fgenesh annotation

Workflow information:

  • Input = genome.fasta.
  • Outputs = soft_masked_genome.fasta, hard_masked_genome.fasta, ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.875.3

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

  • Repeat masking
  • RNAseq QC and read trimming
  • Find transcripts
  • Combine transcripts
  • Extract transcripts
  • Convert formats
  • Fgenesh annotation

About this workflow:

  • Input: merged_transcriptomes.fasta.
  • Runs TransDecoder to produce longest_transcripts.fasta ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.879.1

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

  • Repeat masking
  • RNAseq QC and read trimming
  • Find transcripts
  • Combine transcripts
  • Extract transcripts
  • Convert formats
  • Fgenesh annotation

About this workflow:

  • Inputs: multiple transcriptome.gtfs from different tissues, genome.fasta, coding_seqs.fasta, ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.878.3

Powered by
(v.1.17.3)
Copyright © 2008 - 2026 The University of Manchester and HITS gGmbH