This is part of a series of workflows to annotate a genome, tagged with TSI-annotation.
These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.
The workflows can be run in this order:
- Repeat masking
- RNAseq QC and read trimming
- Find transcripts
- Combine transcripts
- Extract transcripts
- Convert formats
- Fgenesh annotation
Steps
| ID | Name | Description |
|---|---|---|
| 0 | FGENESH split | fgenesh_split |
| 1 | FGENESH split | fgenesh_split |
| 2 | FGENESH annotate | fgenesh_annotate |
| 3 | FGENESH merge | fgenesh_merge |
| 4 | FGENESH merge | fgenesh_merge |
| 5 | FGENESH get mRNA or GC | fgenesh_get_mrnas_gc |
| 6 | FGENESH get mRNA or GC | fgenesh_get_mrnas_gc |
| 7 | FGENESH get protein | fgenesh_get_proteins |
| 8 | Busco | toolshed.g2.bx.psu.edu/repos/iuc/busco/busco/5.5.0+galaxy0 |
Version History
Version 4 (latest) Created 5th May 2026 at 03:18 by Anna Syme
Version 4 changes: The mRNA input preprocessing (previously handled by workflow 880 – Convert formats) is now built into this workflow. TransDecoder CDS output from workflow 879 is automatically linearised, filtered to complete ORFs, and reformatted for Fgenesh before annotation. Workflow 880 is no longer needed. Also fixes a Galaxy Australia compatibility issue: fgenesh_get_mrnas_gc (which fails on Galaxy AU) is replaced with fgenesh_get_mrnas v2024.2+galaxy1.
Frozen
Version-4
00b9a00
Version 3.2 Created 18th Apr 2026 at 11:50 by Anna Syme
Shortened the licence-agreement parameter input label from the full BioCommons acknowledgement text to "Licence agreement (tick to accept Fgenesh terms)" to tidy the workflow diagram. Updated the workflow description to accurately list inputs (assembled_genome.fasta, hard_masked_genome.fasta, mRNA_sequences.fasta) and clarify that species matrix, NR database and BUSCO lineage are dropdown selections rather than uploaded files. No functional changes; .ga steps and tool versions unchanged from Version 3.1.
Frozen
Version-3.2
af61f4b
Version 3.1 Created 18th Apr 2026 at 11:21 by Anna Syme
Fixed: added license_agreements input, mRNA_sequences.fasta input, updated fgenesh_annotate/fgenesh_get_proteins to v2024.2+galaxy1. Tested on S. cerevisiae (BUSCO C:84.0%).
Frozen
Version-3.1
fe9c08f
Version 3 Created 17th Nov 2024 at 23:41 by Anna Syme
Changes made 13 Nov 2024: Added correct input files and connected them to the split steps. Added inputs for db selections in the annotation step. Added lineage input to Busco. Added genome annotation stats derived from gff3 output. Connected in assembly.fasta sequences to "get mRNA/cDNA" tools. Expanded this information text and clarified the need for .cdna .pro and .dat files as input.
Frozen
Version-3
fe9c08f
Version 2.1 Created 18th Jun 2024 at 10:08 by Anna Syme
add updated workflow image
Frozen
Version-2.1
3cebff9
Version 1 (earliest) Created 8th May 2024 at 08:28 by Anna Syme
Initial commit
Frozen
Version-1
1b30a7e
Creators and SubmitterCreator
Submitter
Views: 10247 Downloads: 2753 Runs: 16
Created: 8th May 2024 at 08:28
Last updated: 5th May 2026 at 03:21
Tags
AttributionsNone
Collections
Run on Galaxy
https://orcid.org/0000-0002-9906-0673