Anna Syme

Overview
Related items

About Anna Syme:

https://www.annasyme.com/

SEEK ID: https://workflowhub.eu/people/200

Location: Australia

ORCID: https://orcid.org/0000-0002-9906-0673

Joined: 8th Nov 2021

Expertise: Not specified

Tools: Not specified

Their tags

genome_assembly Large-genome-assembly nanopore TSI TSI-annotation

Related items

Advanced Spaces list for this Person with search and filtering

Australian BioCommons

The Australian BioCommons enhances digital life science research through world class collaborative distributed infrastructure. It aims to ensure that Australian life science research remains globally competitive, through sustained strategic leadership, research community engagement, digital service provision, training and support.

Teams: Australian BioCommons, QCIF Bioinformatics, Pawsey Supercomputing Research Centre, Sydney Informatics Hub, Janis, Melbourne Data Analytics Platform (MDAP), Galaxy Australia, National Computational Infrastructure (NCI) WorkflowHub team

Web page: https://www.biocommons.org.au/

Advanced Teams list for this Person with search and filtering

Australian BioCommons

Space: Australian BioCommons

Public web page: https://www.biocommons.org.au/

Organisms: Not specified

Galaxy Australia

Galaxy is an open, web-based platform for accessible, reproducible, and transparent computational biological research.

Accessible: Users can easily run tools without writing code or using the CLI; all via a user-friendly web interface.
Reproducible: Galaxy captures all the metadata from an analysis, making it completely reproducible.
Transparent: Users share and publish analyses via interactive pages that can enhance analyses with user annotations.
Scalable: Galaxy ...

Space: Australian BioCommons

Public web page: https://usegalaxy.org.au/

Organisms: Not specified

Advanced Organizations list for this Person with search and filtering

Australian BioCommons

ROR ID: Not specified

Department: Not specified

Country: Australia

City: Not specified

Web page: Not specified

Showing 20 out of a possible 26 Workflows Advanced Workflows list for this Person with search and filtering

Genome-assessment-post-assembly

Galaxy Australia

Stable

Post-genome assembly quality control workflow using Quast, BUSCO, Meryl, Merqury and Fasta Statistics, with updates November 2024.

Workflow inputs: reads as fastqsanger.gz (not fastq.gz), and primary assembly.fasta. (To change reads format: click on the pencil icon next to the file in the Galaxy history, then "Datatypes", then set "New type" as fastqsanger.gz). Note: the reads should be those that were used for the assembly (i.e., the filtered/cleaned reads), not the raw reads.

What it does: ...

Type: Galaxy

Creators: Kate Farquharson, Gareth Price, Simon Tang, Anna Syme

Submitters: Johan Gustafsson, Anna Syme

DOI: 10.48546/workflowhub.workflow.403.7

Created: 7th Nov 2022 at 07:10, Last updated: 4th Dec 2024 at 02:15

Fgenesh annotation -TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

Inputs required: assembled-genome.fasta, hard-repeat-masked-genome.fasta, and (because this workflow maps known mRNA ...

Type: Galaxy

Creator: Luke Silver

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.881.5

Created: 8th May 2024 at 08:28, Last updated: 17th Nov 2024 at 23:51

Genome assembly workflow for nanopore reads, for TSI

Australian BioCommons, Galaxy Australia

Genome assembly workflow for nanopore reads, for TSI

Input:

Nanopore reads (can be in format: fastq, fastq.gz, fastqsanger, or fastqsanger.gz)

Optional settings to specify when the workflow is run:

[1] how many input files to split the original input into (to speed up the workflow). default = 0. example: set to 2000 to split a 60 GB read file into 2000 files of ~ 30 MB.
[2] filtering: min average read quality score. default = 10
[3] filtering: min read length. default = 200
[4] ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.1114.1

Created: 3rd Sep 2024 at 02:07, Last updated: 3rd Sep 2024 at 02:13

TSI-Scaffolding-with-HiC (based on VGP-HiC-scaffolding)

Australian BioCommons, Galaxy Australia

Scaffolding using HiC data with YAHS

This workflow has been created from a Vertebrate Genomes Project (VGP) scaffolding workflow.

For more information about the VGP project see https://galaxyproject.org/projects/vgp/.
The scaffolding workflow is at https://dockstore.org/workflows/github.com/iwc-workflows/Scaffolding-HiC-VGP8/main:main?tab=info
Please see that link for the workflow diagram.

Some minor changes have been made to better fit with TSI project data:

optional inputs of SAK info ...

Type: Galaxy

Creators: VGP Project, VGP, Galaxy

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.1054.1

Created: 21st Jun 2024 at 01:48, Last updated: 21st Jun 2024 at 01:56

Repeat masking - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

Workflow information:

Input = genome.fasta.
Outputs = soft_masked_genome.fasta, hard_masked_genome.fasta, ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.875.3

Created: 8th May 2024 at 04:59, Last updated: 21st Jun 2024 at 00:45

Convert formats - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

About this workflow:

Inputs: transdecoder-peptides.fasta, transdecoder-nucleotides.fasta
Runs many steps ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.880.1

Created: 8th May 2024 at 08:23, Last updated: 9th May 2024 at 05:09

Extract transcripts - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

About this workflow:

Input: merged_transcriptomes.fasta.
Runs TransDecoder to produce longest_transcripts.fasta ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.879.1

Created: 8th May 2024 at 08:15, Last updated: 9th May 2024 at 05:08

Combine transcripts - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

About this workflow:

Inputs: multiple transcriptome.gtfs from different tissues, genome.fasta, coding_seqs.fasta, ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.878.1

Created: 8th May 2024 at 08:07, Last updated: 9th May 2024 at 05:06

Find transcripts - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

About this workflow:

Run this workflow per tissue.
Inputs: masked_genome.fasta and the trimmed RNAseq reads ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.877.1

Created: 8th May 2024 at 07:51, Last updated: 9th May 2024 at 05:04

QC and trimming of RNAseq reads - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

About this workflow:

Repeat this workflow separately for datasets from different tissues.
Inputs = collections ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.876.1

Created: 8th May 2024 at 07:39, Last updated: 9th May 2024 at 05:02

Partial ref-guided workflow - gstacks and pops

Galaxy Australia, Australian BioCommons

workflow-partial-gstacks-populations

These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.

Galaxy Australia: https://usegalaxy.org.au/

Stacks: http://catchenlab.life.illinois.edu/stacks/

This workflow is part of the reference-guided stacks workflow, https://workflowhub.eu/workflows/347

This workflow takes in bam files and a population map.

To generate bam files see: https://workflowhub.eu/workflows/351

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

Created: 31st May 2022 at 09:12

Partial ref-guided workflow - bwa mem only

Galaxy Australia, Australian BioCommons

workflow-partial-bwa-mem

These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.

Galaxy Australia: https://usegalaxy.org.au/

Stacks: http://catchenlab.life.illinois.edu/stacks/

This workflow is part of the reference-guided stacks workflow, https://workflowhub.eu/workflows/347

Inputs

demultiplexed reads in fastq format, may be output from the QC workflow. Files are in a collection.
reference genome in fasta format ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

Created: 31st May 2022 at 09:05

Partial de novo workflow: c-s-g-pops only

Galaxy Australia, Australian BioCommons

workflow-partial-cstacks-sstacks-gstacks

These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.

Galaxy Australia: https://usegalaxy.org.au/

Stacks: http://catchenlab.life.illinois.edu/stacks/

This workflow takes in ustacks output, and runs cstacks, sstacks and gstacks.

To generate ustacks output see https://workflowhub.eu/workflows/349

For the full de novo workflow see https://workflowhub.eu/workflows/348

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

Created: 31st May 2022 at 08:56

Partial de novo workflow: ustacks only

Galaxy Australia, Australian BioCommons

workflow-partial-ustacks-only

These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.

Galaxy Australia: https://usegalaxy.org.au/

Stacks: http://catchenlab.life.illinois.edu/stacks/

For the full de novo workflow see https://workflowhub.eu/workflows/348

You may want to run ustacks with different batches of samples.

To be able to combine these later, there are some necessary steps - we need to keep track of how many ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

Created: 31st May 2022 at 08:50

Stacks RAD-seq de novo workflow

Galaxy Australia, Australian BioCommons

workflow-denovo-stacks

These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.

Galaxy Australia: https://usegalaxy.org.au/

Stacks: http://catchenlab.life.illinois.edu/stacks/

Inputs

demultiplexed reads in fastq format, may be output from the QC workflow. Files are in a collection.
population map in text format

Steps and outputs

ustacks:

input reads go to ustacks.
ustacks assembles the reads into matching ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

Created: 31st May 2022 at 08:39

Stacks RAD-seq reference-guided workflow

Galaxy Australia, Australian BioCommons

workflow-ref-guided-stacks

These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.

Galaxy Australia: https://usegalaxy.org.au/

Stacks: http://catchenlab.life.illinois.edu/stacks/

Inputs

demultiplexed reads in fastq format, may be output from the QC workflow. Files are in a collection.
population map in text format
reference genome in fasta format

Steps and outputs

BWA MEM 2:

The reads are mapped to the ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

Created: 31st May 2022 at 08:20

QC of RADseq reads

Galaxy Australia, Australian BioCommons

workflow-qc-of-radseq-reads

These workflows are part of a set designed to work for RAD-seq data on the Galaxy platform, using the tools from the Stacks program.

Galaxy Australia: https://usegalaxy.org.au/

Stacks: http://catchenlab.life.illinois.edu/stacks/

Inputs

demultiplexed reads in fastq format, in a collection
two adapter sequences in fasta format, for input into cutadapt

Steps and outputs

The workflow can be modified to suit your own parameters.

The workflow steps are:

Run ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

Created: 31st May 2022 at 07:55

Combined workflows for large genome assembly

Galaxy Australia, Australian BioCommons

Combined workflow for large genome assembly

The tutorial document for this workflow is here: https://doi.org/10.5281/zenodo.5655813

What it does: A workflow for genome assembly, containing subworkflows:

Data QC
Kmer counting
Trim and filter reads
Assembly with Flye
Assembly polishing
Assess genome quality

Inputs:

long reads and short reads in fastq format
reference genome for Quast

Outputs:

Data information - QC, kmers
Filtered, trimmed reads
Genome assembly, assembly graph, ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.230.1

Download

Created: 8th Nov 2021 at 06:08, Last updated: 11th Nov 2021 at 03:37

Assess genome quality

Galaxy Australia, Australian BioCommons

Assess genome quality; can run alone or as part of a combined workflow for large genome assembly.

What it does: Assesses the quality of the genome assembly: generate some statistics and determine if expected genes are present; align contigs to a reference genome.
Inputs: polished assembly; reference_genome.fasta (e.g. of a closely-related species, if available).
Outputs: Busco table of genes found; Quast HTML report, and link to Icarus contigs browser, showing contigs aligned to a reference ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.229.1

Download

Created: 8th Nov 2021 at 06:03, Last updated: 9th Nov 2021 at 01:12

Racon polish with long reads, x4

Galaxy Australia, Australian BioCommons

Assembly polishing subworkflow: Racon polishing with long reads

Inputs: long reads and assembly contigs

Workflow steps:

minimap2 : long reads are mapped to assembly => overlaps.paf.
overaps, long reads, assembly => Racon => polished assembly 1
using polished assembly 1 as input; repeat minimap2 + racon => polished assembly 2
using polished assembly 2 as input, repeat minimap2 + racon => polished assembly 3
using polished assembly 3 as input, repeat minimap2 + racon => ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.227.1

Download

Created: 8th Nov 2021 at 05:45, Last updated: 9th Nov 2021 at 01:12

View all 26 Workflows

Advanced Collections list for this Person with search and filtering

TSI annotation workflows

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

Maintainers: Anna Syme

Number of items: 7

Tags: TSI-annotation

Created: 8th May 2024 at 08:30, Last updated: 8th May 2024 at 08:36