Australian BioCommons

Overview
Related items

The Australian BioCommons enhances digital life science research through world class collaborative distributed infrastructure. It aims to ensure that Australian life science research remains globally competitive, through sustained strategic leadership, research community engagement, digital service provision, training and support.

Space: Australian BioCommons

SEEK ID: https://workflowhub.eu/projects/30

Public web page: https://www.biocommons.org.au/

Organisms: No Organisms specified

WorkflowHub PALs: No PALs for this Team

Team created: 16th Feb 2021

Annotated Properties

Scientific disciplines

Biochemistry, Genetics and Molecular Biology

Related items

Advanced People list for this Team with search and filtering

Ziad Al-Bkhetan

Teams: Australian BioCommons

Organizations: Australian BioCommons

https://orcid.org/0000-0002-4032-5331

Patrick Capon

Teams: Australian BioCommons

Organizations: Australian BioCommons

https://orcid.org/0000-0002-7396-5757

Johan Gustafsson

Teams: Australian BioCommons, Galaxy Australia, ELIXIR Training, ELIXIR Tools platform, National Computational Infrastructure (NCI) WorkflowHub team

Organizations: University of Melbourne, Australian BioCommons

https://orcid.org/0000-0002-2977-5032

Expertise: Biochemistry, Proteomics, Mass Spectrometry Imaging

Tools: Mass spectrometry, Proteomics

Thomas Litfin

Teams: UNSW MWAC Structural Biology Facility, Australian BioCommons

Organizations: UNSW Sydney

https://orcid.org/0000-0002-4863-3865

Expertise: Structural Bioinformatics, Algorithm Development, Deep learning, Structure Prediction

Andrew Lonie

Teams: GalaxyProject SARS-CoV-2, CWL workflow SARS-CoV-2, Australian BioCommons

Organizations: Australian BioCommons

https://orcid.org/0000-0002-2006-3856

Tiff Nelson

Teams: Australian BioCommons

Organizations: The University of Melbourne

https://orcid.org/0000-0002-5341-312X

Lisa Phippard

Teams: Australian BioCommons

Organizations: University of Melbourne, Australian BioCommons

https://orcid.org/0000-0001-8198-9735

Anna Syme

Teams: Galaxy Australia, Australian BioCommons

Organizations: Australian BioCommons

https://orcid.org/0000-0002-9906-0673

https://www.annasyme.com/

Advanced Spaces list for this Team with search and filtering

Australian BioCommons

Teams: Australian BioCommons, QCIF Bioinformatics, Pawsey Supercomputing Research Centre, Sydney Informatics Hub, Janis, Melbourne Data Analytics Platform (MDAP), Galaxy Australia, National Computational Infrastructure (NCI) WorkflowHub team

Web page: https://www.biocommons.org.au/

Advanced Organizations list for this Team with search and filtering

Australian BioCommons

ROR ID: Not specified

Department: Not specified

Country: Australia

City: Not specified

Web page: Not specified

The University of Melbourne

ROR ID: Not specified

Department: Not specified

Country: Australia

City: Not specified

Web page: Not specified

University of Melbourne

ROR ID: Not specified

Department: Not specified

Country: Australia

City: Melbourne

Web page: https://www.unimelb.edu.au/

UNSW Sydney

ROR ID: https://ror.org/03r8z3t63

Department: Not specified

Country: Australia

City: Sydney

Web page: https://www.unsw.edu.au/

Showing 20 out of a possible 44 Workflows Advanced Workflows list for this Team with search and filtering

Convert formats - TSI

Australian BioCommons, Galaxy Australia

Note: Deprecated as of May 2025. The mRNA preprocessing previously performed by this workflow is now built into the Fgenesh annotation workflow (881) Version 4. This workflow is no longer needed in the TSI annotation pipeline. Please use workflow 881 Version 4 directly with TransDecoder CDS output from workflow 879 (Extract transcripts).

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.880.2

Created: 8th May 2024 at 08:23, Last updated: 5th May 2026 at 03:28

Fgenesh annotation -TSI

Australian BioCommons, Galaxy Australia

Fgenesh Annotation - TSI Workflow Description

Overview

One of a series of workflows to annotate a genome, tagged TSI-annotation. Based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

Workflow Sequence

Run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Fgenesh annotation (this workflow)

Inputs Required

Files uploaded by the user:

assembled_genome.fasta — the ...

Type: Galaxy

Creator: Luke Silver

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.881.8

Created: 8th May 2024 at 08:28, Last updated: 5th May 2026 at 03:21

Assembly polishing

Galaxy Australia, Australian BioCommons

Assembly polishing; can run alone or as part of a combined workflow for large genome assembly.

What it does: Polishes (corrects) an assembly, using long reads (with the tools Racon and Medaka) and short reads (with the tool Racon). (Note: medaka is only for nanopore reads, not PacBio reads).
Inputs: assembly to be polished: assembly.fasta; long reads - the same set used in the assembly (e.g. may be raw or filtered) fastq.gz format; short reads, R1 only, in fastq.gz format
Outputs: ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.226.2

Created: 8th Nov 2021 at 05:32, Last updated: 19th Apr 2026 at 02:44

Germline-ShortV @ NCI-Gadi

Australian BioCommons, Sydney Informatics Hub

(Show All)

Work-in-progress

Germline-ShortV @ NCI-Gadi is an implementation of the BROAD Institute's best practice workflow for germline short variant discovery. This implementation is optimised for the National Compute Infrastucture's Gadi HPC, utilising scatter-gather parallelism to enable use of multiple nodes with high CPU or memory efficiency. This workflow requires sample BAM files, which can be generated using the Fastq-to-bam @ NCI-Gadi pipeline. Germline-ShortV can be applied ...

Type: Shell Script

Creators: Tracy Chew, Cali Willet, Georgina Samaha, Rosemarie Sadsad

Submitter: Tracy Chew

DOI: 10.48546/workflowhub.workflow.143.1

Download

Created: 17th Aug 2021 at 05:35, Last updated: 25th Jul 2025 at 03:04

Genome assembly workflow for nanopore reads, for TSI

Australian BioCommons, Galaxy Australia

Genome assembly workflow for nanopore reads, for TSI

Input:

Nanopore reads (can be in format: fastq, fastq.gz, fastqsanger, or fastqsanger.gz)

Optional settings to specify when the workflow is run:

[1] how many input files to split the original input into (to speed up the workflow). default = 0. example: set to 2000 to split a 60 GB read file into 2000 files of ~ 30 MB.
[2] filtering: min average read quality score. default = 10
[3] filtering: min read length. default = 200
[4] ...

Type: Galaxy

Creator: Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.1114.1

Created: 3rd Sep 2024 at 02:07, Last updated: 3rd Sep 2024 at 02:13

TSI-Scaffolding-with-HiC (based on VGP-HiC-scaffolding)

Australian BioCommons, Galaxy Australia

Scaffolding using HiC data with YAHS

This workflow has been created from a Vertebrate Genomes Project (VGP) scaffolding workflow.

For more information about the VGP project see https://galaxyproject.org/projects/vgp/.
The scaffolding workflow is at https://dockstore.org/workflows/github.com/iwc-workflows/Scaffolding-HiC-VGP8/main:main?tab=info
Please see that link for the workflow diagram.

Some minor changes have been made to better fit with TSI project data:

optional inputs of SAK info ...

Type: Galaxy

Creators: VGP Project, VGP, Galaxy

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.1054.1

Created: 21st Jun 2024 at 01:48, Last updated: 21st Jun 2024 at 01:56

Repeat masking - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

Workflow information:

Input = genome.fasta.
Outputs = soft_masked_genome.fasta, hard_masked_genome.fasta, ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.875.3

Created: 8th May 2024 at 04:59, Last updated: 21st Jun 2024 at 00:45

Extract transcripts - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

About this workflow:

Input: merged_transcriptomes.fasta.
Runs TransDecoder to produce longest_transcripts.fasta ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.879.1

Created: 8th May 2024 at 08:15, Last updated: 9th May 2024 at 05:08

Combine transcripts - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

About this workflow:

Inputs: multiple transcriptome.gtfs from different tissues, genome.fasta, coding_seqs.fasta, ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.878.3

Created: 8th May 2024 at 08:07, Last updated: 9th May 2024 at 05:06

Find transcripts - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

About this workflow:

Run this workflow per tissue.
Inputs: masked_genome.fasta and the trimmed RNAseq reads ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.877.1

Created: 8th May 2024 at 07:51, Last updated: 9th May 2024 at 05:04

QC and trimming of RNAseq reads - TSI

Australian BioCommons, Galaxy Australia

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Convert formats
Fgenesh annotation

About this workflow:

Repeat this workflow separately for datasets from different tissues.
Inputs = collections ...

Type: Galaxy

Creators: Luke Silver, Anna Syme

Submitter: Anna Syme

DOI: 10.48546/workflowhub.workflow.876.1

Created: 8th May 2024 at 07:39, Last updated: 9th May 2024 at 05:02

Genome-assessment-post-assembly

Australian BioCommons

Post-genome assembly quality control workflow using Quast, BUSCO, Meryl, Merqury and Fasta Statistics. Updates November 2023. Inputs: reads as fastqsanger.gz (not fastq.gz), and assembly.fasta. New default settings for BUSCO: lineage = eukaryota; for Quast: lineage = eukaryotes, genome = large. Reports assembly stats into a table called metrics.tsv, including selected metrics from Fasta Stats, and read coverage; reports BUSCO versions and dependencies; and displays these tables in the workflow ...

Type: Galaxy

Creators: Gareth Price, Anna Syme

Submitter: Johan Gustafsson

Created: 13th Mar 2024 at 23:32

pipesnake

Australian BioCommons

Stable

Welcome to the pipesnake. Let's get started.

Introduction

pipesnake is a bioinformatics best-practice analysis pipeline for phylogenomic reconstruction starting from short-read 'second-generation' sequencing data.

The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It uses Docker/Singularity ...

Type: Nextflow

Creators: Ziad Al-Bkhetan, Ian Brennan

Submitter: Ziad Al-Bkhetan

Created: 2nd Feb 2024 at 05:15, Last updated: 2nd Feb 2024 at 05:17

Somatic-ShortV-nf

Sydney Informatics Hub, Australian BioCommons

(Show All)

Work-in-progress

This is a Nextflow implementaion of the GATK Somatic Short Variant Calling workflow. This workflow can be used to discover somatic short variants (SNVs and indels) from tumour and matched normal BAM files following GATK's Best Practices Workflow. The workflowis currently optimised to run efficiently and at scale on the National Compute Infrastructure, Gadi.

Type: Nextflow

Creators: Nandan Deshpande, Tracy Chew, Cali Willet, Georgina Samaha

Submitter: Georgina Samaha

DOI: 10.48546/workflowhub.workflow.691.1

Created: 20th Dec 2023 at 01:12, Last updated: 20th Dec 2023 at 01:16

GermlineStructuralV-nf

Sydney Informatics Hub, Australian BioCommons

(Show All)

GermlineStructuralV-nf is a pipeline for identifying structural variant events in human Illumina short read whole genome sequence data. GermlineStructuralV-nf identifies structural variant and copy number events from BAM files using Manta, Smoove, and TIDDIT. Variants are then merged using SURVIVOR, ...

Type: Nextflow

Creators: Georgina Samaha, Marina Kennerson, Tracy Chew, Sarah Beecroft

Submitter: Georgina Samaha

DOI: 10.48546/workflowhub.workflow.431.1

Created: 31st Jan 2023 at 23:40, Last updated: 18th Dec 2023 at 05:36

HiFi de novo genome assembly workflow

Australian BioCommons

Stable

HiFi de novo genome assembly workflow

HiFi-assembly-workflow is a bioinformatics pipeline that can be used to analyse Pacbio CCS reads for de novo genome assembly using PacBio Circular Consensus Sequencing (CCS) reads. This workflow is implemented in Nextflow and has 3 major sections.

Please refer to the following documentation for detailed description of each workflow section:

[Adapter filtration and pre-assembly quality control ...

Type: Nextflow

Creators: Naga Kasinadhuni, Ziad Al-Bkhetan, Martha Zakrzewski, Kenneth Chan, Uwe Winter, Johan Gustafsson

Submitter: Johan Gustafsson

Created: 31st Aug 2023 at 08:41, Last updated: 31st Aug 2023 at 08:52

IGVreport-nf

Sydney Informatics Hub, Australian BioCommons

Work-in-progress

IGVreport-nf

Description
Diagram
User guide
Workflow summaries
Metadata
Component tools
Required (minimum) inputs/parameters
Additional notes
Help/FAQ/Troubleshooting
Acknowledgements/citations/credits

Description

Quickly generate [IGV .html ...

Type: Nextflow

Creators: Georgina Samaha, Tracy Chew

Submitter: Georgina Samaha

Created: 21st Mar 2023 at 05:17

PacBio HiFi genome assembly using hifiasm v2.1

Galaxy Australia, Australian BioCommons

Stable

PacBio HiFi genome assembly using hifiasm v2.1

General usage recommendations

Please see the Genome assembly with hifiasm on Galaxy Australia guide.

See change log

Acknowledgements

The workflow & the doc_guidelines template used are supported by the Australian BioCommons via Bioplatforms Australia funding, the Australian ...

Type: Galaxy

Creators: Gareth Price, Katherine Farquharson

Submitter: Johan Gustafsson

DOI: 10.48546/workflowhub.workflow.221.3

Created: 26th Oct 2021 at 01:25, Last updated: 24th Oct 2022 at 02:55

Purge duplicates from hifiasm assembly v1.0

Galaxy Australia, Australian BioCommons

Stable

Purge-duplicates-from-hifiasm-assembly

General recommendations for using `Purge-duplicates-from-hifiasm-assembly`

Please see the Genome assembly with hifiasm on Galaxy Australia guide.

Acknowledgements

The workflow & the doc_guidelines template used are supported by the Australian BioCommons via Bioplatforms Australia funding, the Australian ...

Type: Galaxy

Creators: Gareth Price, Gareth Price

Submitter: Johan Gustafsson

DOI: 10.48546/workflowhub.workflow.237.2

Created: 15th Nov 2021 at 01:39, Last updated: 17th Oct 2022 at 03:53

BAM to FASTQ + QC v1.0

Australian BioCommons, Galaxy Australia

Stable

BAM-to-FASTQ-QC

General recommendations for using BAM-to-FASTQ-QC

Please see the Genome assembly with hifiasm on Galaxy Australia guide.

Acknowledgements

The workflow & the doc_guidelines template used are supported by the Australian BioCommons via Bioplatforms Australia funding, the Australian Research Data Commons (https://doi.org/10.47486/PL105) ...

Type: Galaxy

Creator: Gareth Price

Submitter: Johan Gustafsson

DOI: 10.48546/workflowhub.workflow.220.2

Created: 21st Oct 2021 at 06:52, Last updated: 17th Oct 2022 at 03:51

View all 44 Workflows

Advanced Collections list for this Team with search and filtering

Australian Structural Biology Computing (ASBC) community collection

This collection brings together a diverse set of workflows developed by members of the Australian Structural Biology Computing (ASBC) community. These workflows provide reusable solutions to common challenges in structural biology research.

Created and maintained by researchers, software developers, and facility staff from institutions across Australia, the workflows reflect community best practices and real-world experience. By sharing ...

Maintainers: Johan Gustafsson, Thomas Litfin

Number of items: 1

Tags: Not specified

Created: 28th Apr 2026 at 02:37, Last updated: 12th Jun 2026 at 07:46

TSI annotation workflows

This is part of a series of workflows to annotate a genome, tagged with TSI-annotation. These workflows are based on command-line code by Luke Silver, converted into Galaxy Australia workflows.

The workflows can be run in this order:

Repeat masking
RNAseq QC and read trimming
Find transcripts
Combine transcripts
Extract transcripts
Fgenesh annotation

Update May 2026: Note: The "convert formats" workflow (880) is now no longer needed.

Maintainers: Anna Syme

Number of items: 7

Tags: TSI-annotation

Created: 8th May 2024 at 08:30, Last updated: 5th May 2026 at 03:29

BioCommons ‘Bring Your Own Data’ Expansion Project

This ARDC and BioCommons sponsored project delivers a key component of BioCommon’s vision for an ecosystem of platforms providing researchers with sophisticated data analysis and digital asset stewardship capabilities. The Bring Your Own Data (BYOD) Platform (https://www.biocommons.org.au/byod-expansion) has enabled highly accessible, highly available, highly scalable analysis and data sharing capabilities for the benefit of life science researchers nationally.

**This WorkflowHub collection ...

Maintainers: Johan Gustafsson, Lisa Phippard

Number of items: 39

Tags: Australian BioCommons

Created: 1st Dec 2022 at 01:26, Last updated: 14th Feb 2023 at 23:14

HiFi genome assembly on Galaxy

No description specified

Maintainers: Johan Gustafsson

Number of items: 4

Tags: Genome assembly, HiFi, Galaxy

Created: 8th Sep 2022 at 01:22, Last updated: 14th Nov 2022 at 04:56

Australian BioCommons

Related items

Fgenesh Annotation - TSI Workflow Description

Overview

Workflow Sequence

Inputs Required

Scaffolding using HiC data with YAHS

Introduction

HiFi de novo genome assembly workflow

IGVreport-nf

Description

PacBio HiFi genome assembly using hifiasm v2.1

General usage recommendations

See change log

Acknowledgements

Purge-duplicates-from-hifiasm-assembly

General recommendations for using Purge-duplicates-from-hifiasm-assembly

Acknowledgements

BAM-to-FASTQ-QC

General recommendations for using BAM-to-FASTQ-QC

Acknowledgements

General recommendations for using `Purge-duplicates-from-hifiasm-assembly`