Viral Amplicon Analysis Pipeline for ONT Data
Description
Galaxy workflow for processing viral amplicon datasets, such as MPXV (Mpox), sequenced using the Oxford Nanopore Technologies (ONT) platform. The workflow uses the latest Fieldbioinformatics Artic Minion pipeline that uses the clair3 variant caller.
The ARTIC minion is a pipeline for working with viral nanopore sequencing data, generated from tiling amplicon schemes. It is designed to help run the artic bioinformatics protocols; for example the MPXV protocol. The ARTIC minion pipeline performs genome mapping, variant calling and consensus assembly.
Inputs
- ONT Reads: Oxford Nanopore Technologies (ONT) sequenced reads from ARTIC assay with fastqsanger encoding. [
fastqsanger,fastqsanger.gz] - Claire3 Model: Basecalling model to be used for processing ONT data. The basecalling model must be uploaded to your Galaxy History. Available models can be downloaded from the Rerio repository: https://github.com/nanoporetech/rerio. Please note: By using these models, you agree to the Oxford Nanopore Technologies, Ltd. Public License agreement [
tar.gz] - Normalise Reads: Sample at most this number of reads per amplicon and strand. This parameter sets the value for the ARTIC minion input
Coverage normalisation depth(Default value400). [integer] - Oxford Nanopore Technologies, Ltd. Public License agreement: Users must agree to the Oxford Nanopore Technologies, Ltd. Public License agreement in order to run this workflow. [
boolean] - Maximum Read Length: Optional input specifying the maximum permitted read length. Reads longer than this value will be discarded during filtering (Default value
2600). [integer] - Minimum Read Length: Optional input specifying the minimum permitted read length. Reads shorter than this value will be discarded during filtering (Default value
200). [integer] - Amplicon scheme name: Name of the amplicon scheme on labs.primalscheme.com to fetch primer BED and reference (Default value
artic-inrb-mpox). - Amplicon scheme version: Version of the amplicon scheme on labs.primalscheme.com in format
vX.X.X(Default valuev1.0.0). - Amplicon scheme amplicon length: Length of the amplicons in the scheme. Only required if the scheme named has multiple lengths (Default value
2500).
NOTE: Alternatively, the primer bed file (primer.bed) and reference genome fasta file (reference.fasta) can be provided from the user's History if previously imported into Galaxy from labs.primalscheme.com.
Outputs
- Collated QC Results:
MultiQCcombined QC reports output byFastplong. [html,tabular] - Collated BAM Files: Collection of the
bamfiles output byArtic minion. [bam] - Collated Alignment Reports: Collection of the alignment reports output by
Artic minion. [tabular] - Collated VCF Outputs: Collection of
vcffiles output byArtic minion. [vcf] - Combined FASTA Outputs: Single
fastafile of all consensus sequences output byArtic minion. [fasta] - Plot Depth: Plot of the mapped read depth calculated by
Mosdepth. [pdf]
Workflow Steps
- Filter ONT reads by length using
Fastplong. - Run the
ARTIC minionpipeline tool to perform the following processing steps.- Align reads to reference genome using
minimap2 - Trim primers
- Call variants (mutations)
- Create consensus genome sequence
- Align reads to reference genome using
- Calculate the mapped read depth using
Mosdepthand convertbedgraphoutput totabularusingConvert bedgraph to tsv. - Rename consensus sequence headers produced by
Artic minionfrom extended path/file names to the sample barcode (e.g.>barcode09/KJ642613.1_masked/ARTIC/clair3 KJ642613.1_masked->>barcode09) usingrename headerandclean header. - Combine
FastplongQC reports usingMultiQC. - Plot mapped read depth using
Plotting tool. - Create a collection of the
Artic minionoutputbamfiles usingBuild list. - Create a collection of the
Artic minionoutput alignment reports usingBuild list. - Create a collection of the
Artic minionoutputvcffiles usingBuild list. - Merge all
Artic minionoutput consensus sequences into a singlefastafile usingConcatenate datasets.
Inputs
| ID | Name | Description | Type |
|---|---|---|---|
| Amplicon scheme amplicon length | Amplicon scheme amplicon length | Length of the amplicons in the scheme. Only required if the scheme named has multiple lengths |
|
| Amplicon scheme name | Amplicon scheme name | Name of the amplicon scheme on labs.primalscheme.com to fetch primer BED and reference for |
|
| Amplicon scheme version | Amplicon scheme version | Version of the amplicon scheme on labs.primalscheme.com in format vX.X.X |
|
| Clair3 Model | Clair3 Model | Model must be tar.gz |
|
| I agree to the terms of the Oxford Nanopore Technologies, Ltd. Public License agreement | I agree to the terms of the Oxford Nanopore Technologies, Ltd. Public License agreement | To use Rerio models (e.g. for ONT R10 chemistry) you must agree to the terms of the Oxford Nanopore Technologies, Ltd. Public License agreement, which can be found in the nanoporetech/rerio Github repository. |
|
| Maximum read length | Maximum read length | Reads longer than this value will be discarded. |
|
| Minimum read length | Minimum read length | Reads shorter than this value will be discarded. |
|
| Normalise Reads | Normalise Reads | Sample at most this number of reads per amplicon and strand. |
|
| ONT Reads | ONT Reads | ONT reads from ARTIC assay with fastqsanger encoding |
|
Steps
| ID | Name | Description |
|---|---|---|
| 9 | Fastplong | toolshed.g2.bx.psu.edu/repos/iuc/fastplong/fastplong/0.3.0+galaxy1 |
| 10 | Artic minion | toolshed.g2.bx.psu.edu/repos/iuc/artic_minion/artic_minion/1.7.3+galaxy0 |
| 11 | Collated - QC Results | toolshed.g2.bx.psu.edu/repos/iuc/multiqc/multiqc/1.27+galaxy3 |
| 12 | Mosdepth: calc depth (bedgraph) | toolshed.g2.bx.psu.edu/repos/iuc/mosdepth/mosdepth/0.3.8+galaxy0 |
| 13 | rename header | Expected header: barcode09/KJ642613.1_masked/ARTIC/clair3 KJ642613.1_masked renamed to: barcode09 toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regex1/1.0.3 |
| 14 | Collated - BAM Files | __BUILD_LIST__ |
| 15 | Collate Alignment Reports | __BUILD_LIST__ |
| 16 | Collated - VCF Outputs | __BUILD_LIST__ |
| 17 | Convert bedgraph to tsv | Converts bedgraph to per base tsv for plotting toolshed.g2.bx.psu.edu/repos/iuc/bedtools/bedtools_makewindowsbed/2.31.1+galaxy0 |
| 18 | clean header | Rename from: >{barcode}.fastq.gz to >{barcode} toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regex1/1.0.3 |
| 19 | Plot Depth | toolshed.g2.bx.psu.edu/repos/devteam/xy_plot/XY_Plot_1/1.0.2 |
| 20 | Combine fasta outputs | Combine the output of renamed artic consensus. toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_cat/9.5+galaxy0 |
Outputs
| ID | Name | Description | Type |
|---|---|---|---|
| alignment_report | alignment_report | n/a |
|
| html_report | html_report | n/a |
|
| stats_table | stats_table | n/a |
|
| Collated - BAM Files | Collated - BAM Files | n/a |
|
| Collated - VCF Outputs | Collated - VCF Outputs | n/a |
|
| Depth Plots | Depth Plots | n/a |
|
| All Consensus Sequences | All Consensus Sequences | n/a |
|
Version History
Version 1 (earliest) Created 18th Jun 2026 at 01:10 by Tristan Reynolds
Initial commit
Open
master
9341a2e
Creators and SubmitterCreators
Submitter
Views: 0 Downloads: 0 Runs: 0
Created: 18th Jun 2026 at 01:10
AttributionsNone
Run on Galaxy
https://orcid.org/0000-0002-6606-5953