Workflow Type: Galaxy
Open
MPXV (Mpox) Phylogenetic Analysis with Squirrel
Description
Galaxy workflow to perform MPXV phylogenetic reconstruction using the Squirrel (Some QUIck Reconstruction to Resolve Evolutionary Links) Galaxy tools, Squirrel QC and Squirrel Phylo. The workflow also performs masking (using the squirrel tool) of SNPs.
Galaxy Squirrel tools:
Squirrel QC: The quality control (QC) mode of Squirrel that can run QC on the alignment and flag certain sites to the user that may need to be masked. Squirrel can flag potential issues in the MPXV sequences that have been provided for alignment (e.g. SNPS near tracts of N, clusters of unique SNPs, reversions to reference alleles and convergent mutations) and outputs these in a mask file for investigation.Squirrel Phylo: The Squirrel mode that performs alignment and creation of maximum-likelihood phylogenetic trees. Squirrel allows for rapidly producing reliable alignments for MPXV and also enable maximum-likelihood phylogenetics pipeline tree estimation. Squirrel maps each query genome in the input file against a reference genome specific to each clade usingminimap2. Usinggofasta, the mapping file is then converted into a multiple sequence alignment. Squirrel can also run an APOBEC3-reconstruction and map these mutations onto the phylogeny.
Inputs
- Consensus Sequences: Input consensus sequences for phylogenetic reconstruction. Must be a single clade (Clade I or Clade II) as clades cannot be mixed. [
fasta] - MPXV Clade: MPXV Clade selected from
cladei,cladeia,cladeib,cladeii,cladeiib. [text] - Run APOBEC Analysis: Optionally perform APOBEC3-reconstruction and map mutations onto the phylogeny. [
boolean] - Plot Height: Optional input to set the height of the phylogenetic tree plots in inches. [
integer] - Plot Width: Optional input to set the width of the phylogenetic tree plots in inches. [
integer]
Outputs
- Tree (Newick): The output maximum likelihood tree file from
iqtree2with Node labels that correspond to the reconstruction Node labels. This tree can be viewed in various tree viewers, for example FigTree. [newick] - MSA: The alignment file, with alignment scaffolded against a clade-specific reference. By default one of the ITR regions and a curated set of problematic regions is masked as Ns. [
fasta] - Tree (PNG): Visualisation of reconstructed tree showing whether mutations are consistent with APOBEC3 editing or not. [
png] - Tree (SVG): Visualisation of reconstructed tree showing whether mutations are consistent with APOBEC3 editing or not. [
svg] - Branch AA Reconstruction: A report of each mutation that occurs across the phylogeny, their location, dinucleotide context, APOBEC3 status, which gene they're present in, codon position, amino acid change and a prediction of how extreme that amino acid change is with Grantham score. [
csv] - Branch SNPS Reconstruction: A report of individual site changes mapped to specific branches and their dinucleotide context. [
csv]
Workflow Steps
- Run
Squirrel QCto perform QC on the alignment and flag sites that may need to be masked.Squirrel QCproduces an outputcsvfile of flagged sites that is used to mask these sites bySquirrel Phylo. - Run
Squirrel Phyloto perform the following steps for phylogenetic analysis.- Alignment of consensus sequences to clade specific reference genome using
minimap2. - Masking (replacement with N) of low-complexity or repetitive regions that have been characterised for Clade I and II and sites flagged by
Squirrel QC. - Generate a maximum liklihood phylogenetic tree using
iqtree2. - Optionally, run APOBEC3 ancestral state reconstruction and map these mutations onto the phylogeny using
iqtree2.
- Alignment of consensus sequences to clade specific reference genome using
- Convert the
nexusformat phylogenetic tree output bySquirrel Phylotonewickformat usingRegex Find And Replace.
Inputs
| ID | Name | Description | Type |
|---|---|---|---|
| Consensus Sequences | #main/Consensus Sequences | Input sequences for phylogenetic reconstruction. Must be a single clade (CladeI or CladeII). Do not mix clades. |
|
| MPXV Clade | #main/MPXV Clade | Select MPXV Clade. |
|
| Plot Height (inches) | #main/Plot Height (inches) | Overwrite phylogenetic tree figure default height |
|
| Plot Width (inches) | #main/Plot Width (inches) | Overwrite phylogenetic tree figure default width |
|
| Run APOBEC Analysis | #main/Run APOBEC Analysis | Perform APOBEC3-reconstruction and map mutations onto the phylogeny. |
|
Steps
| ID | Name | Description |
|---|---|---|
| 5 | Squirrel QC | toolshed.g2.bx.psu.edu/repos/iuc/squirrel_qc/squirrel_qc/1.0.13+galaxy0 |
| 6 | Squirrel Phylo | toolshed.g2.bx.psu.edu/repos/iuc/squirrel_phylo/squirrel_phylo/1.0.13+galaxy0 |
| 7 | convert to newick | Uses regex to convert this nhx to newick toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regex1/1.0.3 |
Outputs
| ID | Name | Description | Type |
|---|---|---|---|
| MSA | #main/MSA | n/a |
|
| Tree | #main/Tree | n/a |
|
| Tree (PNG) | #main/Tree (PNG) | n/a |
|
| Tree (SVG) | #main/Tree (SVG) | n/a |
|
| aa_recon | #main/aa_recon | n/a |
|
| branch_snps | #main/branch_snps | n/a |
|
| tree | #main/tree | n/a |
|
Version History
Version 1 (earliest) Created 18th Jun 2026 at 01:15 by Tristan Reynolds
Initial commit
Open
master
74718a8
Creators and SubmitterCreators
Submitter
License
Activity
Views: 0 Downloads: 0 Runs: 0
Created: 18th Jun 2026 at 01:15
Annotated Properties
AttributionsNone
Visit source
Run on Galaxy
https://orcid.org/0000-0002-6606-5953