MPXV (Mpox) Phylogenetic Analysis with Squirrel
Version 1

Workflow Type: Galaxy

MPXV (Mpox) Phylogenetic Analysis with Squirrel

Description

Galaxy workflow to perform MPXV phylogenetic reconstruction using the Squirrel (Some QUIck Reconstruction to Resolve Evolutionary Links) Galaxy tools, Squirrel QC and Squirrel Phylo. The workflow also performs masking (using the squirrel tool) of SNPs.

Galaxy Squirrel tools:

  • Squirrel QC: The quality control (QC) mode of Squirrel that can run QC on the alignment and flag certain sites to the user that may need to be masked. Squirrel can flag potential issues in the MPXV sequences that have been provided for alignment (e.g. SNPS near tracts of N, clusters of unique SNPs, reversions to reference alleles and convergent mutations) and outputs these in a mask file for investigation.
  • Squirrel Phylo: The Squirrel mode that performs alignment and creation of maximum-likelihood phylogenetic trees. Squirrel allows for rapidly producing reliable alignments for MPXV and also enable maximum-likelihood phylogenetics pipeline tree estimation. Squirrel maps each query genome in the input file against a reference genome specific to each clade using minimap2. Using gofasta, the mapping file is then converted into a multiple sequence alignment. Squirrel can also run an APOBEC3-reconstruction and map these mutations onto the phylogeny.

Inputs

  • Consensus Sequences: Input consensus sequences for phylogenetic reconstruction. Must be a single clade (Clade I or Clade II) as clades cannot be mixed. [fasta]
  • MPXV Clade: MPXV Clade selected from cladei,cladeia,cladeib,cladeii,cladeiib. [text]
  • Run APOBEC Analysis: Optionally perform APOBEC3-reconstruction and map mutations onto the phylogeny. [boolean]
  • Plot Height: Optional input to set the height of the phylogenetic tree plots in inches. [integer]
  • Plot Width: Optional input to set the width of the phylogenetic tree plots in inches. [integer]

Outputs

  • Tree (Newick): The output maximum likelihood tree file from iqtree2 with Node labels that correspond to the reconstruction Node labels. This tree can be viewed in various tree viewers, for example FigTree. [newick]
  • MSA: The alignment file, with alignment scaffolded against a clade-specific reference. By default one of the ITR regions and a curated set of problematic regions is masked as Ns. [fasta]
  • Tree (PNG): Visualisation of reconstructed tree showing whether mutations are consistent with APOBEC3 editing or not. [png]
  • Tree (SVG): Visualisation of reconstructed tree showing whether mutations are consistent with APOBEC3 editing or not. [svg]
  • Branch AA Reconstruction: A report of each mutation that occurs across the phylogeny, their location, dinucleotide context, APOBEC3 status, which gene they're present in, codon position, amino acid change and a prediction of how extreme that amino acid change is with Grantham score. [csv]
  • Branch SNPS Reconstruction: A report of individual site changes mapped to specific branches and their dinucleotide context. [csv]

Workflow Steps

  1. Run Squirrel QC to perform QC on the alignment and flag sites that may need to be masked. Squirrel QC produces an output csv file of flagged sites that is used to mask these sites by Squirrel Phylo.
  2. Run Squirrel Phylo to perform the following steps for phylogenetic analysis.
    • Alignment of consensus sequences to clade specific reference genome using minimap2.
    • Masking (replacement with N) of low-complexity or repetitive regions that have been characterised for Clade I and II and sites flagged by Squirrel QC.
    • Generate a maximum liklihood phylogenetic tree using iqtree2.
    • Optionally, run APOBEC3 ancestral state reconstruction and map these mutations onto the phylogeny using iqtree2.
  3. Convert the nexus format phylogenetic tree output by Squirrel Phylo to newick format using Regex Find And Replace.

Inputs

ID Name Description Type
Consensus Sequences #main/Consensus Sequences Input sequences for phylogenetic reconstruction. Must be a single clade (CladeI or CladeII). Do not mix clades.
  • File
MPXV Clade #main/MPXV Clade Select MPXV Clade.
  • string
Plot Height (inches) #main/Plot Height (inches) Overwrite phylogenetic tree figure default height
  • int
Plot Width (inches) #main/Plot Width (inches) Overwrite phylogenetic tree figure default width
  • int
Run APOBEC Analysis #main/Run APOBEC Analysis Perform APOBEC3-reconstruction and map mutations onto the phylogeny.
  • boolean

Steps

ID Name Description
5 Squirrel QC toolshed.g2.bx.psu.edu/repos/iuc/squirrel_qc/squirrel_qc/1.0.13+galaxy0
6 Squirrel Phylo toolshed.g2.bx.psu.edu/repos/iuc/squirrel_phylo/squirrel_phylo/1.0.13+galaxy0
7 convert to newick Uses regex to convert this nhx to newick toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regex1/1.0.3

Outputs

ID Name Description Type
MSA #main/MSA n/a
  • File
Tree #main/Tree n/a
  • File
Tree (PNG) #main/Tree (PNG) n/a
  • File
Tree (SVG) #main/Tree (SVG) n/a
  • File
aa_recon #main/aa_recon n/a
  • File
branch_snps #main/branch_snps n/a
  • File
tree #main/tree n/a
  • File

Version History

Version 1 (earliest) Created 18th Jun 2026 at 01:15 by Tristan Reynolds

Initial commit


Open master 74718a8
help Creators and Submitter
Creators
Submitter
Activity

Views: 0   Downloads: 0   Runs: 0

Created: 18th Jun 2026 at 01:15

Annotated Properties
Topic annotations
Scientific disciplines
Immunology and Microbiology
help Attributions

None

Total size: 682 KB
Powered by
(v.1.17.3)
Copyright © 2008 - 2026 The University of Manchester and HITS gGmbH