Workflow Type:  Galaxy
        
        
        
  
        
          
            
              
    
      
        
        
    
    
      
        
        
    
    
      
        
        
    
            
          
        
        
      
  
    
      
        
      
Frozen
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
    
    
  
      
  
    
      
        
      
Frozen
    
    
  
      
      Purge contigs marked as duplicates by purge_dups in a single haplotype (could be haplotypic duplication or overlap duplication). If you think the purged contigs might belong to the other haplotype, use the workflow VGP6 instead. This workflow is the 6th workflow of the VGP pipeline. It is meant to be run after one of the contigging steps (Workflow 3, 4, or 5).
Inputs
| ID | Name | Description | Type | 
|---|---|---|---|
| Assembly Name | Assembly Name | For workflow report. | 
 | 
| Assembly to leave alone (For Merqury comparison) | Assembly to leave alone (For Merqury comparison) | Assembly that does not need purging. | 
 | 
| Assembly to purge | Assembly to purge | Assembly containing duplications to be purged. | 
 | 
| Database for Busco Lineage | Database for Busco Lineage | Database to use for Busco lineages. | 
 | 
| Estimated genome size - Parameter File | Estimated genome size - Parameter File | Estimated genome file obtained in the contiging workflow. | 
 | 
| Genomescope model parameters | Genomescope model parameters | Model parameters obtained in the k-mer profiling workflow. | 
 | 
| Haplotype | Haplotype | For workflow report. | 
 | 
| Lineage | Lineage | Taxonomic lineage for the organism being assembled for Busco analysis | 
 | 
| Meryl Database | Meryl Database | Meryl database obtained in the k-mer profiling workflow. | 
 | 
| Name of purged assembly | Name of purged assembly | n/a | 
 | 
| Name of un-altered assembly | Name of un-altered assembly | n/a | 
 | 
| Pacbio Reads Collection - Trimmed | Pacbio Reads Collection - Trimmed | Trimmed PacBio HiFi reads—outputs of cutadapt in the contiging workflow. | 
 | 
| Species Name | Species Name | For workflow report. | 
 | 
Steps
| ID | Name | Description | 
|---|---|---|
| 13 | Compose text parameter value | toolshed.g2.bx.psu.edu/repos/iuc/compose_text_param/compose_text_param/0.1.1 | 
| 14 | Compose text parameter value | toolshed.g2.bx.psu.edu/repos/iuc/compose_text_param/compose_text_param/0.1.1 | 
| 15 | Compose text parameter value | toolshed.g2.bx.psu.edu/repos/iuc/compose_text_param/compose_text_param/0.1.1 | 
| 16 | Compute | toolshed.g2.bx.psu.edu/repos/devteam/column_maker/Add_a_column1/2.1 | 
| 17 | Map with minimap2 | toolshed.g2.bx.psu.edu/repos/iuc/minimap2/minimap2/2.28+galaxy2 | 
| 18 | Purge overlaps | toolshed.g2.bx.psu.edu/repos/iuc/purge_dups/purge_dups/1.2.6+galaxy0 | 
| 19 | gfastats | toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.11+galaxy0 | 
| 20 | Estimated genome size | param_value_from_file | 
| 21 | Compose text parameter value | toolshed.g2.bx.psu.edu/repos/iuc/compose_text_param/compose_text_param/0.1.1 | 
| 22 | Cut | Cut1 | 
| 23 | Cut | Cut1 | 
| 24 | Map with minimap2 | toolshed.g2.bx.psu.edu/repos/iuc/minimap2/minimap2/2.28+galaxy2 | 
| 25 | gfastats_data_prep | n/a | 
| 26 | gfastats | toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.11+galaxy0 | 
| 27 | Parse parameter value | param_value_from_file | 
| 28 | Parse parameter value | param_value_from_file | 
| 29 | Text reformatting | toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.5+galaxy2 | 
| 30 | Purge overlaps | toolshed.g2.bx.psu.edu/repos/iuc/purge_dups/purge_dups/1.2.6+galaxy0 | 
| 31 | Purge overlaps | toolshed.g2.bx.psu.edu/repos/iuc/purge_dups/purge_dups/1.2.6+galaxy0 | 
| 32 | Remove REPEATs from BED | toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_grep_tool/9.5+galaxy2 | 
| 33 | Purge overlaps | toolshed.g2.bx.psu.edu/repos/iuc/purge_dups/purge_dups/1.2.6+galaxy0 | 
| 34 | Merqury | toolshed.g2.bx.psu.edu/repos/iuc/merqury/merqury/1.3+galaxy4 | 
| 35 | gfastats | toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.11+galaxy0 | 
| 36 | Busco | toolshed.g2.bx.psu.edu/repos/iuc/busco/busco/5.8.0+galaxy1 | 
| 37 | Convert purged fasta to gfa | toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.11+galaxy0 | 
| 38 | gfastats | toolshed.g2.bx.psu.edu/repos/bgruening/gfastats/gfastats/1.3.11+galaxy0 | 
| 39 | compleasm | toolshed.g2.bx.psu.edu/repos/iuc/compleasm/compleasm/0.2.6+galaxy3 | 
| 40 | merqury_QV | __EXTRACT_DATASET__ | 
| 41 | output_merqury.spectra-cn.fl | __EXTRACT_DATASET__ | 
| 42 | output_merqury.spectra-asm.fl | __EXTRACT_DATASET__ | 
| 43 | output_merqury.assembly_01.spectra-cn.fl | __EXTRACT_DATASET__ | 
| 44 | merqury_stats | __EXTRACT_DATASET__ | 
| 45 | output_merqury.assembly_02.spectra-cn.fl | __EXTRACT_DATASET__ | 
| 46 | gfastats_data_prep | n/a | 
| 47 | Text reformatting | toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_awk_tool/9.5+galaxy2 | 
| 48 | gfastats_plot | n/a | 
| 49 | Join two Datasets | join1 | 
| 50 | Advanced Cut | toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_cut_tool/9.5+galaxy2 | 
| 51 | Replace | toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_find_and_replace/9.5+galaxy2 | 
Outputs
| ID | Name | Description | Type | 
|---|---|---|---|
| Species for report | Species for report | n/a | 
 | 
| Assembly for report | Assembly for report | n/a | 
 | 
| Haplotype for report | Haplotype for report | n/a | 
 | 
| Lineage for report | Lineage for report | n/a | 
 | 
| Cutoffs | Cutoffs | n/a | 
 | 
| Read Coverage and cutoffs calculation: Histogram plot | Read Coverage and cutoffs calculation: Histogram plot | n/a | 
 | 
| Purged assembly | Purged assembly | n/a | 
 | 
| Removed haplotigs | Removed haplotigs | n/a | 
 | 
| Merqury on Phased assemblies: Images | Merqury on Phased assemblies: Images | n/a | 
 | 
| Merqury on Phased assemblies: stats | Merqury on Phased assemblies: stats | n/a | 
 | 
| qv_files | qv_files | n/a | 
 | 
| Busco on Purged Primary assembly: summary image | Busco on Purged Primary assembly: summary image | n/a | 
 | 
| Busco on Purged Primary assembly: short summary | Busco on Purged Primary assembly: short summary | n/a | 
 | 
| Purged assembly (GFA) | Purged assembly (GFA) | n/a | 
 | 
| Purged assembly statistics | Purged assembly statistics | n/a | 
 | 
| Compleasm on purged Assembly: Full Table | Compleasm on purged Assembly: Full Table | n/a | 
 | 
| Compleasm on purged Assembly: Miniprot | Compleasm on purged Assembly: Miniprot | n/a | 
 | 
| Compleasm on purged Assembly: Translated Proteins | Compleasm on purged Assembly: Translated Proteins | n/a | 
 | 
| Compleasm on purged Assembly: Full Table Busco | Compleasm on purged Assembly: Full Table Busco | n/a | 
 | 
| merqury_QV | merqury_QV | n/a | 
 | 
| output_merqury.spectra-cn.fl | output_merqury.spectra-cn.fl | n/a | 
 | 
| output_merqury.spectra-asm.fl | output_merqury.spectra-asm.fl | n/a | 
 | 
| output_merqury.assembly_01.spectra-cn.fl | output_merqury.assembly_01.spectra-cn.fl | n/a | 
 | 
| merqury_stats | merqury_stats | n/a | 
 | 
| output_merqury.assembly_02.spectra-cn.fl | output_merqury.assembly_02.spectra-cn.fl | n/a | 
 | 
| Nx Plot | Nx Plot | n/a | 
 | 
| Size Plot | Size Plot | n/a | 
 | 
| Assembly statistics for both assemblies | Assembly statistics for both assemblies | n/a | 
 | 
| clean_stats | clean_stats | n/a | 
 | 
Version History
v0.8.4 (latest) Created 1st Oct 2025 at 03:01 by WorkflowHub Bot
Updated to v0.8.4
Frozen
 v0.8.4
v0.8.434af902
    v0.1 (earliest) Created 15th Feb 2024 at 03:01 by WorkflowHub Bot
Updated to v0.1
Frozen
 v0.1
v0.149773bd
     Creators and Submitter
 Creators and SubmitterCreators
Not specifiedAdditional credit
Galaxy, VGP
Submitter
Activity
Views: 12695 Downloads: 21697 Runs: 3
Created: 15th Feb 2024 at 03:01
Last updated: 1st Oct 2025 at 03:01
 Tags
 TagsThis item has not yet been tagged.
 Attributions
 AttributionsNone

 View on GitHub
View on GitHub Download RO-Crate
Download RO-Crate Run on Galaxy
Run on Galaxy


