Workflow Type:  Galaxy
        
        
        
  
            
              
                
                     
                
              
            
        
          
            
              
    
      
        
        
    
    
      
        
        
    
    
      
        
        
    
            
          
        
        
      
  
    
      
        
      
Open
    
    
  
      
      Kmer counting step, can run alone or as part of a combined workflow for large genome assembly.
- What it does: Estimates genome size and heterozygosity based on counts of kmers
- Inputs: One set of short reads: e.g. R1.fq.gz
- Outputs: GenomeScope graphs
- Tools used: Meryl, GenomeScope
- Input parameters: None required
- Workflow steps: The tool meryl counts kmers in the input reads (k=21), then converts this into a histogram. GenomeScope: runs a model on the histogram; reports estimates. k-mer size set to 21.
- Options: Use a different kmer counting tool. e.g. khmer.
Infrastructure_deployment_metadata: Galaxy Australia (Galaxy)
Inputs
| ID | Name | Description | Type | 
|---|---|---|---|
| Illumina reads R1 | Illumina reads R1 | n/a | 
 | 
Steps
| ID | Name | Description | 
|---|---|---|
| 1 | Meryl - count kmers | toolshed.g2.bx.psu.edu/repos/iuc/meryl/meryl/1.3+galaxy4 | 
| 2 | Meryl - generate histogram | toolshed.g2.bx.psu.edu/repos/iuc/meryl/meryl/1.3+galaxy4 | 
| 3 | Genomescope | toolshed.g2.bx.psu.edu/repos/iuc/genomescope/genomescope/2.0+galaxy1 | 
Outputs
| ID | Name | Description | Type | 
|---|---|---|---|
| Meryl on input dataset(s): read-db.meryldb | Meryl on input dataset(s): read-db.meryldb | n/a | 
 | 
| _anonymous_output_1 | _anonymous_output_1 | n/a | 
 | 
| _anonymous_output_2 | _anonymous_output_2 | n/a | 
 | 
| _anonymous_output_3 | _anonymous_output_3 | n/a | 
 | 
| _anonymous_output_4 | _anonymous_output_4 | n/a | 
 | 
| GenomeScope on input dataset(s) Transformed log plot | GenomeScope on input dataset(s) Transformed log plot | n/a | 
 | 
| GenomeScope on input dataset(s) Log plot | GenomeScope on input dataset(s) Log plot | n/a | 
 | 
| GenomeScope on input dataset(s) Linear plot | GenomeScope on input dataset(s) Linear plot | n/a | 
 | 
| GenomeScope on input dataset(s) Transformed linear plot | GenomeScope on input dataset(s) Transformed linear plot | n/a | 
 | 
Version History
Version 1 (earliest) Created 8th Nov 2021 at 04:47 by Anna Syme
Added/updated 2 files
Open
 master
masterabe51c7
     Creators and Submitter
 Creators and SubmitterCreator
Submitter
Tool
    
    Citation
  
  
  Syme, A. (2021). kmer counting - meryl. WorkflowHub. https://doi.org/10.48546/WORKFLOWHUB.WORKFLOW.223.1
    License
Activity
Views: 8297 Downloads: 681 Runs: 3
Created: 8th Nov 2021 at 04:47
Last updated: 9th Nov 2021 at 01:10
Annotated Properties
 Attributions
 AttributionsNone
 Collections
 Collections
 Download RO-Crate
Download RO-Crate Run on Galaxy
Run on Galaxy
 BioCommons ‘Bring Y...
        BioCommons ‘Bring Y...

 https://orcid.org/0000-0002-9906-0673
 https://orcid.org/0000-0002-9906-0673

