Workflow Type: Galaxy
Frozen
This workflow begins from a set of genome assemblies of different samples, strains, species. The genome is first annotated with Funnanotate. Predicted proteins are furtner annotated with Busco. Next, 'ProteinOrtho' finds orthologs across the samples and makes orthogroups. Orthogroups where all samples are represented are extracted. Orthologs in each orthogroup are aligned with ClustalW. Test dataset: https://zenodo.org/record/6610704#.Ypn3FzlBw5k
Inputs
| ID | Name | Description | Type |
|---|---|---|---|
| Input genomes as collection | Input genomes as collection | n/a |
|
Steps
| ID | Name | Description |
|---|---|---|
| 1 | Replace Text | toolshed.g2.bx.psu.edu/repos/bgruening/text_processing/tp_replace_in_line/1.1.2 |
| 2 | RepeatMasker | toolshed.g2.bx.psu.edu/repos/bgruening/repeat_masker/repeatmasker_wrapper/4.1.2-p1+galaxy0 |
| 3 | Funannotate predict annotation | toolshed.g2.bx.psu.edu/repos/iuc/funannotate_predict/funannotate_predict/1.8.9+galaxy2 |
| 4 | Extract ORF | toolshed.g2.bx.psu.edu/repos/bgruening/glimmer_gbk_to_orf/glimmer_gbk_to_orf/3.02 |
| 5 | Regex Find And Replace | toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regex1/1.0.1 |
| 6 | Collapse Collection | toolshed.g2.bx.psu.edu/repos/nml/collapse_collections/collapse_dataset/4.2 |
| 7 | Proteinortho | toolshed.g2.bx.psu.edu/repos/iuc/proteinortho/proteinortho/6.0.14+galaxy2.9.1 |
| 8 | Busco | toolshed.g2.bx.psu.edu/repos/iuc/busco/busco/4.1.4 |
| 9 | Filter | Filter1 |
| 10 | Proteinortho grab proteins | toolshed.g2.bx.psu.edu/repos/iuc/proteinortho_grab_proteins/proteinortho_grab_proteins/6.0.14+galaxy2.9.1 |
| 11 | Regex Find And Replace | toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regex1/1.0.1 |
| 12 | ClustalW | toolshed.g2.bx.psu.edu/repos/devteam/clustalw/clustalw/2.1 |
Outputs
| ID | Name | Description | Type |
|---|---|---|---|
| headers_shortened | headers_shortened | n/a |
|
| repeat_masked | repeat_masked | n/a |
|
| funannotate_predicted_proteins | funannotate_predicted_proteins | n/a |
|
| extracted_ORFs | extracted_ORFs | n/a |
|
| _anonymous_output_1 | _anonymous_output_1 | n/a |
|
| sample_names_to_headers | sample_names_to_headers | n/a |
|
| proteomes_to_one_file | proteomes_to_one_file | n/a |
|
| _anonymous_output_2 | _anonymous_output_2 | n/a |
|
| Proteinortho on input dataset(s): orthology-groups | Proteinortho on input dataset(s): orthology-groups | n/a |
|
| _anonymous_output_3 | _anonymous_output_3 | n/a |
|
| _anonymous_output_4 | _anonymous_output_4 | n/a |
|
| _anonymous_output_5 | _anonymous_output_5 | n/a |
|
| _anonymous_output_6 | _anonymous_output_6 | n/a |
|
| _anonymous_output_7 | _anonymous_output_7 | n/a |
|
| Proteinortho_extract_by_orthogroup | Proteinortho_extract_by_orthogroup | n/a |
|
| fasta_header_cleaned | fasta_header_cleaned | n/a |
|
| ClustalW on input dataset(s): clustal | ClustalW on input dataset(s): clustal | n/a |
|
Version History
Version 1 (earliest) Created 6th Jun 2022 at 15:05 by Miguel Roncoroni
Initial commit
Frozen
Version-1
a3e26fb
Creators and SubmitterCreators
Not specifiedAdditional credit
Miguel Roncoroni
Submitter
Activity
Views: 6173 Downloads: 606 Runs: 7
Created: 6th Jun 2022 at 15:05
AttributionsNone
Run on Galaxy
https://orcid.org/0000-0001-7461-1427