Workflows
What is a Workflow?Filters
A workflow to show how material from DaSch can be processed in Galaxy. The example used is a optical character recognition of a German newspaper from DaSch which will be made machine-readable, cleaned, stripped of punctuation and visualised in a Wordcloud.
Workflow for short paired end reads quality control, trimming and filtering. Multiple paired datasets will be merged into single paired dataset. Summary:
- Sequali QC on raw data files
- fastp for read quality trimming
- BBduk for phiX and rRNA filtering (optional)
- Filter human reads using Hostile (optional)
- Custom read filtering using Hostile (optional)
- Sequali QC on filtered (merged) data
Other UNLOCK workflows on WorkflowHub: https://workflowhub.eu/projects/16/workflows?view=default ...
Type: Common Workflow Language
Creators: Bart Nijsse, Jasper Koehorst, Changlin Ke
Submitter: Bart Nijsse
Runs MetaPhlAn 4 and HUMAnN 3
- Optional short read quality control workflow (https://workflowhub.eu/workflows/336)
- Includes renormalizing and all regroupings to other functional categories (EC,KO.. etc)
Required inputs are paired end reads and databases
Workflow for the Galaxy Training Network tutorial "Hybrid genome assembly - Nanopore and Illumina"
Associated Tutorial
This workflows is part of the tutorial Hybrid genome assembly - Nanopore and Illumina, available in the GTN
Features
- Includes Galaxy Workflow Tests ...
Evaluation of Pacbio Hifi Reads and genome profiling. Create Meryl Database used for the estimation of assembly parameters and quality control with Merqury. Part of the VGP pipeline.
Summary
This notebook shows how to integrate genomic and image data resources. This notebook looks at the question Which diabetes related genes are expressed in the pancreas?
Steps:
- Query humanmine.org, an integrated database of Homo sapiens genomic data using the intermine API to find the genes.
- Using the list of found genes, search in the Image Data Resource (IDR) for images linked to the genes, tissue and disease.
We use the intermine API and the IDR API
The notebook can be opened ...
3D genome builder (3DGB)
3D genome builder (3DGB) is a workflow to build 3D models of genomes from HiC raw data and to integrate omics data on the produced models for further visual exploration. 3DGB bundles HiC-Pro, PASTIS and custom Python scripts into a unified Snakemake workflow with limited inputs (see Preparing Required Files). 3DGB produces ...
This workflow encodes the top-ranking predicted pathways from the previous workflow into plasmids intended to be expressed in the specified organism. BASIC is used as assembly method.
ENA Reads & Assembly Submission Workflow
Originally developed within the EVORA project, this two-step Galaxy workflow streamlines submissions to the European Nucleotide Archive (ENA). The workflow first submits raw sequencing reads via the Galaxy ENA upload tool, then submits assembled sequences using the Galaxy ENA Webin CLI tool. The process is fully interactive and GUI-driven while retaining ENA’s required validations ...
This workflow encodes the top-ranking predicted pathways from the previous workflow into plasmids intended to be expressed in the specified organism. Assembly methods are Gibson, Golden or Ligation Chain Reaction.
Download