Workflows
What is a Workflow?Filters
Summary
This notebook shows how to integrate genomic and image data resources. This notebook looks at the question Which diabetes related genes are expressed in the pancreas?
Steps:
- Query humanmine.org, an integrated database of Homo sapiens genomic data using the intermine API to find the genes.
- Using the list of found genes, search in the Image Data Resource (IDR) for images linked to the genes, tissue and disease.
We use the intermine API and the IDR API
The notebook can be opened ...
3D genome builder (3DGB)
3D genome builder (3DGB) is a workflow to build 3D models of genomes from HiC raw data and to integrate omics data on the produced models for further visual exploration. 3DGB bundles HiC-Pro, PASTIS and custom Python scripts into a unified Snakemake workflow with limited inputs (see Preparing Required Files). 3DGB produces ...
ENA Reads & Assembly Submission Workflow
Originally developed within the EVORA project, this two-step Galaxy workflow streamlines submissions to the European Nucleotide Archive (ENA). The workflow first submits raw sequencing reads via the Galaxy ENA upload tool, then submits assembled sequences using the Galaxy ENA Webin CLI tool. The process is fully interactive and GUI-driven while retaining ENA’s required validations ...
📄 Generalizable machine learning models for rapid antimicrobial resistance prediction in unseen healthcare settings
This repository contains the code used for the experiments in the paper:
Generalizable machine learning models for rapid antimicrobial resistance prediction in unseen healthcare settings by Diane Duroux, Paul P. Meyer, Giovanni Visonà, and Niko Beerenwinkel.
⚙️ Install the dependencies
You can set up the project with either pip or uv.
Option A - pip:
Install the ...
Type: Nextflow
Creators: Alem Gusinac, Thomas Ederveen, Jos Boekhorst, Annemarie Boleij
Submitter: Alem Gusinac
The bioinformatic workflow presented here enables the analysis of RNA sequencing data obtained from human reproductive tissues in unexplained recurrent pregnancy loss (uRPL) research. This pipeline requires a sample sheet containing the sample information (example_input_data.csv) and gene expression matrices generated using the Salmon tool in the nf-core/rnaseq bioinformatics pipeline (example_count_data.csv). For more information on how to use the nf-core/rnaseq pipeline including the required ...
Workflow for Metagenomics binning from assembly.
Minimal inputs are: Identifier, assembly (fasta) and an associated sorted BAM file
Summary
- MetaBAT2 (binning)
- MaxBin2 (binning)
- SemiBin2 (binning)
- Binette (bin merging)
- EukRep (eukaryotic classification)
- CheckM2 (bin completeness and contamination)
- BUSCO (bin completeness)
- GTDB-Tk (bin taxonomic classification)
- CoverM (bin abundances)
Including: Bin annotation (workflow: https://workflowhub.eu/workflows/1170):
- Bakta
...
Type: Common Workflow Language
Creators: Jasper Koehorst, Bart Nijsse
Submitters: Jasper Koehorst, Bart Nijsse
MAGNETO is an automated snakemake workflow dedicated to MAG (Metagenome-Assembled Genomes) reconstruction from metagenomic data.
It includes a fully-automated coassembly step informed by optimal clustering of metagenomic distances, and implements complementary genome binning strategies, for improving MAG recovery.
Key Features
- Quality Control (QC): Automatically assesses the quality and the contamination of input reads, ensuring that low-quality data are filtered out to improve downstream ...
Type: Snakemake
Creators: Samuel Chaffron, Audrey Bihouee, Benjamin Churcheward, Maxime Millet, Guillaume Fertin, Hugo Lefeuvre
Submitter: Hugo Lefeuvre
This project is an analysis pipeline using Snakemake for RNAseq analysis in order to find differentially expressed genes. It has been widely tested on human RNA sequencing from an Illumina HiSeq but should work on most systems and many other species, provided the necessary resource files can be downloaded.
Description
This pipeline is set for paired-end data only from Illumina HiSeq output files.
The main steps of the pipeline are:
- optionnal cleaning data with ...
Soil Metagenome Pipeline
Soil Metagenome Pipeline is a modular, Nextflow DSL2 workflow for assembling, polishing, binning, annotating, and functionally characterizing complex soil metagenomes. It orchestrates state-of-the-art tools for long- and short-read metagenomics, generates high-quality MAGs, assigns taxonomy, and screens for biosynthetic gene clusters (BGCs).
What it does
- Assembles long-read metagenomes (e.g., ONT) with Flye and optionally polishes with Medaka and/or NextPolish using ...



 Download
Download








