Workflow for converting (genome) annotation tool output into a GBOL RDF file (TTL/HDT) using SAPP
Current formats / tools:
- EMBL format
- InterProScan (JSON/TSV)
- eggNOG-mapper (TSV)
- KoFamScan (TSV)
git: https://gitlab.com/m-unlock/cwl
SAPP (Semantic Annotation Platform with Provenance):
https://gitlab.com/sapp
https://academic.oup.com/bioinformatics/article/34/8/1401/4653704
Workflow for microbial (meta-)genome annotation
Input is a (meta)genome sequence in fasta format.
-
bakta
-
KoFamScan (optional)
-
InterProScan (optional)
-
eggNOG mapper (optional)
-
To RDF conversion with SAPP (optional, default on) --> SAPP conversion Workflow in WorkflowHub
The workflow requires the user to provide:
- ENSEMBL link address of the annotation GFF3 file
- ENSEMBL link address of the assembly FASTA file
- NCBI taxonomy ID
- BUSCO lineage
- OMArk database
Thw workflow will produce statistics of the annotation based on AGAT, BUSCO and OMArk.
Annotation of an assembled bacterial genomes to detect genes, potential plasmids, integrons and Insertion sequence (IS) elements.
Type: Galaxy
Creators: ABRomics , Pierre Marin, Clea Siguret, abromics-consortium
Submitter: WorkflowHub Bot
An open-source analysis pipeline to detect germline or somatic variants from whole genome or targeted sequencing
Type: Nextflow
Creators: No author provided, Hadrien Gourlé, Daniel Straub, Sabrina Krakau
Submitter: WorkflowHub Bot
Parabricks-Genomics-nf is a GPU-enabled pipeline for alignment and germline short variant calling for short read sequencing data. The pipeline utilises NVIDIA's Clara Parabricks toolkit to dramatically speed up the execution of best practice bioinformatics tools. Currently, this pipeline is configured specifically for NCI's Gadi HPC.
NVIDIA's Clara Parabricks can deliver a significant ...
ANNOTATO - Annotation workflow To Annotate Them Oll
ERGA Protein-coding gene annotation workflow.
Adapted from the work of Sagane Joye:
https://github.com/sdind/genome_annotation_workflow
Prerequisites
The following programs are required to run the workflow and the listed version were tested. It should be noted that older versions of snakemake are not compatible with newer versions of singularity as is noted here: https://github.com/nextflow-io/nextflow/issues/1659.
conda v 23.7.3
...
PAIRED-END workflow. Align reads on fasta reference/assembly using bwa mem, get a consensus, variants, mutation explanations.
IMPORTANT:
- For "bcftools call" consensus step, the --ploidy file is in "Données partagées" (Shared Data) and must be imported in your history to use the worflow by providing this file (tells bcftools to consider haploid variant calling).
- SELECT THE MOST ADAPTED VADR MODEL for annotation (see vadr parameters).
SINGLE-END workflow. Align reads on fasta reference/assembly using bwa mem, get a consensus, variants, mutation explanations.
IMPORTANT:
- For "bcftools call" consensus step, the --ploidy file is in "Données partagées" (Shared Data) and must be imported in your history to use the worflow by providing this file (tells bcftools to consider haploid variant calling).
- SELECT the mot ADAPTED VADR MODEL for annotation (see vadr parameters).
This repository hosts Metabolome Annotation Workflow (MAW). The workflow takes MS2 .mzML format data files as an input in R. It performs spectral database dereplication using R Package Spectra and compound database dereplication using SIRIUS OR MetFrag . Final candidate selection is done in Python using RDKit and PubChemPy.
Type: Common Workflow Language
Creators: Mahnoor Zulfiqar, Michael R. Crusoe, Luiz Gadelha, Christoph Steinbeck, Maria Sorokina, Kristian Peters
Submitter: Mahnoor Zulfiqar
GermlineStructuralV-nf is a pipeline for identifying structural variant events in human Illumina short read whole genome sequence data. GermlineStructuralV-nf identifies structural variant and copy number events from BAM files using Manta, Smoove, and TIDDIT. Variants are then merged using SURVIVOR, ...
Type: Nextflow
Creators: Georgina Samaha, Marina Kennerson, Tracy Chew, Sarah Beecroft
Submitter: Georgina Samaha
MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over the past 2 years, MGnify (formerly EBI Metagenomics) has more than doubled the number of publicly available analysed datasets held within the resource. Recently, an updated approach to data analysis has been unveiled (version 5.0), replacing the previous single pipeline ...
Type: Common Workflow Language
Creator: Alex L Mitchell, Alexandre Almeida, Martin Beracochea, Miguel Boland, Josephine Burgin, Guy Cochrane, Michael R Crusoe, Varsha Kale, Simon C Potter, Lorna J Richardson, Ekaterina Sakharova, Maxim Scheremetjew, Anton Korobeynikov, Alex Shlemov, Olga Kunyavskaya, Alla Lapidus, Robert D Finn
Submitter: Martin Beracochea
This workflow begins from a set of genome assemblies of different samples, strains, species. The genome is first annotated with Funnanotate. Predicted proteins are furtner annotated with Busco. Next, 'ProteinOrtho' finds orthologs across the samples and makes orthogroups. Orthogroups where all samples are represented are extracted. Orthologs in each orthogroup are aligned with ClustalW. Test dataset: https://zenodo.org/record/6610704#.Ypn3FzlBw5k
ORSON combine state-of-the-art tools for annotation processes within a Nextflow pipeline: sequence similarity search (PLAST, BLAST or Diamond), functional annotation retrieval (BeeDeeM) and functional prediction (InterProScan). When required, BUSCO completness evaluation and eggNOG Orthogroup annotation can be activated. While ORSON results can be analyzed through the command-line, it also offers the possibility to be compatible with BlastViewer or Blast2GO graphical tools.
Type: Nextflow
Creators: Cyril Noel, Alexandre Cormier, Patrick Durand, Laura Leroi, Pierre Cuzin
Submitter: Patrick Durand
Variant Interpretation Pipeline (VIP) that annotates, filters and reports prioritized causal variants in humans, see https://github.com/molgenis/vip for more information.
Alignment, assembly and annotation of generated transcripts from RNASEQ reads.
Alignment, assembly and annotation of RNASEQ reads as well as annotation of generated transcripts.
Alignment, assembly RNASEQ reads and annotation of generated transcripts.
TronFlow is an open source collection of computational workflows originally conceived for tumor-normal somatic variant calling over whole exome data and the manipulation of BAM and VCF files with the aim of having comparable and analysis-ready data. Over time, we have extended it to germline variant calling, copy numbers and other related technologies and analyses.
Its modular architecture covers different analytical and methodological use cases that allow analysing FASTQ files into analysis-ready ...
Maintainers: Pablo Riesgo Ferreiro
Number of items: 2
Tags: Nextflow, variant calling, VCF, Mutect2, HaplotyeCaller, Strelka2, Alignment, Annotation