Workflows
What is a Workflow?Filters
Build a consensus sequence from FILTER PASS variants with intrasample allele-frequency above a configurable consensus threshold. Hard-mask regions with low coverage (but not consensus variants within them) and ambiguous sites.
This workflow takes a VCF dataset of variants produced by any of the *-variant-calling workflows in https://github.com/galaxyproject/iwc/tree/main/workflows/sars-cov-2-variant-calling and generates tabular lists of variants by Samples and by Variant, and an overview plot of variants and their allele-frequencies.
Overview
Developmental version of MSC: This github page contains developmental version of R package for Multi-scale clustering (MSC) to perform single-cell transcriptome clustering. The manuscript is currently under review.
Installation:
MEGENA needs to be installed, prior to MSC installation: library(devtools); install_github("songw01/MEGENA");
For installation for developmental github version: library(devtools); install_github("songlabcodes/MSC");
Vignettes [PBMC 8k ...
RepeatMasking Workflow
This workflow uses RepeatModeler and RepeatMasker for genome analysis.
-
RepeatModeler is a software package for identifying and modeling de novo families of transposable elements (TEs). At the heart of RepeatModeler are three de novo repeat search programs (RECON, RepeatScout and LtrHarvest/Ltr_retriever) which use complementary computational methods to identify repeat element boundaries and family relationships from sequence data.
-
RepeatMasker is a program that analyzes ...
This workflow allows you to annotate a genome with Helixer and evaluate the quality of the annotation using BUSCO and Genome Annotation statistics. GFFRead is also used to predict protein sequences derived from this annotation, and BUSCO and OMArk are used to assess proteome quality.
This workflow uses eggNOG mapper and InterProScan for functional annotation of protein sequences.
This workflow allows for genome annotation using Maker and evaluates the quality of the annotation.
This workflow runs the FEELnc tool to annotate long non-coding RNAs. Before annotating these long non-coding RNAs, StringTie will be used to assemble the RNA-seq alignments into potential trancriptions. The gffread tool provides a genome annotation file in GTF format.
This workflow shows Flexynesis usage for predicting survival markers from a multi-omics dataset
Associated Tutorial
This workflows is part of the tutorial Identifing Survival Markers of Brain tumor with Flexynesis, available in the GTN
Features
- Includes Galaxy Workflow Tests ...
This WF applies Flexynesis on BRCA data from Metabric for a classification task
Associated Tutorial
This workflows is part of the tutorial Modeling Breast Cancer Subtypes with Flexynesis, available in the GTN
Features
- Includes Galaxy Workflow Tests ...