14 items tagged with 'Genome assembly'.

This is an inclusive collection of workflows related to biodiversity and ecology (especially non-microbial). A big portion covers genome assembly of newly-sequenced species, using long reads (ONT or PacBio HiFi), possibly complemented by chromosome capture (typically HiC) for scaffolding, or/and by short reads (typically Illumina). It also aims at collating workflows related to ecology, biodiversity, biogeography, natural history, and related scientific areas, across the whole WorkflowHub and ...

Maintainers: Matúš Kalaš, Keiler Collier

Number of items: 28

Tags: Biodiversity, Ecology, Genome assembly, dna barcoding, natural history collections

Created: 30th Jun 2025 at 11:16, Last updated: 10th Jul 2025 at 12:38

ERGA Assembly Galaxy Long Reads & Hi-C Pipelines (Hifiasm-solo + Purge_Dups + YaHS)

Collection of de-novo genome assembly workflows written for implementation in Galaxy

Input data should be PacBio HiFi or ONT reads and Illumina 3-dimensional Chromatin Confirmation Capture (Hi-C) reads

Executing the workflows collection will output a scaffolded primary assembly and alternate contigs, with the complete QC analyses

Please run the workflows in order: WF0', WF1, WF2, WF3, WF4

'Notice there is one for HiFi, one for ONT, and one for Illumina (WGS or Hi-C). Run according to your data. ...

Maintainers: Diego De Panis

Number of items: 7

Tags: Genome assembly, Biodiversity

Created: 24th Sep 2024 at 22:32, Last updated: 1st Jun 2025 at 13:17

Genome Evaluation for ERGA-BGE Reports

Collection of Galaxy workflows for generating results used for creating ERGA-BGE Reports

For a given genome, two workflows should be run: the assembly evaluation (ASM analyses), and the annotation evaluation (ANNOT analyses)

Depending on the kind of data used for the genome assembly, you should choose HiFi or ONT (Illumina) workflows for ASM analyses

Maintainers: Diego De Panis

Number of items: 3

Tags: Genomics, QC, Genome assembly

Created: 20th Aug 2024 at 14:44, Last updated: 26th Aug 2024 at 13:03

ERGA Assembly Snakemake HiFi & HiC Pipelines

Collection of workflows designed to assembled a set of PacBio HiFi and Illumina HiC reads into a chromosome-scale de-novo assembly.

Development versions of these pipelines can be found in the ERGA github and any questions or queries can be raised on the ERGA Discussions Channel

Want to find out more about the work done by ERGA? Become a member ...

Maintainers: Tom Brown, Diego De Panis, ERGA

Number of items: 3

Tags: Genome assembly

Created: 16th Mar 2024 at 09:08, Last updated: 16th Mar 2024 at 09:10

ERGA Assembly Galaxy ONT+Illumina & HiC Pipelines (Flye-HyPo + Purge_Dups + YaHS)

Collection of de-novo genome assembly workflows written for implementation in Galaxy

Input data should be Oxford Nanopore raw reads plus Illumina WGS reads and Illumina 3-dimensional Chromatin Confirmation Capture (HiC) reads

Executing all workflows will output one scaffolded collapsed assembly and the complete QC analyses

Please run the workflows in order: WF0 (there are two, one for ONT, and another one for Illumina that can be used independently for the WGS and HiC reads), WF1, WF2, WF3, WF4

Maintainers: Diego De Panis

Number of items: 6

Tags: Assembly, Bioinformatics, Galaxy, Genomics, Genome assembly, ONT, illumina, Hi-C

Created: 8th Jan 2024 at 09:54, Last updated: 11th Mar 2024 at 12:42

ERGA Assembly Galaxy ONT+Illumina & HiC Pipelines (NextDenovo-HyPo + Purge_Dups + YaHS)

Collection of de-novo genome assembly workflows written for implementation in Galaxy

Input data should be Oxford Nanopore raw reads plus Illumina WGS reads and Illumina 3-dimensional Chromatin Confirmation Capture (HiC) reads

Executing all workflows will output one scaffolded collapsed assembly and the complete QC analyses

Please run the workflows in order: WF0 (there are two, one for ONT, and another one for Illumina that can be used independently for the WGS and HiC reads), WF1, WF2, WF3, WF4

Maintainers: Diego De Panis

Number of items: 6

Tags: Assembly, Bioinformatics, Galaxy, Genomics, Genome assembly, ONT, illumina, Hi-C

Created: 8th Jan 2024 at 09:51, Last updated: 11th Mar 2024 at 14:45

ERGA Assembly Galaxy HiFi & HiC Pipelines (Hifiasm-HiC + Purge_Dups + YaHS)

Collection of de-novo genome assembly workflows written for implementation in Galaxy

Input data should be PacBio HiFi reads and Illumina 3-dimensional Chromatin Confirmation Capture (HiC) reads

Executing all workflows will output two scaffolded haplotype assemblies and the complete QC analyses

Please run the workflows in order: WF0 (there are two, one for HiFi and one for Illumina HiC), WF1, WF2, WF3, WF4

Maintainers: Tom Brown, Diego De Panis

Number of items: 6

Tags: Assembly, Bioinformatics, Galaxy, Genomics, Genome assembly, HiFi, Hi-C

Created: 16th Jun 2023 at 15:07, Last updated: 20th Nov 2023 at 16:20

HiFi genome assembly on Galaxy

No description specified

Maintainers: Johan Gustafsson

Number of items: 4

Tags: Genome assembly, HiFi, Galaxy

Created: 8th Sep 2022 at 01:22, Last updated: 14th Nov 2022 at 04:56

GALOP - Genome Assembly using Long reads Pipeline

Bioinformatics Laboratory for Genomics and Biodiversity (LBGB), ERGA Assembly

(Show All)

Work-in-progress

GALOP - Genome Assembly using Long reads Pipeline

This repository contains an exact copy of the standard Genoscope long reads assembly pipeline.

At the moment, this is not intended for users to download as it uses grid submission commands that will only work at Genoscope. As time goes on, we intend to make this pipeline available to a broader audience. However, genome assembly and polishing commands are accessible in the lib/assembly.py and lib/polishing.py files.

galop.py -h 
Mandatory
...

Type: Python

Creators: Benjamin Istace, Jean-Marc Aury, Caroline Belser

Submitter: Benjamin Istace

DOI: 10.48546/workflowhub.workflow.1200.2

Created: 12th Nov 2024 at 07:37, Last updated: 14th Nov 2024 at 06:55

skim2mito

NHM Clark group

Stable

skim2mito

skim2mito is a snakemake pipeline for the batch assembly, annotation, and phylogenetic analysis of mitochondrial genomes from low coverage genome skims. The pipeline was designed to work with sequence data from museum collections. However, it should also work with genome skims from recently collected samples.

Setup
Example data
Input
Output
Filtering contaminants
[Assembly and ...

Type: Snakemake

Creators: None

Submitter: Oliver White

Created: 12th Mar 2024 at 15:03, Last updated: 7th Oct 2024 at 13:24

ERGA-BGE Genome Report ASM analyses (one-asm HiFi + HiC)

ERGA Assembly

Stable

Assembly Evaluation for ERGA-BGE Reports

One Assembly, HiFi WGS reads + HiC reads

The workflow requires the following:

Species Taxonomy ID number
NCBI Genome assembly accession code
BUSCO Lineage
WGS accurate reads accession code
NCBI HiC reads accession code

The workflow will get the data and process it to generate genome profiling (genomescope, smudgeplot -optional-), assembly stats (gfastats), merqury stats (QV, completeness), BUSCO, snailplot, contamination blobplot, and HiC ...

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1104.1

Created: 20th Aug 2024 at 14:19, Last updated: 5th Dec 2024 at 16:47

ERGA-BGE Genome Report ASM analyses (one-asm WGS Illumina PE + HiC)

ERGA Assembly

Stable

Assembly Evaluation for ERGA-BGE Reports

One Assembly, Illumina WGS reads + HiC reads

The workflow requires the following:

Species Taxonomy ID number
NCBI Genome assembly accession code
BUSCO Lineage
WGS accurate reads accession code
NCBI HiC reads accession code

Type: Galaxy

Creators: Diego De Panis, ERGA

Submitter: Diego De Panis

DOI: 10.48546/workflowhub.workflow.1103.2

Created: 19th Aug 2024 at 10:38, Last updated: 5th Dec 2024 at 16:48

HiC contact map generation

ERGA Assembly, Biodiversity Genomics Europe (general)

Stable

HiC contact map generation

Snakemake pipeline for the generation of .pretext and .mcool files for visualisation of HiC contact maps with the softwares PretextView and HiGlass, respectively.

Prerequisites

This pipeine has been tested using Snakemake v7.32.4 and requires conda for installation of required tools. To run the pipline use the command:

snakemake --use-conda

There are provided a set of configuration and running scripts for exectution on a slurm queueing system. After configuring ...

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.795.2

Created: 14th Mar 2024 at 09:50, Last updated: 14th Mar 2024 at 09:52

Purge retained haplotypes using Purge-Dups

ERGA Assembly, Biodiversity Genomics Europe (general)

Purge dups

This snakemake pipeline is designed to be run using as input a contig-level genome and pacbio reads. This pipeline has been tested with snakemake v7.32.4. Raw long-read sequencing files and the input contig genome assembly must be given in the config.yaml file. To execute the workflow run:

snakemake --use-conda --cores N

Or configure the cluster.json and run using the ./run_cluster command

Type: Snakemake

Creator: Tom Brown

Submitter: Tom Brown

DOI: 10.48546/workflowhub.workflow.506.2

Created: 16th Jun 2023 at 14:56, Last updated: 16th Mar 2024 at 07:49

GALOP - Genome Assembly using Long reads Pipeline

skim2mito

Contents

HiC contact map generation

Prerequisites

Purge dups