SAMBA is a FAIR scalable workflow integrating, into a unique tool, state-of-the-art bioinformatics and statistical methods to conduct reproducible eDNA analyses using Nextflow. SAMBA starts processing by verifying integrity of raw reads and metadata. Then all bioinformatics processing is done using commonly used procedure (QIIME 2 and DADA2) but adds new steps relying on dbOTU3 and microDecon to build high quality ASV count tables. Extended statistical analyses are also performed. Finally, SAMBA produces a full dynamic HTML report including resources used, commands executed, intermediate results, statistical analyses and figures.
The SAMBA pipeline can run tasks across multiple compute infrastructures in a very portable manner. It comes with singularity containers making installation trivial and results highly reproducible.
Version History
Version 1 (earliest) Created 10th Sep 2021 at 14:40 by Cyril Noel
Added/updated 2 files
Open
master
e560923
Creators
Additional credit
Laure Quintric
Submitter
Views: 3093 Downloads: 308
Created: 10th Sep 2021 at 14:40
Last updated: 10th Sep 2021 at 14:43
None