16s analysis

Denoising pipeline using DADA2 algorithm to process raw .gz sequencing files from paired-end MiSeq Illumina sequencing.

The pipeline currently takes care of trimming standard Illumina adapters from reads, filter and merge the reads (denoising), and determine the taxonomy of the different identified ASVs, as well as aligning them.

Moreover, the pipeline results in the creation of a Phyloseq object, containing the processed samples and their metadata, for further downstream analysis in R.

The intended contents of each directory is explained in separate README.md files.

How to run

After cloning the repository, put your raw data, database and metadata in:

data/raw_internal
data/raw_internal/db
data/meta

Metadata file should be in .tsv format, the names of the raw files should follow the convention "{your_sample}.R1.fastq.gz" to work.

Have snakemake installed and working on your machine.
Run the pipeline with:

snakemake --use-conda --cores all all

There are some parameters that you might want to use in the command line, one of them is:

--config preprocess="value"

Where value can be either "yes" or "no", the choice indicates if you want snakemake to use preprocessing steps that include:

Generating fastqc files
Summarizing fastqc files with MultiQC
Trimming sequences with Trim_galore

Moreover an option for performing optionally the phylogenetic tree was added, as

--config phylogeny="value"

Just note that trimming doesn't happen in the pipeline when using DADA2, so if no pre-processing takes place, the sequences will not be trimmed at all.

the --cores flag just specify the amount of cores to use, you can select what you think works best.

Name	Name	Last commit message	Last commit date
Latest commit Headonpillow format Mar 27, 2025 7206985 · Mar 27, 2025 History 70 Commits
code	code	Removed phylogenetic distances from analysis	Dec 12, 2023
data	data	Added db folder again.	May 30, 2023
doc	doc	removed the 'paper' folder from documentation	May 30, 2023
intermediate	intermediate	added some readme files and updated the gitignore to include them	May 10, 2023
logs	logs	README.md added.	May 10, 2023
results	results	added some readme files and updated the gitignore to include them	May 10, 2023
scratch	scratch	added some readme files and updated the gitignore to include them	May 10, 2023
.gitignore	.gitignore	added svg to .gitignore	Dec 12, 2023
16s_analysis.yml	16s_analysis.yml	Updated MAFFT version package	May 29, 2024
README.md	README.md	format	Mar 27, 2025
Snakefile	Snakefile	Added a config option for the production of a tree	May 30, 2024
config.yml	config.yml	initializing the repository	Dec 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

16s analysis

How to run

About

Releases 1

Packages

Languages

Headonpillow/16S_analysis

Folders and files

Latest commit

History

Repository files navigation

16s analysis

How to run

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages