dadaist2

Dadaist2-tree

A phylogenetic tree generation tool optimized for metabarcoding data (16S rRNA and ITS sequences), that takes representative-sequences.fasta and generates a tree.nwk file (Newick format).

usage: dadaist2-tree [-h] -i INPUT -o OUTPUT [-t THREADS] [--marker {16S,ITS}] [--fast]

Generate phylogenetic tree from metabarcoding (16S/ITS) sequences

options:
  -h, --help            show this help message and exit
  -i INPUT, --input INPUT
                        Input FASTA file containing ASVs/OTUs (default: None)
  -o OUTPUT, --output OUTPUT
                        Output tree file (Newick format) (default: None)
  -t THREADS, --threads THREADS
                        Number of threads to use (default: 1)
  --marker {16S,ITS}    Marker gene type (affects alignment parameters) (default: 16S)
  --fast                Use fast mode (Clustal Omega + FastTree) instead of MAFFT + IQ-TREE (default: False)

Features

Installation

Dependencies

The program requires the following external tools to be installed and available in your PATH:

Standard mode:

Fast mode:

Python Dependencies

pip install rich

Usage

Basic usage:

python dadaist2-tree.py -i input.fasta -o output.tree -t 4

All options:

python dadaist2-tree.py [-h] -i INPUT -o OUTPUT [-t THREADS] [--marker {16S,ITS}] [--fast]

Arguments

Marker Type Selection (–marker)

The --marker option optimizes the alignment parameters based on the type of sequences being analyzed:

16S rRNA (–marker 16S)

For 16S rRNA sequences, the program uses:

This configuration is optimized for:

ITS (–marker ITS)

For ITS (Internal Transcribed Spacer) sequences, the program uses:

This configuration is optimized for:

Modes of Operation

Standard Mode (Default)

Uses MAFFT for alignment and IQ-TREE for phylogenetic inference:

# Standard mode example
python dadaist2-tree.py -i input.fasta -o output.tree -t 4 --marker 16S

Fast Mode

Uses Clustal Omega for alignment and FastTree for phylogenetic inference:

# Fast mode example
python dadaist2-tree.py -i input.fasta -o output.tree -t 4 --fast

Output

The program generates a phylogenetic tree in Newick format. In standard mode, the tree includes:

Error Handling

The program includes:

Contributing

Feel free to submit issues and enhancement requests!