A phylogenetic tree generation tool optimized for metabarcoding data (16S rRNA and ITS sequences), that takes representative-sequences.fasta and generates a tree.nwk file (Newick format).
usage: dadaist2-tree [-h] -i INPUT -o OUTPUT [-t THREADS] [--marker {16S,ITS}] [--fast]
Generate phylogenetic tree from metabarcoding (16S/ITS) sequences
options:
-h, --help show this help message and exit
-i INPUT, --input INPUT
Input FASTA file containing ASVs/OTUs (default: None)
-o OUTPUT, --output OUTPUT
Output tree file (Newick format) (default: None)
-t THREADS, --threads THREADS
Number of threads to use (default: 1)
--marker {16S,ITS} Marker gene type (affects alignment parameters) (default: 16S)
--fast Use fast mode (Clustal Omega + FastTree) instead of MAFFT + IQ-TREE (default: False)
The program requires the following external tools to be installed and available in your PATH:
Standard mode:
Fast mode:
pip install rich
Basic usage:
python dadaist2-tree.py -i input.fasta -o output.tree -t 4
All options:
python dadaist2-tree.py [-h] -i INPUT -o OUTPUT [-t THREADS] [--marker {16S,ITS}] [--fast]
-i, --input
: Input FASTA file containing ASVs/OTUs (required)-o, --output
: Output tree file in Newick format (required)-t, --threads
: Number of threads to use (default: 1)--marker
: Marker gene type, either ‘16S’ or ‘ITS’ (default: 16S)--fast
: Use fast mode (Clustal Omega + FastTree) instead of MAFFT + IQ-TREEThe --marker
option optimizes the alignment parameters based on the type of sequences being analyzed:
For 16S rRNA sequences, the program uses:
This configuration is optimized for:
For ITS (Internal Transcribed Spacer) sequences, the program uses:
This configuration is optimized for:
Uses MAFFT for alignment and IQ-TREE for phylogenetic inference:
# Standard mode example
python dadaist2-tree.py -i input.fasta -o output.tree -t 4 --marker 16S
Uses Clustal Omega for alignment and FastTree for phylogenetic inference:
# Fast mode example
python dadaist2-tree.py -i input.fasta -o output.tree -t 4 --fast
The program generates a phylogenetic tree in Newick format. In standard mode, the tree includes:
The program includes:
Feel free to submit issues and enhancement requests!