Subcommand information
download
- download taxonomy metadata
sylph-tax download --download-to /my/folder/sylph_taxonomy_files/
- Downloads taxonomic annotation files (~50 MB; see here) to
--download-to
. - The
--download-to
folder must exist. The location can be wherever. Its location is written to~/.config/sylph-tax/config.json
. - If you don't have access to
$HOME
, you can specify a custom location in theSYLPH_TAXONOMY_CONFIG
environment variable. E.g.export SYLPH_TAXONOMY_CONFIG=/write_access_folder/sylph-tax-config.json
.
taxprof
- taxonomic profiles from sylph's output
sylph-tax taxprof sylph_results/*.tsv -o prefix_or_folder/ -t {sylph-tax identifier}
sylph_results/*.tsv
: outputs from sylph. The databases used for sylph must be the same as the-t
option.-t/--taxonomy-metadata
: A list ofsylph-tax identifier
s specified in this table (e.g.GTDB_r220
orIMGVR_4.1
). Multiple taxonomy metadata files can be input. Custom taxonomy files are also possible.-o
: prepends this prefix to all of the output files. One file is output per sample insylph_output.tsv
-a/--annotate-virus-hosts
: annotates found viral genomes with host information metadata (only available forIMGVR_4.1
right now)- Output suffix is
.sylphmpa
.
Tip
In python/pandas, pd.read_csv('output.sylphmpa',sep='\t', comment='#')
works.
merge
- merge multiple taxonomic profiles
sylph-tax merge *.sylphmpa --column {ANI, relative_abundance, sequence_abundance} -o output_table.tsv
*.sylphmpa
files are outputs fromsylph-tax taxprof
.--column
can be ANI, relative abundance, or sequence abundance (see paper for difference between abundances)-o
output file in TSV format.
Output format for merge
(TSV)
clade_name sample1.fastq.gz sample2.fastq.gz
d__Archaea 0.0 1.1
d__Archaea|p__Methanobacteriota 0.0 0.0965
...