Map metagenomic reads onto representative databases¶
Template of how to map metagenomic reads onto a representative database:
sparse predict --dbname </path/to/SPARSE/database> --mapDB <comma delimited MapDB's> --r1 <read_1> --r2 <read_2> --workspace <workspace_name>
Example (single end):
sparse predict --dbname refseq --mapDB representative,subpopulation,Virus --r1 read1.fq.gz --workspace read1
The outputs consist of two files, with detailed information in the [“output” section](output.md).
Extract reference specific reads¶
You first need to find out the indices of the interesting references in the [output files](output.md), and use the indexes to extract related reads.
sparse extract --dbname refseq --workspace read1 --ref_id <comma delimited indices>
For example, we extract all reads specific to reference id 16, which is a Vibrio cholerae genome.
sparse extract --dbname refseq --workspace read1 --ref_id 16