A poor man’s BLASTX—high-throughput metagenomic protein database search using PAUDA

by cupton · June 10, 2013

See on Scoop.it – Virology and Bioinformatics from Virology.ca

In the context of metagenomics, we introduce a new approach to protein database search called PAUDA, which runs ∼10 000 times faster than BLASTX, while achieving about one-third of the assignment rate of reads to KEGG orthology groups, and producing gene and taxon abundance prowp-content/uploads/2018/12 that are highly correlated to those obtained with BLASTX. PAUDA requires <80 CPU hours to analyze a dataset of 246 million Illumina DNA reads from permafrost soil for which a previous BLASTX analysis (on a subset of 176 million reads) reportedly required 800 000 CPU hours, leading to the same clustering of samples by functional prowp-content/uploads/2018/12.

See on bioinformatics.oxfordjournals.org

A poor man’s BLASTX—high-throughput metagenomic protein database search using PAUDA

You may also like...

Leave a Reply Cancel reply

Twitter Timeline

A poor man’s BLASTX—high-throughput metagenomic protein database search using PAUDA

You may also like...

Progressive multiple sequence alignment with indel evolution | BMC Bioinformatics | Full Text

How TO Host A Scoopit Newsletter / Landing Page FREE on Google Drive or Dropbox

Male Circumcision Shown to Prevent HPV Infections in Female Partners

Leave a Reply Cancel reply

Twitter Timeline