Introduction

UNLABELLED: Pyrosequencing technologies are frequently used for sequencing the 16S ribosomal RNA marker gene for profiling microbial communities. Clustering of the produced reads is an important but time-consuming task. We present Dynamic Seed-based Clustering (DySC), a new tool based on the greedy clustering approach that uses a dynamic seeding strategy. Evaluations based on the normalized mutual information (NMI) criterion show that DySC produces higher quality clusters than UCLUST and CD-HIT at a comparable runtime. AVAILABILITY AND IMPLEMENTATION: DySC, implemented in C, is available at http://code.google.com/p/dysc/ under GNU GPL license.

Publications

  1. DySC: software for greedy clustering of 16S rRNA reads.
    Cite this
    Zheng Z, Kramer S, Schmidt B, 2012-08-01 - Bioinformatics (Oxford, England)

Credits

  1. Zejun Zheng
    Developer

  2. Stefan Kramer
    Developer

  3. Bertil Schmidt
    Investigator

Community Ratings

UsabilityEfficiencyReliabilityRated By
0 user
Sign in to rate
Summary
AccessionBT000497
Tool TypeApplication
Category
PlatformsLinux/Unix
TechnologiesC
User InterfaceTerminal Command Line
Download Count0
Submitted ByBertil Schmidt