已收录 268921 条政策
 政策提纲
  • 暂无提纲
Cnidaria: fast, reference-free clustering of raw and assembled genome and transcriptome NGS data
[摘要] BackgroundIdentification of biological specimens is a requirement for a range of applications. Reference-free methods analyse unprocessed sequencing data without relying on prior knowledge, but generally do not scale to arbitrarily large genomes and arbitrarily large phylogenetic distances.ResultsWe present Cnidaria, a practical tool for clustering genomic and transcriptomic data with no limitation on genome size or phylogenetic distances. We successfully simultaneously clustered 169 genomic and transcriptomic datasets from 4 kingdoms, achieving 100 % identification accuracy at supra-species level and 78 % accuracy at the species level.ConclusionCNIDARIA allows for fast, resource-efficient comparison and identification of both raw and assembled genome and transcriptome data. This can help answer both fundamental (e.g. in phylogeny, ecological diversity analysis) and practical questions (e.g. sequencing quality control, primer design).
[发布日期] 2015-11-02 [发布机构] 
[效力级别]  [学科分类] 
[关键词] Clustering;k-mer;NGS;RNA-seq;Phylogeny;Species identification [时效性] 
   浏览次数:4      统一登录查看全文      激活码登录查看全文