TY - JOUR T1 - A robust statistical framework for reconstructing genomes from metagenomic data JF - bioRxiv DO - 10.1101/011460 SP - 011460 AU - Dongwan D. Kang AU - Jeff Froula AU - Rob Egan AU - Zhong Wang Y1 - 2014/01/01 UR - http://biorxiv.org/content/early/2014/11/15/011460.abstract N2 - We present software that reconstructs genomes from shotgun metagenomic sequences using a reference-independent approach. This method permits the identification of OTUs in large complex communities where many species are unknown. Binning reduces the complexity of a metagenomic dataset enabling many downstream analyses previously unavailable. In this study we developed MetaBAT, a robust statistical framework that integrates probabilistic distances of genome abundance with sequence composition for automatic binning. Applying MetaBAT to a human gut microbiome dataset identified 173 highly specific genomes bins including many representing previously unidentified species. ER -