Abstract
Evaluating the quality of metagenomic assemblies is important for constructing reliable metagenome-assembled genomes and downstream analyses. Here, we present metaMIC (https://github.com/ZhaoXM-Lab/metaMIC), a machine-learning based tool for identifying and correcting misassemblies in metagenomic assemblies. Benchmarking results on both simulated and real datasets demonstrate that metaMIC outperforms existing tools when identifying misassembled contigs. Furthermore, metaMIC is able to localize the misassembly breakpoints, and the correction of misassemblies by splitting at misassembly breakpoints can improve downstream scaffolding and binning results.
Competing Interest Statement
The authors have declared no competing interest.
Footnotes
Emails, Senying Lai: 19110850024{at}fudan.edu.cn, Shaojun Pan: 19110850020{at}fudan.edu.cn
Abbreviations
- MAG
- metagenome-assembled genomes
- bp
- base pair
- AUPRC
- area under the precision-recall curve
- KAD
- k-mer abundance difference