BuddySuite: Command-Line Toolkits for Manipulating Sequences, Alignments, and Phylogenetic Trees

Mol Biol Evol. 2017 Jun 1;34(6):1543-1546. doi: 10.1093/molbev/msx089.

Abstract

The ability to manipulate sequence, alignment, and phylogenetic tree files has become an increasingly important skill in the life sciences, whether to generate summary information or to prepare data for further downstream analysis. The command line can be an extremely powerful environment for interacting with these resources, but only if the user has the appropriate general-purpose tools on hand. BuddySuite is a collection of four independent yet interrelated command-line toolkits that facilitate each step in the workflow of sequence discovery, curation, alignment, and phylogenetic reconstruction. Most common sequence, alignment, and tree file formats are automatically detected and parsed, and over 100 tools have been implemented for manipulating these data. The project has been engineered to easily accommodate the addition of new tools, is written in the popular programming language Python, and is hosted on the Python Package Index and GitHub to maximize accessibility. Documentation for each BuddySuite tool, including usage examples, is available at http://tiny.cc/buddysuite_wiki. All software is open source and freely available through http://research.nhgri.nih.gov/software/BuddySuite.

Keywords: Python; alignment; command line; phylogenetic tree; sequence; software.

Publication types

  • Research Support, N.I.H., Intramural

MeSH terms

  • Computational Biology
  • Phylogeny
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Software