TY - JOUR T1 - Cross-species genome-wide identification of evolutionary conserved microProteins JF - bioRxiv DO - 10.1101/061655 SP - 061655 AU - Daniel Straub AU - Stephan Wenkel Y1 - 2016/01/01 UR - http://biorxiv.org/content/early/2016/07/01/061655.abstract N2 - MicroProteins are small single domain proteins that act by engaging their targets into non-productive protein complexes. In order to identify novel microProteins in any sequenced genome of interest, we have developed miPFinder, a program that identifies and classifies potential microProteins. In the past years, several microProteins have been discovered in plants where they are mainly involved in the regulation of development. The miPFinder algorithm identifies all up to date known plant microProteins and extends the microProtein concept to other protein families. Here, we reveal potential microProtein candidates in several plant and animal reference genomes. A large number of these microProteins are species-specific while others evolved early and are evolutionary highly conserved. Most known microProtein genes originated from large ancestral genes by gene duplication, mutation and subsequent degradation. Gene ontology analysis shows that putative microProtein ancestors are often located in the nucleus, and involved in DNA binding and formation of protein complexes. Additionally, microProtein candidates act in plant transcriptional regulation, signal transduction and anatomical structure development. MiPFinder is freely available to find microProteins in any genome and will aid in the identification of novel microProteins in plants and animals ER -