TY - JOUR T1 - The large majority of intergenic sites in bacteria are selectively constrained, even when known regulatory elements are excluded JF - bioRxiv DO - 10.1101/069708 SP - 069708 AU - Harry A. Thorpe AU - Sion Bayliss AU - Laurence D. Hurst AU - Edward J. Feil Y1 - 2016/01/01 UR - http://biorxiv.org/content/early/2016/08/29/069708.abstract N2 - There are currently no broad estimates of the overall strength and direction of selection operating on intergenic variation in bacteria. Here we address this using large whole genome sequence datasets representing six diverse bacterial species; Escherichia coli, Staphylococcus aureus, Salmonella enterica, Streptococcus pneumoniae, Klebsiella pneumoniae, and Mycobacterium tuberculosis. Excluding M. tuberculosis, we find that a high proportion (62%-79%; mean 70%) of intergenic sites are selectively constrained, relative to synonymous sites. Non-coding RNAs tend to be under stronger selective constraint than promoters, which in turn are typically more constrained than rho-independent terminators. Even when these regulatory elements are excluded, the mean proportion of constrained intergenic sites only falls to 69%; thus our current understanding of the functionality of intergenic regions (IGRs) in bacteria is severely limited. Consistent with a role for positive as well as negative selection on intergenic sites, we present evidence for strong positive selection in Mycobacterium tuberculosis promoters, underlining the key role of regulatory changes as an adaptive mechanism in this highly monomorphic pathogen. ER -