Abstract
There is a molecular basis for many sleep patterns and disorders involving circadian clock genes. In humans, “short-sleeper” behavior has been linked to specific amino acid substitutions in BHLHE41 (DEC2), yet little is known about variation at these sites and across this gene in mammals. We compare BHLHE41 coding sequences for 27 mammals. The coding sequence alignment length was 1794bp, of which 55.0% of base pairs were invariant among the sampled mammals. The mean pairwise nucleotide identity was 92.2%. Of the 598 residue amino acid alignment for mammals, 71.7% of amino acids were identical. The pairwise percent identity for amino acids was 94.8%. No other mammals had the same “short-sleeper” amino acid substitutions previously described from humans. Phylogenetic analyses based on the nucleotides of the coding sequence alignment are consistent with established mammalian relationships. Significant purifying selection was detected in 66.2% of variable codons. No codons exhibited significant signs of positive selection. Unexpectedly, the gorilla BHLHE41 sequence has a 318 bp insertion at the 5’ end of the coding sequence and a deletion of 195 bp near the 3’ end of the coding sequence (including the two short sleeper variable sites). Given the strong signal of purifying selection across this gene, phylogenetic congruence with expected relationships and generally conserved function among mammals investigated thus far, we suggest the unexpected indels predicted in the gorilla BHLHE41 may represent an annotation error and warrant experimental validation.