TY - JOUR T1 - The evolutionary fates of a large segmental duplication in mouse JF - bioRxiv DO - 10.1101/043687 SP - 043687 AU - Andrew P Morgan AU - J Matthew Holt AU - Rachel C McMullan AU - Timothy A Bell AU - Amelia M-F Clayshulte AU - John P Didion AU - Liran Yadgary AU - David Thybert AU - Duncan T Odom AU - Paul Flicek AU - Leonard McMillan AU - Fernando Pardo-Manuel de Villena Y1 - 2016/01/01 UR - http://biorxiv.org/content/early/2016/04/29/043687.abstract N2 - Gene duplication and loss are major sources of genetic polymorphism in populations, and are important forces shaping the evolution of genome content and organization. We have reconstructed the origin and history of a 127 kbp segmental duplication, R2d, in the house mouse (Mus musculus). R2d contains a single protein-coding gene, Cwc22. De novo assembly of both the ancestral (R2d1) and the derived (R2d2) copies reveals that they have been subject to non-allelic gene conversion events spanning tens of kilobases. R2d2 is also a hotspot for structural variation: its diploid copy number ranges from zero in the mouse reference genome to more than 80 in wild mice sampled from around the globe. Hemizgyosity for high-copy-number alleles of R2d2 is associated in cis with meiotic drive, suppression of meiotic crossovers, and copy-number instability, with a mutation rate in excess of 1 per 100 transmissions in laboratory populations. We identify an additional 57 loci covering 0.8% of the mouse genome with patterns of sequence variation similar to those at R2d1 and R2d2. Our results provide a striking example of allelic diversity generated by duplication and demonstrate the value of de novo assembly in a phylogenetic context for understanding the mutational processes affecting duplicate genes. ER -