RT Journal Article SR Electronic T1 Comparative analysis of variation and selection in the HCV genome JF bioRxiv FD Cold Spring Harbor Laboratory SP 078584 DO 10.1101/078584 A1 Juan Ángel Patiño-Galindo A1 Fernando González-Candelas YR 2016 UL http://biorxiv.org/content/early/2016/09/30/078584.abstract AB Genotype 1 of the hepatitis C virus (HCV) is the most prevalent of the variants of this virus. Its two main subtypes, HCV-1a and HCV-1b, are associated to differences in epidemic features and risk groups, despite sharing similar features in most biological properties. We have analyzed the impact of positive selection on the evolution of these variants using complete genome coding regions, and compared the levels of genetic variability and the distribution of positively selected sites. We have also compared the distributions of positively selected and conserved sites considering different factors such as RNA secondary structure, the presence of different epitopes (antibody, CD4 and CD8), and secondary protein structure. Less than 10% of the genome was found to be under positive selection, and purifying selection was the main evolutionary force in both subtypes. We found differences in the number of positively selected sites between subtypes in several genes (Core, HVR2 in E2, P7, helicase in NS3 and NS4a). Heterozygosity values in positively selected sites and the rate of non-synonymous substitutions were significantly higher in subtype HCV-1b. Logistic regression analyses revealed that similar selective forces act at the genome level in both subtypes: RNA secondary structure and CD4 T-cell epitopes are associated with conservation, while CD8 T-cell epitopes are associated with positive selection in both subtypes. These results indicate that similar selective constraints are acting along HCV-1a and HCV-1b genomes, despite some differences in the distribution of positively selected sites at independent genes.