Experimental estimation of the effects of all amino-acid mutations to HIV Env

Hugh K. Haddox; Adam S. Dingens; Jesse D. Bloom

doi:10.1101/067470

Abstract

HIV is notorious for its capacity to evade immunity and anti-viral drugs through rapid sequence evolution. Knowledge of the functional effects of mutations to HIV is critical for understanding this evolution. HIV’s most rapidly evolving protein is its envelope (Env). Here we use deep mutational scanning to experimentally estimate the effects of all amino-acid mutations to Env on viral replication in cell culture. Most mutations are under purifying selection in our experiments, although a few sites experience strong selection for mutations that enhance HIV’s growth in cell culture. We compare our experimental measurements of each site’s preference for each amino acid to the actual frequencies of these amino acids in naturally occurring HIV sequences. Our measured amino-acid preferences correlate with amino-acid frequencies in natural sequences for most sites. However, our measured preferences are less concordant with natural amino-acid frequencies at surface-exposed sites that are subject to pressures absent from our experiments such as antibody selection. We show that some regions of Env have a high inherent tolerance to mutation, whereas other regions (such as epitopes of broadly neutralizing antibodies) have a significantly reduced capacity to tolerate mutations. Overall, our results help disentangle the role of inherent functional constraints and external selection pressures in shaping Env’s evolution.

Introduction

HIV evolves rapidly: the envelope (Env) proteins of two viral strains within a single infected host diverge as much in a year as the typical human and chimpanzee ortholog has diverged over ~5-million years [1–4]. This rapid evolution is essential to HIV’s biology. Most humans infected with HIV generate antibodies against Env that effectively neutralize viruses from early in the infection [5–7]. However, Env evolves so rapidly that HIV is able to stay ahead of this antibody response, with new viral variants escaping from antibodies that neutralized their predecessors just months before [5–7]. Env’s exceptional evolutionary capacity is therefore central to HIV’s lifestyle.

A protein’s evolutionary capacity depends on its ability to tolerate point mutations. Detailed knowledge of how mutations affect Env is therefore key to understanding its evolution. Many studies have estimated the effects of mutations to Env. One strategy is experimental: numerous studies have used site-directed mutagenesis or alanine scanning to measure how specific mutations affect various aspects of Env’s function [8–17]. However, these experiments have examined only a small fraction of the many possible mutations to Env. Another strategy is computational: under certain assumptions, the fitness effects of mutations can be estimated from their frequencies in global or intra-patient HIV sequences [18–22]. However, these computational strategies are of uncertain accuracy and cannot separate the contributions of inherent functional constraints from those of external selection pressures such as antibodies. Therefore, a more complete and direct delineation of how every mutation affects Env’s function would be of great value.

Recently it has become possible to make massively parallel experimental measurements of the effects of protein mutations using deep mutational scanning [23–25]. These experiments involve creating large libraries of mutants of a gene, subjecting them to bulk functional selections, and quantifying the effect of each mutation by using deep sequencing to assess its frequency pre- and post-selection. Over the last few years, deep mutational scanning has been used to estimate the effects of all single amino-acid mutations to a variety of proteins [26–38]. However, deep mutational scanning has not yet been comprehensively applied to any HIV protein, although it has been used to measure the effects of several thousand single-nucleotide mutations spread across the viral genome [39].

Here we use deep mutational scanning to experimentally estimate how all amino-acid mutations to the ectodomain and transmembrane domain of Env affect viral replication in cell culture. At most sites, our measurements correlate with the frequencies of amino acids in natural HIV sequences. However, there are large deviations at sites where natural evolution is strongly shaped by factors (e.g., antibodies) that are absent from our experiments. Our results show that site-to-site variation in Env’s inherent capacity to tolerate mutations helps explain why some portions of Env evolve so rapidly while other regions are much more conserved. Overall, our work helps elucidate how inherent functional constraints shape Env’s evolution, and demonstrates a powerful experimental approach for comprehensively mapping how mutations affect HIV phenotypes that can be selected for in the lab.

Results

Deep mutational scanning of Env

We used the deep mutational scanning approach in Fig 1A estimate the effects of all single amino-acid mutations to Env. We applied this approach to Env from the LAI strain of HIV [40]. LAI is a CXCR4-tropic subtype B virus isolated from a chronically infected individual and then passaged in human T-lymphocytes. We chose this strain because LAI and the closely related HXB2 strain have been widely used to study Env’s structure and function [8–11,41–43], providing extensive biochemical data with which to benchmark our results. LAI’s Env is 861 amino acids in length. We mutagenized amino acids 31-702 (throughout this paper, we use the HXB2 numbering scheme [44]). We excluded the N-terminal signal peptide and the C-terminal cytoplasmic tail, since mutations in these regions can alter Env expression in ways that affect viral infectivity in cell culture [45–47]. The region of Env that we mutagenized spanned 677 residues, meaning that there are 677 × 63 = 42, 651 possible codon mutations, corresponding to 677 × 19 = 12, 863 possible amino-acid mutations.

Fig 1. Deep mutational scanning workflow.

(A) We created libraries of HIV proviral plasmids with random codon mutations in env, and generated mutant viruses by transfecting these plasmid libraries into 293T cells. Since cells receive multiple plasmids, there may not be a link between viral genotype and phenotype at this stage. To establish this link and select for functional variants, we passaged the viruses twice at low multiplicity of infection (MOI) in SupT1 cells. We deep sequenced env before and after selection to quantify the enrichment or depletion of each mutation, and used these data to estimate the preference of each site for each amino acid. Each mutant library was paired with a control in which cells were transfected with a wildtype HIV proviral plasmid to generate initially wildtype viruses that were passaged in parallel with the mutant viruses. Deep sequencing of these wildtype controls enabled estimation of the rates of apparent mutations arising from deep sequencing and viral replication. (B) We performed the entire experiment in triplicate. Additionally, we passaged the replicate-3 transfection supernatant in duplicate (replicate 3b). We also performed the second passage of replicate 3b in duplicate (replicates 3b-1 and 3b-2).

To create plasmid libraries containing all these mutations, we used a previously described PCR mutagenesis technique [31] that creates multi-nucleotide (e.g, gca→CAT) as well as single-nucleotide (e.g, gca→gAa) codon mutations. We created three independent plasmid libraries, and carried each library through all subsequent steps independently, meaning that all our measurements were made in true biological triplicate (Fig 1B). We Sanger sequenced 26 clones to estimate the frequency of mutations in the plasmid mutant libraries (S1 Fig). There were an average of 1.4 codon mutations per clone, with the number of mutations per clone roughly following a Poisson distribution. The deep sequencing described in the next section found that at least 79% of the ≈ 10⁴ possible amino-acid mutations were observed at least three times in each of the triplicate libraries, and that 98% of mutations were observed at least three times across all three libraries combined. The plasmid libraries therefore sampled most amino-acid mutations to Env.

We produced virus libraries by transfecting each plasmid library into 293T cells. The viruses in the resulting transfection supernatant lack a genotype-phenotype link, since each cell is transfected by many plasmids. We therefore passaged the transfection supernatants twice in SupT1 cells at an MOI of 0.005 to create a genotype-phenotype link and select for functional variants. Importantly, neither 293T nor SupT1 cells express detectable levels of APOBEC3G [48,49], which can hypermutate HIV genomes [50,51]. This is a crucial point: although HIV encodes a protein that counteracts APOBEC3G, a fraction of viruses will lack a functional version of this protein and so have their genomes hypermutated in APOBEC3G-expressing cells. For each library, we passaged 5 × 10⁵ infectious particles in order to maintain library diversity. We used Illumina deep sequencing to quantify the frequency of each mutation before and after passaging, using molecular barcodes to increase sequencing accuracy as in [37]. We sequenced the plasmids to assess the initial mutation frequencies, and sequenced non-integrated viral DNA [52] from infected SupT1 cells to assess the mutation frequencies in the viruses. Each mutant library was paired with a control in which we generated wildtype virus from unmutated plasmid, enabling us to estimate the rates of sequencing and viral replication errors. Overall, these procedures allowed us to implement the deep mutational scanning workflow in Fig 1.

Most mutations are under purifying selection, but a few sites experience selection for cell-culture adaptation mutations

Our deep mutational scanning experiments require that selection purge the virus libraries of nonfunctional variants. As an initial gene-wide measure of selection, we analyzed how different types of codon mutations (nonsynonymous, synonymous, and stop-codon mutations) changed in frequency after selection. In these analyses, we corrected for background errors from PCR, sequencing, and viral replication by subtracting the mutation frequencies measured in our wildtype controls from those measured in the mutant libraries (S2 Fig).

Stop-codon mutations are expected to be uniformly deleterious. Indeed, after correcting for background errors, stop codons were purged to <1% of their initial frequency in the twice-passaged viruses for each replicate, indicating strong purifying selection (see the data for “all sites” in Fig 2A). The second viral passage is important for complete selection, as stop codons remain at about ≈16% of their initial frequency in viruses that were only been passaged once (S3 Fig).

Fig 2. Selection purged mutations in most of env, but favored mutations at a few sites.

(A) For each replicate, we deep sequenced the initial plasmids (DNA) and the viruses after two rounds of passaging (P2). Bars show the per-codon mutation frequency averaged across sites after subtracting error rates determined from the wildtype controls (S2 Fig). When mutation frequencies are averaged across all sites, selection purged stop codons to <1% of their frequency in the initial DNA. Selection only slightly reduced the average frequency of nonsynonymous mutations; however, this average results from two distinct trends. For ≈4% of sites, the frequency of nonsynonymous mutations in the twice-passaged viruses (f^P2) increased >3-fold relative to the frequency in the initial plasmid DNA (f^DNA). For all other sites, the frequency of nonsynonymous mutations decreased substantially after selection. (B) The sites at which the error-corrected mutation frequency increased >3-fold are similar between replicates, indicating consistent selection for tissue-culture adaptation at a few positions. The left Venn diagram shows the overlap among replicates in the sites with a >3-fold increase. The right Venn diagram shows the expected overlap if the same number of sites per replicate are randomly drawn from Env’s primary sequence. This difference is statistically significant, with P < 10⁻⁴ when comparing the actual overlap among all three replicates to the random expectation. Another summary view of selection on env is provided by S4 Fig and S5 Fig.

Interpreting the frequencies of nonsynonymous mutations is more nuanced, as different amino-acid mutations have different functional effects. However, a large fraction of amino-acid mutations are deleterious to any protein [53–55]. Therefore, one might expect that the frequency of nonsynonymous mutations would decrease substantially in the twice-passaged mutant viruses. But surprisingly, even after correcting for background errors, the average frequency of nonsynonymous mutations in the passaged viruses is ≈90% of its value in the mutant plasmids (see the data for “all sites” in Fig 2A). However, the average masks two disparate trends. In each library, a few sites exhibit large increases in the frequency of nonsynonymous mutations, whereas this frequency decreases by nearly two-fold for all other sites (see the data for the subgroups of sites in Fig 2A).

An obvious hypothesis is that at a few sites, amino-acid mutations are favored because they are adaptive for viral replication in cell culture. Consistent with this hypothesis, the sites that experienced large increases in mutation frequencies are similar among the three replicates (Fig 2B), suggestive of reproducible selection for mutations at these sites. Moreover, these sites are spatially clustered in Env’s crystal structure in regions where mutations are likely to enhance viral growth in cell culture (Fig 3 and S1 Table). One cluster of mutations disrupts potential glycosylation sites at the trimer apex (Fig 3A), and could be beneficial in cell culture since glycans that shield Env from antibodies in nature [6,56] are often expendable or even deleterious for viral growth in the lab [57–59]. A second cluster overlaps sites where mutations influence Env’s conformational dynamics, which are commonly altered by cell-culture passage [60,61]. A third cluster is at the co-receptor binding interface (Fig 3B), where mutations may enhance viral entry in cell culture. Therefore, while most of Env is under purifying selection against changes to the protein sequence, a few sites are under selection for cell-culture adapting amino-acid mutations.

Fig 3. Sites of recurrent cell-culture mutations mapped on Env’s structure.

The 25 sites from Fig 2B where the mutation frequency increased >3-fold in at least two replicates after cell-culture passage. (A) Trimeric Env (PDB 5FYK [94]) with one monomer in grey and the others in white, oriented so the membrane-proximal region is at the bottom. Sites of cell-culture mutations are shown as spheres, colored red-to-blue according to primary sequence. Most of these sites fall in one of three clusters. Mutations in the first cluster disrupt potential glycosylation sites at Env’s apex. The second cluster includes or is adjacent to sites where mutations are known to affect Env’s conformational dynamics [95,96]. (B) The third cluster is near the co-receptor binding surface. This panel shows an apex-down view of monomeric gp120 (grey) in complex with CD4 (green) (PDB 3JWD [42]). Sites of recurrent cell-culture mutations are shown as spheres colored according to primary sequence as in panel A. The black bar indicates cropping of CD4. (C) The same view as panel B, but the spheres now show sites known to affect binding to CXCR4 [10] or CCR5 [97]. Note the extensive overlap between the spheres in this panel and panel B.

The average error-corrected frequency of synonymous mutations changes little after selection (an average decrease to 96% of the original frequency; see the data for “all sites” in Fig 2A). This overall trend is consistent with the fact that synonymous mutations usually have smaller functional effects than nonsynonymous mutations. However, synonymous mutations can sometimes have substantial effects [21,62–64], particularly in viruses like HIV that are under strong selection for RNA secondary structure and codon usage [65,66]. To assess selection on synonymous mutations on a more site-specific level, we examined the change in frequency of multi-nucleotide codon mutations across env’s primary sequence (Fig 4). The rationale behind examining only multi-nucleotide codon mutations is that they are not appreciably confounded by errors from PCR, deep sequencing, or de novo mutations from viral replication (S2 Fig, S4 Fig). In a region roughly spanning codons 500 to 600, selection strongly purged both synonymous and nonsynonymous multi-nucleotide codon mutations (Fig 4). This region contains env’s Rev-response element (RRE) [67], a highly structured region of RNA that is bound by the Rev protein to control the temporal export of unspliced HIV transcripts from the nucleus [68, 69]. The finding of strong selection on the nucleotide as well as the amino-acid sequence of the RRE region of Env therefore agrees with our biological expectations.

Fig 4. Selection depleted multi-nucleotide codon mutations in the Rev-response element (RRE).

This plot shows a 51-codon sliding-window average of the fold change in per-codon multi-nucleotide mutation frequency after two rounds of viral passage, with data plotted for the center point in each window. The strongest depletion of both synonymous and nonsynonymous mutations occurred in the RRE, which is an RNA secondary structure important for viral replication.

The preference for each amino acid at each site in Env

The previous section examined broad trends in selection averaged across many sites. But our data also enable much more fine-grained estimates of the preference for every amino-acid at every position in Env. We define a site’s preference for an amino acid to be proportional to the enrichment or depletion of that amino acid after selection (correcting for the error rates determined using the wildtype controls), normalizing the preferences for each site so that they sum to one. We denote the preference of site r for amino acid a as π_r,a, and compute the preferences from the deep-sequencing data as described in [70]. Since we mutagenized 677 residues in Env, there are 677 × 20 = 13, 540 preferences. If selection in our experiments exactly parallels selection in nature and there are no shifts in mutational effects as Env evolves, then these preferences are the expected frequencies of each amino acid at each site in an alignment of Env sequences that have reached evolutionary equilibrium under a mutation process that introduces each amino acid with equal probability [31, 71].

Fig 5 shows Env’s site-specific amino-acid preferences after averaging across replicates and rescaling to account for the stringency of selection in our experiments (details of this re-scaling are in the next section). As is immediately obvious from Fig 5, sites vary dramatically in their tolerance for mutations. Some sites strongly prefer a single amino acid, while other sites can tolerate many amino acids. For instance, site 457, an important receptor-binding residue [8], has a strong preference for aspartic acid. However, this site is adjacent to a variable loop (sites 460-469) where most sites tolerate many amino acids. Another general observation is that when sites tolerate multiple amino acids, they often prefer ones with similar chemical properties. For instance, sites 225 and 226 prefer hydrophobic amino acids, while sites 162 to 164 prefer positively charged amino acids.

Fig 5. Env’s site-specific amino-acid preferences.

The amino-acid preferences averaged across replicates and re-scaled to account for differences in the stringency of selection between our experiments and natural selection. The height of each letter is proportional to the the preference for that amino acid at that site, and letters are colored according to hydrophobicity. The overlay bar indicates the gp120 variable loops, other regions of gp120, and gp41. Black dots indicate sites where mutations are known to disrupt CD4 binding (Table 1). Sites are numbered using the HXB2 scheme [44]. Numerical values of the preferences before and after re-scaling are in S1 File and S2 File, respectively.

To confirm that our experiments captured known constraints on Env’s function, we examined mutations that have been characterized to affect key functions of Env. Table 1 lists mutations known to disrupt an essential disulfide bond, binding to receptor or co-receptor, or protease cleavage. In almost all cases, the deleterious mutation introduces an amino-acid that our experiments report as having a markedly lower preference than the wildtype amino acid. Therefore, our measurements largely concord with existing knowledge about mutations that affect key aspects of Env’s function.

View this table:

Table 1.

Our experimental estimates are mostly concordant with existing knowledge about the effects of mutations to functionally or structurally important parts of Env.

A crucial aspect of any high-throughput experiment is assessing the reproducibility of independent replicates. Fig 5 shows the average of the preferences measured in each replicate. Fig 6A shows the correlations among the 13,540 site-specific amino-acid preferences estimated from each of the three replicates. The correlations are modest, indicating substantial replicate-to-replicate noise. In principle, this noise could arise from differences in the initial plasmid mutant libraries, bottlenecks during the generation of viruses by transfection, bottlenecks during viral passage, or bottlenecks during the sequencing of proviral DNA from infected cells. Analysis of technical replicates of the first or second round of viral passaging indicates that most of the noise arises from bottlenecks during the viral passaging or sequencing steps. Specifically, measurements from replicate 3 are no more correlated to those from replicates 3b-1 or 3b-2 (which are repeated passages of the same transfection supernatant, Fig 1B) than they are to those from totally independent replicates (compare Fig 6 and S6 Fig). However, replicates 3b-1 and 3b-2 (which shared the first of the two viral passages, Fig 1) do yield more correlated measurements than independent replicates (S6 Fig). The existence of bottlenecks during viral passage is also suggested by the data in S4 Fig and S5 Fig. Therefore, the experimental reproducibility could probably be increased by passaging more infectious viruses at each step.

Fig 6. The amino-acid preferences are modestly correlated among experimental replicates, but the preferred amino acids have highly similar hydrophobicities.

(A) Correlations between the site-specific amino-acid preferences from each replicate. (B) Correlations between the preference-weighted hydrophobicities. For each site r, the preference-weighted hydrophobicity is ∑_a π_r,a × X_a where π_r,a is the preference of r for amino acid a, and X_a is the Kyte-Doolittle hydropathy [98] of a. The fact that the hydrophobicities are more correlated than the amino-acid preferences means that when different amino acids are preferred at a site in different experimental replicates, the chemical properties of the preferred amino acids are similar. Each plot shows the Pearson correlation coefficient and associated P-value. Similar data for replicates 3b-1 and 3b-2 are in S6 Fig.

If bottlenecks cause each replicate to sample slightly different mutations, then perhaps in each replicate there will be selection for similar amino acids even if the exact mutations differ. To test this hypothesis, we quantified the extent that each site preferred hydrophobic or hydrophilic amino acids. We did this by computing a site-specific hydrophobicity score from the amino-acid preferences. Fig 6B shows that these preference-weighted hydrophobicities are more correlated between replicates than the preferences themselves. Therefore, even though there is replicate-to-replicate noise in the exact amino acids preferred at a site, the preferred amino acids have similar chemical properties among replicates.

The amino-acid preferences correlate with amino-acid frequencies in HIV sequence alignments at most sites, but deviate at positions subject to selection pressures absent from our experiments

In the previous section, we showed that our experimentally measured amino-acid preferences captured the constraints on Env’s biological function for sites with known mutational effects (Table 1). If this is true across the entire protein, then our measurements should correlate with the frequencies of amino acids in natural HIV sequences. Table 2 shows that there is a modest correlation (Pearson’s R ranging from 0.29 to 0.36) between the preferences from each experimental replicate and the frequencies in an alignment of HIV-1 group-M sequences (a phylogenetic tree of these sequences is in Fig 7A). Since each replicate suffers from noise due to partial bottlenecking of the viral diversity, we hypothesized that averaging the preferences across replicates should make them more accurate. Indeed, averaging the replicates increased the correlation to R = 0.4 (Table 2).

Fig 7. Correlations between amino-acid preferences and frequencies in natural HIV sequences.

(A) Phylogenetic tree of the HIV-1 group-M sequences in the alignment. (B) Correlation between alignment frequencies and preferences. The preferences are the replicate averages re-scaled by the stringency parameter in Table 2. (C) The correlation if evolution is simulated along the phylogenetic tree assuming that the preferences correctly describe the actual selection. (D) There is no correlation in a control simulation in which preferences are randomized among sites. (E), (F) Correlation between preferences and alignment frequencies as a function of relative solvent accessibility (RSA). Red lines show the actual correlation. Dark and light gray show the range of correlations in the middle 80% and 100% of 100 simulations. For both plots, data are shown until the subset of sites that meets the RSA cutoff becomes less than 10% of all sites in Env; this is why neither x-axis extends all the way from 0 to 1. Correlation coefficients are Pearson’s R.

View this table:

Table 2.

Correlation of amino-acid preferences with amino-acid frequencies in nature.

The concordance between deep mutational scanning measurements and natural sequence variation is improved by accounting for differences in the stringency of selection in the experiments compared to natural selection [71,72]. Specifically, if the measured preference is π_r,a and the stringency parameter is β, then the re-scaled preference is . A stringency parameter of β > 1 means that natural evolution favors the same amino acids as the experiments, but with greater stringency. Table 2 shows that for all replicates, the stringency parameter that maximizes the correlation is > 1. Therefore, natural selection prefers the same amino acids as our experiments, but with greater stringency.

After averaging across replicates and re-scaling by the optimal stringency parameter, the Pearson correlation is 0.44 between our experimentally measured preferences and amino-acid frequencies in the alignment of naturally occurring HIV sequences (Fig 7B). Is this a good correlation? At first glance, a correlation of 0.44 seems unimpressive. But we do not expect a perfect correlation even if the experiments perfectly concord with selection on Env in nature. The reason is that natural HIV sequences are drawn from a phylogeny (Fig 7A), not an ideal ensemble of all possible Env sequences. The frequencies of amino acids in this phylogeny reflect evolutionary history as well as natural selection. For instance, if several amino acids are equally preferred at a site, one is likely to be more frequent in the alignment due to historical contingency. Additionally, natural evolution is influenced by the genetic code and mutation biases: a mutation from the tryptophan codon TGG to the valine codon GTT is extremely unlikely even if valine is more preferred than tryptophan. Therefore, the correlation will be imperfect even if the preferences completely concord with natural selection –the question is how the actual correlation compares to what is expected given the phylogenetic history and mutation biases.

To determine the expected correlation, we simulated evolution along the phylogenetic tree in Fig 7A under the assumption that the experimentally measured preferences exactly match natural selection. Specifically, we used pyvolve [73] to simulate evolution using the experimentally informed site-specific codon substitution models described in [72], which define mutation-fixation probabilities in terms of the amino-acid preferences. In addition to the preferences and the stringency parameter P = 2.1 from Table 2, the substitution models in [72] require specification of parameters reflecting biases in the mutation process. We estimated nucleotide mutation bias parameters of φ_A = 0.55, φ_C = 0.15, φ_G = 0.11, and φ_T = 0.18 from the frequencies at the third-nucleotide codon position in sequences in the group-M alignment for sites where the most common amino acid had 4-fold codon degeneracy. We used the transition-transversion ratio of k = 4.4 estimated in [74].

The correlation between the preferences and amino-acid frequencies in a representative simulated alignment is shown in Fig 7C. As this plot illustrates, the expected correlation is only about 0.46 if the experimentally measured preferences exactly describe natural selection on Env under our model. As a control, we also simulated evolution using substitution models in which the preferences have been randomized among sites (Fig 7D); as should be the case, there is no correlation in these control simulations. So the actual correlation is nearly as high as expected if natural selection concords with the preferences measured in our experiment.

We next investigated if there are parts of Env for which there is an especially low correlation between our experimentally measured preferences and natural amino-acid frequencies. For instance, antibodies exert selection on the surface of Env in nature [6,7,75,76]. We therefore examined the actual and simulated correlations between the preferences and frequencies as a function of solvent accessibility (Fig 7E,F). For all sites (right side of Fig 7E, left side of Fig 7F), the actual correlation is only slightly lower than the range of correlations in 100 simulations. For more buried sites, both the simulated and actual correlations increase (Fig 7E), presumably because sites in the core of Env tend to have stronger preferences for specific amino acids. But as sites become more surface-exposed, the actual correlation drops below the value expected from the simulations (Fig 7F). Therefore, our experiments provide a relatively worse description of natural selection on Env’s surface than its core – probably because the evolution of the protein’s core is shaped mostly by inherent functional constraints that are effectively captured by our experiments, whereas the surface is subject to selection pressures (e.g., antibodies) that are not modeled in our experiments.

Comparing disulfide-bonded cysteines and glycosylation sites vividly illustrates this dichotomy between inherent functional constraints and external selection pressures. Env has 10 highly conserved disulfide bonds, most of which are essential for the protein’s inherent function [43]. Env also has numerous N-linked glycosylation sites, many of which are also highly conserved in nature, where they help shield the protein from antibodies [6,56]. Fig 8 shows that our experimentally measured preferences are highly correlated with natural amino-acid frequencies at the sites of the disulfides, but not at the glycosylation sites. This result can easily be rationalized: the disulfides are inherently necessary for Env’s function, whereas the glycosylation sites are important largely because of the external selection imposed by antibodies. Our experiments therefore accurately reflect the natural constraints on the former but not the latter.

Fig 8. The correlation between the experimentally measured preferences and amino-acid frequencies in natural sequences is low at glycosylation sites, but high at disulfide-bonded cysteines.

(A) The logo plots show the frequencies of amino acids in the group-M alignment or the amino-acid preferences from our experiments at a subset of potential N-linked glycosylation sites (see S7 Fig for all 30 sites). The glycosylation sites are conserved in nature, but tolerant of mutations in our experiment. The scatter plot shows that there is a poor correlation between the preferences and natural amino-acid frequencies at all 22 alignable glycosylation sites: red triangles represent the first position in each glycosylation site, whereas gray circles represent all other sites. (B) There is much better concordance between the preferences and natural amino-acid frequencies for Env’s disulfide-bonded cysteines. The logo plots show each pair of cysteines for a subset of disulfides (see S7 Fig for all 10 disulfides). The scatter plot shows that there is a strong correlation between the preferences and natural amino-acid frequencies at all disulfide-bonded cysteines.

Env has a high inherent mutational tolerance in its variable loops, and a low mutational tolerance in broadly neutralizing antibody epitopes

Different sites in Env evolve at different rates in natural HIV sequences. These differences arise from two factors. First, some sites are inherently better at tolerating mutations without disrupting Env’s essential functions. Second, some sites are under immune selection that favors rapid change. However, since Env is under selection both to maintain its function and escape immunity, it is difficult to deconvolve these factors.

Our experiments estimate each site’s inherent tolerance for mutations under selection purely for Env’s function in cell culture, without the confounding effects of immune selection (for the remainder of this section, we define a site’s mutational tolerance as the Shannon entropy of its amino-acid preferences shown in Fig 5). We can therefore assess whether regions of Env that evolve rapidly or slowly in nature also have unusually high or low inherent tolerance to mutations. We focused on two regions of Env: the portions of the protein classified as “variable loops” due to extensive variation in nature [77,78], and the epitopes of antibodies that broadly neutralize many HIV strains. We hypothesized that the variable loops would have a high inherent mutational tolerance, whereas broadly neutralizing antibody epitopes would have a low mutational tolerance.

In testing this hypothesis, it is important to control for other properties known to affect mutational tolerance. This can be done by using multiple linear regression to simultaneously analyze how several independent variables affect the dependent variable of mutational tolerance. Relative solvent accessibility (RSA) is the strongest determinant of mutational tolerance in proteins [79], so we included RSA as a variable in the regression. The region of env that contains the RRE is under strong nucleotide-level constraint [67–69, Fig 4], so we also included being in the RRE as a binary variable in the regression. We defined the variable loops as indicated in Fig 5, and included being in one of these loops as a binary variable in the regression. Finally, we used crystal structures to delineate broadly neutralizing antibody epitopes. We focused on broadly neutralizing antibodies targeting the CD4 binding site, since most other broadly neutralizing antibodies target either glycans (which are subject to pressures that are not well-modeled in our experiments; Fig 8A) or a membrane-proximal region of gp41 that is not fully resolved in crystal structures of trimeric Env making it impossible to correct for RSA. Specifically, we analyzed the three antibodies with the greatest breadth from [80]: VRC01 (PDB 3NGB [81]), 12A21 (PDB 4JPW [82]), and 3BNC117 (PDB 4JPV [82]). We defined a site as part of an epitope if it was within a 4Å inter-atomic distance of the antibody, and included the number of epitopes in which a site is found as a discrete variable in the regression.

The results of the multiple linear regression are in Table 3. As expected, increased solvent accessibility is strongly associated with increased mutational tolerance, whereas presence in the RRE is strongly associated with decreased mutational tolerance. After correcting for these effects, sites in broadly neutralizing epitopes have significantly reduced mutational tolerance. Sites in the variable loops have higher mutational tolerance, although this effect is not statistically significant after controlling for solvent accessibility (mutational tolerance is significantly elevated in the variable loops if solvent accessibility is not corrected for; S2 Table). Overall, this analysis provides statistical confirmation of something that is widely assumed in the study of HIV: broadly neutralizing antibodies are unique because they target regions of Env that are inherently intolerant of mutations.

View this table:

Table 3.

Broadly neutralizing antibody epitopes have significantly lower mutational tolerance than other sites in Env

Discussion

We have used deep mutational scanning to experimentally estimate the effects of all amino-acid mutations to most of HIV Env. Our experiments select for Env variants that enable HIV to undergo multi-cycle replication in a T-cell line. The broad trends in our data are consistent with what is expected from general considerations of how gene sequence maps to protein function: stop codons are efficiently purged by selection, many but not all nonsynonymous mutations are selected against, and synonymous mutations are less affected by selection except at regions where the nucleotide sequence itself is known to be biologically important. We also find a few sites where nonsynonymous mutations are strongly favored by selection in our experiments, probably because they adapt the virus to cell culture by affecting Env’s conformational dynamics, co-receptor binding, and glycosylation.

We use our experimental data to estimate the preference of each site in Env for each amino acid. We show that these preferences correlate with amino-acid frequencies in natural HIV sequences nearly as well as would be expected if the experimentally measured preferences capture the true selection on Env in nature. The strongest deviations between our measurements and amino-acid frequencies in HIV sequences occur at sites on the surface of the virus that in nature are targeted by pressures (such as antibodies) that are not present in our experiments.

The ability to identify deviations between our measurements and amino-acid frequencies in nature points to a powerful aspect of our approach: it can de-convolve the role of inherent functional constraints and external selection pressures in shaping Env’s evolution. For instance, it is known that some regions of Env regions are conserved in nature, and so can be targeted by broadly neutralizing antibodies. To what extent are these patterns of conservation shaped by Env’s inherent capacity to evolve versus the fact that immune selection targets some parts of the protein more strongly than others? By measuring inherent mutational tolerance at each site under functional selection alone, we show that the epitopes of broadly neutralizing antibodies indeed have a reduced capacity to tolerate mutations irrespective of the action of immune selection.

More generally, our experiments provide high-throughput experimental data that can augment computational efforts to infer features of HIV’s fitness landscape [18–20,22,83]. Such data will aid in efforts to understand viral evolutionary dynamics both within and between patients. Additionally, our experimental approach can be extended to comprehensively examine the effects of mutations on other HIV phenotypes that can be selected in the lab. We anticipate that future experiments could build on the work here to map how mutations affect phenotypes such as cell tropism or sensitivity to antibodies.

Materials and Methods

Data and computer code

The computer code to analyze the sequencing data and generate the figures is provided in a series of IPython notebooks in S3 File. Illumina sequencing data are available from the Sequence Read Archive (http://www.ncbi.nlm.nih.gov/sra) under the accession numbers in S10 File.

Sequence numbering

We use the HXB2 numbering system [44] unless otherwise noted. The “variable loop” definitions were taken from http://www.hiv.lanl.gov/, not including the flanking disulfide-bonded cysteines as part of the loops.

Codon mutant libraries

We created the codon mutant libraries in the context of the pro-viral genomic plasmid pLAI, which encodes the LAI strain of HIV [40]. This plasmid was obtained from the lab of Michael Emerman. The plasmid sequence is in S4 File.

We created codon mutant libraries of env using the PCR mutagenesis technique described in [31] (see also [32, 37]) except that we performed two total rounds of mutagenesis rather than the three rounds in [31]. The codon tiling mutagenic primers are in S5 File. The end primers were: 5’-ttggaatttctggcccagaccgtctcatgagagtgaaggagaaatatcagcacttg-3’ and 5’-catctgctgctggctcagc-3’. We created three replicate libraries by performing all the steps independently for each replicate starting with independent plasmid preps.

We cloned the PCR mutagenized env amplicons into the LAI plasmid with high efficiency to create plasmid mutant libraries. To seamlessly clone the PCR products into the proviral plasmid, we created a recipient version of the plasmid that had env replaced by GFP flanked by restriction sites for BsmBI, which cleaves outside its recognition sequence. We named this recipient plasmid pLAI-δenv-BsmBI; its sequence is in S6 File. We digested both this recipient plasmid and the gel-purified PCR amplicons with BsmBI (there are BsmBI sites at either end of the PCR amplicon), gel purified the digested PCR products, and ligated them into the plasmid using a T4 DNA ligase. We column purified the ligation products, electroporated them into competent cells (Invitrogen,12033-015), and plated the transformed cells on LB plates supplemented with 100 μg/mL ampicillin. For each of the three replicate libraries, we performed enough transformations to yield >1.4 million unique colonies as estimated by plating dilutions of each transformation on separate plates. Control ligations lacking an insert yielded at least 10-fold fewer colonies. The transformed cells were scraped from the plates, grown in liquid LB-ampicillin at 37°C for ~4 hours, and mini-prepped to obtain the plasmid mutant libraries. For the wildtype controls, we prepped three independent cultures of the wildtype LAI proviral plasmid.

Generation and passaging of viruses

We generated the mutant virus libraries by transfecting the mutant plasmid libraries into 293T cells. For each replicate, we transfected two 12-well tissue-culture plates to increase the diversity of the generated viruses. Specifically, we plated 293T cells at 2.4×10⁵ cells/well in D10 media (DMEM supplemented with 10% FBS, 1% 200 mM L-glutamine, and 1% of a solution of 10,000 units/mL penicillin and 10,000 μg/mL streptomycin). The next day, we transfected each well with 1 μg plasmid using BioT (Bioland Scientific LLC, B01-01). For the three wildtype controls we used the same process but with only a single 12-well plate per replicate. At one day post-transfection, we aspirated the old media, replacing it with fresh D10. At ~60 hours post-transfection, we filtered the transfection supernatants through 0.4 μm filters. To remove residual plasmid DNA from the transfection, we then treated the filtrate with DNase-I (Roche, 4716728001) at a final concentration of 100 U/mL in the presence of 10 mM magnesium chloride (Sigma, M8266) at 37° C for 20-30 minutes. We froze aliquots of the DNase-treated supernatant at −80°C. Aliquots were thawed and titered by TZM-bl and TCID-50 assays as described below.

We passaged the transfection supernatants in SupT1 cells obtained from the NIH AIDS Reagent Program [84]. SupT1 cells were maintained in a media identical to the D10 described above except that the DMEM was replaced with RPMI-1640 (GE Healthcare Life Sciences, SH30255.01). Before infecting cells, for replicates 1, 2, and 3 (but not replicate 3b), we first filtered thawed transfection supernatants through a 0.2 μm filter in an effort to remove any large viral aggregates. We then infected 10⁸ SupT1 cells with 5 × 10⁵ TZM-bl units of the mutant library transfection supernatant in a final volume of 100 mL SupT1 culture medium in a vented tissue-culture flask (Fisher Scientific, 14-826-80). In parallel, we passaged 10⁵ TZM-bl units of transfection supernatant for each wildtype control in 20 million SupT1 cells in a final volume of 20 mL. At one day post-infection, we pelleted cells at 300× g for 4 minutes and resuspended in fresh media to the same volume as before. At two days post-infection, we added fresh media equal to the volume already in the flask to dilute the cells and provide fresh media. We harvested virus at three days post-infection (for replicates 1, 2, and 3) or four days post-infection (for replicate 3b) by pelleting cell debri at 300×g for 4 minutes and then collecting the viral supernatant for storage at −80°C. To remove residual culture media and plasmid DNA from the cell pellets, we washed pellets two times in PBS. The washed cells were resuspended in PBS to a final concentration of 10⁷ cells/mL, and aliquots were frozen at −80°C for DNA purification.

We conducted a second passage by infecting new cells with the passage-1 viral supernatants. The second passage differed from the first passage in the following ways: Before infecting cells, we filtered passage-1 supernatant of replicate 3b-2 through a 0.2 μm filter but did not filter any of the other replicates. We also had to modify the passaging conditions for some replicates due to low titers of the passage-1 supernatants. For viruses in which the passage-1 supernatant was at too low a concentration to infect at an MOI of 0.005 in the volumes indicated above, we added additional passage-1 supernatant, and then reduced the volume to that indicated above during the day-one media change.

Virus titering by TCID₅₀ and TZM-bl assays

We measured viral titers using TZM-bl reporter cells [85]. Specifically, we added 2× 10⁴ cells in 0.5 mL D10 to each well of a 12-well plate. We made dilutions of viral inoculum and infected cells with 100 uL of each dilution. At 2 days post-infection, we fixed cells in a solution of 1% formaldehyde and 0.2% glutaraldehyde in PBS for 5 minutes at room temperature, washed with PBS to remove the fixing solution, and stained for beta-galactosidase activity with a solution of 4 mM potassium ferrocyanide, 4 mM potassium ferricyanide, and 0.4 mg/mL X-gal in PBS at 37° C for 50 minutes. After washing cells with PBS to remove the staining solution, we used a microscope to count the number of blue cells per well, computing the viral titer as the number of blue cells per mL of viral inoculum.

We were concerned that the infectious titer in SupT1 cells might differ from the TZM-bl titers. We therefore also performed TCID₅₀ assay to directly measure infectious titers in SupT1 cells. To do this, we made dilutions of viral transfection supernatant in a 96-well tissue-culture plate and added SupT1 cells at a final concentration of 2.5×10⁵ cells/mL in a final volume of 180 μL/well. At 4 and 8 days post-infection, we passaged supernatant 1:10 into fresh media to prevent cells from becoming over confluent. At 12 days post-infection, we measured the titer of culture supernatants using the TZM-bl assay to determine which SupT1 infections had led to the production of virus. Based on binary scoring from these TZM-bl assays, we calculated titers using the Reed-Muench formula [86] as implemented at https://github.com/jbloomlab/reedmuenchcalculator. At least for the LAI strain used in our experiments, the SupT1 TCID₅₀ titers were approximately equal to the TZM-bl titers. Therefore, we used only the less time-consuming TZM-bl assay for all subsequent titering.

Generation of samples for Illumina sequencing

We purified non-integrated viral DNA from aliquots of frozen cells using a mini-prep kit (Qiagen, 27104) with ~ 10⁷ cells per prep. In some cases, we then concentrated the purified DNA using Agencourt AMPure XP beads (Beckman Coulter, A63880) using a bead-to-sample ratio of 1.0 and eluting with half of the starting sample volume.

We next generated PCR amplicons of env to use as templates for Illumina sequencing. We created these amplicons from plasmid or mini-prepped non-integrated viral DNA by PCR using the primers 5’-agcgacgaagacctcctcaag-3’ and 5’-acagcactattctttagttcctgactcc-3’. PCRs were performed in 20 μl or 50 μl volumes using KOD Hot Start Master Mix (71842, EMD Millipore) with 0.3 μM of each primer and 3 ng/μl of mini-prepped DNA or 0.3 ng/μl of plasmid as template. The PCR program was:

95 ° C, 2 minutes
95 °C, 20 seconds
70 ° C, 1 second
64.3 °C, 10 seconds (cooling to this temperature at 0.5 °C/second)
70 ° C, 1 minute 48 seconds
Go to 2, 27 times
hold at 4 ° C

For replicate 3b, there were a few modifications: the annealing temperature was 64.9 ° C, the extension time was 54 seconds, and we performed only 25 cycles. To quantify the number of unique template molecules amplified in each PCR, we performed standard curves using known amounts of template env in pro-viral plasmid, and ran the the bands on an agarose gel alongside our amplicons for visual quantification. We performed a sufficient number of PCR reactions to ensure that amplicons from plasmid were coming from > 10⁶ unique template molecules, and amplicons from viral DNA were coming from ~ 2 × 10⁵ template molecules. All PCR products were purified with Agencourt beads (using a sample-to-bead ratio of 1.0) and quantified by Quant-iT PicoGreen dsDNA Assay Kit (Life Technologies, P7589).

We deep sequenced these amplicons using the strategy for barcoded-subamplicon sequencing in [37], dividing env into six subamplicons (this is a variation of the strategy originally described in [87–89]). The sequences of the primers used in the two rounds of PCR are in S9 File. Our first-round PCR conditions slightly differed from [37]: our 25 μL PCRs contained 12.5 μL KOD Hot Start Master Mix, 0.3 μM of each primer, and 5 ng of purified amplicon. For replicates 1, 2, and 3, the first-round PCR program was:

95 ° C, 2 minutes
95 °C, 20 seconds
70 ° C, 1 seconds
60 °C, 10 seconds (cooling to this temperature at 0.5 °C/second)
70 °C, 10 seconds
Go to 2, 10 times
95 °C, 1 min
hold 4 °C

For replicate 3b, we used the same program, but with 9 PCR cycles instead of 11. Prior to the second round PCR, we bottlenecked each subamplicon by diluting it to a concentration that should have yielded between 3 and 5 × 10⁵ unique single-stranded molecules per subamplicon per sample. We purified the second-round PCR products using Agencourt beads, quantified with PicoGreen, pooled in equimolar amounts, and purified by agarose gel electrophoresis, excising DNA corresponding to the expected ~500 base pairs in length. We sequenced the purified DNA using multiple runs of an Illumina MiSeq with 2×275 bp paired-end reads.

Analysis of deep-sequencing data

We used dms_tools (http://jbloomlab.github.io/dms_tools/), version 1.1.dev13, to filter and align the deep-sequencing reads, count the number of times each codon mutation was observed both before and after selection, and infer Env’s site-specific amino-acid preferences using the algorithm described in [70]. The code that performs this analysis is in S3 File. Figures summarizing the results of the deep sequencing are also in this supplementary file.

Alignment of group-M env sequences

We downloaded the 2014 filtered web alignment of env from http://www.hiv.lanl.gov/, including all subtypes for HIV-1/SIVcpz. We then curated this alignment in the following ways. First, we removed sequences differed in length from HXB2 (including gap characters) or contained a premature stop codon, ambiguous residue, or frame-shift mutation. Next, we removed columns in the alignment for which we lacked deep mutational scanning data, columns that had >5% gap characters, or columns in variable loops that appeared poorly aligned by eye. Finally, we randomly selected 30 sequences per subtype for group-M subtypes A, B, C, D, F, and G, for a total of 180 sequences. The resulting alignment is in S7 File. The phylogenetic tree in Fig 7 was inferred using RAxML [90] with the GTRCAT substitution model.

Computing relative solvent accessibilities

We computed absolute solvent accessibilities based on the PDB structure 4TVP (including all three Env monomers after removing antibody chains) using DSSP [91,92]. We normalized absolute solvent accessibilities to relative ones using the maximum accessibilities provided in the first table of [93]. The relative solvent accessibilities are listed in S8 File.

Acknowledgments

Thanks to Michael Emerman for providing the LAI plasmid and numerous helpful suggestions, to Julie Overbaugh for numerous helpful suggestions, and to Lily Wu for HIV training.

Footnotes

↵* jbloom{at}fredhutch.org

References

1.↵
Zanini F, Brodin J, Thebo L, Lanz C, Bratt G, Albert J, et al. Population genomics of intrapatient HIV-1 evolution. eLife. 2015;4:e11282.
OpenUrl PubMed
2.
Chen FC, Vallender E, Wang H, Tzeng CS, Li WH. Genomic divergence between human and chimpanzee estimated from large-scale alignments of genomic sequences. Journal of Heredity. 2001;92(6):481–489.
OpenUrl CrossRef GeoRef PubMed Web of Science
3.
Mikkelsen T, Hillier L, Eichler E, Zody M, Jaffe D, Yang SP, et al. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature. 2005;437(7055):69–87.
OpenUrl CrossRef PubMed Web of Science
4.↵
Langergraber KE, Prüfer K, Rowney C, Boesch C, Crockford C, Fawcett K, et al. Generation times in wild chimpanzees and gorillas suggest earlier divergence times in great ape and human evolution. Proceedings of the National Academy of Sciences. 2012;109(39):15716–15721.
OpenUrl Abstract/FREE Full Text
5.↵
Albert J, Abrahamsson B, Nagy K, Aurelius E, Gaines H, Nyström G, et al. Rapid development of isolate-specific neutralizing antibodies after primary HIV-1 infection and consequent emergence of virus variants which resist neutralization by autologous sera. Aids. 1990;4(2):107–112.
OpenUrl CrossRef PubMed Web of Science
6.↵
Wei X, Decker JM, Wang S, Hui H, Kappes JC, Wu X, et al. Antibody neutralization and escape by HIV-1. Nature. 2003;422(6929):307–312.
OpenUrl CrossRef PubMed Web of Science
7.↵
Richman DD, Wrin T, Little SJ, Petropoulos CJ. Rapid evolution of the neutralizing antibody response to HIV type 1 infection. Proceedings of the National Academy of Sciences. 2003;100(7):4144–4149.
OpenUrl Abstract/FREE Full Text
8.↵
Olshevsky U, Helseth E, Furman C, Li J, Haseltine W, Sodroski J. Identification of individual human immunodeficiency virus type 1 gp120 amino acids important for CD4 receptor binding. Journal of virology. 1990;64(12):5701–5707.
OpenUrl Abstract/FREE Full Text
9.
Cordonnier A, Montagnier L, Emerman M. Single amino-acid changes in HIV envelope affect viral tropism and receptor binding. Nature. 1989;340(6234):571.
OpenUrl CrossRef PubMed
10.↵
Basmaciogullari S, Babcock GJ, Van Ryk D, Wojtowicz W, Sodroski J. Identification of conserved and variable structures in the human immunodeficiency virus gp120 glycoprotein of importance for CXCR4 binding. Journal of virology. 2002;76(21):10791–10800.
OpenUrl Abstract/FREE Full Text
11.↵
Freed E, Myers D, Risser R. Mutational analysis of the cleavage sequence of the human immunodeficiency virus type 1 envelope glycoprotein precursor gp160. Journal of virology. 1989;63(11):4670–4675.
OpenUrl Abstract/FREE Full Text
12.
Lu M, Stoller MO, Wang S, Liu J, Fagan MB, Nunberg JH. Structural and functional analysis of interhelical interactions in the human immunodeficiency virus type 1 gp41 envelope glycoprotein by alanine-scanning mutagenesis. Journal of virology. 2001;75(22):11146–11156.
OpenUrl Abstract/FREE Full Text
13.
Jacobs A, Sen J, Rong L, Caffrey M. Alanine scanning mutants of the HIV gp41 loop. Journal of Biological Chemistry. 2005;280(29):27284–27288.
OpenUrl Abstract/FREE Full Text
14.
Zwick MB, Jensen R, Church S, Wang M, Stiegler G, Kunert R, et al. Anti-human immunodeficiency virus type 1 (HIV-1) antibodies 2F5 and 4E10 require surprisingly few crucial residues in the membrane-proximal external region of glycoprotein gp41 to neutralize HIV-1. Journal of virology. 2005;79(2):1252–1261.
OpenUrl Abstract/FREE Full Text
15.
Pantophlet R, Saphire EO, Poignard P, Parren PW, Wilson IA, Burton DR. Fine mapping of the interaction of neutralizing and nonneutralizing monoclonal antibodies with the CD4 binding site of human immunodeficiency virus type 1 gp120. Journal of virology. 2003;77(1):642–658.
OpenUrl Abstract/FREE Full Text
16.
Li Y, O’Dell S, Walker LM, Wu X, Guenaga J, Feng Y, et al. Mechanism of neutralization by the broadly neutralizing HIV-1 monoclonal antibody VRC01. Journal of virology. 2011;85(17):8954–8967.
OpenUrl Abstract/FREE Full Text
17.↵
Lynch RM, Wong P, Tran L, O’Dell S, Nason MC, Li Y, et al. HIV-1 fitness cost associated with escape from the VRC01 class of CD4 binding site neutralizing antibodies. Journal of virology. 2015;89(8):4201–4213.
OpenUrl Abstract/FREE Full Text
18.↵
Dahirel V, Shekhar K, Pereyra F, Miura T, Artyomov M, Talsania S, et al. Coordinate linkage of HIV evolution reveals regions of immunological vulnerability. Proceedings of the National Academy of Sciences. 2011;108(28):11530–11535.
OpenUrl Abstract/FREE Full Text
19.
Ferguson AL, Mann JK, Omarjee S, Ndung’u T, Walker BD, Chakraborty AK. Translating HIV sequences into quantitative fitness landscapes predicts viral vulnerabilities for rational immunogen design. Immunity. 2013;38(3):606–617.
OpenUrl CrossRef PubMed Web of Science
20.↵
Zanini F, Puller V, Brodin J, Albert J, Neher R. In-vivo mutation rates and fitness landscape of HIV-1. arXiv preprint arXiv:160306634. 2016;.
21.↵
Zanini F, Neher RA. Quantifying selection against synonymous mutations in HIV-1 env evolution. Journal of virology. 2013;87(21):11843–11850.
OpenUrl Abstract/FREE Full Text
22.↵
Hartl M, Theys K, Feder A, Gelbart M, Stern A, Pennings PS. Within-patient HIV mutation frequencies reveal fitness costs of CpG dinucleotides, drastic amino acid changes and G→A mutations. bioRxiv. 2016; p. 057026.
23.↵
Fowler DM, Araya CL, Fleishman SJ, Kellogg EH, Stephany JJ, Baker D, et al. High-resolution mapping of protein sequence-function relationships. Nature methods. 2010;7(9):741–746.
OpenUrl
24.
Fowler DM, Fields S. Deep mutational scanning: a new style of protein science. Nature methods. 2014;11(8):801–807.
OpenUrl
25.↵
Boucher JI, Cote P, Flynn J, Jiang L, Laban A, Mishra P, et al. Viewing protein fitness landscapes through a next-gen lens. Genetics. 2014;198(2):461–471.
OpenUrl
26.↵
McLaughlin Jr RN, Poelwijk FJ, Raman A, Gosal WS, Ranganathan R. The spatial architecture of protein function and adaptation. Nature. 2012;491(7422):138–142.
OpenUrl CrossRef PubMed Web of Science
27.
Roscoe BP, Thayer KM, Zeldovich KB, Fushman D, Bolon DN. Analyses of the effects of all ubiquitin point mutants on yeast growth rate. Journal of molecular biology. 2013;425(8):1363–1377.
OpenUrl CrossRef PubMed
28.
Firnberg E, Labonte JW, Gray JJ, Ostermeier M. A comprehensive, high-resolution map of a gene’s fitness landscape. Molecular biology and evolution. 2014;31(6):1581–1592.
OpenUrl CrossRef PubMed Web of Science
29.
Olson CA, Wu NC, Sun R. A comprehensive biophysical description of pairwise epistasis throughout an entire protein domain. Current Biology. 2014;24(22):2643–2651.
OpenUrl CrossRef PubMed
30.
Melnikov A, Rogov P, Wang L, Gnirke A, Mikkelsen TS. Comprehensive mutational scanning of a kinase in vivo reveals substrate-dependent fitness landscapes. Nucleic Acids Research. 2014;42(14):e112.
OpenUrl CrossRef PubMed
31.↵
Bloom JD. An Experimentally Determined Evolutionary Model Dramatically Improves Phylogenetic Fit. Mol Biol Evol. 2014;31(8):195–619.
OpenUrl
32.↵
Thyagarajan B, Bloom JD. The inherent mutational tolerance and antigenic evolvability of influenza hemagglutinin. eLife. 2014;3:e03300.
OpenUrl CrossRef PubMed
33.
Stiffler MA, Hekstra DR, Ranganathan R. Evolvability as a function of purifying selection in TEM-1 β-lactamase. Cell. 2015;160(5):882–892.
OpenUrl CrossRef PubMed
34.
Doud M, Ashenberg O, Bloom J. Site-Specific Amino Acid Preferences Are Mostly Conserved in Two Closely Related Protein Homologs. Molecular biology and evolution. 2015;32(11):2944–2960.
OpenUrl CrossRef PubMed
35.
Kitzman JO, Starita LM, Lo RS, Fields S, Shendure J. Massively parallel single-amino-acid mutagenesis. Nature methods. 2015;12(3):203–206.
OpenUrl
36.
Mishra P, Flynn JM, Starr TN, Bolon DN. Systematic Mutant Analyses Elucidate General and Client-Specific Aspects of Hsp90 Function. Cell reports. 2016;15(3):588–598.
OpenUrl
37.↵
Doud MB, Bloom JD. Accurate measurement of the effects of all amino-acid mutations to influenza hemagglutinin. Viruses. 2016;8:155.
OpenUrl CrossRef PubMed
38.↵
Mavor D, Fraser J, et al. Determination of Ubiquitin Fitness Landscapes Under Different Chemical Stresses in a Classroom Setting. eLife. 2016;5:e15802.
OpenUrl CrossRef
39.↵
Al-Mawsawi LQ, Wu NC, Olson C, Shi VC, Qi H, Zheng X, et al. High-throughput profiling of point mutations across the HIV-1 genome. Retrovirology. 2014;11(1):124.
OpenUrl CrossRef PubMed
40.↵
Peden K, Emerman M, Montagnier L. Changes in growth properties on passage in tissue culture of viruses derived from infectious molecular clones of HIV-1 LAI, HIV-1 MAL, and HIV-1 ELI. Virology. 1991;185(2):661–672.
OpenUrl CrossRef PubMed Web of Science
41.↵
Kwong PD, Wyatt R, Robinson J, Sweet RW, Sodroski J, Hendrickson WA. Structure of an HIV gp120 envelope glycoprotein in complex with the CD4 receptor and a neutralizing human antibody. Nature. 1998;393(6686):648–659.
OpenUrl CrossRef PubMed Web of Science
42.↵
Pancera M, Majeed S, Ban YEA, Chen L, Huang Cc, Kong L, et al. Structure of HIV-1 gp120 with gp41-interactive region reveals layered envelope architecture and basis of conformational mobility. Proceedings of the National Academy of Sciences. 2010;107(3):116–611.
OpenUrl
43.↵
van Anken E, Sanders RW, Liscaljet IM, Land A, Bontjer I, Tillemans S, et al. Only five of 10 strictly conserved disulfide bonds are essential for folding and eight for function of the HIV-1 envelope glycoprotein. Molecular biology of the cell. 2008;19(10):429–843.
OpenUrl
44.↵
Korber B, Foley BT, Kuiken C, Pillai SK, Sodroski JG, et al. Numbering positions in HIV relative to HXB2CG. Human retroviruses and AIDS. 1998;3:102–111.
OpenUrl
45.↵
Chakrabarti L, Emerman M, Tiollais P, Sonigo P. The cytoplasmic domain of simian immunodeficiency virus transmembrane protein modulates infectivity. Journal of virology. 1989;63(10):439–544.
OpenUrl
46.
Yuste E, Reeves JD, Doms RW, Desrosiers RC. Modulation of Env content in virions of simian immunodeficiency virus: correlation with cell surface expression and virion infectivity. Journal of virology. 2004;78(13):677–567.
OpenUrl
47.↵
Li Y, Luo L, Thomas DY, Kang OY. Control of expression, glycosylation, and secretion of HIV-1 gp120 by homologous and heterologous signal sequences. Virology. 1994;204(1):266–278.
OpenUrl CrossRef PubMed Web of Science
48.↵
Sheehy AM, Gaddis NC, Choi JD, Malim MH. Isolation of a human gene that inhibits HIV-1 infection and is suppressed by the viral Vif protein. Nature. 2002;418(6898):646–650.
OpenUrl CrossRef PubMed Web of Science
49.↵
Refsland EW, Stenglein MD, Shindo K, Albin JS, Brown WL, Harris RS. Quantitative profiling of the full APOBEC3 mRNA repertoire in lymphocytes and tissues: implications for HIV-1 restriction. Nucleic acids research. 2010;38(13):427–442.
OpenUrl
50.↵
Ho YC, Shan L, Hosmane NN, Wang J, Laskey SB, Rosenbloom DI, et al. Replication-competent noninduced proviruses in the latent reservoir increase barrier to HIV-1 cure. Cell. 2013;155(3):540–551.
OpenUrl CrossRef PubMed Web of Science
51.↵
Cuevas JM, Geller R, Garijo R, Lopez-Aldeguer J, Sanjuan R. Extremely High Mutation Rate of HIV-1 In Vivo. PLoS Biol. 2015;13(9):e1002251.
OpenUrl CrossRef PubMed
52.↵
Sloan RD, Wainberg MA. The role of unintegrated DNA in HIV infection. Retrovirology. 2011;8(1):1.
OpenUrl PubMed
53.↵
Guo HH, Choe J, Loeb LA. Protein tolerance to random amino acid change. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(25):920–592.
OpenUrl
54.
Shafikhani S, Siegel R, Ferrari E, Schellenberger V. Generation of large libraries of random mutants in Bacillus subtilis by PCR-based plasmid multimerization. Biotechniques. 1997;23(2):304–311.
OpenUrl PubMed Web of Science
55.↵
Bloom JD, Silberg JJ, Wilke CO, Drummond DA, Adami C, Arnold FH. Thermodynamic prediction of protein neutrality. Proceedings of the National Academy of Sciences of the United States of America. 2005;102(3):606–611.
OpenUrl Abstract/FREE Full Text
56.↵
Johnson WE, Desrosiers RC. Viral persistence: HIV’s strategies of immune system evasion. Annual review of medicine. 2002;53(1):499–518.
OpenUrl CrossRef PubMed Web of Science
57.↵
Ohgimoto S, Shioda T, Mori K, Nakayama EE, Hu H, Nagai Y. Location-specific, unequal contribution of the N glycans in simian immunodeficiency virus gp120 to viral infectivity and removal of multiple glycans without disturbing infectivity. Journal of virology. 1998;72(10):836–583.
OpenUrl
58.
Pugach P, Kuhmann SE, Taylor J, Marozsan AJ, Snyder A, Ketas T, et al. The prolonged culture of human immunodeficiency virus type 1 in primary lymphocytes increases its sensitivity to neutralization by soluble CD4. Virology. 2004;321(1):8–22.
OpenUrl CrossRef PubMed
59.↵
Wang W, Nie J, Prochnow C, Truong C, Jia Z, Wang S, et al. A systematic study of the N-glycosylation sites of HIV-1 envelope protein on infectivity and antibody-mediated neutralization. Retrovirology. 2013;10(1):1.
OpenUrl CrossRef PubMed
60.↵
Moore JP, Cao Y, Qing L, Sattentau QJ, Pyati J, Koduri R, et al. Primary isolates of human immunodeficiency virus type 1 are relatively resistant to neutralization by monoclonal antibodies to gp120, and their neutralization is not predicted by studies with monomeric gp120. Journal of virology. 1995;69(1):101–109.
OpenUrl Abstract/FREE Full Text
61.↵
Sullivan N, Sun Y, Li J, Hofmann W, Sodroski J. Replicative function and neutralization sensitivity of envelope glycoproteins from primary and T-cell line-passaged human immunodeficiency virus type 1 isolates. Journal of virology. 1995;69(7):441–344.
OpenUrl
62.↵
Parmley JL, Chamary J, Hurst LD. Evidence for purifying selection against synonymous mutations in mammalian exonic splicing enhancers. Molecular biology and evolution. 2006;23(2):301–l309.
OpenUrl CrossRef PubMed Web of Science
63.
Cuevas JM, Domingo-Calap P, Sanjuán R. The Fitness Effects of Synonymous Mutations in DNA and RNA Viruses. Molecular Biology and Evolution. 2012;29(1):17–20.
OpenUrl CrossRef PubMed Web of Science
64.↵
Subramaniam AR, DeLoughery A, Bradshaw N, Chen Y, O’Shea E, Losick R, et al. A serine sensor for multicellularity in a bacterium. Elife. 2013;2:e01501.
OpenUrl CrossRef PubMed
65.↵
Haas J, Park EC, Seed B. Codon usage limitation in the expression of HIV-1 envelope glycoprotein. Current Biology. 1996;6(3):315–324.
OpenUrl CrossRef PubMed Web of Science
66.↵
Watts JM, Dang KK, Gorelick RJ, Leonard CW, Bess , Jr JW, Swanstrom R, et al. Architecture and secondary structure of an entire HIV-1 RNA genome. Nature. 2009;460(7256):711–716.
OpenUrl CrossRef PubMed Web of Science
67.↵
Fernandes J, Jayaraman B, Frankel A. The HIV-1 Rev response element: an RNA scaffold that directs the cooperative assembly of a homo-oligomeric ribonucleoprotein complex. RNA biology. 2012;9(1):6–11.
OpenUrl
68.↵
Malim MH, Hauber J, Le SY, Maizel JV, Cullen BR. The HIV-1 rev trans-activator acts through a structured targetsequence to activate nuclear export of unspliced viral mRNA. Nature. 1989;338(6212):254–257.
OpenUrl CrossRef PubMed Web of Science
69.↵
Emerman M, Vazeux R, Peden K. The rev gene product of the human immunodeficiency virus affects envelope-specific RNA localization. Cell. 1989;57(7):115–511.
OpenUrl CrossRef PubMed Web of Science
70.↵
Bloom JD. Software for the analysis and visualization of deep mutational scanning data. BMC bioinformatics. 2015;16:168.
71.↵
Bloom JD. An experimentally informed evolutionary model improves phylogenetic fit to divergent lactamase homologs. Molecular biology and evolution. 2014;31(10):275–327.
OpenUrl
72.↵
Bloom JD. Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models. bioRxiv. 2016; p.037689.
73.↵
Spielman SJ, Wilke CO. Pyvolve: a flexible Python module for simulating sequences along phylogenies. PloS one. 2015;10(9):e0139047.
OpenUrl CrossRef PubMed
74.↵
Nielsen R, Yang Z. Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics. 1998;148(3):929–936.
OpenUrl Abstract/FREE Full Text
75.↵
Shankarappa R, Margolick JB, Gange SJ, Rodrigo AG, Upchurch D, Farzadegan H, et al. Consistent viral evolutionary changes associated with the progression of human immunodeficiency virus type 1 infection. Journal of virology. 1999;73(12):10489–10502.
OpenUrl Abstract/FREE Full Text
76.↵
Overbaugh J, Morris L. The antibody response against HIV-1. Cold Spring Harbor perspectives in medicine. 2012;2(1):a007039.
OpenUrl Abstract/FREE Full Text
77.↵
Starcich BR, Hahn BH, Shaw GM, McNeely PD, Modrow S, Wolf H, et al. Identification and characterization of conserved and variable regions in the envelope gene of HTLV-III/LAV, the retrovirus of AIDS. Cell. 1986;45(5):637–648.
OpenUrl CrossRef PubMed Web of Science
78.↵
Modrow S, Hahn BH, Shaw GM, Gallo RC, Wong-Staal F, Wolf H. Computer-assisted analysis of envelope protein sequences of seven human immunodeficiency virus isolates: prediction of antigenic epitopes in conserved and variable regions. Journal of virology. 1987;61(2):570–578.
OpenUrl Abstract/FREE Full Text
79.↵
Ramsey DC, Scherrer MP, Zhou T, Wilke CO. The relationship between relative solvent accessibility and evolutionary rate in protein evolution. Genetics. 2011;188:479–488.
OpenUrl Abstract/FREE Full Text
80.↵
Zhou T, Lynch RM, Chen L, Acharya P, Wu X, Doria-Rose NA, et al. Structural repertoire of HIV-1-neutralizing antibodies targeting the CD4 supersite in 14 donors. Cell. 2015;161(6):1280–1292.
OpenUrl CrossRef PubMed
81.↵
Zhou T, Georgiev I, Wu X, Yang ZY, Dai K, Finzi A, et al. Structural basis for broad and potent neutralization of HIV-1 by antibody VRC01. Science. 2010;329(5993):811–817.
OpenUrl Abstract/FREE Full Text
82.↵
Klein F, Diskin R, Scheid JF, Gaebler C, Mouquet H, Georgiev IS, et al. Somatic mutations of the immunoglobulin framework are generally required for broad and potent HIV-1 neutralization. Cell. 2013;153(1):126–138.
OpenUrl CrossRef PubMed Web of Science
83.↵
Kouyos RD, Leventhal GE, Hinkley T, Haddad M, Whitcomb JM, Petropoulos CJ, et al. Exploring the complexity of the HIV-1 fitness landscape. PLoS Genet. 2012;8(3):e1002551.
OpenUrl CrossRef PubMed
84.↵
Ablashi D, Berneman Z, Kramarsky B, Whitman J, Asano Y, Pearson G. Human herpesvirus-7 (HHV-7): current status. Clinical and diagnostic virology. 1995;4(1):1–13.
OpenUrl CrossRef PubMed
85.↵
Wei X, Decker JM, Liu H, Zhang Z, Arani RB, Kilby JM, et al. Emergence of resistant human immunodeficiency virus type 1 in patients receiving fusion inhibitor (T-20) monotherapy. Antimicrobial agents and chemotherapy. 2002;46(6):189–619.
OpenUrl
86.↵
Reed LJ, Muench H. A simple method of estimating fifty per cent endpoints. American journal of epidemiology. 1938;27(3):493–497.
OpenUrl CrossRef
87.↵
Hiatt JB, Patwardhan RP, Turner EH, Lee C, Shendure J. Parallel, tag-directed assembly of locally derived short sequence reads. Nat Methods. 2010;7(2):119–122.
OpenUrl CrossRef PubMed Web of Science
88.
Jabara CB, Jones CD, Roach J, Anderson JA, Swanstrom R. Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID. Proceedings of the National Academy of Sciences. 2011;108(50):20166–20171.
OpenUrl Abstract/FREE Full Text
89.↵
Kinde I, Wu J, Papadopoulos N, Kinzler KW, Vogelstein B. Detection and quantification of rare mutations with massively parallel sequencing. Proceedings of the National Academy of Sciences. 2011;108(23):953–095.
OpenUrl
90.↵
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014; p. btu033.
91.↵
Touw WG, Baakman C, Black J, te Beek TA, Krieger E, Joosten RP, et al. A series of PDB-related databanks for everyday needs. Nucleic acids research. 2014; p. gku1028.
92.↵
Kabsch W, Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983;22(12):257–726.
OpenUrl
93.↵
Tien MZ, Meyer AG, Sydykova DK, Spielman SJ, Wilke CO. Maximum allowed solvent accessibilites of residues in proteins. PloS one. 2013;8(11):e80635.
OpenUrl CrossRef PubMed
94.↵
Stewart-Jones GB, Soto C, Lemmin T, Chuang GY, Druz A, Kong R, et al. Trimeric HIV-1-Env Structures Define Glycan Shields from Clades A,B, and G. Cell. 2016;165(4):813–826.
OpenUrl CrossRef PubMed
95.↵
Sanders RW, Vesanen M, Schuelke N, Master A, Schiffner L, Kalyanaraman R, et al. Stabilization of the soluble, cleaved, trimeric form of the envelope glycoprotein complex of human immunodeficiency virus type 1. Journal of virology. 2002;76(17):887–588.
OpenUrl
96.↵
de Taeye SW, Ozorowski G, de la Peña AT, Guttman M, Julien JP, van den Kerkhof TL, et al. Immunogenicity of Stabilized HIV-1 Envelope Trimers with Reduced Exposure of Non-neutralizing Epitopes. Cell. 2015;163(7):170–217.
OpenUrl
97.↵
Rizzuto CD, Wyatt R, Hernández-Ramos N, Sun Y, Kwong PD, Hendrickson WA, et al. A conserved HIV gp120 glycoprotein structure involved in chemokine receptor binding. Science. 1998;280(5371):194–919.
OpenUrl Abstract/FREE Full Text
98.↵
Kyte J, Doolittle RF. A simple method for displaying the hydropathic character of a protein. Journal of molecular biology. 1982;157(1):105–132.
OpenUrl CrossRef PubMed Web of Science
99.
Pancera M, Zhou T, Druz A, Georgiev IS, Soto C, Gorman J, et al. Structure and immune recognition of trimeric pre-fusion HIV-1 Env. Nature. 2014;514(7523):455–461.
OpenUrl CrossRef PubMed Web of Science
100.↵
Zhang M, Gaschen B, Blay W, Foley B, Haigwood N, Kuiken C, et al. Tracking global patterns of N-linked glycosylation site variation in highly variable viral glycoproteins: HIV, SIV, and HCV envelopes and influenza hemagglutinin. Glycobiology. 2004;14(12):122–912.
OpenUrl
101.↵
Leonard CK, Spellman MW, Riddle L, Harris RJ, Thomas JN, Gregory T. Assignment of intrachain disulfide bonds and characterization of potential glycosylation sites of the type 1 recombinant human immunodeficiency virus envelope glycoprotein (gp120) expressed in Chinese hamster ovary cells. Journal of Biological Chemistry. 1990;265(18):10373–10382.
OpenUrl Abstract/FREE Full Text

View the discussion thread.

Posted August 02, 2016.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Microbiology

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11718)
Bioengineering (8724)
Bioinformatics (29132)
Biophysics (14936)
Cancer Biology (12051)
Cell Biology (17360)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14146)
Epidemiology (2067)
Evolutionary Biology (18269)
Genetics (12223)
Genomics (16768)
Immunology (11844)
Microbiology (28016)
Molecular Biology (11560)
Neuroscience (60822)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10401)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] 1.↵
Zanini F, Brodin J, Thebo L, Lanz C, Bratt G, Albert J, et al. Population genomics of intrapatient HIV-1 evolution. eLife. 2015;4:e11282.
OpenUrl PubMed

[2] 2.
Chen FC, Vallender E, Wang H, Tzeng CS, Li WH. Genomic divergence between human and chimpanzee estimated from large-scale alignments of genomic sequences. Journal of Heredity. 2001;92(6):481–489.
OpenUrl CrossRef GeoRef PubMed Web of Science

[3] 3.
Mikkelsen T, Hillier L, Eichler E, Zody M, Jaffe D, Yang SP, et al. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature. 2005;437(7055):69–87.
OpenUrl CrossRef PubMed Web of Science

[4] 4.↵
Langergraber KE, Prüfer K, Rowney C, Boesch C, Crockford C, Fawcett K, et al. Generation times in wild chimpanzees and gorillas suggest earlier divergence times in great ape and human evolution. Proceedings of the National Academy of Sciences. 2012;109(39):15716–15721.
OpenUrl Abstract/FREE Full Text

[5] 5.↵
Albert J, Abrahamsson B, Nagy K, Aurelius E, Gaines H, Nyström G, et al. Rapid development of isolate-specific neutralizing antibodies after primary HIV-1 infection and consequent emergence of virus variants which resist neutralization by autologous sera. Aids. 1990;4(2):107–112.
OpenUrl CrossRef PubMed Web of Science

[6] 6.↵
Wei X, Decker JM, Wang S, Hui H, Kappes JC, Wu X, et al. Antibody neutralization and escape by HIV-1. Nature. 2003;422(6929):307–312.
OpenUrl CrossRef PubMed Web of Science

[7] 7.↵
Richman DD, Wrin T, Little SJ, Petropoulos CJ. Rapid evolution of the neutralizing antibody response to HIV type 1 infection. Proceedings of the National Academy of Sciences. 2003;100(7):4144–4149.
OpenUrl Abstract/FREE Full Text

[8] 8.↵
Olshevsky U, Helseth E, Furman C, Li J, Haseltine W, Sodroski J. Identification of individual human immunodeficiency virus type 1 gp120 amino acids important for CD4 receptor binding. Journal of virology. 1990;64(12):5701–5707.
OpenUrl Abstract/FREE Full Text

[9] 9.
Cordonnier A, Montagnier L, Emerman M. Single amino-acid changes in HIV envelope affect viral tropism and receptor binding. Nature. 1989;340(6234):571.
OpenUrl CrossRef PubMed

[10] 10.↵
Basmaciogullari S, Babcock GJ, Van Ryk D, Wojtowicz W, Sodroski J. Identification of conserved and variable structures in the human immunodeficiency virus gp120 glycoprotein of importance for CXCR4 binding. Journal of virology. 2002;76(21):10791–10800.
OpenUrl Abstract/FREE Full Text

[11] 11.↵
Freed E, Myers D, Risser R. Mutational analysis of the cleavage sequence of the human immunodeficiency virus type 1 envelope glycoprotein precursor gp160. Journal of virology. 1989;63(11):4670–4675.
OpenUrl Abstract/FREE Full Text

[12] 12.
Lu M, Stoller MO, Wang S, Liu J, Fagan MB, Nunberg JH. Structural and functional analysis of interhelical interactions in the human immunodeficiency virus type 1 gp41 envelope glycoprotein by alanine-scanning mutagenesis. Journal of virology. 2001;75(22):11146–11156.
OpenUrl Abstract/FREE Full Text

[13] 13.
Jacobs A, Sen J, Rong L, Caffrey M. Alanine scanning mutants of the HIV gp41 loop. Journal of Biological Chemistry. 2005;280(29):27284–27288.
OpenUrl Abstract/FREE Full Text

[14] 14.
Zwick MB, Jensen R, Church S, Wang M, Stiegler G, Kunert R, et al. Anti-human immunodeficiency virus type 1 (HIV-1) antibodies 2F5 and 4E10 require surprisingly few crucial residues in the membrane-proximal external region of glycoprotein gp41 to neutralize HIV-1. Journal of virology. 2005;79(2):1252–1261.
OpenUrl Abstract/FREE Full Text

[15] 15.
Pantophlet R, Saphire EO, Poignard P, Parren PW, Wilson IA, Burton DR. Fine mapping of the interaction of neutralizing and nonneutralizing monoclonal antibodies with the CD4 binding site of human immunodeficiency virus type 1 gp120. Journal of virology. 2003;77(1):642–658.
OpenUrl Abstract/FREE Full Text

[16] 16.
Li Y, O’Dell S, Walker LM, Wu X, Guenaga J, Feng Y, et al. Mechanism of neutralization by the broadly neutralizing HIV-1 monoclonal antibody VRC01. Journal of virology. 2011;85(17):8954–8967.
OpenUrl Abstract/FREE Full Text

[17] 17.↵
Lynch RM, Wong P, Tran L, O’Dell S, Nason MC, Li Y, et al. HIV-1 fitness cost associated with escape from the VRC01 class of CD4 binding site neutralizing antibodies. Journal of virology. 2015;89(8):4201–4213.
OpenUrl Abstract/FREE Full Text

[18] 18.↵
Dahirel V, Shekhar K, Pereyra F, Miura T, Artyomov M, Talsania S, et al. Coordinate linkage of HIV evolution reveals regions of immunological vulnerability. Proceedings of the National Academy of Sciences. 2011;108(28):11530–11535.
OpenUrl Abstract/FREE Full Text

[19] 19.
Ferguson AL, Mann JK, Omarjee S, Ndung’u T, Walker BD, Chakraborty AK. Translating HIV sequences into quantitative fitness landscapes predicts viral vulnerabilities for rational immunogen design. Immunity. 2013;38(3):606–617.
OpenUrl CrossRef PubMed Web of Science

[20] 20.↵
Zanini F, Puller V, Brodin J, Albert J, Neher R. In-vivo mutation rates and fitness landscape of HIV-1. arXiv preprint arXiv:160306634. 2016;.

[21] 21.↵
Zanini F, Neher RA. Quantifying selection against synonymous mutations in HIV-1 env evolution. Journal of virology. 2013;87(21):11843–11850.
OpenUrl Abstract/FREE Full Text

[22] 22.↵
Hartl M, Theys K, Feder A, Gelbart M, Stern A, Pennings PS. Within-patient HIV mutation frequencies reveal fitness costs of CpG dinucleotides, drastic amino acid changes and G→A mutations. bioRxiv. 2016; p. 057026.

[23] 23.↵
Fowler DM, Araya CL, Fleishman SJ, Kellogg EH, Stephany JJ, Baker D, et al. High-resolution mapping of protein sequence-function relationships. Nature methods. 2010;7(9):741–746.
OpenUrl

[24] 24.
Fowler DM, Fields S. Deep mutational scanning: a new style of protein science. Nature methods. 2014;11(8):801–807.
OpenUrl

[25] 25.↵
Boucher JI, Cote P, Flynn J, Jiang L, Laban A, Mishra P, et al. Viewing protein fitness landscapes through a next-gen lens. Genetics. 2014;198(2):461–471.
OpenUrl

[26] 26.↵
McLaughlin Jr RN, Poelwijk FJ, Raman A, Gosal WS, Ranganathan R. The spatial architecture of protein function and adaptation. Nature. 2012;491(7422):138–142.
OpenUrl CrossRef PubMed Web of Science

[27] 27.
Roscoe BP, Thayer KM, Zeldovich KB, Fushman D, Bolon DN. Analyses of the effects of all ubiquitin point mutants on yeast growth rate. Journal of molecular biology. 2013;425(8):1363–1377.
OpenUrl CrossRef PubMed

[28] 28.
Firnberg E, Labonte JW, Gray JJ, Ostermeier M. A comprehensive, high-resolution map of a gene’s fitness landscape. Molecular biology and evolution. 2014;31(6):1581–1592.
OpenUrl CrossRef PubMed Web of Science

[29] 29.
Olson CA, Wu NC, Sun R. A comprehensive biophysical description of pairwise epistasis throughout an entire protein domain. Current Biology. 2014;24(22):2643–2651.
OpenUrl CrossRef PubMed

[30] 30.
Melnikov A, Rogov P, Wang L, Gnirke A, Mikkelsen TS. Comprehensive mutational scanning of a kinase in vivo reveals substrate-dependent fitness landscapes. Nucleic Acids Research. 2014;42(14):e112.
OpenUrl CrossRef PubMed

[31] 31.↵
Bloom JD. An Experimentally Determined Evolutionary Model Dramatically Improves Phylogenetic Fit. Mol Biol Evol. 2014;31(8):195–619.
OpenUrl

[32] 32.↵
Thyagarajan B, Bloom JD. The inherent mutational tolerance and antigenic evolvability of influenza hemagglutinin. eLife. 2014;3:e03300.
OpenUrl CrossRef PubMed

[33] 33.
Stiffler MA, Hekstra DR, Ranganathan R. Evolvability as a function of purifying selection in TEM-1 β-lactamase. Cell. 2015;160(5):882–892.
OpenUrl CrossRef PubMed

[34] 34.
Doud M, Ashenberg O, Bloom J. Site-Specific Amino Acid Preferences Are Mostly Conserved in Two Closely Related Protein Homologs. Molecular biology and evolution. 2015;32(11):2944–2960.
OpenUrl CrossRef PubMed

[35] 35.
Kitzman JO, Starita LM, Lo RS, Fields S, Shendure J. Massively parallel single-amino-acid mutagenesis. Nature methods. 2015;12(3):203–206.
OpenUrl

[36] 36.
Mishra P, Flynn JM, Starr TN, Bolon DN. Systematic Mutant Analyses Elucidate General and Client-Specific Aspects of Hsp90 Function. Cell reports. 2016;15(3):588–598.
OpenUrl

[37] 37.↵
Doud MB, Bloom JD. Accurate measurement of the effects of all amino-acid mutations to influenza hemagglutinin. Viruses. 2016;8:155.
OpenUrl CrossRef PubMed

[38] 38.↵
Mavor D, Fraser J, et al. Determination of Ubiquitin Fitness Landscapes Under Different Chemical Stresses in a Classroom Setting. eLife. 2016;5:e15802.
OpenUrl CrossRef

[39] 39.↵
Al-Mawsawi LQ, Wu NC, Olson C, Shi VC, Qi H, Zheng X, et al. High-throughput profiling of point mutations across the HIV-1 genome. Retrovirology. 2014;11(1):124.
OpenUrl CrossRef PubMed

[40] 40.↵
Peden K, Emerman M, Montagnier L. Changes in growth properties on passage in tissue culture of viruses derived from infectious molecular clones of HIV-1 LAI, HIV-1 MAL, and HIV-1 ELI. Virology. 1991;185(2):661–672.
OpenUrl CrossRef PubMed Web of Science

[41] 41.↵
Kwong PD, Wyatt R, Robinson J, Sweet RW, Sodroski J, Hendrickson WA. Structure of an HIV gp120 envelope glycoprotein in complex with the CD4 receptor and a neutralizing human antibody. Nature. 1998;393(6686):648–659.
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
Pancera M, Majeed S, Ban YEA, Chen L, Huang Cc, Kong L, et al. Structure of HIV-1 gp120 with gp41-interactive region reveals layered envelope architecture and basis of conformational mobility. Proceedings of the National Academy of Sciences. 2010;107(3):116–611.
OpenUrl

[43] 43.↵
van Anken E, Sanders RW, Liscaljet IM, Land A, Bontjer I, Tillemans S, et al. Only five of 10 strictly conserved disulfide bonds are essential for folding and eight for function of the HIV-1 envelope glycoprotein. Molecular biology of the cell. 2008;19(10):429–843.
OpenUrl

[44] 44.↵
Korber B, Foley BT, Kuiken C, Pillai SK, Sodroski JG, et al. Numbering positions in HIV relative to HXB2CG. Human retroviruses and AIDS. 1998;3:102–111.
OpenUrl

[45] 45.↵
Chakrabarti L, Emerman M, Tiollais P, Sonigo P. The cytoplasmic domain of simian immunodeficiency virus transmembrane protein modulates infectivity. Journal of virology. 1989;63(10):439–544.
OpenUrl

[46] 46.
Yuste E, Reeves JD, Doms RW, Desrosiers RC. Modulation of Env content in virions of simian immunodeficiency virus: correlation with cell surface expression and virion infectivity. Journal of virology. 2004;78(13):677–567.
OpenUrl

[47] 47.↵
Li Y, Luo L, Thomas DY, Kang OY. Control of expression, glycosylation, and secretion of HIV-1 gp120 by homologous and heterologous signal sequences. Virology. 1994;204(1):266–278.
OpenUrl CrossRef PubMed Web of Science

[48] 48.↵
Sheehy AM, Gaddis NC, Choi JD, Malim MH. Isolation of a human gene that inhibits HIV-1 infection and is suppressed by the viral Vif protein. Nature. 2002;418(6898):646–650.
OpenUrl CrossRef PubMed Web of Science

[49] 49.↵
Refsland EW, Stenglein MD, Shindo K, Albin JS, Brown WL, Harris RS. Quantitative profiling of the full APOBEC3 mRNA repertoire in lymphocytes and tissues: implications for HIV-1 restriction. Nucleic acids research. 2010;38(13):427–442.
OpenUrl

[50] 50.↵
Ho YC, Shan L, Hosmane NN, Wang J, Laskey SB, Rosenbloom DI, et al. Replication-competent noninduced proviruses in the latent reservoir increase barrier to HIV-1 cure. Cell. 2013;155(3):540–551.
OpenUrl CrossRef PubMed Web of Science

[51] 51.↵
Cuevas JM, Geller R, Garijo R, Lopez-Aldeguer J, Sanjuan R. Extremely High Mutation Rate of HIV-1 In Vivo. PLoS Biol. 2015;13(9):e1002251.
OpenUrl CrossRef PubMed

[52] 52.↵
Sloan RD, Wainberg MA. The role of unintegrated DNA in HIV infection. Retrovirology. 2011;8(1):1.
OpenUrl PubMed

[53] 53.↵
Guo HH, Choe J, Loeb LA. Protein tolerance to random amino acid change. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(25):920–592.
OpenUrl

[54] 54.
Shafikhani S, Siegel R, Ferrari E, Schellenberger V. Generation of large libraries of random mutants in Bacillus subtilis by PCR-based plasmid multimerization. Biotechniques. 1997;23(2):304–311.
OpenUrl PubMed Web of Science

[55] 55.↵
Bloom JD, Silberg JJ, Wilke CO, Drummond DA, Adami C, Arnold FH. Thermodynamic prediction of protein neutrality. Proceedings of the National Academy of Sciences of the United States of America. 2005;102(3):606–611.
OpenUrl Abstract/FREE Full Text

[56] 56.↵
Johnson WE, Desrosiers RC. Viral persistence: HIV’s strategies of immune system evasion. Annual review of medicine. 2002;53(1):499–518.
OpenUrl CrossRef PubMed Web of Science

[57] 57.↵
Ohgimoto S, Shioda T, Mori K, Nakayama EE, Hu H, Nagai Y. Location-specific, unequal contribution of the N glycans in simian immunodeficiency virus gp120 to viral infectivity and removal of multiple glycans without disturbing infectivity. Journal of virology. 1998;72(10):836–583.
OpenUrl

[58] 58.
Pugach P, Kuhmann SE, Taylor J, Marozsan AJ, Snyder A, Ketas T, et al. The prolonged culture of human immunodeficiency virus type 1 in primary lymphocytes increases its sensitivity to neutralization by soluble CD4. Virology. 2004;321(1):8–22.
OpenUrl CrossRef PubMed

[59] 59.↵
Wang W, Nie J, Prochnow C, Truong C, Jia Z, Wang S, et al. A systematic study of the N-glycosylation sites of HIV-1 envelope protein on infectivity and antibody-mediated neutralization. Retrovirology. 2013;10(1):1.
OpenUrl CrossRef PubMed

[60] 60.↵
Moore JP, Cao Y, Qing L, Sattentau QJ, Pyati J, Koduri R, et al. Primary isolates of human immunodeficiency virus type 1 are relatively resistant to neutralization by monoclonal antibodies to gp120, and their neutralization is not predicted by studies with monomeric gp120. Journal of virology. 1995;69(1):101–109.
OpenUrl Abstract/FREE Full Text

[61] 61.↵
Sullivan N, Sun Y, Li J, Hofmann W, Sodroski J. Replicative function and neutralization sensitivity of envelope glycoproteins from primary and T-cell line-passaged human immunodeficiency virus type 1 isolates. Journal of virology. 1995;69(7):441–344.
OpenUrl

[62] 62.↵
Parmley JL, Chamary J, Hurst LD. Evidence for purifying selection against synonymous mutations in mammalian exonic splicing enhancers. Molecular biology and evolution. 2006;23(2):301–l309.
OpenUrl CrossRef PubMed Web of Science

[63] 63.
Cuevas JM, Domingo-Calap P, Sanjuán R. The Fitness Effects of Synonymous Mutations in DNA and RNA Viruses. Molecular Biology and Evolution. 2012;29(1):17–20.
OpenUrl CrossRef PubMed Web of Science

[64] 64.↵
Subramaniam AR, DeLoughery A, Bradshaw N, Chen Y, O’Shea E, Losick R, et al. A serine sensor for multicellularity in a bacterium. Elife. 2013;2:e01501.
OpenUrl CrossRef PubMed

[65] 65.↵
Haas J, Park EC, Seed B. Codon usage limitation in the expression of HIV-1 envelope glycoprotein. Current Biology. 1996;6(3):315–324.
OpenUrl CrossRef PubMed Web of Science

[66] 66.↵
Watts JM, Dang KK, Gorelick RJ, Leonard CW, Bess , Jr JW, Swanstrom R, et al. Architecture and secondary structure of an entire HIV-1 RNA genome. Nature. 2009;460(7256):711–716.
OpenUrl CrossRef PubMed Web of Science

[67] 67.↵
Fernandes J, Jayaraman B, Frankel A. The HIV-1 Rev response element: an RNA scaffold that directs the cooperative assembly of a homo-oligomeric ribonucleoprotein complex. RNA biology. 2012;9(1):6–11.
OpenUrl

[68] 68.↵
Malim MH, Hauber J, Le SY, Maizel JV, Cullen BR. The HIV-1 rev trans-activator acts through a structured targetsequence to activate nuclear export of unspliced viral mRNA. Nature. 1989;338(6212):254–257.
OpenUrl CrossRef PubMed Web of Science

[69] 69.↵
Emerman M, Vazeux R, Peden K. The rev gene product of the human immunodeficiency virus affects envelope-specific RNA localization. Cell. 1989;57(7):115–511.
OpenUrl CrossRef PubMed Web of Science

[70] 70.↵
Bloom JD. Software for the analysis and visualization of deep mutational scanning data. BMC bioinformatics. 2015;16:168.

[71] 71.↵
Bloom JD. An experimentally informed evolutionary model improves phylogenetic fit to divergent lactamase homologs. Molecular biology and evolution. 2014;31(10):275–327.
OpenUrl

[72] 72.↵
Bloom JD. Identification of positive selection in genes is greatly improved by using experimentally informed site-specific models. bioRxiv. 2016; p.037689.

[73] 73.↵
Spielman SJ, Wilke CO. Pyvolve: a flexible Python module for simulating sequences along phylogenies. PloS one. 2015;10(9):e0139047.
OpenUrl CrossRef PubMed

[74] 74.↵
Nielsen R, Yang Z. Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene. Genetics. 1998;148(3):929–936.
OpenUrl Abstract/FREE Full Text

[75] 75.↵
Shankarappa R, Margolick JB, Gange SJ, Rodrigo AG, Upchurch D, Farzadegan H, et al. Consistent viral evolutionary changes associated with the progression of human immunodeficiency virus type 1 infection. Journal of virology. 1999;73(12):10489–10502.
OpenUrl Abstract/FREE Full Text

[76] 76.↵
Overbaugh J, Morris L. The antibody response against HIV-1. Cold Spring Harbor perspectives in medicine. 2012;2(1):a007039.
OpenUrl Abstract/FREE Full Text

[77] 77.↵
Starcich BR, Hahn BH, Shaw GM, McNeely PD, Modrow S, Wolf H, et al. Identification and characterization of conserved and variable regions in the envelope gene of HTLV-III/LAV, the retrovirus of AIDS. Cell. 1986;45(5):637–648.
OpenUrl CrossRef PubMed Web of Science

[78] 78.↵
Modrow S, Hahn BH, Shaw GM, Gallo RC, Wong-Staal F, Wolf H. Computer-assisted analysis of envelope protein sequences of seven human immunodeficiency virus isolates: prediction of antigenic epitopes in conserved and variable regions. Journal of virology. 1987;61(2):570–578.
OpenUrl Abstract/FREE Full Text

[79] 79.↵
Ramsey DC, Scherrer MP, Zhou T, Wilke CO. The relationship between relative solvent accessibility and evolutionary rate in protein evolution. Genetics. 2011;188:479–488.
OpenUrl Abstract/FREE Full Text

[80] 80.↵
Zhou T, Lynch RM, Chen L, Acharya P, Wu X, Doria-Rose NA, et al. Structural repertoire of HIV-1-neutralizing antibodies targeting the CD4 supersite in 14 donors. Cell. 2015;161(6):1280–1292.
OpenUrl CrossRef PubMed

[81] 81.↵
Zhou T, Georgiev I, Wu X, Yang ZY, Dai K, Finzi A, et al. Structural basis for broad and potent neutralization of HIV-1 by antibody VRC01. Science. 2010;329(5993):811–817.
OpenUrl Abstract/FREE Full Text

[82] 82.↵
Klein F, Diskin R, Scheid JF, Gaebler C, Mouquet H, Georgiev IS, et al. Somatic mutations of the immunoglobulin framework are generally required for broad and potent HIV-1 neutralization. Cell. 2013;153(1):126–138.
OpenUrl CrossRef PubMed Web of Science

[83] 83.↵
Kouyos RD, Leventhal GE, Hinkley T, Haddad M, Whitcomb JM, Petropoulos CJ, et al. Exploring the complexity of the HIV-1 fitness landscape. PLoS Genet. 2012;8(3):e1002551.
OpenUrl CrossRef PubMed

[84] 84.↵
Ablashi D, Berneman Z, Kramarsky B, Whitman J, Asano Y, Pearson G. Human herpesvirus-7 (HHV-7): current status. Clinical and diagnostic virology. 1995;4(1):1–13.
OpenUrl CrossRef PubMed

[85] 85.↵
Wei X, Decker JM, Liu H, Zhang Z, Arani RB, Kilby JM, et al. Emergence of resistant human immunodeficiency virus type 1 in patients receiving fusion inhibitor (T-20) monotherapy. Antimicrobial agents and chemotherapy. 2002;46(6):189–619.
OpenUrl

[86] 86.↵
Reed LJ, Muench H. A simple method of estimating fifty per cent endpoints. American journal of epidemiology. 1938;27(3):493–497.
OpenUrl CrossRef

[87] 87.↵
Hiatt JB, Patwardhan RP, Turner EH, Lee C, Shendure J. Parallel, tag-directed assembly of locally derived short sequence reads. Nat Methods. 2010;7(2):119–122.
OpenUrl CrossRef PubMed Web of Science

[88] 88.
Jabara CB, Jones CD, Roach J, Anderson JA, Swanstrom R. Accurate sampling and deep sequencing of the HIV-1 protease gene using a Primer ID. Proceedings of the National Academy of Sciences. 2011;108(50):20166–20171.
OpenUrl Abstract/FREE Full Text

[89] 89.↵
Kinde I, Wu J, Papadopoulos N, Kinzler KW, Vogelstein B. Detection and quantification of rare mutations with massively parallel sequencing. Proceedings of the National Academy of Sciences. 2011;108(23):953–095.
OpenUrl

[90] 90.↵
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014; p. btu033.

[91] 91.↵
Touw WG, Baakman C, Black J, te Beek TA, Krieger E, Joosten RP, et al. A series of PDB-related databanks for everyday needs. Nucleic acids research. 2014; p. gku1028.

[92] 92.↵
Kabsch W, Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983;22(12):257–726.
OpenUrl

[93] 93.↵
Tien MZ, Meyer AG, Sydykova DK, Spielman SJ, Wilke CO. Maximum allowed solvent accessibilites of residues in proteins. PloS one. 2013;8(11):e80635.
OpenUrl CrossRef PubMed

[94] 94.↵
Stewart-Jones GB, Soto C, Lemmin T, Chuang GY, Druz A, Kong R, et al. Trimeric HIV-1-Env Structures Define Glycan Shields from Clades A,B, and G. Cell. 2016;165(4):813–826.
OpenUrl CrossRef PubMed

[95] 95.↵
Sanders RW, Vesanen M, Schuelke N, Master A, Schiffner L, Kalyanaraman R, et al. Stabilization of the soluble, cleaved, trimeric form of the envelope glycoprotein complex of human immunodeficiency virus type 1. Journal of virology. 2002;76(17):887–588.
OpenUrl

[96] 96.↵
de Taeye SW, Ozorowski G, de la Peña AT, Guttman M, Julien JP, van den Kerkhof TL, et al. Immunogenicity of Stabilized HIV-1 Envelope Trimers with Reduced Exposure of Non-neutralizing Epitopes. Cell. 2015;163(7):170–217.
OpenUrl

[97] 97.↵
Rizzuto CD, Wyatt R, Hernández-Ramos N, Sun Y, Kwong PD, Hendrickson WA, et al. A conserved HIV gp120 glycoprotein structure involved in chemokine receptor binding. Science. 1998;280(5371):194–919.
OpenUrl Abstract/FREE Full Text

[98] 98.↵
Kyte J, Doolittle RF. A simple method for displaying the hydropathic character of a protein. Journal of molecular biology. 1982;157(1):105–132.
OpenUrl CrossRef PubMed Web of Science

[99] 99.
Pancera M, Zhou T, Druz A, Georgiev IS, Soto C, Gorman J, et al. Structure and immune recognition of trimeric pre-fusion HIV-1 Env. Nature. 2014;514(7523):455–461.
OpenUrl CrossRef PubMed Web of Science

[100] 100.↵
Zhang M, Gaschen B, Blay W, Foley B, Haigwood N, Kuiken C, et al. Tracking global patterns of N-linked glycosylation site variation in highly variable viral glycoproteins: HIV, SIV, and HCV envelopes and influenza hemagglutinin. Glycobiology. 2004;14(12):122–912.
OpenUrl

[101] 101.↵
Leonard CK, Spellman MW, Riddle L, Harris RJ, Thomas JN, Gregory T. Assignment of intrachain disulfide bonds and characterization of potential glycosylation sites of the type 1 recombinant human immunodeficiency virus envelope glycoprotein (gp120) expressed in Chinese hamster ovary cells. Journal of Biological Chemistry. 1990;265(18):10373–10382.
OpenUrl Abstract/FREE Full Text

Experimental estimation of the effects of all amino-acid mutations to HIV Env

Abstract

Introduction

Results

Deep mutational scanning of Env

Most mutations are under purifying selection, but a few sites experience selection for cell-culture adaptation mutations

The preference for each amino acid at each site in Env

The amino-acid preferences correlate with amino-acid frequencies in HIV sequence alignments at most sites, but deviate at positions subject to selection pressures absent from our experiments

Env has a high inherent mutational tolerance in its variable loops, and a low mutational tolerance in broadly neutralizing antibody epitopes

Discussion

Materials and Methods

Data and computer code

Sequence numbering

Codon mutant libraries

Generation and passaging of viruses

Virus titering by TCID50 and TZM-bl assays

Generation of samples for Illumina sequencing

Analysis of deep-sequencing data

Alignment of group-M env sequences

Computing relative solvent accessibilities

Acknowledgments

Footnotes

References

Citation Manager Formats

Subject Area

Virus titering by TCID₅₀ and TZM-bl assays