Introduction

In 1960 and 1961, Cain & Currey (1963a) surveyed morph frequencies in populations of the visually polymorphic land snails Cepaea nemoralis and C. hortensis on the Marlborough Downs in south-central England, including both woods and open downland sites. The locality of this survey is famous for having generated the concept of ‘area effect’, a term used by Cain and Currey to describe places where ‘the predominance of a few morphs irrespective of habitat and background characterizes areas vastly larger than that of panmictic populations’ (Cain & Currey, 1963a). The area examined falls within a square approximately 8 km from N to S and 7 km from W to E. The brown morph, which is dominant over pink and yellow at the colour locus, was uncommon over most of the area, but brown snails occurred at a frequency of over 25% to the north (Barbury Castle) and up to 90% in samples in the southern third of the study area.

In 1985 the area was resurveyed by Cowie & Jones (1998). Some sites no longer had Cepaea present and at others the species composition had changed. The majority, however, permitted comparison of morph frequencies between the two surveys. The intention was to look for general trends in the data, which may be correlated with climatic change or with large-scale ecological changes such as the vegetational modifications that have followed the decline in rabbit populations in recent years (Murray & Clarke, 1978; Cameron et al., 1980). Overall patterns of change were therefore examined, without special attention being paid to possible between-colony differences. In C. nemoralis there was no apparent change in frequency of the banded morphs over the intervening period, but a significant overall decrease in the frequency of browns. In C. hortensis the frequency of unbanded shells has increased and there has probably been a decline in the frequency of fusions within the banded class (Cowie & Jones, 1998).

In making these comparisons, it was necessary to choose a method for measuring change in frequency and estimating selection. In the course of discussing the options, the local morph frequency differences were also analysed in more detail. The results of this finer-scale examination are considered here.

Materials and methods

The survey and scoring methods are fully outlined by Cowie & Jones (1998). All but four of the 157 Marlborough Downs sites of Cain & Currey (1963a), including both C. nemoralis and C. hortensis sites, were identified. Snails were collected from as small an area as possible, usually less than 100 m2, although if they were sparse or if the original site was a larger area, collecting sites were sometimes expanded. Samples were scored as in the original survey, and only scores for adult individuals were analysed. Whenever samples were large enough to allow valid statistical testing to be carried out, the morph frequencies in samples from 1960–61 were compared with those from 1985.

The ‘brown’ area effect to the south of the Marlborough Downs involves a high frequency of browns compared to the adjacent areas. It was possible to compare frequencies of browns and non-browns in a total of 43 possible comparisons. Twenty-three sites showed a decrease in browns. Pooling all data from these 43 sites, a general decline in frequency of browns over the 25 intervening years was detected.

The generation time of C. nemoralis is between 3 years (Clarke & Murray, 1962; Wolda, 1970; Murray & Clarke, 1978) and 5 years (Wall et al., 1980). The interval between the two surveys therefore spans approximately eight to five generations. The method of calculating selective coefficients over several generations by treating the difference equation as an approximation to a differential equation and finding the integral goes back at least to Haldane (1924); summarized in Haldane, (1932). The problem is described by Clarke & Murray (1962) as follows, when discussing change in frequency of the brown allele at the colour locus of C. nemoralis at another locality . After a round of random mating the population is represented by:

C represents the ground colour locus, B and X the brown and the non-brown alleles. Over one generation the change in allele frequency is:

Comparison was made of three methods of estimating s, given an estimate of the initial frequency q and the change in frequency.

Method A, by integration

If q0 is the frequency of recessive alleles at the start, and qn is the frequency after n generations and s refers to the dominant morph, as above, then integrating Δq provides (Clarke & Murray, 1962):

The selective coefficient s may therefore be calculated assuming a given number (n) of generations to have elapsed. Clarke & Murray (1962) also provide the variance, which takes account of sample size. The estimate is an approximation which will be more accurate for frequencies near the centre of the range than for extreme frequencies, and for small selective coefficients. Manly (1985) provides a comprehensive account of the properties of this and related methods.

Method B, from morph ratios

An alternative method of calculating s is described by Cain et al. (1990). The value of 1 – s is estimated for an arbitrary time interval (1 year in their study) by saying that the ratio of morphs of the dominant allele to morphs of the recessive allele in 1 year is 1 – s times the ratio in the previous time interval. This is true during the course of selection within one generation, and is a reasonable way of representing change when there are overlapping generations, but the pattern is modified by segregation of gametes. The method works over a range of frequencies and times for which the logarithm of frequency on time is more or less linear. If Pn is the frequency of the dominant morph at time n and Qn is the frequency of the recessive morph:

Then for a pair of ratios with a gap of n time units, fitness per unit time may be obtained from:

The transition from discrete to continuous change is discussed by Cavalli (1950), and graphical treatment of allele frequency change using log transformation is employed extensively by Cavalli-Sforza & Bodmer (1971).

Method C, by iteration

These methods were developed some years ago when personal computers were not available. If fitness differences per generation are desired, the natural approach now would be to choose a value of s which provides qn, given q0, by trial and error. This may be carried out simply by incrementing s by small amounts until a suitable degree of agreement is reached or by use of a convergent algorithm. We have used the method of false positions (e.g. Fröberg, 1966). Let y = qnq′, where q′ is an estimate of qn found using a trial value of s. For two such values yk–1 and yk, computed using two values sk–1 and sk, then so long as s is reasonably close to the required value, a better value of s is:

The s-value calculated by the integration method was used as the starting point for the convergence.

Results

Table 1 shows the three methods of estimating s applied to the data on frequency of browns. It is assumed that eight generations have elapsed, because most ecological studies have indicated a generation time of about 3 years. There can be seen to be general agreement between the estimates, but in order to examine the relationship more closely, methods A and B have been plotted on method C in Fig. 1. Good linear relationships exist between A and C and between B and C, except that samples 3 and 7, which are among the samples with extreme frequencies, produce aberrant estimates of s. The diagonal shown is for unit slope, and the graphs both indicate steeper trends. Excluding samples 3 and 7 the reduced major axis estimate of slope is 1.30 for A on C and 1.05 for B on C. The integration method therefore provides estimates some 30% larger than the iteration, whereas estimates by the ratio method are 5% larger. Further experimentation with a range of values of q and s chosen at random and different time intervals indicates that integration progressively overestimates s as the absolute value of s increases. The ratio method underestimates s when s is negative and works best for positive s and frequencies of the dominant morph between about 10 and 70%.

Table 1 Brown morph frequency in Cepaea nemoralis and selective coefficient of the brown morph for 43 sites on the Marlborough Downs sampled twice, with a 25-year interval between samplings. It is assumed that eight generations have elapsed. Br1 and Br2, morph frequencies in 1961 and 1985; A, estimate of s and its standard error by integration; B, estimate from morph ratios; C, estimate by iteration
Fig. 1
figure 1

Estimates of selection in Cepaea nemoralis using data on change in morph frequency in Table 1. Left, estimates by method A plotted on method C; right, method B plotted on method C. See text for description of methods. Diagonal line shows unit slope. Steeper lines are slopes of the reduced major axis when the exceptional values marked with boxes (samples 3 and 7) are excluded.

Using the estimates of standard error from method A in Table 1, we can test for significance of deviation from s = 0 by calculating χ2 as s2/Var(s). The sum of these values minus the χ2-value calculated from the sum is a measure of heterogeneity between samples. It is 113.5, which is highly significant (d.f. = 42, P < 0.001). This is a conservative estimate of heterogeneity; the equivalent calculation based on the 2 × 2 contingency tables for each sample gives a value of 187.2. It is therefore legitimate to look for patterns of variation in the data. Figure 2 shows a diagram of the location of the sites. Those in which s has a positive sign (brown at a disadvantage) are indicated as squares and those with a negative sign (brown at an advantage) as crosses. The grid shown is coded as in the grid of 1 km squares given in fig. 3 of Cain & Currey (1963a). Over most of the area, sites where brown has increased are mixed with those in which it has decreased. There is, however, a band of 10 contiguous sites extending over 2 km in row D, located on Rockley and Clatford Down, in which brown has decreased. It is possible that these indicate an area where the disadvantage of brown is particularly marked.

Fig. 2
figure 2

Sites for which change in brown morph frequency in Cepaea nemoralis is recorded, arranged according to the 1 km grid given by Cain & Currey (1963a). Squares, no change or decline of brown between surveys; crosses, increase in brown between surveys.

Fig. 3
figure 3

Sites for which change in unbanded frequency in Cepaea nemoralis is recorded, arranged on the 1 km grid. Four regions labelled A–D were distinguished by Cain & Currey (1963a). In area A samples contain unbanded and mid-banded, which is a phenotype homozygous for five-banded. Area C is almost entirely five-banded, and B and D are intermediate in frequency. Squares, no change or decline in unbanded; crosses, increase in unbanded between surveys.

Spatial pattern is not quite independent of frequency, however, because brown is at a low frequency at Barbury Castle in square I4 and increases towards the highest values at Parson’s Penning in C2 and B3. In order to find out whether frequency change is related to initial frequency, the difference between the successive brown morph frequencies was plotted on the initial value. The regression coefficient obtained was −0.171 ± 0.089 (for comparison with zero slope, t41 = 1.93, P = 0.06). For small changes entirely independent of frequency, the expected slope would be zero, whereas the observed value is negative and nearly significant. In any sampling from starting values within a limited range where the succeeding samples are random variables within the same range a negative slope is to be expected; if both sets of values were random variables the expected slope would be −1. A negative gradient may therefore suggest either random or selective convergence of extreme frequencies.

Brown is the morph that provides most data for comparison. Five-banded remains present at a very high frequency to the north, and there is a large area to the south that is almost entirely unbanded and mid-banded. Between the two is a stretch of land showing intermediate frequencies (Fig. 3). Cain & Currey (1963a) divide it into two areas referred to as ‘B, the Barbury area (transitional)’ and ‘D, the southern intermediate area’. This zone contains 16 sites for which both early and late samples are large enough for comparisons to be made. The average frequency of unbanded over the whole stretch did not change (31.9% in the first survey, 32.2% in the second). Using contingency tables, however, the χ2 testing heterogeneity is 41.1, which is significant (d.f. = 15, P < 0.001). This is largely because three samples on Maizey Down in square G6 have shown a decline in unbanded, whereas in samples to the north-west (squares I3, 4 and 5 and H6), unbanded has tended to increase. The cline between the two frequency patches may be relocating.

To the south of the transitional areas is one where samples are almost entirely unbanded or mid-banded. Unbanded is dominant over five-banded at the banding locus, and an allele at an unlinked locus reduces the five-banded phenotype to a single central band. This area is therefore almost fixed for mid-banded, and the frequency of unbanded may be examined as in the transitional area for the possibility of temporal change. Here, 42 samples are available for comparison. The heterogeneity of response is significant (χ241 = 65.0, P ≈ 0.01). If the four areas are taken together (63 samples), then the regression of change in frequency of unbanded on initial frequency is significantly negative (b = –0.163, t61 = 2.07, P < 0.05).

Discussion

Three methods have been used to calculate the selective coefficient of the brown morph in samples of C. nemoralis from the Marlborough Downs. Although one is a mathematical and another a pragmatic approximation, they perform so as to give estimates which are usually broadly comparable to those obtained by iteration over the interval concerned. The ratio method is set up to give a direct estimate over any preferred time interval, not necessarily a generation. Assuming eight generations to have elapsed over the 25-year time interval, the average decline in frequency of browns is such as could be brought about by a selective disadvantage of 3–6% per generation.

Unfortunately, it is still not known what kind of selection may be operating. Difference in consistency and magnitude of change between the Rockley/Clatford Down region and elsewhere may indicate local variation in selection, but if so, it does not follow obvious differences in habitat. Overall, there is overwhelming evidence that selection influences morph frequencies in C. nemoralis (Clarke et al., 1978; Jones et al., 1978; Cook, 1998). Frequencies probably track habitat change brought about by climatic fluctuation and human activity. The response is slow, however, and with respect to the southern English downland populations, Cameron and colleagues have argued that introductions over the past few centuries have played an important part in establishing the present area effects (Cameron et al., 1977, 1980; Cameron & Dillon, 1984; Cameron & Pannett, 1985). Distinct patches of different frequencies could therefore be a result of relatively recent meeting of two or more spreading populations and may be disappearing as introgression takes place and as all the patches respond to similar large-scale selection pressures. The tendency to homogenization suggested by the present analysis may be evidence of this process. Over a longer term, if land-use patterns change again, Cepaea populations may vanish from the Downs altogether, or be subject to a new set of disturbances which change the morph frequencies before they have had a chance to become homogeneous or matched to selection pressures. New area effects could appear, with frequencies different from those we see now, and with different boundaries.

After intensive study for over half a century the extraordinary phenotypic variety in C. nemoralis is still not understood. The challenge should not be abandoned because the problem is difficult to unravel; it relates to a common class of polymorphisms. In field surveys, however, there is a risk of overexplaining data and finding support for hypotheses where none exists. To avoid this, Cowie & Jones (1998) tested only hypotheses already set up, and with respect to browns concluded that a small general decline in frequency has taken place. In this paper we suggest that there may also be spatial variation in response and a homogenizing process of some kind. Support is not conclusive, but it is sometimes rewarding to follow suggestive leads. Instead of simply reporting that morph frequency did not relate to habitat on the Marlborough Downs, Cain & Currey (1963a) used their data to develop the new concept of area effects. The result was a large body of work and discussion (e.g. Cain & Currey, 1963b; Goodhart, 1963; Wright, 1965; Carter, 1968; Clarke, 1968; Gould & Woodruff, 1990), which has stimulated new approaches to understanding the polymorphism.

Any abrupt transition may signal an area effect. Cain (1968) examined another on a sand dune in north Wales, at the same time wondering whether the term should be applied to so small an area as it occupied. This Welsh dune has recently been resurveyed (Cook & Pettitt, 1998). In view of the disturbances known to have occurred in the interim, the present pattern suggests that whatever caused the discontinuity observed by Cain (1968), movement of individuals has since contributed to homogenization. The results for brown from the Marlborough Downs may be another case of migration leading to decay of accidentally generated discontinuities. If movement is responsible for breaking down pattern, it should act equally on all morphs. It is therefore noteworthy that similar patterns were observed in both brown and unbanded/five-banded frequencies. Repeat surveys, studies of allozyme variation and rates of colonization of new regions all suggest that passive movement may have a more important influence on morph frequency patterns than has previously been assumed (Cook, 1998; Cook & Pettitt, 1998). If so, reassessment of the effects of migration may lead to a fresh view of the causes of polymorphism in Cepaea.