The global diversity of the major parasitic nematode Haemonchus contortus is shaped by human intervention and climate

G. Sallé; S.R. Doyle; J. Cortet; J. Cabaret; M. Berriman; N. Holroyd; J.A Cotton

doi:10.1101/450692

Abstract

The gastrointestinal parasite Haemonchus contortus is an haematophagous parasitic nematode of veterinary interest and a model for the study of drug resistance mechanisms or host-parasite interactions. To understand its evolutionary history, and its ability to adapt in the face of climatic and drug pressure, we have performed an extensive survey of genome-wide diversity using single-worm whole genome sequencing of 223 individuals sampled from 19 isolates spanning five continents. The pattern of global diversity is driven by an African origin for the species, together with contemporary dispersal that is consistent with modern human movement, with evidence for parasites spreading during the transatlantic slave trade and colonisation of Australia presented. Strong selective sweeps were identified in independent populations each surrounding the β-tubulin locus, a target of benzimidazole anthelmintic drug treatment used widely to control H. contortus infections. These signatures of selection were further supported by signals of diversifying selection enriched in genes involved in response to drugs, as well as other anthelmintic-associated biological functions including pharyngeal pumping and oviposition. From these analyses, we identify some known, and previously undescribed, candidate genes that may play a role in ivermectin resistance. Finally, we describe genetic signatures of climate-driven adaptation, revealing a gene acting as an epigenetic regulator and components of the dauer pathway may play a role in adaptation in the face of climatic fluctuations. These results begin to define genetic adaptation to climate for the first time in a parasitic nematode, and provides insight into the ongoing expansion in the range of Haemonchus contortus, which may have consequences for the management of this parasite.

Introduction

Nematodes have evolved to exploit a wide diversity of ecological niches. Although many sustain a free-living lifestyle, parasitic nematodes rely on one or more hosts to complete their life cycle. Many parasitic nematodes undergo a complex series of morphological changes, linked to migration through their hosts to establish a mature infection¹. Their complex life cycles may involve both intermediate hosts, vectors or time spent in the environment, where they face harsh and variable conditions such as frost or drought that they must withstand between infection of their hosts². Parasitic nematodes have adapted to a wide range of threats, including predation, climate and the immune responses of a great diversity of both plant and animal host species³⁴.

The evolutionary success of parasitic nematodes comes at a cost to humans, either directly as they significantly impact human health (amounting to a loss of 10 million disability-adjusted life-years)³, or indirectly via major economic losses in plant⁴ and livestock production⁵, and parasite control. The control of animal parasitic nematodes relies almost exclusively on anthelmintic drugs, administered on a recurrent basis in livestock⁶ and through mass-drug administration program in humans⁷. Although the success of such strategies was originally undeniable, the emergence of drug resistant veterinary parasites ⁵, or the reported lack of efficacy in human-infective species⁸, threatens ongoing control efforts for many parasitic infections. Vaccines offer an attractive alternate control strategy against these parasites: the extensive genetic diversity and immune-regulatory properties of parasites has, however, greatly hampered vaccine development⁹, and although two licensed vaccines are currently available for veterinary purposes¹⁰, transcriptomic plasticity of the parasite following vaccine challenge may contribute to circumvent the vaccinal response of their host¹⁰. It is therefore clear that novel, sustainable control strategies are required. The potential of helminths to adapt to – and thus escape – control measures lies in their underlying genetic diversity. A greater understanding of the extent of this diversity and the processes that shape it throughout their range should provide insight into the mechanisms by which they adapt, and may identify new targets which may be exploited for control.

The trichostrongylid Haemonchus contortus is a gastrointestinal parasite of ruminants in tropical and temperate regions throughout the world, and causes significant economic and animal health burden particularly on sheep husbandry. It is also emerging as a model parasitic nematode system for functional and comparative genomics, largely due to its rapid ability to acquire drug resistance, the relative tractability of its life-cycle under laboratory conditions¹¹, the development of extensive genomic resources^12,13, and its relatively close relationship with other clade V parasitic nematodes of both veterinary and medical importance (i.e, other gastro-intestinal nematodes of livestock and human hookworms)¹⁴. We have used whole genome sequencing of 223 individual H. contortus sampled from 19 isolates spanning five continents to characterise genome- and population-wide genetic diversity throughout its range. This survey of genome-wide diversity has revealed old and new genetic connectivity influenced by human history, and signatures of selection in response to anthelmintic exposure and local climatic variation.

Results

Haemonchus contortus isolates are genetically diverse, with large effective population sizes

Whole genome sequencing of 223 individuals from 19 isolates (Fig. 1a; Supplementary Table 1) revealed 23,868,644 SNPs with a genome-wide distribution of 1 SNP per 9.94 bp on average.

Figure 1a.

Global distribution of Haemonchus contortus isolates. Isolates sampled are coloured by geographical region (sand: South-America, brown: Western-Africa, dark green: Mediterranean area, black: Subtropical Africa, red: Australia). Shape indicates the anthelmintic resistance status of each isolate to fenbendazole (resistant =squares; susceptible =circles) or resistant to both ivermectin and fenbendazole (triangles).

Only a proportion of the filtered SNPs were called in more than half the individuals (n = 3,338,155 SNPs), with only 411,574 SNPs segregating with a MAF > 5% in those individuals. Estimates of nucleotide diversity (π) in isolates with at least five individuals ranged from 0.44% (STA.2) to 1.3% (NAM; Supplementary Table 2). Variance in n across autosomes among isolates was partly explained by isolate mean coverage (F_(1,84) = 9.49, P = 0.003; Supplementary note). To account for this bias, estimates were obtained from three subsets of isolates with the highest coverage (greater than 8× on average) from France (FRA.1 and FRA.2), Guadeloupe (GUA) and Namibia (NAM) that yielded slightly higher values that ranged between 0.65 and 1.14%. Overall, these data show equivalent diversity levels to Drosophila melanogaster (ranging between 0.53% and 1.71%)¹⁶ but represented approximately 1.6 to 45-fold greater diversity than two filarial nematode species for which similar statistics are available (0.02% for Wuchereria bancrofti larvae¹⁷; 0.01% and 0.4% for π_S and π_N in Onchocerca volvulus¹⁸ Supplementary Fig. 1).

To begin to explore the global diversity of H. contortus, we performed a principal component analysis (PCA) of genetic variation, which revealed three broad genetic clusters of isolates (Fig. 1b) that largely coincided with the geographic region from which they were sampled, including: (i) Sub-tropical African isolates (NAM, STA and ZAI), (ii) Atlantic isolates including Morocco (MOR), São Tomé (STO), Benin (BEN), Brazil (BRA) and Guadeloupe (GUA), and (iii) the remainder from the Mediterranean area (FRA, ACO) and Oceania (AUS.1, AUS.2 and IND). The greatest diversity among samples was identified in the African and South American isolates, with East African samples spread along PC1 and West African and South American samples along PC2.

Figure 1b.

Principal component analysis based on genotype likelihood inferred from whole genome sequences of 223 individual males (243,012 variants considered). Samples are coloured by geographic region described in panel 1a.

Using estimates of nucleotide diversity estimates, together with the the C. elegans²⁰ mutation rate and assuming a balanced sex ratio²¹, we inferred the current effective population size (N_e) of H. contortus to be between 0.60 and 1.05 million. MSMC analysis, which models past recombination events based on heterozygosity patterns along the genome, revealed the historical N_e has remained within a slightly lower range of values for most of the sampled time interval, i.e. from 2.5 to 500 thousand years ago (kya), with extreme estimates falling between 1.5 × 10⁵ and 6.1 × 10⁵ individuals for GUA (633 years ago) and NAM (415 years ago) isolates, respectively (Fig. 1c). N_e estimates remained relatively constant in most populations until 2.5 Kya; since this time, GUA and FRA.1 isolates suffered a more drastic reduction in N_e, which ranged for each isolate between 0.859 × 10⁵ and 1.2 × 10⁵ individuals approximately 633 and 452 years ago, respectively (Fig. 1c). (Fig. 1c).

Figure 1c.

MSMC effective population size across time for 6 isolates. Coloured shaded area represents range of values estimated from a cross-validation procedure with five replicates, computed by omitting one chromosome out at a time.

Global population connectivity of Haemonchus contortus is characterised by old and new migration

To explore the global connectivity between isolates, we used a number of complementary approaches. Phylogenetic relationships determined by nuclear (Supplementary Fig. 2) and mitochondrial (Fig. 2a; PCA of mtDNA genetic diversity is presented in Supplementary Fig. 3) diversity broadly supported the initial PCA analysis (Fig. 1b), each revealing three main groups of samples partitioned by broad geographic regions, i.e. Africa, Oceania and Mediterranean groups. However, the mitochondrial data revealed further subdivision of the Oceanian and Mediterranean clades than nuclear data alone.

Figure 2a.

Unrooted maximum likelihood phylogenetic tree constructed constructed using 3,052 SNPs from 223 individual mitochondrial genomes. Circles indicate bootstrap support for each branch, blue if support was higher than 70%, red elsewhere. Branches leading to a sample identifier are coloured by geographic region described in Figure 1a. Mitogroups are annotated with constitutive sample populations.

The presence of close genetic relationships between geographically distant isolates, resulting in a weak phylogeographic signal (Mantel’s test r = 0.10, P = 0.001), was inconsistent with a simple isolation-by-distance scenario. This observation was supported by, for example, little genetic differentiation (measured by F_ST) between geographically distant French (FRA) and Oceanian (AUS.1, AUS.2, IND) isolates (Fig. 2b).

Figure 2b.

Matrix showing pairwise dissimilarity between individuals (upper right) and pairwise F_ST between isolates (lower left).

The greatest genetic dissimilarity was found between subtropical African isolates and the French and Australian isolates, with mean F_ST across comparisons of 0.21 (F_ST range according to isolate pairs: 0.06 – 0.42) and 0.24 (F_ST range: 0.17 – 0.33) respectively (Fig. 2b, Supplementary Fig. 4). The divergence of African isolates was likely due to the higher within-isolate diversity (+14% higher mitochondrial nucleotide diversity) relative to others (P = 0.004; mitochondrial nucleotide diversity of 0.63% ± 37%, 0.58% ± 0.31%, 0.66 ± 0.32%, 0.6% ± 0.35%, for Mediterranean, Oceanian, American and south-African isolates respectively).

The higher genetic differentiation (8.9% difference, F_(1,134) = 11.36, P = 0.0009) between STO and other isolates relative to other pairwise comparisons, almost certainly reflects the isolation of an island population relative to continental populations (Fig. 2b). STO samples also displayed higher genetic dissimilarity (5.7% difference, F_(1,7567) = 581, P < 10⁻⁴) to other other samples. Inference of the joint history of STO and other African populations supported this view. The best-fitting model (supplementary Table 3) was consistent with either ancient symmetrical gene flow followed by isolation or an early split followed by secondary contact before isolation with MOR population (supplementary Table 3). This latter demography would underpin the pattern of admixture detected between STO and MOR (Fig. 2c).

Figure 2c.

Admixture analysis of 223 individuals. A cluster size of K = 3 is presented, determined from sites with minor allele frequency above 5% and call rate higher than 50%. Admixture pattern for other values of K is provided in supplementary Figure 5. Isolates are presented sorted along their longitudinal range, and samples sorted by assignment to the 3 clusters.

A closer inspection of GUA samples revealed a mixed ancestry, likely derived from West African and Mediterraneanheritage. A subset of GUA samples showed limited genetic differentiation to FRA isolates (F_ST range = 0.07 – 0.10; Fig. 2b), whereas the remaining GUA samples were genetically similar to isolates from the West African coast, i.e. ACO, BEN, MOR, STO (Fig. 2b & c), as indicated by lower F_ST estimates with these isolates (F_ST range = 0.07 – 0.13; Wilcoxon’s test, P = 0.07; Fig. 2b) and evidence of shared ancestry from the admixture analysis (Fig. 2c, Supplementary Fig. 5 and 6). A particularly close relationship was identified between STO and GUA (F_ST = 0.10; Fig. 2b). This conflicting genetic origin among GUA sub-isolates is responsible for the higher nucleotide diversity observed in GUA as a whole (Supplementary Table 2).

The patterns of genetic connectivity support at least three distinct migration events in time and space that we investigated using forward genetic simulations (supplementary Table 3). First, we detect an out-of-Africa scenario, whereby the greatest diversity was sampled within Africa, and that isolates outside of Africa represent a subset of this diversity. Consistent with this hypothesis, nuclear genome variation of non-African isolates experienced a genetic bottleneck that occurred between 2.5 and 10 kya that is not present in the African isolates sampled (Fig. 1c). Bayesian coalescent estimate of this initial divergence from mitochondrial genome data yielded an overlapping time range of between 3.6 kya and 4.1 kya (Supplementary Fig. 7). Genetic simulations of the joint demography between FRA.1 and African populations also favoured complex scenarios involving early split (maximum likelihood estimates ranging between 13 and 25 Kya) followed by ongoing gene flow with STA populations or more recent isolation with Namibia (supplementary Table 3). Secondly, the genetic connectivity of West African and American isolates is consistent with parasites spreading during the trans-Atlantic slave trade movement. The scenario linking GUA and STO was compatible with initial population division occurring around 1640 Common Era (CE) ± 167 years (supplementary table 3), consistent with migration associated with colonization and slave trade that occurred in the West Indies under French influence during that time²². The third pattern of connectivity likely reflects British colonisation of Australia in late 1700’s: the interweaving of Australian and South-African worms into the Mediterranean phylogenetic haplogroup (Fig. 2b) may mirror the foundation of Australian Merino sheep, which were first introduced into Australia from South Africa, before additional contributions from Europe and America were made^23,24. The admixture pattern observed for worms from the two countries matched their shared ancestry (Fig. 2c), as well as a genetic connection between America and Australia (seen for the AUS.1 isolate; Fig. 2c). Although maximum likelihood estimates supported these scenarios with the isolation of European and South-African isolates occurring between 1794 and 1871 CE (supplementary Table 3), broad uncertainty limited our ability to specifically define the timings of these events (Supplementary Table 3). However, the split between Australian isolates occurred in 1895 CE ± 132 years, consistent with the initial foundation of the sheep industry in this country.These complex patterns reiterate that human movement has played an important role in shaping the diversity of this livestock parasite throughout the world.

Anthelmintic resistance has left distinct patterns of diversity in the Haemonchus contortus genome

The extensive use of anthelmintics has been and remains the primary means of H. contortus and other gastrointestinal control worldwide. This strategy has resulted in the independent emergence of drug-resistant isolates throughout the world, which now limits farming in some areas. Strong selection should impact the distribution of genetic variation within isolates; signatures of selection may reveal genes associated with drug resistance, knowledge of which may contribute to monitoring the emergence and spread of drug resistant isolates, and the design of new control strategies.

The genetic determinants of benzimidazole resistance is perhaps the best characterised of all anthelmintics, with either one of three amino acid residue changes – F167Y, E198A, and F200Y – in the beta tubulin isotype 1 (Hco-tbb-iso-1) protein capable of mediating phenotypic resistance. We identified indications of a selective sweep in the region surrounding the Hco-tbb-iso-1 locus in the resistant isolates analysed. Focusing on three isolates with the highest coverage, we found an average 2.31-fold reduction of Tajima’s D coefficient within 1 Mbp (Fig. 3a, Supplementary Fig. 8) and an average 33% reduction in nucleotide diversity (Supplementary Fig. 9) within this region relative to the rest of chromosome I.

Figure 3a.

Analysis of Tajima’s D surrounding the Hco-tbb-iso-1 locus on chromosome I. A total of 10-Mbp surrounding was considered (pink), of which 1 Mbp nearest to Hco-tbb-iso-1 is highlighted (blue). Isolates compared included benzimidazole-resistant French (FRA.1), Guadeloupian (GUA), Namibian (NAM) or benzimidazole susceptible Australian (AUS.2) and Zaire (ZAI) isolates. Mean expected Tajima’s D (solid grey line) and 99% confidence interval (dotted line) were estimated from a 1,000 simulated 10-Kbp wide sequences following MSMC-inferred demography.

This signature was most evident in the GUA and NAM isolates, but was weaker in the French isolate (FRA.1) due to both phenotypically susceptible and resistant individuals being present (Supplementary Table 4). Phased genotypic information over the whole Hco-tbb-iso-1 (Supplementary Fig. 10) revealed that French and Guadeloupian individuals had little divergence in their haplotypes, suggesting that gene flow of resistant haplotypes between mainland France and the West Indies had occurred (Supplementary Fig. 10). A topology analysis of a 100 Kbp region spanning the Hco-tbb-iso-1 locus supported the shared phylogenetic origin between FRA.1 and FRG (topology 3), whereas the surrounding region was in favour of topologies congruent with overall population structure (Fig. 3b). This finding is consistent when either Moroccan (Fig. 3b) or São Tomé populations are used as a population with close ancestry with FRG (Supplementary Fig. 11).

Figure 3b.

Topology weighing analysis of a 100-Kbp window centred on Hco-tbb-iso-1. At each position, the weight of each of the three possible topologies inferred from 50 Kbp-windows are overlaid. Topology 2 (blue) corresponds to an isolation-by-distance history, while topology 3 (brown) would agree with shared introgressed material between worm populations from French mainland into Guadeloupe. The Hco-tbb-iso-1 locus is indicated by vertical dashed lines.

We analysed the frequency of the three well-characterised resistance-associated mutations affecting codon positions 167 (T to A), 198 (A to T) and 200 (T to A) (Supplementary Table 4; Supplementary Table 5; Supplementary Fig. 12 & 13). The F200Y homozygous genotype was the most common and widespread resistant genotype (n = 39), and accounted for all samples from the Guadeloupe (n = 14) and the White-River South-African (STA.3; n = 6) isolates. Variants at codons 167 and 198 were much less common, i.e. 13 mutant allelic carriers were observed at position 198 and only one F167Y mutant was identified in a French isolate. No double homozygous mutants were found, but three individuals from France, Australia and South-Africa were heterozygous at both positions 198 and 200. In each case, inspection of the sequencing reads revealed that the two mutations never appeared together in the same sequencing read, suggesting they cannot co-occur in cis (on the same chromosome copy). Genotype frequencies were consistent with isolate-level benzimidazole efficacies as measured by the percentage of egg excretion after treatment (Supplementary Table 5). Samples analysed from suspected susceptible isolates always presented with the susceptible genotype for each of the three positions considered (n = 20).

The genetic determinants of ivermectin resistance are largely unknown, but many genes have been proposed to be associated with resistance; although one interpretation of this observation is that ivermectin resistance is a multigenic trait, a major locus associated with resistance in Australian and South African H. contortus has recently been mapped to a region approximately 37-40 Mbp along chromosome V ¹². To examine for the presence of ivermectin-mediated selection in our data, a pairwise differentiation scan was performed between known ivermectin-resistant isolates and other isolates sharing same genetic ancestry, i.e. STA.1 and STA.3 versus NAM and ZAI in Africa, AUS.1 against AUS.2 in Australia (Fig. 3c).

Figure 3c.

Genome-wide differentiation scan between pairs of ivermectin-resistant (AUS.1, STA.1, STA.3) and populations with no evidence of ivermectin resistance. Chromosomes are coloured and ordered by name, i.e. from I (pink) to V (forest green). Horizontal line represents the 0.5% F_ST quantile cut-off. Vertical line on chromosome I points at Hco-tbb-iso-1 locus.

The previously described region on chromosome V¹² appeared differentiated, particularly between pairwise comparisons involving the South-African resistant isolate (Fig. 3c). A second major differentiation hotspot spanned a 3 Mbp region of chromosome I and encompassed the Hco-tbb-iso-1 locus (Fig. 3c). Although non-synonymous mutations at codon positions 167, 198 and 200 of the β-tubulin isotype 1 are associated with resistance to benzimidazoles, it has been proposed that there may also be an association between this gene and ivermectin resistance^25,26. Although our data seem to support this association, an attempt to narrow-down the differentiation signal in this region found significant differentiation in only two 10-kbp windows in two comparisons (NAM vs STA.1 and NAM vs. STA.3, and these two windows did not overlap the Hco-tbb-iso-1 locus. Furthermore, the fact that all of our ivermectin-resistant isolates were also benzimidazole-resistant suggests that this signal is confounded; pairwise F_ST analyses between ivermectin-resistant and – susceptible isolates from the field will always be biased toward strong differentiation around the Hco-tbb-iso-1 locus due to loss of diversity in benzimidazole-resistant isolates. As such, it was not possible to confirm the putative association between the Hco-tbb-iso-1 locus and ivermectin resistance. We note that the lack of QTL evidence in this region from controlled genetic crosses using ivermectin selection¹² supports the conclusion that standing genetic variation at the Hco-tbb-iso-1locus is unlikely to be directly influenced by ivermectin.

To further investigate more subtle signatures of selection, we computed the XP-CLR coefficient, which simultaneously exploits within-isolate departure from neutrality and between-isolate allele-frequency differences ²⁷. This analysis relies on called genotypes rather than genotype likelihoods, but is robust to SNP uncertainty²⁷. Within continent pairwise comparisons of African (NAM, MOR, STA.1, STA.3, ZAI) and Australian isolates (AUS.1, AUS.2) yielded 1,740 hotspots of diversification, 48% of which were contained within a gene locus (Supplementary Fig. 14, Supplementary Table 6). Among these hotspots, two known candidate genes associated with ivermectin resistance were identified, namely an ivermectin sensitive glutamate-gated chloride channel²⁸ (HCOI00617300, glc-4 ortholog) on chromosome and a P-glycoprotein coding gene already involved in ivermectin susceptibility in the equine ascarid Parascaris sp ²⁹, (HCOI00233200, pgp-11 ortholog) on chromosome V. While the former showed indication of reduced genetic diversity in its vicinity in resistant isolates relative to others (0.48% ± 0.09% difference in nucleotide diversity between the two groups, t = 5.25, P < 10⁻⁴; Fig. 3d), the genetic diversity pattern in the 100 Kbp window surrounding the latter was similar across isolates (t = 0.004, P = 0.99; Fig. 3d), suggesting its role may not be specifically related to ivermectin resistance.

Figure 3d.

Nucleotide diversity over 100 Kbp windows surrounding positional candidate genes for ivermectin resistance. Candidate gene position is highlighted by vertical dashed lines. Ivermectin-resistant isolates (IVM-R) appear as triangles and squares indicate susceptible isolates (IVM-S).

GO term enrichment analysis of all XP-CLR significant regions identified 26 significant terms (Supplementary Table 7). The top ten most significant GO terms encompassed enzymatic-related activity, neurotransmitter transporter activity (GO:0005326, P = 5.8 × 10⁻⁴) and response to drug (GO:0042493, P = 4.3 − 10⁻³). Additional significant biological process associated terms were related to phenotypes tightly linked to anthelmintic effects. For example, pharyngeal pumping³⁰ (GO:0043051, P = 5.8 × 10⁻³) and oviposition (GO:0046662, P = 8.6 × 10⁻³) genes were enriched, both of which are phenotypes linked to ivermectin and its effect on parasite fecundity³¹. Neurotransmission is the primary target of anthelmintics such as macrocyclic lactones³² and levamisole³³; genes with GO terms related to neurotransmission (GO:0001505, P = 9.1 × 10⁻³) were significantly enriched across comparisons. Anthelmintic-associated GO terms including “response to drug”, “regulation of neurotransmitter” and “neurotransmitter transporter activity” were significantly enriched in every comparison involving a South-African multi-resistant field isolates (STA.1, Supplementary Table 8). Four candidate genes (HCOI00389600, HCOI00032800, HCOI00243900, HCOI00489500) were found overlapping XP-CLR signals of selection and thus seem candidates for drug resistance loci in H. contortus, as supported by the functions of their C. elegans orthologs (snf-9, unc-24, B0361.4, aex-3 respectively). The snf-9 gene encodes a neurotransmitter:sodium symporter belonging to a family of proteins involved in neurotransmitter reuptake, and the latter three genes are expressed in neurons. Unc-24 and B0361.4 are involved in response to lipophilic compounds and aex-3 is critical in synaptic vesicle release³⁴. A reduction in nucleotide diversity surrounding unc-24, B0361.4, aex-3 in the STA.1 isolate in comparison to others (Fig. 3d) may reflect evidence of drug selection.

Climatic adaptation has shaped genomic variation between isolates

In addition to putative anthelmintic-related GO terms, response to stress (GO:0006950, P = 5.9 × 10⁻³) was among the top ten significant biological process ontologies associated with genes under diversifying selection. Although anthelmintic exposure would be associated with significant stress on susceptible (and perhaps resistant or tolerant) parasites, all free-living stages of H. contortus will be exposed to and must tolerate abiotic factors such as temperature or humidity prior to infection of a new host. To evaluate the impact of such climatic stressors, a genome scan for genetic differentiation between isolates categorised by by the climactic conditions prevailing at their sampling locations, i.e. arid (Namibia), temperate (France mainland) and tropical (Guadeloupe), was performed (Fig. 4a).

Figure 4a. Genomic signal of climate adaptation

Top three panels represent F_ST values plotted against genomic position (Mbp), colored and ordered by chromosome name (from I to V). Horizontal dashed line represents the 0.5% quantile. Bottom panel shows genomic positions of significant associations between annual precipitation (circle) and temperature annual range (triangle).

A major signal of differentiation formed of two windows (6.925 – 6.995 Mbp and 7.165 – 7.205 Mbp on chromosome I) was shared by all pairwise comparisons. Six genes were found within these two regions (Supplementary Table 9), among which orthologs of C. elegans genes hprt-1, cpb-1, B0205.4 were identified. A highly differentiated 260 Kbp region of chromosome III also repeatedly occurred across comparisons. A window between 31.155 Mbp and 31.185 Mbp was common to comparisons involving isolates from arid areas (NAM vs. FRA.1 and NAM vs. GUA), and a second window (31.345 Mbp to 31.415 Mbp) was common to comparisons between tropical isolates and others (GUA vs. FRA.1 and GUA vs. NAM). While the two annotated genes in the latter region were not associated with any biological description, the first window overlapped a chromo-domain containing protein (HCOI01540500; Supplementary Table 89). This gene is an ortholog of Pc (FBgn0003042) in D. melanogaster, a chromo-domain subunit of the Polycomb PRC1 -complex that specifically recognizes trimethylated lysine7 of histone 3 (H3K27me3)³⁵. Nucleotide diversity over these genomic windows revealed reduced genetic diversity in FRA.1 relative to isolates from hotter climates (Fig. 4b). The contrasted diversity downstream from HCOI01540500 (31.225 Kbp to 31.350 Kbp, Fig. 4b) found between GUA and NAM certainly contributed to higher F_ST in these windows. Additional differentiation analyses performed at the gene level with base-pair resolution highlighted a few discrete locations of elevated F_ST common to every comparison and overlapping intron 4 and exon 5 of this gene (Fig. 4b). Translated consensus exon 5 sequences revealed the highest divergence in the GUA isolate (92.1% identity with reference sequence; supplementary Fig. 15), characterized by multiple amino acid changes whose putative functional consequences remain unknown.

Figure 4b. A Polycomb group protein coding gene underpins major differentiation signal between populations from arid, temperate and tropical areas.

Top panel shows nucleotide diversity estimates for 10 Kbp genomic windows spanning windows of chromosome III with high genetic differentiation for climate. Dashed lines indicate the Pc ortholog boundaries. Within these boundaries, F_ST estimates along every base-pair of the gene sequence are represented below (circle size proportional to F_ST coefficient), with predicted exon model shown as grey rectangles. The bottom panel shows translated consensus sequences of exon 5 for populations of interest with asterisks marking mutations.

To further explore the impact of climatic conditions of genetic diversity, we used a random forest based statistical approach to quantify the relationship between genetic information along environmental gradients, and relative impact of specific bioclimatic variables on genetic diversity. Bioclimatic variables were derived from monthly temperature and precipitation records, with the aim to represent annual trends or seasonality over the 1970 to 2000 time period (Supplementary Fig. 16; Supplementary Table 10). This analysis randomly samples subsets of sites (encoded as SNP frequencies) that can be partitioned into groups based on differences in climactic variables to estimate the predictive ability of climatic variables. Annual precipitation (BIO12), and temperature annual range (BIO7) were revealed to be the most important bioclimatic variables impacting genetic variation (Fig. 4c).

Figure 4c.

Proportion of genetic variance explained by precipitation- (blue) and temperature-associated (orange) climatic variables. Variants from eight isolates with no record of ivermectin resistance (AUS.2, FRA.1, GUA, IND, MOR, NAM, STO, ZAI) were analysed against a set of nine variables (from a total of 19), selected to minimize correlation between variables. Annual precipitation (BIO12) and temperature annual range (BIO7) are main contributors of genetic variance.

To identify genes that might be impacted by these variables, we performed a genome-wide test for association between SNP variants, and each of these two environmental variables, accounting for genetic structure between isolates. In total, 17 and 25 significant associations (5% FDR; 49,370 SNPs tested) were found with BIO7 and BIO12 respectively (Fig. 4a bottom panel; Supplementary Table 11). Consistent with the initial differentiation scan, chromosomes I and III harboured most of the associations (n = 13 and n = 11, respectively). On chromosome I, eight (of the 13) associations with BIO12 fell within a 6 Kbp window (7,177,538 bp and 7,183,617 bp), overlapping the region universally differentiated between climatic areas. Chromosome III contained 11 significant associations for both BIO7 (n = 6) and BIO12 (n = 5). The two most significant associations were also found on this chromosome for BIO7 (positions 26,824,240 and 26,824,197 bp; P = 6.34 × 10⁻⁹ and P = 1.39 × 10⁻⁷, respectively). These two SNPs fell within the HCOI00198200 locus, which codes for a metallopeptidase M1, an ortholog of the C. elegans gene anp-1. Additional BIO7-associated SNPs were found within (16,544,889 bp) and in the vicinity (15,905,141 & 15,905,191 bp) of a solute carrier coding gene (HCOI00312500) on chromosome IV, whose orthologs in C. elegans are under the control of daf-12, a key player in dauer formation. Of note, two H. contortus orthologs of components of the dauer pathway in C. elegans, namely tax-4 (HCOI00661100) and daf-36 (HCOI00015800), were among genes under diversifying selection (Supplementary Fig. 14), but neither of these genes contained SNPs identified as being associated with the two most significant bioclimatic variables.

Discussion

The ecology and epidemiology of gastrointestinal nematodes have been well characterized and exploited to build mathematical models to guide treatment decision in the field³⁶. On the contrary, knowledge about their genetic diversity remains limited; a better understanding of population structure and selective pressure applied by environmental factors would yield better predictive power of their range dispersion³⁷. By exploiting a broad collection of samples from globally-distributed isolates, together with chromosome-scale assembly and extensive individual resequencing, we have performed an in-depth characterization of H. contortus genetic diversity, explored historical contributions to its current population structure and identified important drivers shaping the genome of this parasite.

H. contortus populations displayed high levels of nucleotide diversity, consistent with early estimates based on mitochondrial data^39,40, and recent re-sequencing experiments of inbred isolates¹². These extreme levels of genetic diversity are thought to arise from both a large census population⁴² and the high fecundity of H. contortus females²¹.

An early attempt based on a set of genome-wide AFLP markers obtained for 150 individual worms from 14 countries supported the first exploration of H. contortus population structure at the continent level ⁴⁰. Analyses of these data identified three to four (Africa, South-East Asia, America and Europe) main phylogenetic clusters as well as evidence for the strong genetic connectivity between Australian, South-African and European isolates⁴⁰. Our genome wide data corroborated these early results. However, the use of a chromosome-scale assembly and individual resequencing contributed to identify genome-wide patterns of genetic diversity in its chromosomal context, and in turn, provided sufficient resolution to identify genes likely associated with selective advantage against drug or climate selection pressures. This had not been possible in previous attempts with AFLP markers ⁴⁰.

Major past human migrations and associated sheep movements have contributed to the mixing of parasite populations, which partly accounts for the limited genetic structure and extensive admixture between some of our globally distributed isolates. Our data supported an out of Africa expansion derived from ancestral populations from Western Africa, with a bottleneck dated back between 2.5 to 10 kya. Sheep domestication originally took place in the Middle East around 10 kya⁴³, before introduction into Eastern Africa and subsequent spread, likely through cultural diffusion, toward southern Africa approximately 2 to 2.5 kya ⁴⁴. The timing of the bottleneck identified in our data and occurring between 2.5 and 10 kya is compatible with major migrations of pastoralist populations that ultimately resulted in the import of small ruminants into Southern Africa ⁴⁴. The simultaneous increase in rainfall in central Africa around 10.5 kya⁴⁵ would have supported the population expansion and dispersal of H. contortus. In addition, early radiation towards Asia observed in our data was congruent with evidence obtained from the study of retroviral insertions within the sheep genome that suggested direct migration between Africa and southwest Asia⁴⁶, and with the timing of Asian sheep expansion between 1.8 and 14 kya⁴⁷.

The genetic congruence of parasites in Guadeloupe with those from both West Africa and the Mediterranean region is consistent with co-transportation of livestock, including sheep, during the discovery and subsequent colonization of America.. Woolly Churra sheep were originally brought to the Caribbean by Spanish conquistadors ^49,50, before West African breeds more suited to tropical conditions were transported, resulting in an admixed Carribean sheep population ^49,50. The timing of genetic admixture here overlaps with centuries of human and presumably livestock movement during the transport of slaves, most of whom originated in West Africa and were transported to colonies that included the French West Indies²². The Mediterranean ancestry of H. contortus isolates in Guadeloupe and the close relationship with French worm populations suggest additional sheep transport occurred between French mainland and its Guadeloupian overseas territory. Although these movements are difficult to track precisely, live sheep were shipped aboard slave trade vessels departing from French harbours⁵¹, and may have introduced European H. contortus to the island. Based on historical data^49,50, a Spanish lineage of H. contortus on Guadeloupe would also be expected to exist, though data are missing to confirm this. It can be speculated that the shared ancestry between Guadeloupian and Moroccan worm isolates might result from the introduction of Spanish Churra sheep, as this breed emerged in Spain while the region was under Arab influence between the 8^th and 13^th century^49,52. Introduction of contortus-infected sheep from the Maghreb may have been associated with the spreading of African worms in Spain, and ultimately, Guadeloupe.

The widespread use of anthelmintic drugs has been a major selective force shaping standing genetic variation in the isolates analysed. Adaptation to drug exposure was clearly illustrated in the genetic diversity surrounding the β-tubulin coding gene, a target of benzimidazole drugs; strong loss of diversity was observed at this locus in many distinct isolates phenotypically characterised to be resistant to this class of drug. The co-occurrence of the same resistant haplotypes in geographically disconnected isolates is almost certainly due to the independent evolution of benzimidazole resistance as has been described previously⁵³; however, some evidence of shared ancestry between French and Guadeloupe resistant individuals emphasises the risk of spreading resistance without careful monitoring of parasite populations during livestock trade. Furthermore, this highlights the risk of evolution of resistance in other parasitic nematodes, for example, parasites of medical interest that are treated with benzimidazoles in mass-drug administration programs⁵⁴.

Ivermectin is an important anthelmintic for parasite control in both veterinary and medical settings⁵⁵. Worldwide emergence of ivermectin-resistant veterinary parasitic nematodes⁵ and evidence of reduced efficacy in human filarial nematodes⁵⁶ underline the urgency and importance of a better understanding of the mechanisms involved⁵⁵. Our data identified genes previously associated with ivermectin resistance in parasitic nematodes of either veterinary (pgp-11 in Parascaris sp.²⁹) or medical (aex-3 in Onchocerca volvulus⁵⁷) interest and uncovered new candidates. Additional validation study will determine whether these could serve as that may be useful markers in the field to monitor drug efficacy.. Of note, support for a major QTL for ivermectin resistance on chromosome V recently identified in an introgression experiment in two resistant isolates³⁸ was also present in our genome-wide data and warrants further investigation.

Our broad sampling throughout the global range of H. contortus has enabled the first analyses into climate-driven adaptation in a parasitic nematode. Understanding how climate shapes parasite genetic variation is of primary importance to foresee consequences of climate change on parasite phenology and range dispersion. Parasite dispersal is largely driven by their hosts but H. contortus free-living stages experience climatic conditions that affect their development² and constrain their spatio-temporal dispersal⁵⁸. Observations in Northern Europe suggest that climate change has already altered H. contortus winter phenology ⁵⁹. Our results suggest that adaptation toward annual precipitation was mostly under the control of variation on chromosome I, however, no obvious candidate genes could be identified. In addition, the Hco-tbb-iso-1 locus was close to the region of interest, and linked variation in allelic frequency at this locus as a result of benzimidazole selection cannot be ruled out. A second region of chromosome III was associated with both temperature- and precipitation-related variables, within which biologically relevant genes could be identified; first, the strongest genetic associations with annual temperature range were found in a metallopeptidase with zinc ion binding function (anp-1 ortholog), an enzyme linked to drought stress tolerance in Drosophila ⁶⁰, and second, an ortholog of Pc displayed strong genetic differentiation between arid- and wetter temperate or tropical environmental conditions. This gene has been linked to putative epigenetic regulation of xeric adaptation in Drosophila melanogaster, where Pc mutants display lower resistance to desiccation stress⁶¹. This finding is also corroborated by observations in plants showing Polycomb-mediated regulation of climate-induced phenotypes⁶². Further links to climatic adaptation include identification members of the dauer pathway⁶³; tax-4 and daf-36 orthologs were under diversifying selection and a daf-12-respondent solute carrier was associated with annual temperature range. Dauer is a developmentally arrested stage in C. elegans that is triggered by environmental stress and mediates tolerance to unfavourable conditions until better conditions are met⁶³, and can occur in parasitic nematodes like H. contortus under semi-arid conditions⁶⁴. Evidence for climatic adaptation suggests that adaptation in the face of climate change will be constrained by available genetic variability at temperature-selected loci, that may both limit or enable range expansion depending on the region. By a better understanding of the interaction between climatic conditions and phenotypes such as hypobiosis (a temporary developmental arrest during unfavorable conditions), optimisation of treatment timing may be possible to maximise control efficacy.

In summary, our data describes the extensive global and genome-wide diversity of the blood-feeding parasitic nematode H. contortus, and how this diversity has been shaped by adaptation to its environment and to drug exposure. Understanding the mechanism(s) by which parasites adapt to fluctuating environmental conditions both within and outside their hosts will have important implications for field management of parasitic nematodes in both veterinary and medical settings. Further characterisation of these putative strategies, together with genetic covariation of drug resistance genes, should contribute to refining epidemiological models and guide treatment decision-trees for a more sustainable management of worm populations in the face of a changing climate.

Legends to Figures

Figure 1. Global diversity of Haemonchus contortus.

(a) Global distribution of Haemonchus contortus isolates. Isolates sampled are coloured by geographical region (sand: South-America, brown: Western-Africa, dark green: Mediterranean area, black: Subtropical Africa, red: Australia). Shape indicates the anthelmintic resistance status of each isolate to fenbendazole (resistant =squares; susceptible =circles) or resistant to both ivermectin and fenbendazole (triangles).

(b) Principal component analysis based on genotype likelihood inferred from whole genome sequences of 223 individual males (243,012 variants considered). Samples are coloured by geographic region described in panel 1a.

(c) MSMC effective population size across time for 6 isolates. Coloured shaded area represents range of values estimated from a cross-validation procedure with five replicates, computed by omitting one chromosome at a time.

Figure 2. Global connectivity of Haemonchus contortus isolates.

(a) Unrooted maximum likelihood phylogenetic tree constructed using 3,052 SNPs from 223 individual mitochondrial genomes. Circles indicate bootstrap support for each branch, blue if support was higher than 70%, red elsewhere. Branches leading to a sample identifier are coloured by geographic region described in Figure 1a. Mitogroups are annotated with constitutive sample populations.

(b) Matrix showing pairwise dissimilarity between individuals (upper right) and pairwise F_ST between isolates (lower left).

(c) Admixture analysis of 223 individuals. A cluster size of K = 3 is presented, determined from sites with minor allele frequency above 5% and call rate higher than 50%. Admixture pattern for other values of K is provided in supplementary Figure 5. Isolates are presented sorted along their longitudinal range, and samples sorted by assignment to the 3 clusters.

Figure 3. Anthelmintic-mediated selection is a major driver of genetic variation

(a) Analysis of Tajima’s D surrounding the Hco-tbb-iso-1 locus on chromosome I. A total of 10-Mbp surrounding was considered (pink), of which 1 Mbp nearest to Hco-tbb-iso-1 is highlighted (blue). Isolates compared included benzimidazole-resistant French (FRA.1), Guadeloupian (GUA), Namibian (NAM) or benzimidazole susceptible Australian (AUS.2) and Zaire (ZAI) isolates. Mean expected Tajima’s D (solid grey line) and 99% confidence interval (dotted line) were estimated from a 1,000 simulated 10-Kbp wide sequences following MSMC-inferred demography.

(b) Topology weighing analysis of a 100-Kbp window centred on Hco-tbb-iso-1. At each position, the weight of each of the three possible topologies inferred from 50 Kbp-windows are overlaid. Topology 2 (blue) corresponds to an isolation-by-distance history, while topology 3 (brown) would agree with shared introgressed material between worm isolates from French mainland into Guadeloupe. The Hco-tbb-iso-1 locus is indicated by vertical dashed lines.

(c) Genome-wide differentiation scan between pairs of ivermectin-resistant (AUS.1, STA.1, STA.3) and isolates with no evidence for ivermectin resistance. Chromosomes are coloured and ordered by name, i.e. from I (pink) to V (forest green). Horizontal line represents the 0.5% F_ST quantile cut-off. Vertical line on chromosome I points at Hco-tbb-iso-1 locus.

(d) Nucleotide diversity over 100 Kbp windows surrounding positional candidate genes for ivermectin resistance. Candidate gene position is highlighted by vertical dashed lines. Ivermectin-resistant isolates (IVM-R) appear as triangles and squares indicate susceptible isolates (IVM-S).

Figure 4. Genomic signal of climate adaptation

(a) Top three panels represent F_ST values plotted against genomic position (Mbp), colored and ordered by chromosome name (from I to V). Horizontal dashed line represents the 0.5% quantile. Bottom panel shows genomic positions of significant associations between annual precipitation (circle) and temperature annual range (triangle).

(b) A Polycomb group protein coding gene underpins major differentiation signal between populations from arid, temperate and tropical areas. Top panel shows nucleotide diversity estimates for 10 Kbp genomic windows spanning windows of chromosome III with high genetic differentiation for climate. Dashed lines indicate the Pc ortholog boundaries. Within these boundaries, F_ST estimates along every base-pair of the gene sequence are represented below (circle size proportional to F_ST coefficient), with predicted exon model shown as grey rectangles. The bottom panel shows translated consensus sequences of exon 5 for populations of interest with asterisks marking mutations.

(c) Proportion of genetic variance explained by precipitation- (blue) and temperature-associated (orange) climatic variables. Variants from eight isolates with no record of ivermectin resistance (AUS.2, FRA.1, GUA, IND, MOR, NAM, STO, ZAI) were analysed against a set of nine variables (from a total of 19), selected to minimize correlation between variables. Annual precipitation (BIO12) and temperature annual range (BIO7) are main contributors of genetic variance.

Materials and methods

Sample DNA extraction and sequencing

A total of 267 individual male H. contortus were obtained from a collection held at INRA¹⁶ (metadata for all samples is presented in detail in Supplementary Table 1). The sampling regime was motivated to delineate the contribution of major evolutionary forces, i.e. migration and selection (drug and climate) but also constrained by the material available in the collection. Because migration was likely to match human history, isolates from western African countries and southern America were selected to address the contribution of slave trade history to the structuring of H. contortus populations; isolates from former colonies of the British Empire (South-Africa, Australia) were sampled to establish the connectivity between worm populations from Europe and these countries. Ivermectin-resistant isolates (AUS.1, STA.1, STA.3) were also retained to evaluate how anthelmintics had shaped H. contortus genomic variability (drug efficacy data have been provided in supplementary Table 4). Isolates were selected based on available material to ensure minimal sample size (n = 9) per isolate and proper allele frequency estimation. Following these criteria, 19 isolates from 12 countries were available (Supplementary Table 1). Note that the second Australian population (AUS.2) was obtained from an Italian laboratory (labelled ITA_NAP, supplementary Table 1). Samples were gathered between 1995 and 2011 (Supplementary Table 1), and stored in liquid nitrogen upon collection. Four isolates were fenbendazole susceptible (Fig. 1 and supplementary table 4; triangles; FRA.2, FRA.4, STO, ZAI).

DNA was extracted with the NucleoSpin Tissue XS kit (Macherey-Nagel GmbH&Co, France) following the manufacturer’s instruction. Sequencing libraries were prepared as previously described¹². DNA libraries were sequenced with 125 bp paired-end reads on an Illumina Hiseq2500 platform using V4 chemistry (Supplementary Table 1). A second round of sequencing was performed with 75 bp paired-end reads to increase the coverage of 43 samples (Supporting Fig. 15). After sequencing, two samples were identified to be heavily contaminated by kraken-0.10.6-a2d113dc8f⁶⁵ and were discarded. In total, 18 sequencing lanes consisting of 4,152,170,256 reads were sequenced, the raw data of which are archived under the ENA study accession PRJEB9837.

Sequencing data processing

Read mapping to both the mitochondrial and nuclear genomes (v3.0, available at ftp://ngs.sanger.ac.uk/production/pathogens/Haemonchus_contortus) was performed using SMALT (http://www.sanger.ac.uk/science/tools/smalt-0) with a median insert size of 500 bp, k-mer length of 13 bp, and a stringency of 90%. For samples that had two or more BAM files (when split across multiple sequencing lanes), the BAM files were merged using samtools v.0.1.19-44428⁶⁶, and duplicated reads removed using Picard v.2_14_0 (https://github.com/broadinstitute/picard) before performing realignment around indels using Genome Analysis Toolkit (GATK v3.6)⁶⁷ RealignerTargetCreator. Mean coverage of the mitochondrial and genomic genomes were estimated using GATK DepthOfCoverage, revealing coverage lower than the estimated target coverage (original 8x) for most samples. Individuals with more than 80% of their mitochondrial sequence with at least 15 reads and a mean mitochondrial genome coverage of at least 20x were retained for population genetic inferences (n = 223 individuals).

Nuclear genome SNP calling

To call SNPs, we used GATK HaplotypeCaller in GVCF mode, followed by joint genotyping across samples (GenotypeGVCFs) and extraction of variants (SelectVariants), resulting in a total of 30,040,159 unfiltered SNPs across the five autosomes. Sex determination in H. contortus is based on an XX/XO system; as only male worms were sequenced, their hemizygous X chromosome would have revealed limited phylogenetic information relative to the autosomes, and was henceforth excluded from further analysis.

Low coverage sequencing will inadvertently bias allele sampling at heterozygous SNPs, resulting in excess homozygous genotypes particularly if stringent filtering is applied during SNP calling. To circumvent this issue, we applied the GATK Variant Quality Score Recalibration (https://gatkforums.broadinstitute.org/gatk/discussion/39/variant-quality-score-recalibration-vqsr), which first uses a reference (truth) SNP set to estimate the covariance between called SNP quality score annotations and SNP probabilities, followed by application of these probabilities to the raw SNPs of interest.

The reference “truth” SNP database was generated from the intersection of variants called from samples with at least a mean of 10x coverage (n = 13) using three independent SNP callers: (i) samtools mpileup (-q20 −Q20 −C50 −uD), (ii) Freebayes⁶⁸ v.9.9.2-13-gad718f9-dirty (--min-mapping-quality 20 --min-alternate-count 5 --no-indels --min-alternate-qsum 40 --pvar 0.0001 --use-mapping-quality --posterior-integration-limits 1,3 --genotype-variant-threshold 4 --use-mapping-quality --site-selection-max-iterations 3 --genotyping-max-iterations 25 --max-complex-gap 3), and (iii) GATK HaplotypeCaller followed by hard filtering (--QD<2, --DP>10000, --FS>60, --MQ<40, --MQRS <-12.5, --RPRS<-8). The three variant call sets were merged (GATK CombineVariants with --genotypeMergeOptions UNIQUIFY), resulting in an intersecting set of 794,606 SNPs (extracted with GATK SelectVariants). The GATK VariantRecalibrator model was trained with this reference SNP database with a 90% prior likelihood, before being subsequently applied to the raw set of SNPs (n = 30,040,159). The estimation was run for several truth sensitivity threshold values ranging from 90 to 99.9%. After visual inspection of the additional number of SNPs by using sensitivity tranche curves (Supplementary Fig. 16a), a 97% sensitivity threshold was applied to the raw SNP set with GATK ApplyRecalibration, resulting in a total set of 23,868,644 SNPs spanning the five autosomes. Variant depth of coverage (DP) and strand bias (FS) were the main drivers of SNP removal (Supplementary Fig. 16b).

Called SNPs were used for particular analyses (N_e trajectory through time, cross-population composite likelihood-ratio) that could not be performed under the probabilistic framework that relied on genotype likelihoods as implemented in ANGSD⁶⁹ v. 0.919-20-gb988fab (Supplementary note). These data were also used to estimate average differentiation between isolates. However, within isolate diversity and admixture were analysed using ANGSD⁶⁹ (Supplementary note).

Mitochondrial DNA data processing

The mitochondrial genome exhibited an average coverage depth of 322× (ranging from 24× to 5,868×) per sample (Supplementary Table 1). Mitochondrial reads were extracted and filtered from poorly mapped reads using samtools view (-q 20 -f 0×0002 -F 0×0004 -F 0×0008) and from duplicated reads using Picard v.2_14_0 MarkDuplicates (https://github.com/broadinstitute/picard). Realignment around indels was applied with GATK⁷⁰ and SNP were subsequently called using samtools⁷¹ mpileup using only reads that achieved a mapping quality of 30 and base quality of 30. The occurrence of heterozygous sites in mtDNA, known as heteroplasmy, has been described across vertebrate species⁷² and in other nematode species, including C. briggsae⁷³ and C. elegans⁷⁴. However, heterozygous sites may also occur as technical artefacts as a result of genetically similar sequences shared between the nuclear and mitochondrial genomes, i.e., numts⁷⁵. To exclude sites prone to heterozygous signals from further phylogenetic inference analysis, a SNP calling procedure was implemented with the HaplotypeCaller tool to apply hard filtering parameters on the raw SNP sets (QD>=10, FS<=35 MQ>=30 MQRankSum>=−12.5 and ReadPosRankSum >=−8) with a minimum depth of 20 reads. This procedure excluded 1,354 putative heterozygous sites, and retained 72% of the putative SNP sites (3,052 out of 4,234 SNPs). Nucleotide diversity and Tajima’s D were computed by sliding-windows of 100 bp using vcftools v.0.1.15⁷⁶. A principal component analysis (PCA) was performed on genotypes using the SNPrelate package⁷⁷ in R version 3.5⁷⁸. A consensus fasta sequence was subsequently generated with GATK FastaAlternateReferenceMaker for each sample using the filtered variant set, which was used for the phylogenetic analyses.

Diversity and divergence analysis

Genome-wide nucleotide diversity (π) was computed for each isolate with at least five individuals using ANGSD⁶⁹. Using genotype likelihoods (GLs) from samtools⁷¹ (option GL=1) as an input, variants were included that had a minimal supporting evidence of 5 reads, and base and mapping quality phred scores of at least 20. As π values were biased by population mean coverage (Supplementary Fig. 17), π was also calculated from a subset of isolates containing individuals with a minimum mean coverage of 5×: this was limited to France (FRA.1, n = 5, mean coverage of 7.66×), Guadeloupe (n = 5, mean coverage of 12.75×) and Namibia (n = 6, mean coverage of 9.85×).

F_ST was estimated from the VQSR-called genotypes between isolates with at least five individuals using the Weir-Cockerham estimator⁷⁹ in vcftools v0.1.15⁷⁶. To prevent artifactual signal linked to variation in coverage between isolates, F_ST was calculated on subsets of SNPs, binned based on their minor allele frequency (MAF) in 10% increments. The maximum F_ST value calculated was retained for comparison (Supplementary Fig. 4) and the maximal value was considered as reported elsewhere⁸⁰. The resulting F_ST estimates were not biased by coverage, as measured by negligible correlation between pairwise F_ST coefficients and associated population cross-coverage (Pearson’s r₍₁₃₆₎ = −0.05, P = 0.55; Supplementary Fig. 18).

The pairwise sequence divergence between individual samples was calculated using the Hamming distance, i.e. 1-IBS, using PLINK⁸¹, considering the only individuals with a mean coverage above 2.5× as failure to do so yielded biased estimates (Supplementary note, Supplementary Fig. 19). A neighbour-joining tree of Hamming distances calculated from the nuclear DNA genotypes was built using the R package ape⁸².

PCA on genotypes inferred from genotype likelihoods of the 223 samples was generated using ANGSD ngsCovar, filtering for sites with base and mapping quality phred scores less than 30, minimum depth of 5 reads, and a SNP p-value (as computed by ANGSD) below 10⁻³. Clustering was robust to coverage variation and closely matched the PCA from the VQSR-called SNP genotypes (Supplementary Fig. 20).

Phylogenetic inference

To determine the phylogenetic structure of the cohort, the 223 consensus mitochondrial fasta sequences were first aligned using Muscle v3.8.31⁸³, followed by stringent trimming of sequence alignments using Gblocks⁸⁴. The most likely evolutionary model, GTR substitution model with rate heterogeneity modelled by a gamma distribution with invariable sites, was determined using modelgenerator v.0.851⁸⁵. A maximum-likelihood tree was subsequently generated using PhyML⁸⁶ v.20120412, with branch supports computed using 100 bootstraps.

Admixture analysis

Admixture was determined using NGSAdmix⁸⁷. This tool relies on genotype likelihoods to account for data uncertainty, and has been shown to produce robust inferences about population ancestry from low coverage samples alone, or a mixture of low and higher coverage samples⁸⁷. This analysis was performed on 223 samples, for K ranging from 2 to 10 clusters (Supplementary Fig. 5 and 6), retaining sites with less than 50% missing data across individuals and minor allele frequency (MAF) above 5%. Five iterations were run omitting one autosome out at a time, and the best K was chosen as the first value that would minimize the median absolute deviation across runs (Supplementary Fig. 6). Sample coverage did not affect the results (Supplementary Fig. 21).

Effective population size and population divergence dating

The effective population size (N_e) trajectory through time, and the cross-divergence time between populations, were estimated using MSMC2^88,89. This approach uses patterns of heterozygosity along the genome to identify past recombination events modelled as Markov processes⁹⁰. Mutation density along the sequence mirrors either recent (long tract of limited diversity) or older (enrichment in heterozygosity over short distances) events. According to coalescent theory⁹¹, at any given time, the amount of recombination is proportional to N_e.

MSMC2 was applied to individuals with a mean coverage above 10x, limiting the analysis to six isolates (FRA.1, GUA, IND, MOR, NAM, STA.1), and considering four haplotypes per isolate. Beagle v4.1 was used to impute missing genotypes and to establish phase in VQSR SNP calls. Imputation accuracy analysis revealed a 7.2% and 9.0% discordance rates at the individual and site levels respectively (Supplementary Fig. 22). Input files were created following MSMC recommendations and available msmc-tools (https://github.com/stschiff/msmc-tools). Briefly, the reference fasta sequence was masked with SNPable (http://lh3lh3.users.sourceforge.net/snpable.shtml) to extract regions of unambiguous read mapping in chromosome-specific bed files (using the available msmc_create_map_mask.py python script). Negative bed files indicating regions with sufficient coverage at the individual level were created from samples bam files with the bamCaller.py script and filter out sites with coverage below genome-wide average depth. Finally, MSMC2 input files were created for each chromosome with the generate_multihetsep.py script and concatenated into a single input file. For each isolate, estimates were averaged across five runs, leaving one chromosome out at a time for cross-validation, using rho/mu parameter value of 6.22 (average recombination rate of 1.68 cM/Mbp¹² and considering a mutation rate similar to that of C. elegans mutation rate, 2.7 × 10⁻⁹ per site per generation²⁰). MSMC2 times and coalescent rates were scaled to real time and population sizes by assuming the same mutation rate²⁰, a balanced sex-ratio²¹ and an inferred generation interval of 40 days (the sum of 10 days to reach mature free-living infective larvae from the egg stage, and a 30-day prepatent period for fully mature egg-laying females)⁹².

Migratory scenarios between populations were determined using the forward simulation framework implemented in δaδi⁹³. Under a given evolutionary scenario, this software models the expected joint site frequency spectrum between multiple populations using a diffusion equation. These expected values are then used to compute the most likely demographic parameters knowing the observed site frequency spectrum. For each model, four rounds of forward simulations were run with 10, 20, 30 and 40 replicates respectively using previously published python scripts⁹⁴ (https://github.com/dportik/dadi_pipeline). Model Akaike Information Criterion (AIC) were compared for ranking scenarios, from which the lower the score, the more likely the outcome.

We first compared a divergence scenario without migration against models including symmetrical and asymmetrical gene flow before isolation. In case migration was the most likely, more complex models (involving split with ancestral (a)symmetrical gene flow, with or without population size change, or models involving secondary contact with/without gene flow and population size change) were tested. However, initial exploration indicated a likely lack of power in our design to accurately estimate parameters of more complex demographic models than the split and isolation model. Nevertheless, these models still provide the most likely scenario and their output have been listed in supplementary Table 3.

Parameters were scaled to real time using same parameters as for MSMC2 inference. Standard deviations of timing estimates for the simple split and isolation models were obtained using the Godambe Information Matrix⁹⁵ applied to 100 simulated site frequency spectra produced with the ms software⁹⁶ under the most likely demographic model.

Additional support to the estimates from the nuclear genome were obtained from a phylogenetic analysis of coding sequences using BEASTv1.10⁹⁷. Mitochondrial coding sequences were extracted from the consensus sequence of every individual, concatenated per individual, and aligned using Musclev3.8.31⁸³. A Bayesian skyline model⁹⁸ was used, with a HKY substitution model and a strict clock model, as other modalities yielded weak effective sample size (ESS) and unstable parameter values. Clock rate was set to C. elegans mitochondrial mutation rate, i.e. 1 × 10⁻⁷ per site per generation⁷⁴, as variation in sampling date was not sufficient to estimate molecular rate. Parameters showed sufficient sampling (effective size above 200) after 50,000,000 iterations and a burn-in of the first 20 million steps. Node ages were scaled to years assuming a generation interval of 40 days. A maximum clade credibility tree was generated with TreeAnnotator v1.10.1 (http://beast.community/treeannotator).

Diversifying selection scan

To further characterize the genetic diversity in H. contortus populations, we identified genomic regions under diversifying selection using XP-CLR²⁷. This approach takes advantage of both within-population distortion of the allele frequency spectrum, and between-population differences in allele frequencies in the vicinity of selective sweeps²⁷. Although XP-CLR is robust to SNP ascertainment bias²⁷, the analysis was restricted to isolates with at least five individuals with a minimum mean coverage of 2× per individual. Unphased VQSR-derived genotypes were filtered to retain SNPs with a within-isolate call rate >80% and MAF >5%. The analysis was run on every one of the 22 possible within-continent pairwise comparisons of the retained isolates (Australia and Africa) with the following options: -w1 0.0001 500 2000 -p0 0. This fit a grid of putatively selected points every 2 Kbp along the genome, with a sliding window size around grid points of 0.01 cM, interpolating SNP position in Morgans from the average recombination rate for each chromosome¹². Down-sampling was applied to windows where more than 500 SNPs were found to keep SNP numbers comparable between regions, and no LD-based down-weighing of the CLR scores was applied. A selection score was subsequently computed at every position as the root mean square of XP-CLR coefficients, and the highest 0.1% of selection scores were deemed significant, as reported elsewhere⁸⁰.

Analysis of β-tubulin isotype 1 (Hco-btub-iso-1) and the genetic architecture of benzimidazole- and ivermectin-resistance

The genomic coordinates of Hco-btub-iso-1 were determined by blasting the gene coding sequence from WormBase Parasite⁹⁹ against current genome assembly (BLASTN, e-value < 10⁻⁵⁰). The SNP positions associated with codons 167, 198 and 200 was determined to be at 7027535, 7027755, and 7027758 bp, respectively, along chromosome I after aligning the Hco-btub-iso-1 consensus sequence and published sequence¹⁰⁰ (GenBank accession FJ981629.1) using muscle v3.8.31⁸³. Genotype and GLs at these positions were determined with ANGSD⁶⁹. Genotypes were only considered for GL >60% (n = 74) as coverage bias occurred otherwise (Supplementary Table 6 and Supplementary Fig. 12). For these samples, phased and imputed genotypes from VQSR SNPs spanning the Hco-btub-iso-1 locus were used to compute pairwise number of allele differences. Selection in the vicinity of the Hco-btub-iso-1 locus was assessed by computing Tajima’s D with ANGSD¹⁰¹. For benzimidazole-resistant isolates, neutral state was built with ms¹⁰² by simulating 10-Kbp wide isolate-specific sequences (n=1,000) following the same coalescent scenario as predicted by MSMC2 for chromosome I (considering a recombination rate of 1.83 cM/Mbp¹²). Suitable ms input parameters were derived from MSMC2 output files using the msmc2ms.py script from msmc-tools. This approach was implemented for isolates with sufficient mean depth of coverage, i.e. three benzimidazole-resistant isolates. In case of susceptible isolates, the lack of significant departure in the Hco-btub-iso-1 vicinity relative to the rest of chromosome I was tested.

The introgression of resistant haplotypes from France mainland into Guadeloupian worm isolates was tested by a topology weighting analysis implemented with TWISST¹⁰³. This method computes phylogenetic trees between a set of isolates, using genetic information from short sliding windows (50 Kbp) and by sampling with replacement individuals from each isolate. At each window, a weight is subsequently computed for each of the possible tree topology, ultimately providing inference of introgression events in discrete locations where tree topology connects phylogenetically distant isolates. Analyses were run using individuals with sufficient mean depth of coverage (5× and more) from French (FRA.1; n = 5; mean depth = 7.62×) and Guadeloupian (GUA; n = 5; mean depth = 12.75×) isolates, adding Namibian isolate (n = 2; mean depth = 10.9×) as an outgroup. A fourth isolate was chosen for its common ancestry with Guadeloupian isolates: first, an analysis was run with worms from São Tomé (n = 2; mean depth = 7.01×), followed by a second analysis using Moroccan samples (n = 2; mean depth = 10.6×), to ensure that evidence of introgression was consistent in both cases.

To investigate the genetic architecture of ivermectin resistance, a differentiation scan was run between ivermectin-resistant isolates and isolates of unknown status sharing same ancestry, i.e. STA.1 and STA.3 versus NAM and ZAI in Africa, AUS.1 against AUS.2 in Australia. Windowed F_ST estimates⁷⁹ were calculated every 10 Kbp with a 1 Kbp overlap along the genome with ANGSD⁶⁹, retaining sites with minimal depth of 5 reads, mapping and base quality phred scores of at least 30, missing rate below 50%, and windows with at least 1000 sites. Genomic coordinates of the top 0.5% most differentiated windows were extracted and analysed by BLASTN (minimum e-value = 10⁻⁵⁰) against the published V1 H. contortus assembly¹³, and annotated gene identifiers were inferred from the corresponding GFF file from WormBase Parasite⁹⁹.

Identification of SNPs under environmental selection

To identify SNPs putatively influenced by environmental selection, we first defined the climatic conditions of each isolate using the Köppen-Geiger classification¹⁰⁴, and inferred from isolate geographical coordinates¹⁰⁵. Three isolates with the best coverage, i.e. Namibia, France, and Guadeloupe, were used to contrast dry, temperate and tropical conditions, respectively. We performed a genome-wide scan using pairwise comparisons of F_ST using ANGSD⁶⁹, of which test values greater than the 0.5% quantile in at least two comparisons were analysed further.

To further explore the contribution of environmental climatic variables on standing genetic variation, we applied a machine-learning gradient forest algorithm¹⁰⁶ to quantify changes in genetic variation (fit as population SNP MAF) along environmental gradients. The gradients consisted of 19 bioclimatic variables¹⁰⁷ summarizing rainfall and temperature information recorded between 1970 and 2000(supplementary table 9). For each SNP, a random forest of 500 trees was grown. For each tree, bootstrapped SNP MAFs were regressed against a random subset of bioclimatic variables to determine the variable that best partitioned the data, thereby building the first node that partitions the data into two sets of homogeneous observations. Iterations follow to determine subsequent nodes by resampling a random subset of bioclimatic variables until no observations are left. The proportion of variance explained by each bioclimatic variable is then averaged across SNPs, and the function of SNP frequency modification along bioclimatic variable is built.

The analysis was performed on isolates with at least 5 individuals, a mean coverage of 2×, and no record of ivermectin resistance (AUS.2, FRA.1, GUA, IND, MOR, NAM, STO, ZAI). SNP MAFs were estimated from genotype likelihoods with ANGSD, and subsequently filtered to ensure a within-isolate MAF >10%, at least 90% within-isolate call-rate, and shared across the eight considered isolates, resulting in 3,758 SNPs retained for further analysis.

Environmental variables were highly correlated (Supplementary Fig. 15a), resulting in instability in the predictor’s importance. To minimise this effect, the gradient forest analysis was restricted to the 11 environmental variables showing least redundancy as assessed by a PCA (Supplementary Fig. 15b). Pair-wise Euclidean distances between variables was computed from their respective coordinates on the first two PCA axes (supplementary Fig. 15a). We selected variables with higher distances (mean distance > 0.85) with any others (BIO4, 7, 2, 15, 5, 10). Other variables defined three clusters (Supplementary Fig. 15b). Within each cluster, we picked the variable with closest distance from every other in the cluster, i.e. the variable summarizing others’ contributions (BIO9, BIO13, BIO19). For the BIO12, 13, 16, 18 cluster, we chose to pick BIO12 (annual precipitation) which is more relevant toward parasite life-cycle across climatic areas. Under temperate areas, quarter-based (BIO16, BIO18) or wettest month (BIO13) statistics would match seasons where hosts are housed. SNP-environment associations were further investigated by focusing on the top two environmental predictors of genomic variations and using a Latent Factor Mixed Model analysis. This analysis was implemented with the lfmm R package¹⁰⁸ on genotype calls from the VQSR pipeline, considering SNPs with call rate of at least 70% across isolates and minor genotype frequency above 5%, using K = 3 for the latent factor accounting for underlying population structure. Analyses were run 5 times and P values were combined and adjusted as recommended¹⁰⁸.

Gene identification and Gene Ontology enrichment analysis

Annotation of the revised H. contortus genome is ongoing. Therefore, genes underlying major differentiation signals were inferred by blasting the region of interest (10 Kbp window for F_ST analyses, 2000 bp region around XP-CLR hits) against the published V1 H. contortus genome assembly¹³. Any genes falling within most probable blast hit coordinates were retrieved from the H. contortus published assembly available from WormBase Parasite⁹⁹. This database was also used to retrieve C. elegans orthologs of positional candidate genes and H. contortus Gene Ontology (GO) terms. GO term enrichment analysis was run with the R topGO¹⁰⁹ package, considering nodes with at least 10 annotated genes. The “weight01” algorithm was used to account for existing topology between GO terms. This framework makes P-value of one GO term conditioned on its neighbours, thereby correcting for multiple testing. Enrichment was tested by the Kolmogorov-Smirnov statistic applied to gene selection or F_ST score accordingly. GO terms with P values below 1% were deemed significant.

Data availability

Raw sequencing data are archived under the ENA study accession PRJEB9837. Sequencing data were analysed with publicly available script and software as mentioned in main text. Outputs were analysed using an R script available at: https://github.com/guiSalle/Haemonchus_diversity. Reference assembly used in this project is available at: ftp://ngs.sanger.ac.uk/production/pathogens/Haemonchus_contortus.

Author contributions

GS, JAC designed the experiment. GS, JAC, SD drafted the manuscript. JCa and JCo sampled and prepared parasite materials. GS performed DNA extraction and data analyses. SD built the reference genome. MB and NH managed and supervised parasite sequencing and the project.

Competing interests

Authors declare they have no competing interests.

Acknowledgements

JAC, MB, NH, and SD are supported by the Wellcome Trust via their core funding of the Wellcome Trust Sanger Institute (grant 206194). GS has received the support of the EU in the framework of the Marie-Curie FP7 COFUND People Programme, through the award of an AgreenSkills (grant agreement n° 267196) and AgreenSkills+ fellowships (grant agreement n°609398). Authors are grateful to Pr. Beech and Gilleard for insightful discussions.

References

1.↵
Blaxter, M. & Koutsovoulos, G. The evolution of parasitism in Nematoda. Parasitology 142 Suppl 1, S26–39 (2015).
OpenUrl CrossRef PubMed
2.↵
O'Connor, L.J., Walkden-Brown, S.W. & Kahn, L.P. Ecology of the free-living stages of major trichostrongylid parasites of sheep. Vet Parasitol 142, 1–15 (2006).
OpenUrl CrossRef PubMed
3.↵
Jones, J.T. et al. Top 10 plant-parasitic nematodes in molecular plant pathology. Mol Plant Pathol 14, 946–61 (2013).
OpenUrl CrossRef PubMed Web of Science
4.↵
Anderson, R.C. The origins of zooparasitic nematodes. Can J Zool 62, 317–328 (1984).
OpenUrl CrossRef Web of Science
5.↵
Kaplan, R.M. & Vidyashankar, A.N. An inconvenient truth: Global worming and anthelmintic resistance. Veterinary Parasitology 186, 70–78 (2012).
OpenUrl CrossRef PubMed
6.↵
McKellar, Q.A. & Jackson, F. Veterinary anthelmintics: old and new. Trends Parasitol 20, 456–61 (2004).
OpenUrl CrossRef PubMed Web of Science
7.↵
Bundy, D.A.P. et al. 100 Years of Mass Deworming Programmes: A Policy Perspective From the World Bank's Disease Control Priorities Analyses. Adv Parasitol 100, 127–154 (2018).
OpenUrl
8.↵
Schulz, J.D., Moser, W., Hurlimann, E. & Keiser, J. Preventive Chemotherapy in the Fight against Soil-Transmitted Helminthiasis: Achievements and Limitations. Trends Parasitol (2018).
9.↵
Hewitson, J.P. & Maizels, R.M. Vaccination against helminth parasite infections. Expert Rev Vaccines 13, 473–87 (2014).
OpenUrl CrossRef PubMed
10.↵
Sallé, G. et al. Transcriptomic profiling of nematode parasites surviving vaccine exposure. Int J Parasitol (2018).
11.↵
Gilleard, J.S. Haemonchus contortus as a paradigm and model to study anthelmintic drug resistance. Parasitology 140, 1506–22 (2013).
OpenUrl CrossRef PubMed
12.↵
Doyle, S.R. et al. A Genome Resequencing-Based Genetic Map Reveals the Recombination Landscape of an Outbred Parasitic Nematode in the Presence of Polyploidy and Polyandry. Genome Biol Evol 10, 396–409 (2018).
OpenUrl
13.↵
Laing, R. et al. The genome and transcriptome of Haemonchus contortus, a key model parasite for drug and vaccine discovery. Genome Biol 14, R88 (2013).
OpenUrl CrossRef PubMed
14.↵
International Helminth Genomes, C. Comparative genomics of the major parasitic worms. Nat Genet 51, 163–174 (2019).
OpenUrl
15.
Mougin, C. et al. BRC4Env, a network of Biological Resource Centres for research in environmental and agricultural sciences. Environ Sci Pollut Res Int (2018).
16.↵
Langley, C.H. et al. Genomic variation in natural populations of Drosophila melanogaster. Genetics 192, 533–98 (2012).
OpenUrl Abstract/FREE Full Text
17.↵
Small, S.T. et al. Population genomics of the filarial nematode parasite Wuchereria bancrofti from mosquitoes. Mol Ecol 25, 1465–77 (2016).
OpenUrl
18.↵
Choi, Y.J. et al. Genomic diversity in Onchocerca volvulus and its Wolbachia endosymbiont. Nat Microbiol 2, 16207 (2016).
OpenUrl
19.
Perry, G.H. et al. Comparative RNA sequencing reveals substantial genetic variation in endangered primates. Genome Res 22, 602–10 (2012).
OpenUrl Abstract/FREE Full Text
20.↵
Denver, D.R. et al. A genome-wide view of Caenorhabditis elegans base-substitution mutation processes. Proc Natl Acad Sci U S A 106, 16310–4 (2009).
OpenUrl Abstract/FREE Full Text
21.↵
Saccareau, M. et al. Meta-analysis of the parasitic phase traits of Haemonchus contortus infection in sheep. Parasit Vectors 10, 201 (2017).
OpenUrl
22.↵
Geggus, D. The French Slave Trade: An Overview. The William and Mary Quaterly 58, 119–138 (2001).
OpenUrl
23.↵
Anonymous. The wool industry – looking back and forward. (Australian Bureau of Statistics, 2003).
24.↵
Evesson, B. & Moor, R. The foundation of Australia’s fine wool industry. in Journal of the Royal Australian Historical Society (2000).
25.↵
Eng, J.K. et al. Ivermectin selection on beta-tubulin: evidence in Onchocerca volvulus and Haemonchus contortus. Mol Biochem Parasitol 150, 229–35 (2006).
OpenUrl CrossRef PubMed
26.↵
de Lourdes Mottier, M. & Prichard, R.K. Genetic analysis of a relationship between macrocyclic lactone and benzimidazole anthelmintic selection on Haemonchus contortus. Pharmacogenet Genomics 18, 129–40 (2008).
OpenUrl CrossRef PubMed Web of Science
27.↵
Chen, H., Patterson, N. & Reich, D. Population differentiation as a test for selective sweeps. Genome Res 20, 393–402 (2010).
OpenUrl Abstract/FREE Full Text
28.↵
Ardelli, B.F., Stitt, L.E., Tompkins, J.B. & Prichard, R.K. A comparison of the effects of ivermectin and moxidectin on the nematode Caenorhabditis elegans. Vet Parasitol 165, 96–108 (2009).
OpenUrl CrossRef PubMed
29.↵
Janssen, I.J., Krucken, J., Demeler, J. & von Samson-Himmelstjerna, G. Transgenically expressed Parascaris P-glycoprotein-11 can modulate ivermectin susceptibility in Caenorhabditis elegans. Int J Parasitol Drugs Drug Resist 5, 44–7 (2015).
OpenUrl CrossRef PubMed
30.↵
Avery, L. & Horvitz, H.R. Effects of starvation and neuroactive drugs on feeding in Caenorhabditis elegans. J Exp Zool 253, 263–70 (1990).
OpenUrl CrossRef PubMed Web of Science
31.↵
Scott, E.W., Baxter, P. & Armour, J. Fecundity of anthelmintic resistant adult Haemonchus contortus after exposure to ivermectin or benzimidazoles in vivo. Res Vet Sci 50, 247–9 (1991).
OpenUrl PubMed
32.↵
Kotze, A.C. & Prichard, R.K. Anthelmintic Resistance in Haemonchus contortus: History, Mechanisms and Diagnosis. Adv Parasitol 93, 397–428 (2016).
OpenUrl
33.↵
Martin, R.J. et al. Levamisole receptors: a second awakening. Trends Parasitol 28, 289–96 (2012).
OpenUrl CrossRef PubMed
34.↵
Wada, M. et al. Isolation and characterization of a GDP/GTP exchange protein specific for the Rab3 subfamily small G proteins. J Biol Chem 272, 3875–8 (1997).
OpenUrl Abstract/FREE Full Text
35.↵
Cao, R. & Zhang, Y. The functions of E(Z)/EZH2-mediated methylation of lysine 27 in histone H3. Curr Opin Genet Dev 14, 155–64 (2004).
OpenUrl CrossRef PubMed Web of Science
36.↵
Bolajoko, M.B. et al. The basic reproduction quotient (Q0) as a potential spatial predictor of the seasonality of ovine haemonchosis. Geospat Health 9, 333–50 (2015).
OpenUrl
37.↵
Fitzpatrick, M.C. & Keller, S.R. Ecological genomics meets community-level modelling of biodiversity: mapping the genomic landscape of current and future environmental adaptation. Ecol Lett 18, 1–16 (2015).
OpenUrl CrossRef PubMed
38.↵
Doyle, S.R. et al. Population genomic and evolutionary modelling analyses reveal a single major QTL for ivermectin drug resistance in the pathogenic nematode, Haemonchus contortus. BMC Genomics 20, 218 (2019).
OpenUrl
39.↵
Blouin, M.S., Yowell, C.A., Courtney, C.H. & Dame, J.B. Host movement and the genetic structure of populations of parasitic nematodes. Genetics 141, 1007–14 (1995).
OpenUrl Abstract/FREE Full Text
40.↵
Troell, K., Engstrom, A., Morrison, D.A., Mattsson, J.G. & Hoglund, J. Global patterns reveal strong population structure in Haemonchus contortus, a nematode parasite of domesticated ruminants. Int J Parasitol 36, 1305–16 (2006).
OpenUrl CrossRef PubMed Web of Science
41.
Dey, A., Chan, C.K., Thomas, C.G. & Cutter, A.D. Molecular hyperdiversity defines populations of the nematode Caenorhabditis brenneri. Proc Natl Acad Sci U S A 110, 11056–60 (2013).
OpenUrl Abstract/FREE Full Text
42.↵
Gilleard, J.S. & Redman, E. Genetic Diversity and Population Structure of Haemonchus contortus. Adv Parasitol 93, 31–68 (2016).
OpenUrl CrossRef PubMed
43.↵
Larson, G. et al. Current perspectives and the future of domestication studies. Proc Natl Acad Sci U S A 111, 6139–46 (2014).
OpenUrl Abstract/FREE Full Text
44.↵
Pleurdeau, D. et al. "Of sheep and men": earliest direct evidence of caprine domestication in southern Africa at Leopard Cave (Erongo, Namibia). PLoS One 7, e40340 (2012).
OpenUrl CrossRef PubMed
45.↵
Campbell, M.C. & Tishkoff, S.A. The evolution of human genetic and phenotypic variation in Africa. Curr Biol 20, R166–73 (2010).
OpenUrl CrossRef PubMed Web of Science
46.↵
Chessa, B. et al. Revealing the history of sheep domestication using retrovirus integrations. Science 324, 532–6 (2009).
OpenUrl Abstract/FREE Full Text
47.↵
Tapio, M. et al. Sheep mitochondrial DNA variation in European, Caucasian, and Central Asian areas. Mol Biol Evol 23, 1776–83 (2006).
OpenUrl CrossRef PubMed Web of Science
48.
Kijas, J.W. et al. Genome-wide analysis of the world's sheep breeds reveals high levels of historic mixture and strong recent selection. PLoS Biol 10, e1001258 (2012).
OpenUrl CrossRef PubMed
49.↵
Spangler, G.L. et al. Whole genome structural analysis of Caribbean hair sheep reveals quantitative link to West African ancestry. PLoS One 12, e0179021 (2017).
OpenUrl
50.↵
Naves, M., Alexandre, G., Leimbacher, F., Mandonnet, N. & Menendez-buxadera, A. Les ruminants domestiques de la Caraïbe: le point sur les ressources génétiques et leur exploitation. INRA Prod. Anim. 14, 181–192 (2001).
OpenUrl
51.↵
Roman, A. Saint Malo au temps des négriers, 357 (Karthala, 2003).
52.↵
Pereira, F. et al. Genetic signatures of a Mediterranean influence in Iberian Peninsula sheep husbandry. Mol Biol Evol 23, 1420–6 (2006).
OpenUrl CrossRef PubMed Web of Science
53.↵
Redman, E. et al. The emergence of resistance to the benzimidazole anthlemintics in parasitic nematodes of livestock is characterised by multiple independent hard and soft selective sweeps. PLoS Negl Trop Dis 9, e0003494 (2015).
OpenUrl
54.↵
Vercruysse, J. et al. Is anthelmintic resistance a concern for the control of human soil-transmitted helminths? Int J Parasitol Drugs Drug Resist 1, 14–27 (2011).
OpenUrl
55.↵
Laing, R., Gillan, V. & Devaney, E. Ivermectin – Old Drug, New Tricks? Trends Parasitol 33, 463–472 (2017).
OpenUrl CrossRef
56.↵
Osei-Atweneboana, M.Y. et al. Phenotypic evidence of emerging ivermectin resistance in Onchocerca volvulus. PLoS Negl Trop Dis 5, e998 (2011).
OpenUrl CrossRef PubMed
57.↵
Doyle, S.R. et al. Genome-wide analysis of ivermectin response by Onchocerca volvulus reveals that genetic drift and soft selective sweeps contribute to loss of drug sensitivity. PLoS Negl Trop Dis 11, e0005816 (2017).
OpenUrl
58.↵
Rose, H. et al. Climate-driven changes to the spatio-temporal distribution of the parasitic nematode, Haemonchus contortus, in sheep in Europe. Glob Chang Biol 22, 1271–85 (2016).
OpenUrl
59.↵
van Dijk, J., David, G.P., Baird, G. & Morgan, E.R. Back to the future: developing hypotheses on the effects of climate change on ovine parasitic gastroenteritis from historical data. Vet Parasitol 158, 73–84 (2008).
OpenUrl PubMed
60.↵
Rajpurohit, S., Oliveira, C.C., Etges, W.J. & Gibbs, A.G. Functional genomic and phenotypic responses to desiccation in natural populations of a desert drosophilid. Mol Ecol 22, 2698–715 (2013).
OpenUrl CrossRef Web of Science
61.↵
Sharma, V., Kohli, S. & Brahmachari, V. Correlation between desiccation stress response and epigenetic modifications of genes in Drosophila melanogaster: An example of environment-epigenome interaction. Biochim Biophys Acta 1860, 1058–1068 (2017).
OpenUrl
62.↵
Coustham, V. et al. Quantitative modulation of polycomb silencing underlies natural variation in vernalization. Science 337, 584–7 (2012).
OpenUrl Abstract/FREE Full Text
63.↵
Cassada, R.C. & Russell, R.L. The dauerlarva, a post-embryonic developmental variant of the nematode Caenorhabditis elegans. Dev Biol 46, 326–42 (1975).
OpenUrl CrossRef PubMed Web of Science
64.↵
Blitz, N.M. & Gibbs, H.C. Studies on the arrested development of Haemonchus contortus in sheep. I. The induction of arrested development. Int J Parasitol 2, 5–12 (1972).
OpenUrl CrossRef PubMed
65.↵
Wood, D.E. & Salzberg, S.L. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 15, R46 (2014).
OpenUrl CrossRef PubMed
66.↵
Li, B.W. et al. Transcriptomes and pathways associated with infectivity, survival and immunogenicity in Brugia malayi L3. BMC Genomics 10, 267 (2009).
OpenUrl CrossRef PubMed
67.↵
DePristo, M.A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43, 491–8 (2011).
OpenUrl CrossRef PubMed Web of Science
68.↵
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv preprint arXiv:1207.3907 [q-bio.GN](2012).
69.↵
Korneliussen, T.S., Albrechtsen, A. & Nielsen, R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics 15, 356 (2014).
OpenUrl CrossRef PubMed
70.↵
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20, 1297–303 (2010).
OpenUrl Abstract/FREE Full Text
71.↵
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–9 (2009).
OpenUrl CrossRef PubMed Web of Science
72.↵
Rensch, T., Villar, D., Horvath, J., Odom, D.T. & Flicek, P. Mitochondrial heteroplasmy in vertebrates using ChIP-sequencing data. Genome Biol 17, 139 (2016).
OpenUrl
73.↵
Howe, D.K., Baer, C.F. & Denver, D.R. High rate of large deletions in Caenorhabditis briggsae mitochondrial genome mutation processes. Genome Biol Evol 2, 29–38 (2009).
OpenUrl
74.↵
Konrad, A. et al. Mitochondrial Mutation Rate, Spectrum and Heteroplasmy in Caenorhabditis elegans Spontaneous Mutation Accumulation Lines of Differing Population Size. Mol Biol Evol 34, 1319–1334 (2017).
OpenUrl
75.↵
Hazkani-Covo, E., Zeller, R.M. & Martin, W. Molecular poltergeists: mitochondrial DNA copies (numts) in sequenced nuclear genomes. PLoS Genet 6, e1000834 (2010).
OpenUrl CrossRef PubMed
76.↵
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–8 (2011).
OpenUrl CrossRef PubMed Web of Science
77.↵
Zheng, X. et al. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326–8 (2012).
OpenUrl CrossRef PubMed Web of Science
78.↵
R Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, Vienna, 2016).
79.↵
Weir, B.S. & Cockerham, C.C. Estimating F-Statistics for the Analysis of Population Structure. Evolution 38, 1358–1370 (1984).
OpenUrl CrossRef PubMed Web of Science
80.↵
Svardal, H. et al. Ancient hybridization and strong adaptation to viruses across African vervet monkey populations. Nat Genet 49, 1705–1713 (2017).
OpenUrl CrossRef
81.↵
Chang, C.C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
OpenUrl CrossRef PubMed
82.↵
Paradis, E., Claude, J. & Strimmer, K. APE: Analyses of Phylogenetics and Evolution in R language. Bioinformatics 20, 289–90 (2004).
OpenUrl CrossRef PubMed Web of Science
83.↵
Edgar, R.C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32, 1792–7 (2004).
OpenUrl CrossRef PubMed Web of Science
84.↵
Talavera, G. & Castresana, J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol 56, 564–77 (2007).
OpenUrl CrossRef PubMed Web of Science
85.↵
Keane, T.M., Creevey, C.J., Pentony, M.M., Naughton, T.J. & McLnerney, J.O. Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified. BMC Evol Biol 6, 29 (2006).
OpenUrl CrossRef PubMed
86.↵
Guindon, S. & Gascuel, O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol 52, 696–704 (2003).
OpenUrl CrossRef PubMed Web of Science
87.↵
Skotte, L., Korneliussen, T.S. & Albrechtsen, A. Estimating individual admixture proportions from next generation sequencing data. Genetics 195, 693–702 (2013).
OpenUrl Abstract/FREE Full Text
88.↵
Schiffels, S. & Durbin, R. Inferring human population size and separation history from multiple genome sequences. Nat Genet 46, 919–25 (2014).
OpenUrl CrossRef PubMed
89.↵
Malaspinas, A.S. et al. A genomic history of Aboriginal Australia. Nature 538, 207–214 (2016).
OpenUrl CrossRef PubMed
90.↵
McVean, G.A. & Cardin, N.J. Approximating the coalescent with recombination. Philos Trans R Soc Lond B Biol Sci 360, 1387–93 (2005).
OpenUrl CrossRef PubMed
91.↵
Kingman, J.F.C. The coalescent. Stochastic Processes and their Applications 13, 235–248 (1982).
OpenUrl CrossRef
92.↵
Emery, D.L., Hunt, P.W. & Le Jambre, L.F. Haemonchus contortus: the then and now, and where to from here? Int J Parasitol 46, 755–769 (2016).
OpenUrl CrossRef PubMed
93.↵
Gutenkunst, R.N., Hernandez, R.D., Williamson, S.H. & Bustamante, C.D. Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet 5, e1000695 (2009).
OpenUrl CrossRef PubMed
94.↵
Portik, D.M. et al. Evaluating mechanisms of diversification in a Guineo-Congolian tropical forest frog using demographic model selection. Mol Ecol 26, 5245–5263 (2017).
OpenUrl
95.↵
Coffman, A.J., Hsieh, P.H., Gravel, S. & Gutenkunst, R.N. Computationally Efficient Composite Likelihood Statistics for Demographic Inference. Mol Biol Evol 33, 591–3 (2016).
OpenUrl CrossRef PubMed
96.↵
Hudson, R.R. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics 18, 337–8 (2002).
OpenUrl CrossRef PubMed Web of Science
97.↵
Suchard, M.A. et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol 4, vey016 (2018).
OpenUrl CrossRef PubMed
98.↵
Drummond, A.J., Rambaut, A., Shapiro, B. & Pybus, O.G. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol 22, 1185–92 (2005).
OpenUrl CrossRef PubMed Web of Science
99.↵
Howe, K.L., Bolt, B.J., Shafie, M., Kersey, P. & Berriman, M. WormBase ParaSite – a comprehensive resource for helminth genomics. Mol Biochem Parasitol 215, 2–10 (2017).
OpenUrl CrossRef PubMed
100.↵
Rufener, L., Kaminsky, R. & Maser, P. In vitro selection of Haemonchus contortus for benzimidazole resistance reveals a mutation at amino acid 198 of beta-tubulin. Mol Biochem Parasitol 168, 120–2 (2009).
OpenUrl PubMed
101.↵
Korneliussen, T.S., Moltke, I., Albrechtsen, A. & Nielsen, R. Calculation of Tajima's D and other neutrality test statistics from low depth next-generation sequencing data. BMC Bioinformatics 14, 289 (2013).
OpenUrl CrossRef PubMed
102.↵
Hudson, P.J. et al. Trophic interactions and population growth rates: describing patterns and identifying mechanisms. Philos Trans R Soc Lond B Biol Sci 357, 1259–71 (2002).
OpenUrl CrossRef PubMed
103.↵
Martin, S.H. & Van Belleghem, S.M. Exploring Evolutionary Relationships Across the Genome Using Topology Weighting. Genetics 206, 429–438 (2017).
OpenUrl Abstract/FREE Full Text
104.↵
Köppen, W. Die Wärmezonen der Erde, nach der Dauer der heissen, gemässigten und kalten Zeit und nach der Wirkung der Wärme auf die organische Welt betrachtet Meteorol. Z. 1, 215–226 (1884).
OpenUrl
105.↵
Chen, D. & Chen, H.W. Using the Köppen classification to quantify climate variation and change: An example for 1901–2010. Environmental Development 6, 69–79 (2013).
OpenUrl
106.↵
Ellis, N., Smith, S.J. & Pitcher, C.R. Gradient forests: calculating importance gradients on physical predictors. Ecology 93, 156–68 (2012).
OpenUrl CrossRef PubMed
107.↵
Fick, S. E. & Hijmans, R.J. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. International Journal of Climatology 37, 4302–4315 (2017).
OpenUrl
108.↵
Frichot, E., Schoville, S.D., Bouchard, G. & Francois, O. Testing for associations between loci and environmental gradients using latent factor mixed models. Mol Biol Evol 30, 1687–99 (2013).
OpenUrl CrossRef PubMed Web of Science
109.↵
Alexa, A., Rahnenfuhrer, J. topGO: Enrichment Analysis for Gene Ontology. (2016).

View the discussion thread.

Posted April 13, 2019.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Genomics

Subject Areas

All Articles

Animal Behavior and Cognition (5204)
Biochemistry (11725)
Bioengineering (8728)
Bioinformatics (29135)
Biophysics (14940)
Cancer Biology (12052)
Cell Biology (17363)
Clinical Trials (138)
Developmental Biology (9408)
Ecology (14147)
Epidemiology (2067)
Evolutionary Biology (18272)
Genetics (12223)
Genomics (16773)
Immunology (11844)
Microbiology (28027)
Molecular Biology (11564)
Neuroscience (60841)
Paleontology (451)
Pathology (1864)
Pharmacology and Toxicology (3232)
Physiology (4940)
Plant Biology (10405)
Scientific Communication and Education (1681)
Synthetic Biology (2878)
Systems Biology (7335)
Zoology (1642)

[1] 1.↵
Blaxter, M. & Koutsovoulos, G. The evolution of parasitism in Nematoda. Parasitology 142 Suppl 1, S26–39 (2015).
OpenUrl CrossRef PubMed

[2] 2.↵
O'Connor, L.J., Walkden-Brown, S.W. & Kahn, L.P. Ecology of the free-living stages of major trichostrongylid parasites of sheep. Vet Parasitol 142, 1–15 (2006).
OpenUrl CrossRef PubMed

[3] 3.↵
Jones, J.T. et al. Top 10 plant-parasitic nematodes in molecular plant pathology. Mol Plant Pathol 14, 946–61 (2013).
OpenUrl CrossRef PubMed Web of Science

[4] 4.↵
Anderson, R.C. The origins of zooparasitic nematodes. Can J Zool 62, 317–328 (1984).
OpenUrl CrossRef Web of Science

[5] 5.↵
Kaplan, R.M. & Vidyashankar, A.N. An inconvenient truth: Global worming and anthelmintic resistance. Veterinary Parasitology 186, 70–78 (2012).
OpenUrl CrossRef PubMed

[6] 6.↵
McKellar, Q.A. & Jackson, F. Veterinary anthelmintics: old and new. Trends Parasitol 20, 456–61 (2004).
OpenUrl CrossRef PubMed Web of Science

[7] 7.↵
Bundy, D.A.P. et al. 100 Years of Mass Deworming Programmes: A Policy Perspective From the World Bank's Disease Control Priorities Analyses. Adv Parasitol 100, 127–154 (2018).
OpenUrl

[8] 8.↵
Schulz, J.D., Moser, W., Hurlimann, E. & Keiser, J. Preventive Chemotherapy in the Fight against Soil-Transmitted Helminthiasis: Achievements and Limitations. Trends Parasitol (2018).

[9] 9.↵
Hewitson, J.P. & Maizels, R.M. Vaccination against helminth parasite infections. Expert Rev Vaccines 13, 473–87 (2014).
OpenUrl CrossRef PubMed

[10] 10.↵
Sallé, G. et al. Transcriptomic profiling of nematode parasites surviving vaccine exposure. Int J Parasitol (2018).

[11] 11.↵
Gilleard, J.S. Haemonchus contortus as a paradigm and model to study anthelmintic drug resistance. Parasitology 140, 1506–22 (2013).
OpenUrl CrossRef PubMed

[12] 12.↵
Doyle, S.R. et al. A Genome Resequencing-Based Genetic Map Reveals the Recombination Landscape of an Outbred Parasitic Nematode in the Presence of Polyploidy and Polyandry. Genome Biol Evol 10, 396–409 (2018).
OpenUrl

[13] 13.↵
Laing, R. et al. The genome and transcriptome of Haemonchus contortus, a key model parasite for drug and vaccine discovery. Genome Biol 14, R88 (2013).
OpenUrl CrossRef PubMed

[14] 14.↵
International Helminth Genomes, C. Comparative genomics of the major parasitic worms. Nat Genet 51, 163–174 (2019).
OpenUrl

[15] 15.
Mougin, C. et al. BRC4Env, a network of Biological Resource Centres for research in environmental and agricultural sciences. Environ Sci Pollut Res Int (2018).

[16] 16.↵
Langley, C.H. et al. Genomic variation in natural populations of Drosophila melanogaster. Genetics 192, 533–98 (2012).
OpenUrl Abstract/FREE Full Text

[17] 17.↵
Small, S.T. et al. Population genomics of the filarial nematode parasite Wuchereria bancrofti from mosquitoes. Mol Ecol 25, 1465–77 (2016).
OpenUrl

[18] 18.↵
Choi, Y.J. et al. Genomic diversity in Onchocerca volvulus and its Wolbachia endosymbiont. Nat Microbiol 2, 16207 (2016).
OpenUrl

[19] 19.
Perry, G.H. et al. Comparative RNA sequencing reveals substantial genetic variation in endangered primates. Genome Res 22, 602–10 (2012).
OpenUrl Abstract/FREE Full Text

[20] 20.↵
Denver, D.R. et al. A genome-wide view of Caenorhabditis elegans base-substitution mutation processes. Proc Natl Acad Sci U S A 106, 16310–4 (2009).
OpenUrl Abstract/FREE Full Text

[21] 21.↵
Saccareau, M. et al. Meta-analysis of the parasitic phase traits of Haemonchus contortus infection in sheep. Parasit Vectors 10, 201 (2017).
OpenUrl

[22] 22.↵
Geggus, D. The French Slave Trade: An Overview. The William and Mary Quaterly 58, 119–138 (2001).
OpenUrl

[23] 23.↵
Anonymous. The wool industry – looking back and forward. (Australian Bureau of Statistics, 2003).

[24] 24.↵
Evesson, B. & Moor, R. The foundation of Australia’s fine wool industry. in Journal of the Royal Australian Historical Society (2000).

[25] 25.↵
Eng, J.K. et al. Ivermectin selection on beta-tubulin: evidence in Onchocerca volvulus and Haemonchus contortus. Mol Biochem Parasitol 150, 229–35 (2006).
OpenUrl CrossRef PubMed

[26] 26.↵
de Lourdes Mottier, M. & Prichard, R.K. Genetic analysis of a relationship between macrocyclic lactone and benzimidazole anthelmintic selection on Haemonchus contortus. Pharmacogenet Genomics 18, 129–40 (2008).
OpenUrl CrossRef PubMed Web of Science

[27] 27.↵
Chen, H., Patterson, N. & Reich, D. Population differentiation as a test for selective sweeps. Genome Res 20, 393–402 (2010).
OpenUrl Abstract/FREE Full Text

[28] 28.↵
Ardelli, B.F., Stitt, L.E., Tompkins, J.B. & Prichard, R.K. A comparison of the effects of ivermectin and moxidectin on the nematode Caenorhabditis elegans. Vet Parasitol 165, 96–108 (2009).
OpenUrl CrossRef PubMed

[29] 29.↵
Janssen, I.J., Krucken, J., Demeler, J. & von Samson-Himmelstjerna, G. Transgenically expressed Parascaris P-glycoprotein-11 can modulate ivermectin susceptibility in Caenorhabditis elegans. Int J Parasitol Drugs Drug Resist 5, 44–7 (2015).
OpenUrl CrossRef PubMed

[30] 30.↵
Avery, L. & Horvitz, H.R. Effects of starvation and neuroactive drugs on feeding in Caenorhabditis elegans. J Exp Zool 253, 263–70 (1990).
OpenUrl CrossRef PubMed Web of Science

[31] 31.↵
Scott, E.W., Baxter, P. & Armour, J. Fecundity of anthelmintic resistant adult Haemonchus contortus after exposure to ivermectin or benzimidazoles in vivo. Res Vet Sci 50, 247–9 (1991).
OpenUrl PubMed

[32] 32.↵
Kotze, A.C. & Prichard, R.K. Anthelmintic Resistance in Haemonchus contortus: History, Mechanisms and Diagnosis. Adv Parasitol 93, 397–428 (2016).
OpenUrl

[33] 33.↵
Martin, R.J. et al. Levamisole receptors: a second awakening. Trends Parasitol 28, 289–96 (2012).
OpenUrl CrossRef PubMed

[34] 34.↵
Wada, M. et al. Isolation and characterization of a GDP/GTP exchange protein specific for the Rab3 subfamily small G proteins. J Biol Chem 272, 3875–8 (1997).
OpenUrl Abstract/FREE Full Text

[35] 35.↵
Cao, R. & Zhang, Y. The functions of E(Z)/EZH2-mediated methylation of lysine 27 in histone H3. Curr Opin Genet Dev 14, 155–64 (2004).
OpenUrl CrossRef PubMed Web of Science

[36] 36.↵
Bolajoko, M.B. et al. The basic reproduction quotient (Q0) as a potential spatial predictor of the seasonality of ovine haemonchosis. Geospat Health 9, 333–50 (2015).
OpenUrl

[37] 37.↵
Fitzpatrick, M.C. & Keller, S.R. Ecological genomics meets community-level modelling of biodiversity: mapping the genomic landscape of current and future environmental adaptation. Ecol Lett 18, 1–16 (2015).
OpenUrl CrossRef PubMed

[38] 38.↵
Doyle, S.R. et al. Population genomic and evolutionary modelling analyses reveal a single major QTL for ivermectin drug resistance in the pathogenic nematode, Haemonchus contortus. BMC Genomics 20, 218 (2019).
OpenUrl

[39] 39.↵
Blouin, M.S., Yowell, C.A., Courtney, C.H. & Dame, J.B. Host movement and the genetic structure of populations of parasitic nematodes. Genetics 141, 1007–14 (1995).
OpenUrl Abstract/FREE Full Text

[40] 40.↵
Troell, K., Engstrom, A., Morrison, D.A., Mattsson, J.G. & Hoglund, J. Global patterns reveal strong population structure in Haemonchus contortus, a nematode parasite of domesticated ruminants. Int J Parasitol 36, 1305–16 (2006).
OpenUrl CrossRef PubMed Web of Science

[41] 41.
Dey, A., Chan, C.K., Thomas, C.G. & Cutter, A.D. Molecular hyperdiversity defines populations of the nematode Caenorhabditis brenneri. Proc Natl Acad Sci U S A 110, 11056–60 (2013).
OpenUrl Abstract/FREE Full Text

[42] 42.↵
Gilleard, J.S. & Redman, E. Genetic Diversity and Population Structure of Haemonchus contortus. Adv Parasitol 93, 31–68 (2016).
OpenUrl CrossRef PubMed

[43] 43.↵
Larson, G. et al. Current perspectives and the future of domestication studies. Proc Natl Acad Sci U S A 111, 6139–46 (2014).
OpenUrl Abstract/FREE Full Text

[44] 44.↵
Pleurdeau, D. et al. "Of sheep and men": earliest direct evidence of caprine domestication in southern Africa at Leopard Cave (Erongo, Namibia). PLoS One 7, e40340 (2012).
OpenUrl CrossRef PubMed

[45] 45.↵
Campbell, M.C. & Tishkoff, S.A. The evolution of human genetic and phenotypic variation in Africa. Curr Biol 20, R166–73 (2010).
OpenUrl CrossRef PubMed Web of Science

[46] 46.↵
Chessa, B. et al. Revealing the history of sheep domestication using retrovirus integrations. Science 324, 532–6 (2009).
OpenUrl Abstract/FREE Full Text

[47] 47.↵
Tapio, M. et al. Sheep mitochondrial DNA variation in European, Caucasian, and Central Asian areas. Mol Biol Evol 23, 1776–83 (2006).
OpenUrl CrossRef PubMed Web of Science

[48] 48.
Kijas, J.W. et al. Genome-wide analysis of the world's sheep breeds reveals high levels of historic mixture and strong recent selection. PLoS Biol 10, e1001258 (2012).
OpenUrl CrossRef PubMed

[49] 49.↵
Spangler, G.L. et al. Whole genome structural analysis of Caribbean hair sheep reveals quantitative link to West African ancestry. PLoS One 12, e0179021 (2017).
OpenUrl

[50] 50.↵
Naves, M., Alexandre, G., Leimbacher, F., Mandonnet, N. & Menendez-buxadera, A. Les ruminants domestiques de la Caraïbe: le point sur les ressources génétiques et leur exploitation. INRA Prod. Anim. 14, 181–192 (2001).
OpenUrl

[51] 51.↵
Roman, A. Saint Malo au temps des négriers, 357 (Karthala, 2003).

[52] 52.↵
Pereira, F. et al. Genetic signatures of a Mediterranean influence in Iberian Peninsula sheep husbandry. Mol Biol Evol 23, 1420–6 (2006).
OpenUrl CrossRef PubMed Web of Science

[53] 53.↵
Redman, E. et al. The emergence of resistance to the benzimidazole anthlemintics in parasitic nematodes of livestock is characterised by multiple independent hard and soft selective sweeps. PLoS Negl Trop Dis 9, e0003494 (2015).
OpenUrl

[54] 54.↵
Vercruysse, J. et al. Is anthelmintic resistance a concern for the control of human soil-transmitted helminths? Int J Parasitol Drugs Drug Resist 1, 14–27 (2011).
OpenUrl

[55] 55.↵
Laing, R., Gillan, V. & Devaney, E. Ivermectin – Old Drug, New Tricks? Trends Parasitol 33, 463–472 (2017).
OpenUrl CrossRef

[56] 56.↵
Osei-Atweneboana, M.Y. et al. Phenotypic evidence of emerging ivermectin resistance in Onchocerca volvulus. PLoS Negl Trop Dis 5, e998 (2011).
OpenUrl CrossRef PubMed

[57] 57.↵
Doyle, S.R. et al. Genome-wide analysis of ivermectin response by Onchocerca volvulus reveals that genetic drift and soft selective sweeps contribute to loss of drug sensitivity. PLoS Negl Trop Dis 11, e0005816 (2017).
OpenUrl

[58] 58.↵
Rose, H. et al. Climate-driven changes to the spatio-temporal distribution of the parasitic nematode, Haemonchus contortus, in sheep in Europe. Glob Chang Biol 22, 1271–85 (2016).
OpenUrl

[59] 59.↵
van Dijk, J., David, G.P., Baird, G. & Morgan, E.R. Back to the future: developing hypotheses on the effects of climate change on ovine parasitic gastroenteritis from historical data. Vet Parasitol 158, 73–84 (2008).
OpenUrl PubMed

[60] 60.↵
Rajpurohit, S., Oliveira, C.C., Etges, W.J. & Gibbs, A.G. Functional genomic and phenotypic responses to desiccation in natural populations of a desert drosophilid. Mol Ecol 22, 2698–715 (2013).
OpenUrl CrossRef Web of Science

[61] 61.↵
Sharma, V., Kohli, S. & Brahmachari, V. Correlation between desiccation stress response and epigenetic modifications of genes in Drosophila melanogaster: An example of environment-epigenome interaction. Biochim Biophys Acta 1860, 1058–1068 (2017).
OpenUrl

[62] 62.↵
Coustham, V. et al. Quantitative modulation of polycomb silencing underlies natural variation in vernalization. Science 337, 584–7 (2012).
OpenUrl Abstract/FREE Full Text

[63] 63.↵
Cassada, R.C. & Russell, R.L. The dauerlarva, a post-embryonic developmental variant of the nematode Caenorhabditis elegans. Dev Biol 46, 326–42 (1975).
OpenUrl CrossRef PubMed Web of Science

[64] 64.↵
Blitz, N.M. & Gibbs, H.C. Studies on the arrested development of Haemonchus contortus in sheep. I. The induction of arrested development. Int J Parasitol 2, 5–12 (1972).
OpenUrl CrossRef PubMed

[65] 65.↵
Wood, D.E. & Salzberg, S.L. Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 15, R46 (2014).
OpenUrl CrossRef PubMed

[66] 66.↵
Li, B.W. et al. Transcriptomes and pathways associated with infectivity, survival and immunogenicity in Brugia malayi L3. BMC Genomics 10, 267 (2009).
OpenUrl CrossRef PubMed

[67] 67.↵
DePristo, M.A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43, 491–8 (2011).
OpenUrl CrossRef PubMed Web of Science

[68] 68.↵
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv preprint arXiv:1207.3907 [q-bio.GN](2012).

[69] 69.↵
Korneliussen, T.S., Albrechtsen, A. & Nielsen, R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinformatics 15, 356 (2014).
OpenUrl CrossRef PubMed

[70] 70.↵
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20, 1297–303 (2010).
OpenUrl Abstract/FREE Full Text

[71] 71.↵
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–9 (2009).
OpenUrl CrossRef PubMed Web of Science

[72] 72.↵
Rensch, T., Villar, D., Horvath, J., Odom, D.T. & Flicek, P. Mitochondrial heteroplasmy in vertebrates using ChIP-sequencing data. Genome Biol 17, 139 (2016).
OpenUrl

[73] 73.↵
Howe, D.K., Baer, C.F. & Denver, D.R. High rate of large deletions in Caenorhabditis briggsae mitochondrial genome mutation processes. Genome Biol Evol 2, 29–38 (2009).
OpenUrl

[74] 74.↵
Konrad, A. et al. Mitochondrial Mutation Rate, Spectrum and Heteroplasmy in Caenorhabditis elegans Spontaneous Mutation Accumulation Lines of Differing Population Size. Mol Biol Evol 34, 1319–1334 (2017).
OpenUrl

[75] 75.↵
Hazkani-Covo, E., Zeller, R.M. & Martin, W. Molecular poltergeists: mitochondrial DNA copies (numts) in sequenced nuclear genomes. PLoS Genet 6, e1000834 (2010).
OpenUrl CrossRef PubMed

[76] 76.↵
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–8 (2011).
OpenUrl CrossRef PubMed Web of Science

[77] 77.↵
Zheng, X. et al. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326–8 (2012).
OpenUrl CrossRef PubMed Web of Science

[78] 78.↵
R Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, Vienna, 2016).

[79] 79.↵
Weir, B.S. & Cockerham, C.C. Estimating F-Statistics for the Analysis of Population Structure. Evolution 38, 1358–1370 (1984).
OpenUrl CrossRef PubMed Web of Science

[80] 80.↵
Svardal, H. et al. Ancient hybridization and strong adaptation to viruses across African vervet monkey populations. Nat Genet 49, 1705–1713 (2017).
OpenUrl CrossRef

[81] 81.↵
Chang, C.C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
OpenUrl CrossRef PubMed

[82] 82.↵
Paradis, E., Claude, J. & Strimmer, K. APE: Analyses of Phylogenetics and Evolution in R language. Bioinformatics 20, 289–90 (2004).
OpenUrl CrossRef PubMed Web of Science

[83] 83.↵
Edgar, R.C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32, 1792–7 (2004).
OpenUrl CrossRef PubMed Web of Science

[84] 84.↵
Talavera, G. & Castresana, J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol 56, 564–77 (2007).
OpenUrl CrossRef PubMed Web of Science

[85] 85.↵
Keane, T.M., Creevey, C.J., Pentony, M.M., Naughton, T.J. & McLnerney, J.O. Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified. BMC Evol Biol 6, 29 (2006).
OpenUrl CrossRef PubMed

[86] 86.↵
Guindon, S. & Gascuel, O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol 52, 696–704 (2003).
OpenUrl CrossRef PubMed Web of Science

[87] 87.↵
Skotte, L., Korneliussen, T.S. & Albrechtsen, A. Estimating individual admixture proportions from next generation sequencing data. Genetics 195, 693–702 (2013).
OpenUrl Abstract/FREE Full Text

[88] 88.↵
Schiffels, S. & Durbin, R. Inferring human population size and separation history from multiple genome sequences. Nat Genet 46, 919–25 (2014).
OpenUrl CrossRef PubMed

[89] 89.↵
Malaspinas, A.S. et al. A genomic history of Aboriginal Australia. Nature 538, 207–214 (2016).
OpenUrl CrossRef PubMed

[90] 90.↵
McVean, G.A. & Cardin, N.J. Approximating the coalescent with recombination. Philos Trans R Soc Lond B Biol Sci 360, 1387–93 (2005).
OpenUrl CrossRef PubMed

[91] 91.↵
Kingman, J.F.C. The coalescent. Stochastic Processes and their Applications 13, 235–248 (1982).
OpenUrl CrossRef

[92] 92.↵
Emery, D.L., Hunt, P.W. & Le Jambre, L.F. Haemonchus contortus: the then and now, and where to from here? Int J Parasitol 46, 755–769 (2016).
OpenUrl CrossRef PubMed

[93] 93.↵
Gutenkunst, R.N., Hernandez, R.D., Williamson, S.H. & Bustamante, C.D. Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet 5, e1000695 (2009).
OpenUrl CrossRef PubMed

[94] 94.↵
Portik, D.M. et al. Evaluating mechanisms of diversification in a Guineo-Congolian tropical forest frog using demographic model selection. Mol Ecol 26, 5245–5263 (2017).
OpenUrl

[95] 95.↵
Coffman, A.J., Hsieh, P.H., Gravel, S. & Gutenkunst, R.N. Computationally Efficient Composite Likelihood Statistics for Demographic Inference. Mol Biol Evol 33, 591–3 (2016).
OpenUrl CrossRef PubMed

[96] 96.↵
Hudson, R.R. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics 18, 337–8 (2002).
OpenUrl CrossRef PubMed Web of Science

[97] 97.↵
Suchard, M.A. et al. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol 4, vey016 (2018).
OpenUrl CrossRef PubMed

[98] 98.↵
Drummond, A.J., Rambaut, A., Shapiro, B. & Pybus, O.G. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol Biol Evol 22, 1185–92 (2005).
OpenUrl CrossRef PubMed Web of Science

[99] 99.↵
Howe, K.L., Bolt, B.J., Shafie, M., Kersey, P. & Berriman, M. WormBase ParaSite – a comprehensive resource for helminth genomics. Mol Biochem Parasitol 215, 2–10 (2017).
OpenUrl CrossRef PubMed

[100] 100.↵
Rufener, L., Kaminsky, R. & Maser, P. In vitro selection of Haemonchus contortus for benzimidazole resistance reveals a mutation at amino acid 198 of beta-tubulin. Mol Biochem Parasitol 168, 120–2 (2009).
OpenUrl PubMed

[101] 101.↵
Korneliussen, T.S., Moltke, I., Albrechtsen, A. & Nielsen, R. Calculation of Tajima's D and other neutrality test statistics from low depth next-generation sequencing data. BMC Bioinformatics 14, 289 (2013).
OpenUrl CrossRef PubMed

[102] 102.↵
Hudson, P.J. et al. Trophic interactions and population growth rates: describing patterns and identifying mechanisms. Philos Trans R Soc Lond B Biol Sci 357, 1259–71 (2002).
OpenUrl CrossRef PubMed

[103] 103.↵
Martin, S.H. & Van Belleghem, S.M. Exploring Evolutionary Relationships Across the Genome Using Topology Weighting. Genetics 206, 429–438 (2017).
OpenUrl Abstract/FREE Full Text

[104] 104.↵
Köppen, W. Die Wärmezonen der Erde, nach der Dauer der heissen, gemässigten und kalten Zeit und nach der Wirkung der Wärme auf die organische Welt betrachtet Meteorol. Z. 1, 215–226 (1884).
OpenUrl

[105] 105.↵
Chen, D. & Chen, H.W. Using the Köppen classification to quantify climate variation and change: An example for 1901–2010. Environmental Development 6, 69–79 (2013).
OpenUrl

[106] 106.↵
Ellis, N., Smith, S.J. & Pitcher, C.R. Gradient forests: calculating importance gradients on physical predictors. Ecology 93, 156–68 (2012).
OpenUrl CrossRef PubMed

[107] 107.↵
Fick, S. E. & Hijmans, R.J. WorldClim 2: new 1-km spatial resolution climate surfaces for global land areas. International Journal of Climatology 37, 4302–4315 (2017).
OpenUrl

[108] 108.↵
Frichot, E., Schoville, S.D., Bouchard, G. & Francois, O. Testing for associations between loci and environmental gradients using latent factor mixed models. Mol Biol Evol 30, 1687–99 (2013).
OpenUrl CrossRef PubMed Web of Science

[109] 109.↵
Alexa, A., Rahnenfuhrer, J. topGO: Enrichment Analysis for Gene Ontology. (2016).