Experimental Swap of 's Assortative Mating Preferences Demonstrates Key Role of X-Chromosome Divergence Island in Incipient Sympatric Speciation
Anopheles gambiae is the most important vector of malaria in Africa. This species is undergoing speciation and a number of subpopulations have been identified which can produce viable hybrid offspring but are reproductively isolated through assortative mating and ecological adaptation. This complex structure provides an ideal system for studying the unique genetic and behavioural processes required for speciation. Anopheles gambiae’s subpopulations differ genetically in limited regions of their genomes called islands of speciation. Theoretical studies predict that these islands, characterized by restricted genetic rearrangements, may protect genes of assortative mating between emerging species, and are fundamental to the speciation process. We set out to test this prediction by performing complex genetic crosses between the sister species Anopheles coluzzii and Anopheles gambiae s.s. and creating recombinant strains differing only at their X-chromosome island of speciation. We show through behavioural studies that recombinant females consistently mated with matching island-type males thereby associating assortative mating genes with the X-island of divergence. By sequencing the genetic code of the recombinant strains and natural populations, we could confirm these findings and identify candidate assortative mating genes. These findings suggest an important role of divergence islands for the genetic and behavioural processes associated with speciation.
Published in the journal:
. PLoS Genet 11(4): e32767. doi:10.1371/journal.pgen.1005141
Category:
Research Article
doi:
https://doi.org/10.1371/journal.pgen.1005141
Summary
Anopheles gambiae is the most important vector of malaria in Africa. This species is undergoing speciation and a number of subpopulations have been identified which can produce viable hybrid offspring but are reproductively isolated through assortative mating and ecological adaptation. This complex structure provides an ideal system for studying the unique genetic and behavioural processes required for speciation. Anopheles gambiae’s subpopulations differ genetically in limited regions of their genomes called islands of speciation. Theoretical studies predict that these islands, characterized by restricted genetic rearrangements, may protect genes of assortative mating between emerging species, and are fundamental to the speciation process. We set out to test this prediction by performing complex genetic crosses between the sister species Anopheles coluzzii and Anopheles gambiae s.s. and creating recombinant strains differing only at their X-chromosome island of speciation. We show through behavioural studies that recombinant females consistently mated with matching island-type males thereby associating assortative mating genes with the X-island of divergence. By sequencing the genetic code of the recombinant strains and natural populations, we could confirm these findings and identify candidate assortative mating genes. These findings suggest an important role of divergence islands for the genetic and behavioural processes associated with speciation.
Introduction
Unravelling the genomic processes underlying sympatric speciation, the evolution of new species from a single ancestral species within the same geographical region, is fundamental to our understanding of biodiversity. At the core of this interest is the search for distinct genomic signatures that can help us understand what is thought to be a relatively narrow and unlikely set of genetic and ecological conditions facilitating the emergence and divergence of two gene pools from an originally panmictic population. For sympatric speciation with gene flow to occur, divergent selection acting on locally adapted genes has to overcome the homogenizing effects of migration and recombination [1,2]. Theoretical models of sympatric speciation have long recognised that this can only occur under a restricted set of conditions in which the genomic architecture often plays a major role [1,3,4]. Features of the genome such as chromosomal inversions and peri-centromeric regions that suppress recombination and link together genes of pre-mating isolation and ecological adaptation genes are predicted to facilitate sympatric speciation [1,5,6,7]. In addition, hemizygosity and lower recombination rates are thought to predispose sex chromosomes to the more rapid accumulation of genes of pre and post-mating isolation [8].
These general predictions have found empirical support from a limited number of studies designed to map genes involved in speciation [9,10] and/or to detect loci under divergent selection across genomes [11,12,13,14]. Currently this evidence concerns almost exclusively species already separated by both pre-zygotic and intrinsic post-zygotic reproductive isolation in which teasing out the genomic signature of the onset of speciation from the genomic processes that follow post-mating reproductive isolation constitutes a major challenge [1,3,4]. The Anopheles gambiae complex comprises the main vector species responsible for malaria transmission. It is also a species complex rich in cryptic taxa separated by various degrees of reproductive isolation [15,16] and could provide the ideal system for studying the genomic signature of pre-mating isolation independent of intrinsic isolation processes. The sibling species An. coluzzii and An. gambiae s.s. were until recently known as the 'M and S' molecular forms of An. gambiae in reference to diagnostic genetic differences in the ribosomal DNA regions [17]. These incipient species co-occur over large areas of West Africa and do not exhibit intrinsic post-mating barriers to reproduction [18,19]. Across much of their sympatric range their integrity is maintained by strong assortative mating [20,21] resulting in rare hybrids and low levels of genetic introgression between the two taxa [22,23]. However, in the Western coastal countries of Guinea Bissau and Senegal hybrids can occur locally at much higher frequencies [24], resulting in a large hybrid zone between the incipient sibling species and high levels of genetic introgression [22,23].
Divergence between these species is thought to be driven by larval adaptation to different types of bodies of water [25,26]. Transplant experiments have shown that An. coluzzii larvae survive better than those of An. gambiae s.s. in habitats that are more permanent and rich in aquatic predators while the reverse is true in predator-free temporary water bodies [25,27]. Adults of both species have similar feeding and resting habits and mate in swarms at dusk in villages. Swarm site segregation is thought to contribute to assortative mating [28], but the occurrence of mixed swarms at various frequencies [21,28] points towards additional conspecific recognition mechanisms, possibly based on flight tones [29,30].
Several groundbreaking studies have shown that sympatric speciation in these two incipient species probably involved the divergence of a few ‘islands of divergence' that possibly contain clusters of speciation genes and located in areas of low recombination [31,32]. These putative 'islands of speciation' include 3 pericentromeric islands of divergence located on the X, 2L and 3L chromosomes as well as smaller islands located in the vicinity of inversion breakpoints [31,32]. Perfect linkage disequilibrium between the X, 2L and 3L islands was found in samples from sympatric populations of An. coluzzii and An. gambiae s.s. from central West Africa [32]. This pattern suggested very low gene flow between the sibling species and the possibility that pericentromeric islands of divergence were merely 'incidental rather than instrumental' to the speciation process [32]. Subsequent genomic studies reinforced this view, suggesting divergence at many other loci across the genome and a more advanced stage of sympatric speciation [33]. However, recent studies have shown that the linkage disequilibrium between the pericentromeric islands breaks down to various degrees in areas with higher introgression between the sibling species [23,24]. Taken together the comparative genomics data would therefore support a model of genomic divergence in which pericentromeric divergence islands could play a major role in speciation in the face of varying levels of gene flow [22,23].
Since this view is currently subject to debate and because there are limits to the inferences that can be drawn from comparative genomics studies, we set out to demonstrate the role of divergence islands in the sympatric speciation process using an experimental functional genomics approach. We hypothesized that the largest putative 'speciation island' of the X chromosome would be a prime candidate for protecting assortative mating and ecological adaptation genes in the face of ongoing gene flow because it combines pericentromeric recombination suppression with the hemizygosity and decreased recombination typically associated with the X chromosome. Next, we selectively introgressed the S-form X-linked island of divergence of An. gambiae s.s. into an. coluzzii genetic background to create recombinant strains that shared an An. coluzzii genetic background but differed at their X-chromosome islands of speciation. Standardised assortative mating experiments were then used to test the potential association of the X-island molecular type with the mating preferences of recombinant and parental strains. These were combined with full-genome strain comparisons as well as targeted genome sequencing from sympatric populations. This strategy enabled us to identify the size of the pericentromeric region introgressed from An. gambiae s.s. into the An. coluzzii background, fixed protein-coding differences distinguishing recombinant strains, and conserved differences putatively relevant to speciation.
The results demonstrate the close association of assortative-mating genes with the X-island of speciation and thus lend support to models of speciation involving pericentric recombination suppression in these sympatric incipient species. In addition, the development of a laboratory-based model-system for studying assortative mating is an important step towards the description of key reproductive isolation genes and mechanisms responsible for pre-mating isolation among these cryptic taxa.
Results
Recombinant strains with M or S-form X-island molecular type were produced by selectively introgressing the X-island from the S form Kisumu strain of An. gambiae s.s. into the M form Mopti strain of An. coluzzii for 4 generations, followed by two generations of crosses within the introgressed strain that resulted in An. coluzzii-like recombinant strains, one fixed for the M-type X-island (RbMM strain), and one with the S-type X-island (RbSS strain). The design ensured that, outside of the X-island region, the recombinant strains with an M-type X-island and that with an S-type one shared a large genetic similarity with the Mopti An. coluzzii strain. Additionally, recombinants strains shared the maternal mitochondrial genome from An. gambiae s.s. and the Y chromosome from An. coluzzii.
Genetic characterisation of recombinant strains
Following their creation, the RbMM and RbSS recombinant strains and Mopti and Kisumu parental strains were further characterized by genotyping of their 2L and 3L pericentromeric divergence islands [32] and karyotyping of inversion polymorphisms on their 2L and 2R chromosome (Table 1). Predictably, the M-form Mopti strain exhibited M-type 2L and 3L islands in addition to its M-type X-island. It was also fixed for inversion a on chromosome 2L and polymorphic for u on 2R. Unexpectedly, the long-established S-form Kisumu strain was polymorphic at the 2L and 3L islands suggesting historical contamination with an M-form strain. The Kisumu was also polymorphic for a on 2L and standard on 2R. Amongst the recombinant strains, RbMM was M-like across all 3 islands. The RbSS was polymorphic at the 2L locus but fixed for the M-type allele at the 3L. Both recombinant strains were also polymorphic for a on 2L and u on the 2R chromosome (Table 1). Thus their genotypes and karyotypes were those expected from successful backcrossing and crossin steps.
Behavioural assays of assortative mating in recombinants strains
Next, the mating choice preferences of females and males were tested using a standardized assortative mating assay (see methods). Reciprocal experiments were conducted in which virgin females or males from the same RbMM and RbSS cohorts were given a choice between mates with matching or non-matching X-island type (Table 2). Females from the RbMM were found to mate almost exclusively assortatively (Fig 1A)(P< 0.001). Females RbSS had their mating preferences effectively swapped and mated entirely with recombinant males with matching S-type islands (Fig 1A)(P< 0.001). In contrast, males from the RbMM and RbSS recombinant strains did not significantly prefer females with matching X-islands (P = 0.284 and 0.611 respectively). Since assortative mating amongst laboratory strains from An. coluzzii and An. gambiae s.s has never been reported, female and male choosiness were also assessed in the Mopti and Kisumu parental strains. Here again, females mated significantly assortatively (P< 0.001 in both strains) but not males (P = 0.073 and 0.163)(Fig 1B and S1 Table).
Full-genome sequencing of the RbMM, RbSS and Mopti strains
The size of the X-island of divergence and flanking regions differing between the RbMM and RbSS strains was determined through genome-wide genetic differentiation (FST) scans and the occurrence of fixed coding differences between the recombinant strains. Estimates of genetic differentiation between the RbMM, RbSS and Mopti strains were calculated for 3,743,318 SNPs across the X, 2nd and 3rd chromosomes. The genome-wide FST scans showed that the RbSS differs from the RbMM and Mopti strains on chromosome X from the centromere to the reference position ~14.8Mb (Fig 2). This region covered the entire island of speciation plus a large flanking region. In addition, the RbSS and RbMM strain were genetically differentiated at a ~2Mb S-form fragment extending roughly from positions 11.5–13.5Mb (Fig 2). There were no other sizeable S-like regions detected through comparisons of the RbMM, RbSS and Mopti genomes, indicating that the selective introgression design worked as hoped for. Importantly, amongst all fixed differences observed between the RbMM and RbSS strains, non-synonymous differences inducing protein-coding changes (n = 160) were found only in the selectively-introgressed pericentromeric region (Fig 2). Fixed differences located elsewhere in the genome were either coding synonymous changes or non-coding.
Comparison with X-island from sympatric field populations
Given that the Kisumu and Mopti strains were colonized from allopatric populations over 25 and 7 years ago, some of the differences observed between the RbMM and RbSS strains could be due to genetic divergence of the original populations or result from genetic drift and inbreeding [34]. Consequently, we compared the protein-coding differences identified between RbMM and RbSS with those observed between 2 sympatric An. gambiae s.s. and An. coluzzii populations from Southern Ghana. Deep-pooled-targeted exon re-sequencing of the region extending from 17Mb to the centromere showed that, in 114 of the 117 coding differences distinguishing RbSS from RbMM, the M-form allele was fixed or nearly fixed (freq >0.95) in the field population of An. coluzzii. In the sympatric An. gambiae s.s., the alternate S-type allele was found at frequencies >0.8 in 61 of the 114 differences; and 20 of those were fixed or nearly fixed (freq >0.95) and thus conserved differences between sibling species. Conserved differences started from position ~18.1Mb (Fig 2), increased in frequency with proximity to the centromere and affected a total of 12 genes (Fig 3, Table 3).
Discussion
This is the first functional genomic study aiming to test the role of the putative X-chromosome island of speciation in pre-mating reproductive isolation between An. coluzzii and gambiae s.s. and demonstrating its potential importance in their process of incipient sympatric speciation. Albeit demonstrated here using laboratory recombinant strains, the close association observed between pre-mating isolation genes and the X-island supports the hypothesis that pericentric regions can create linkage disequilibrium (LD) and thus protect associations between genes of pre-mating isolation and ecological adaptation [7] and facilitate the onset of sympatric speciation [1,6]. The results are consistent with the notion that hemizygosity and the lower recombination rates of sex chromosomes predispose them to accumulating genes of pre and post-mating isolation [8]. In the case of the X-island of speciation, the pericentric and X-chromosome recombination suppression effects synergise, thereby further reducing recombination and promoting high LD.
Although previous studies have linked loci responsible for reproductive isolation to inversions [12,13], pericentromeric regions [35,36] and sex chromosomes [9,10,35], this is one of the few studies focusing on incipient sympatric species in which intrinsic post-reproductive barriers to reproduction are not yet established and therefore investigating the genomic architecture of pre-mating isolation independent of intrinsic post-mating processes. In sulfur butterflies Colias eurytheme and C. philodice, pre-mating isolation was associated with an inversion located on the X-chromosome [37], but these species co-occur only over a limited hybridization zone. In the hawthorn and apple-infesting races of Rhagoletis pomonella flies genomic divergence was found accentuated around 'continents of differentiation' containing inversions on autosomes [38]. However, pre-mating isolation in this model system is ecological and incidental to specialization to different host plants, hence patterns of genetic divergence, migration and gene flow are more akin to those of micro-allopatric speciation.
Paradoxically, elucidating the processes underpinning the genomic structure of speciation of An. coluzzii and An. gambiae s.s. whose sympatric range spans large areas of Western Africa has proved challenging because of the contrasted patterns of introgression and selection observed within that range [17,22,23]. The experimental functional genomics approach taken here clarifies the role of so-called speciation islands in the speciation process. The results suggest a simple genetic mechanism whereby the low-recombining pericentromeric X-island enables these incipient species to maintain their genetic integrity in Central and Eastern areas of Africa where introgression is uncommon or temporally limited [20,23] and in the hybrid zones of coastal Western Africa where gene flow is extensive [23,24]. The results are compatible with model of sympatric speciation in which the X-chromosome island played an active role in speciation that involved 'divergence hitchhiking' around key speciation loci [1,3,4]. Whether the 2L and 3L islands have played similar important roles remains to be demonstrated. In this study we found no evidence of a direct association of the 2L and 3L islands with asssortative mating since the recombinant where either homozygous or nearly homozygous M-type for these regions. Therefore it is unlikely that these islands of differentiation play a major role in conspecific mate recognition.
In terms of physical size and the number of genes it contain, the X-island of speciation was the largest of the three pericentromeric islands described between An. coluzzii and An. gambiae s.s. through genome-wide studies [31,39]. Depending on the markers and populations considered, previous studies reported it as spanning 3–5Mb and as many as 75–200 genes [39,40]. Two studies showed that recombination was reduced by as much as 16 and 35-fold near the centromere compared to elsewhere on the X chromosome [40,41]. Based on our comparisons of sympatric populations from Ghana and the recombinant strains, we estimated that the island is over 6Mb-long and extends from positions ~18.1 to 24.2Mb. Within the island, 20 unique protein-coding changes affecting 12 genes were identified between the two sibling species. Amongst these genes, 6 have putative biological functions: AGAP001002 (Toll protein) is involved in development and immunity and AGAP001033 (mab-21 like protein) in neural and sensory organ development; AGAP001050 (chondroitin polymerizing factor) and AGAP001052 (ubiquitin carboxyl-terminal hydrolase) are involved in protein secretion and proteolysis; AGAP001022 (gastrin/cholecystokinin receptor) is a receptor for peptides in the brain and gastrointestinal tract; and AGAP001025 (protein msta) is involved in negative regulation of gene expression. Some of these genes might directly be involved with mating or interact with mating genes located elsewhere in the genome. However, others could also affect genes contributing to the ecological speciation of the sibling species, such as those responsible for form-specific larval habitat use [26,42] and larval predator avoidance behaviour [25,43].
Although we did not target intergenic regions when re-sequencing the X-island, these would warrant further investigation as they might contain regulatory elements with cis and trans effects on genes within the X, 2L and 3L island and possibly elsewhere in the genome. Trans-acting effects could explain the generally poor correspondence between differentially-expressed genes and islands of divergence observed in some An. coluzzii and An. gambiae s.s. populations [44,45]. In this study, the parental Mopti and Kisumu strains did not quite mate as perfectly assortatively as the RbMM and RbSS strains with homogenized genome. This would suggest that, in the parental strains, genetic interactions between the X-island and other parts of the parental genomes might have been responsible for a more variable mating phenotype. It should be note that although fixed non-synonymous coding differences were only identified within the selectively-introgressed X-island region, a number of non-coding differences were identified elsewhere in the genome. These might either be chance fixations due to the introgression process or be genuine differences between the parental Kisumu and Mopti strains. Therefore, and albeit we consider it highly unlikely, we cannot strictly rule out the possibility that a non-coding difference may have accidentally resulted in an assortative mating mechanism unique to the Kisumu and Mopti laboratory pairing and the resulting recombinant strains model system.
Despite the methodological complexities inherent in the creation of assortatively-mating recombinant strains and their behavioural and genomic characterization, the resulting laboratory-based model holds much promise for further characterization of assortative mating in An. gambiae. The occurrence of strict female mate-choice in behavioural assays implies the perfect phenotypic expression of species-specific recognition mechanisms in females as well as perfect cues in males. Although no male-driven mate choice was detected in the laboratory, this does not mean that males do not contribute to assortative mating in nature. In the field, males are known to contribute to assortative mating via swarm spatial segregation [21,28]. Furthermore, in large outdoor enclosure mate-choice experiments in which either virgin females or males of An. coluzzii were presented with an equal number of conspecific and interspecific mates, both females and males were found to mate significantly assortatively [46].
We are currently conducting finer phenotypic characterization of the RbMM and RbSS strains in an attempt to identify con-specific mate recognition mechanisms that result in assortative mating. Importantly, the availability of the RbMM and RbSS strains and the development of a standardized laboratory-based mating assay offers the possibility of direct validation of candidate reproductive isolation genes via knockdown and/or knockout experiments combined with accurate phenotyping. Unravelling the genetic basis of mate choice and assortative mating is not only relevant to our understanding of processes of speciation, it can also play a crucial role in improving the mating behaviour of anopheline strains that are mass-reared for sterile-male release programmes and the control of malaria.
Materials and Methods
Mosquito strains and rearing
The Kisumu strain of an An. gambiae s.s. (S molecular form) was used for selective introgression of its X-island of speciation into the Mopti strain of An. coluzzii (M molecular form) genetic background. The Kisumu strain was colonised over 25-years ago from the area of Kisumu, Kenya where An. gambiae s.s. populations are from the Savanna chromosomal form characterized by the presence of the b inversion on chromosome 2R [15,47,48]. The Mopti strain was colonized in 2003 by the Lanzaro lab (UC Davis) from the village of N’Gabacoro droit near Bamako, Mali, where An. coluzzii populations are characterized by the bc and u inversion polymorphisms on 2R typical of the Mopti chromosomal form [15,47,48]. Both strains are well adapted to the laboratory and lay eggs reliably, which was the most important consideration given the complexity of the envisaged genetic crosses.
The two strains were kept at 25°C±1°C and 70–80% relative humidity and reared under standard conditions in order to achieve homogeneity in phenotypic quality [49]. Adult females were fed on horse blood using an artificial feeder (Hemotek membrane feeding system, Discovery workshops, UK) and newly emerged first instars were reared in plastic trays (34x24cm) at a density of 200 larvae per tray in 1L of water. They were fed daily on ground fish food (Tetra werk, Melle, Germany). Pupae were placed in standard 5L rearing cages for emergence and newly emerged male and female mosquitoes were kept together and with access to 5% glucose solution at all time.
Selective introgression of speciation islands
The genetic differences distinguishing the ribosomal DNA from the M molecular form An. coluzzii and S molecular form An. gambiae s.s. were originally described within the large X-chromosome island at a locus very near the centromere [50,51]. Using this marker, the island of speciation from the S molecular form Kisumu strain was introgressed into the M molecular Mopti strain background through 4 generations of selective introgression. The parental strains were first checked for possible contaminations by genotyping of the diagnostic rDNA IGS locus located in the X-chromosome island of speciation using the PCR-RFLP method developed by Fanello et al.[52]. Here and elsewhere 'M' and 'S' refer to the male genotype at the marker rDNA locus (males have one copy of the X-chromosome) and 'MM', 'SS' and 'MS' to possible homozygous and heterozygous female genotypes (females have two copies of the X-chromosome) at the same locus.
Hybrids between the two strains were created by crossing 100 M Mopti males with 100 SS Kisumu females (Fig 4). In order to obtain the 1st backcross progeny, 100 virgin MS hybrid females were mated with 100 virgin M Mopti males, resulting in male progeny of genotype M or S at the r-DNA locus and female of genotypes MM or MS (Fig 4). From generation 2 to 4, MS progeny females were backcrossed with M Mopti virgin males resulting in 4 generations of backcrossing (Fig 4). At each generation MS hybrid and MM families were obtained by bloodfeeding and setting up 80 females for individual oviposition. MS hybrid families were then distinguished from MM families by genotyping ten 2nd instar larvae reared in individual trays. The progenies of trays identified as containing MS hybrid larvae was pooled and reared under standard rearing conditions to obtain the next generation MS backcross females. Families identified as MM were discarded.
Creation of M and S X-chromosome islands recombinant strains
In order to obtain two strains with M and S X-island types but with high genetic similarity elsewhere in the genome, 2 generations of crosses within the introgressed strains were conducted. MS and MM females and M and S males from the 4th backcross were randomly mated with one another in a mixed cage (100 males and 100 females) resulting in MM or MS 6th generation families (Fig 4). Those progenies of families identified as being mixed M and S by larval genotyping which featured all possible male and female genotypes were randomly mated to one another in a mixed cage (100 males and 100 females) for a 7th cross in order to generate MM, MS and SS families. The resulting MM and SS families were pooled together in order to obtain a pair of MM and SS strains, sharing a very large proportion of the Mopti genetic background but differing at their rDNA locus and linked X-chromosome island of speciation. The whole process was done twice simultaneously to generate two MM and two SS recombinant strains. Of those four strains, two were found to be heterozygous at the rDNA locus during their behavioural characterization suggesting possible contaminations. Therefore all analyses focus on the remaining MM and SS recombinant strains referred to as RbMM and RbSS throughout the text.
Fitness of hybrid MS versus homozygous MM backcrosses
Following the first backcross generation, the ratio of MS to MM females (36/3 = 92.8%) significantly deviated from the expected 1:1 ratio (Chi-square Likelihood ratio: χ2 = 39.2, df = 1, P< 0.001) suggesting a strong hybrid advantage (S1 Fig). Thereafter no differences in the ratio of MS and MM females were found, with the percentage of MS females fluctuating from 46.2 in the 2nd (P = 0.579), to 64.8 in the 3rd (P = 0.0683), and 51.9 (P = 0.782) in the 4th backcross generations (S1 Fig). The proportion of MS to MM females in the fifth generation (1st cross within introgressed strain) did not significantly differ from the expected 3:1 MS/MM ratio (Chi-square Likelihood ratio: χ 2 = 0.19, df = 1, P = 0.664).
Genotyping of the 2L and 3L speciation islands and identification of karyotypes
Following the publication of evidence showing linkage disequilibrium between the X, 2L and 3L islands in field M and S molecular form populations from West-Africa [32], the parental strains and recombinant strains were also characterized as ‘M-like or S-like’ at the 2L and 3L islands of speciation using archived DNA and the PCR-RFLP diagnostics developed by the same authors [32]. Primers and PCR conditions were as described by the authors except for the use of more sensitive AmpliTaq Gold DNA Polymerase (Life Technologies). Polytene chromosome preparations were made from the ovaries of 10–20 semi-gravid females per strain using established protocols [53,54]. Inversions present on chromosomes 2L and 2R were scored from chromosome spread under the light microscope.
Behavioral assays of assortative mating
An assortative mating assay was developed in order to create conditions in which the mating choice preferences of females could be tested within standard laboratory breeding cages. Virgin males and females from each recombinant strain (RbMM and RbSS) were produced using our standardized rearing procedure (see first section) and kept in separate cages with access to 5% glucose solution at all times. In preliminary assortative mating experiments, 2-5-day-old virgin females from a given recombinant strain and X-chromosome island type were given a choice of 2-5-day-old recombinant males matching and non-matching their own X-island type. For each of 4 replicates, 30 females, 30 M-type and 30 S-type island males were placed in a standard 5L rearing cages with access to 5% glucose solution and given 24 hours to mate (6pm start). Infrared video recordings show that under these conditions, males initiated swarms soon after the insectary lights went off. At the end of these preliminary experiments all females were collected and stored in 70% ethanol until dissected for sperm genotyping. Across both mating combinations, the genetic analyses of the sperm extracted from the spermathecae of mated females showed that females mated preferentially with recombinant males with an X-island matching their own (S2 Table). Recombinant females with M-type X-islands mated with matching M-type X-island males on average 77% of the time (Chi-square = 15.9, n = 52, P< 0.001)(S2 Fig). Females with S-type X-island mated with recombinant males with matching S-type X-island 81% of the times (Chi-square = 19.2, n = 47, P< 0.001)(S2 Fig).
However, the mating assays were further improved in terms of percentage of assortative mating by using exactly 5-day-old virgin female and male mosquitoes (see results). The RbMM and RbSS strains and the parental Mopti and Kisumu strains were then characterized in assortative mating experiments designed to test both female and male mate preferences. For recombinants strains, recombinant females from the RbMM or RbSS strains were given a choice of virgin males with X-islands matching their own type or not. Male choice experiments were the exact reciprocal of female choice experiments. Experiments were repeated twice and the female and the male choice for both strains (4 mating combinations) tested each time with mosquitoes of the exact same cohort and age from both strains so as to avoid potential confounding factors due to variation in body size or phenotypic quality. The mating preferences of females and males from the parental Mopti and Kisumu strains were tested using the exact same methodology and experimental design with females and males choosing between individuals of the opposite sex from their own strain or not.
Dissection of mated females and genetic analysis of sperm
Females were dissected in order to determine their mating status and to determine the rDNA type of the male they mated with. Their spermatheca was isolated, broken open, and the sperm bundle transferred to a 1.5ml centrifuge tube as described in previous studies [18,20]. DNA extractions were done using the ChargeSwitch gDNA Micro Tissue Kit (Life Technologies, USA) following the manufacturer's instructions. The sperm DNA was genotyped using the PCR-RFLP diagnostic as described elsewhere [52].
Genome sequencing of recombinant and parental strains
Archived DNA from 13 Mopti, 24 RbMM and 23 RbSS individuals was amplified by multiple-displacement amplification using the Illustra GenomiPhiV2 DNA Amplification kit (GE Healthcare Bio-sciences, Piscataway, NJ), purified using a MinElute Reaction Cleanup Kit (Qiagen, Hilden, Germany) and DNA pools were sent to the Liverpool Centre for Genomic Research (CGR) for sequencing. DNA libraries were prepared according to the Illumina TruSeq DNA protocol (Illumina, San Diego, CA), multiplexed and sequenced on two lanes of an Illumina HiSeq 2000 sequencer.
Base-calling of indexed reads was performed with the program CASAVA 1.8.2 (Illumina). The reads were trimmed using the software Cutadapt 1.2.1 [55] and Sickle 1.200 [56] and mapped to the An. gambiae (PEST) reference sequence (assembly AgamP3) using Bowtie 2.1.0 [57]. Alignments were filtered to remove low mapping quality reads and redundant duplicate reads were filtered out using the Picard MarkDuplicates Tool 1.85 (http://picard.sourceforge.net). Mapped reads were locally re‐aligned around indels using the Genome Analysis Tool Kit (GATK) version 2.1.13 [58,59]. The mean coverage depth after local re‐alignment and duplicate removal of reads was equal to 25.3x for RbSS, 30.9x for RbMM and 34.1x for the Mopti parental strain. Variant detection was performed using the GATK 'UnifiedGenotyper' package [58,59] with an expected SNP heterozygosity of 0.01. An expected ploidy of 20 was used (i.e. allele frequencies calculated in increments of 5%) in order to best balance accurate sample representation and computational efficiency. Variants were further filtered using the GATK 'VariantFiltration' package [58,59]. This resulted in the characterization of ~6 million SNPs (~4.8 million passing all filters) and 900,000 indels in each of the sequenced strains. All variants were annotated using snpEff 3.1 [60]. Visual alignment inspections were performed using the Integrative Genomics Viewer (IGV)[61].
Estimates of genetic differentiation FST between two populations a and b were calculated based on SNPs satisfying GATK's most stringent 'pass' criteria and using the formula:
FST = 1- Hs/Ht where Hs is the mean heterozygosity across populations a and b and Ht, the total heterozygosity across all populations [62].
Hs = 1- Σpi2 where pi are the mean frequencies of the major and minor alleles calculated from the a or b populations and:
Ht = 1- Σpi2 with pi being SNP frequencies calculated across all five populations.
Pair-wise FST estimates of genetic differentiation were used for generating scans of genetic differentiation across chromosomes with the software JMP 10 (SAS Institute, Inc). In order to best outline the genomic region(s) introgressed from An. gambiae s.s. into An. coluzzii in the RbSS and RbMM strains, the 'spline' function was fitted over every high-confidence SNPs with δ = 2.7216.
Separate data analyses identified all unique protein-coding differences between the RbSS, RbMM and Mopti strains. These differences were checked by visual inspection and comparison of their genomes using the software IGV.
Deep targeted sequencing of field sympatric populations
Anopheles gambiae s.l. larvae were collected in Akoti-Chirano (Lat. 6° 6’ 17.08” N; Long. 2° 19’ 0.45” W) in the Bibiani-Anhwiaso-Bekwai district of the Western Region of Ghana, West Africa. Populations of An. coluzzii and An. gambiae s.s. co-occur in this deciduous forested area and are both of the 'Forest chromosomal form' characterized by standard karyotypic arrangements (no paracentric inversions) [63]. Larvae were reared to adulthood at the Department of Animal Biology and Conservation Science, University of Ghana, Legon, West Africa, stored in ethanol, and shipped to Keele University. The samples were then individually characterized as An. coluzzii (M molecular form) and An. gambiae s.s. (S molecular form) as described above. The DNA from 30 individuals of each sibling species was pooled and purified using a MinElute Reaction Cleanup Kit (Qiagen, Hilden, Germany) and DNA pools were sent to the Liverpool Centre for Genomic Research (CGR) for sequence capture and sequencing.
SureSelect RNA oligomer baits (Agilent, Santa Clara, CA) were designed to cover coding regions from position 17Mb to the centromere of the X-chromosome based on the (PEST) reference sequence (assembly AgamP3). Prior to the amplification of pre-capture libraries, DNA fragments larger than ∼700bp were removed from DNA pools using Agencourt AMPure XP beads (Beckman Coulter, Brea, CA). Following amplification and adapter-ligation, 750ng of pre‐capture libraries were hybridised to 2μl of RNA oligomer baits for ∼24 hours at 65°C. Captured libraries were amplified, indexed, pooled and sequenced on 1 lane of an Illumina HiSeq 2000. All other procedures were as described above. Within the targeted pericentric region of the X chromosome, the mean coverage depth after local re‐alignment and duplicate removal of low mapping quality and redundant reads was 200x for An. gambiae s.s., leading to the identification of 26,974 SNPS and 2,593 indels. In An. coluzzii, coverage depth was 255x and 18,772 SNPS and 1,738 indels were identified.
Statistical analyses
All statistical analyses were performed using the software JMP 10 (SAS Institute, Inc). Pearson Chi-square tests of randomness and goodness of fit (likelihood ratios) were used for detecting deviations from random mating in assortative mating experiments and to compare observed M and S X-chromosome frequencies to expected Mendelian and Hardy-Weinberg Equilibrium (HWE) proportions at different generations of the genetic crosses.
Supporting Information
Zdroje
1. Feder JL, Egan SP, Nosil P (2012) The genomics of speciation-with-gene-flow. Trends in Genetics 28: 342–350. doi: 10.1016/j.tig.2012.03.009 22520730
2. Butlin R, Debelle A, Kerth C, Snook RR, Beukeboom LW, et al. (2012) What do we need to know about speciation? Trends in Ecology & Evolution 27: 27–39.
3. Via S (2012) Divergence hitchhiking and the spread of genomic isolation during ecological speciation-with-gene-flow. Philosophical Transactions of the Royal Society B-Biological Sciences 367: 451–460.
4. Seehausen O, Butlin RK, Keller I, Wagner CE, Boughman JW, et al. (2014) Genomics and the origin of species. Nature Reviews Genetics 15: 176–192. doi: 10.1038/nrg3644 24535286
5. Rieseberg LH (2001) Chromosomal rearrangements and speciation. Trends in Ecology & Evolution 16: 351–358.
6. Coyne JA, Orr HA (2004) Speciation. Sunderland, Massachusets, USA: Sinauer Associates, Inc.
7. Servedio MR (2009) The role of linkage disequilibrium in the evolution of premating isolation. Heredity 102: 51–56. doi: 10.1038/hdy.2008.98 18813328
8. Qvarnstrom A, Bailey RI (2009) Speciation through evolution of sex-linked genes. Heredity 102: 4–15. doi: 10.1038/hdy.2008.93 18781167
9. Saether SA, Saetre G-P, Borge T, Wiley C, Svedin N, et al. (2007) Sex chromosome-linked species recognition and evolution of reproductive isolation in flycatchers. Science 318: 95–97. 17916732
10. Bailey RI, Innocenti P, Morrow EH, Friberg U, Qvarnstrom A (2011) Female Drosophila melanogaster Gene Expression and Mate Choice: The X Chromosome Harbours Candidate Genes Underlying Sexual Isolation. Plos One 6.
11. Kulathinal RJ, Stevison LS, Noor MAF (2009) The Genomics of Speciation in Drosophila: Diversity, Divergence, and Introgression Estimated Using Low-Coverage Genome Sequencing. PLOS Genetics 5.
12. Corbett-Detig RB, Hartl DL (2012) Population Genomics of Inversion Polymorphisms in Drosophila melanogaster. PLOS Genetics 8.
13. McGaugh SE, Noor MAF (2012) Genomic impacts of chromosomal inversions in parapatric Drosophila species. Philosophical Transactions of the Royal Society B-Biological Sciences 367: 422–429. doi: 10.1098/rstb.2011.0250 22201171
14. Ellegren H, Smeds L, Burri R, Olason PI, Backstrom N, et al. (2012) The genomic landscape of species divergence in Ficedula flycatchers. Nature 491: 756–760. doi: 10.1038/nature11584 23103876
15. della Torre A, Costantini C, Besansky NJ, Caccone A, Petrarca V, et al. (2002) Speciation within Anopheles gambiae—the glass is half full. Science 298: 115–117. 12364784
16. Deitz KC, Athrey G, Reddy MR, Overgaard HJ, Matias A, et al. (2012) Genetic isolation within the malaria mosquito Anopheles melas. Molecular Ecology 21: 4498–4513. doi: 10.1111/j.1365-294X.2012.05724.x 22882458
17. Coetzee M, Hunt RH, Wilkerson R, Della Torre A, Coulibaly MB, et al. (2013) Anopheles coluzzii and Anopheles amharicus, new members of the Anopheles gambiae complex. Zootaxa 3619: 246–274.
18. Tripet F, Thiemann T, Lanzaro GC (2005) Effect of seminal fluids in mating between M and S forms of Anopheles gambiae. Journal of Medical Entomology 42: 596–603. 16119548
19. Diabate A, Dabire RK, Millogo N, Lehmann T (2007) Evaluating the effect of postmating isolation between molecular forms of Anopheles gambiae (Diptera: Culicidae). Journal of Medical Entomology 44: 60–64. 17294921
20. Tripet F, Toure YT, Taylor CE, Norris DE, Dolo G, et al. (2001) DNA analysis of transferred sperm reveals significant levels of gene flow between molecular forms of Anopheles gambiae. Molecular Ecology 10: 1725–1732. 11472539
21. Dabire KR, Sawadodgo S, Diabate A, Toe KH, Kengne P, et al. (2013) Assortative mating in mixed swarms of the mosquito Anopheles gambiae s.s. M and S molecular forms, in Burkina Faso, West Africa. Medical and Veterinary Entomology 27: 298–312. doi: 10.1111/j.1365-2915.2012.01049.x 23360106
22. Weetman D, Wilding CS, Steen K, Pinto J, Donnelly MJ (2012) Gene Flow-Dependent Genomic Divergence between Anopheles gambiae M and S Forms. Molecular Biology and Evolution 29: 279–291. doi: 10.1093/molbev/msr199 21836185
23. Lee Y, Marsden CD, Norris LC, Collier TC, Main BJ, et al. (2013) Spatiotemporal dynamics of gene flow and hybrid fitness between the M and S forms of the malaria mosquito, Anopheles gambiae. Proceedings of the National Academy of Sciences of the United States of America 110: 19854–19859. doi: 10.1073/pnas.1316851110 24248386
24. Caputo B, Santolamazza F, Vicente JL, Nwakanma DC, Jawara M, et al. (2011) The "Far-West'' of Anopheles gambiae Molecular Forms. Plos One 6.
25. Diabate A, Dabire R, Heidenberger K, Crawford J, Lamp W, et al. (2008) Evidence for divergent selection between the molecular forms of Anopheles gambiae: role of predation. BMC Evolutionary Biology 8: 5. doi: 10.1186/1471-2148-8-5 18190719
26. Kamdem C, Fossog BT, Simard F, Etouna J, Ndo C, et al. (2012) Anthropogenic Habitat Disturbance and Ecological Divergence between Incipient Species of the Malaria Mosquito Anopheles gambiae. Plos One 7.
27. Gimonneau G, Bouyer J, Morand S, Besansky NJ, Diabate A, et al. (2010) A behavioral mechanism underlying ecological divergence in the malaria mosquito Anopheles gambiae. Behavioral Ecology 21: 1087–1092. 22476108
28. Diabate A, Dabire RK, Kengne P, Brengues C, Baldet T, et al. (2006) Mixed swarms of the molecular M and S forms of Anopheles gambiae (Diptera: Culicidae) in sympatric area from Burkina Faso. Journal of Medical Entomology 43: 480–483. 16739404
29. Pennetier C, Warren B, Dabire KR, Russell IJ, Gibson G (2010) "Singing on the Wing" as a mechanism for species recognition in the malarial mosquito Anopheles gambiae. Current Biology 20: 131–136. doi: 10.1016/j.cub.2009.11.040 20045329
30. Gibson G, Warren B, Russell IJ (2010) Humming in Tune: Sex and Species Recognition by Mosquitoes on the Wing. Jaro-Journal of the Association for Research in Otolaryngology 11: 527–540.
31. Turner TL, Hahn MW, Nuzhdin SV (2005) Genomic islands of speciation in Anopheles gambiae. Plos Biology 3: 1572–1578.
32. White BJ, Cheng CD, Simard F, Costantini C, Besansky NJ (2010) Genetic association of physically unlinked islands of genomic divergence in incipient species of Anopheles gambiae. Molecular Ecology 19: 925–939. doi: 10.1111/j.1365-294X.2010.04531.x 20149091
33. Reidenbach KR, Neafsey DE, Costantini C, Sagnon N, Simard F, et al. (2012) Patterns of Genomic Differentiation between Ecologically Differentiated M and S Forms of Anopheles gambiae in West and Central Africa. Genome Biology and Evolution 4: 1202–1212. doi: 10.1093/gbe/evs095 23132896
34. Baeshen R, Ekechukwu NE, Toure M, Paton D, Coulibaly M, et al. (2014) Differential effects of inbreeding and selection on male reproductive phenotype associated with the colonization and laboratory maintenance of Anopheles gambiae. Malaria Journal 13.
35. Carneiro M, Blanco-Aguiar JA, Villafuerte R, Ferrand N, Nachman MW (2010) Speciation in the european rabbit (Oryctolagus cuniculus): islands of differentiation on the x chromosome and autosomes. Evolution 64: 3443–3460. doi: 10.1111/j.1558-5646.2010.01092.x 20666840
36. Gimenez MD, White TA, Hauffe HC, Panithanarak T, Searle JB (2013) Understanding the basis of diminished gene flow between hybridizing chromosome races of the house mouse. Evolution 67: 1446–1462. doi: 10.1111/evo.12054 23617920
37. Grula JW, Taylor OR (1980) The effect of x-chromosome inheritance on mate-selection behavior in the sulfur butterflies, Colias eurytheme and Colias philodice. Evolution 34: 688–695.
38. Michel AP, Sim S, Powell THQ, Taylor MS, Nosil P, et al. (2010) Widespread genomic divergence during sympatric speciation. Proceedings of the National Academy of Sciences of the United States of America 107: 9724–9729. doi: 10.1073/pnas.1000939107 20457907
39. Stump AD, Fitzpatrick MC, Lobo NF, Traore S, Sagnon NF, et al. (2005) Centromere-proximal differentiation and speciation in Anopheles gambiae. Proceedings of the National Academy of Sciences of the United States of America 102: 15930–15935. 16247019
40. Slotman MA, Reimer LJ, Thiemann T, Dolo G, Fondjo E, et al. (2006) Reduced recombination rate and genetic differentiation between the M and S forms of Anopheles gambiae s.s. Genetics 174: 2081–2093. 17057242
41. Pombi M, Stump AD, Della Torre A, Besansky NJ (2006) Variation in recombination rate across the X chromosome of Anopheles gambiae. American Journal of Tropical Medicine and Hygiene 75: 901–903. 17123984
42. Gimonneau G, Pombi M, Choisy M, Morand S, Dabire RK, et al. (2012) Larval habitat segregation between the molecular forms of the mosquito Anopheles gambiae in a rice field area of Burkina Faso, West Africa. Medical and Veterinary Entomology 26: 9–17. doi: 10.1111/j.1365-2915.2011.00957.x 21501199
43. Gimonneau G, Pombi M, Dabire RK, Diabate A, Morand S, et al. (2012) Behavioural responses of Anopheles gambiae sensu stricto M and S molecular form larvae to an aquatic predator in Burkina Faso. Parasites & Vectors 5.
44. Cassone BJ, Mouline K, Hahn MW, White BJ, Pombi M, et al. (2008) Differential gene expression in incipient species of Anopheles gambiae. Molecular Ecology 17: 2491–2504. doi: 10.1111/j.1365-294X.2008.03774.x 18430144
45. Cassone BJ, Kamdem C, Cheng CD, Tan JC, Hahn MW, et al. (2014) Gene expression divergence between malaria vector sibling species Anopheles gambiae and An. coluzzii from rural and urban Yaounde Cameroon. Molecular Ecology 23: 2242–2259. doi: 10.1111/mec.12733 24673723
46. Paton D, Toure M, Sacko A, Coulibaly MB, Traore SF, et al. (2013) Genetic and environmental factors associated with laboratory rearing affect survival and assortative mating but not overall mating success in Anopheles gambiae sensu stricto. Plos One 8.
47. Toure YT, Petraca V, Traore SF, Coulibaly A, Maiga HM, et al. (1998) The distribution and inversion polymorphism of chromosomally recognized taxa of the Anopheles gambiae complex in Mali, West Africa. Parrasitologia 40: 477–511.
48. della Torre A, Fanello C, Akogbeto M, Dossou-yovo J, Favia G, et al. (2001) Molecular evidence of incipient speciation within Anopheles gambiae s.s. in West Africa. Insect Molecular Biology 10: 9–18. 11240632
49. Aboagye-Antwi F, Tripet F (2010) Effects of larval growth condition and water availability on desiccation resistance and its physiological basis in adult Anopheles gambiae sensu stricto. Malaria Journal 9.
50. Gentile G, Slotman M, Ketmaier V, Powell JR, Caccone A (2001) Attempts to molecularly distinguish cryptic taxa in Anopheles gambiae s.s. Insect Molecular Biology 10: 25–32. 11240634
51. Favia G, Lanfrancotti A, Spanos L, Siden-Kiamos I, Louis C (2001) Molecular Characterization of ribosomal DNA polymorphisms discriminating among chromosomal forms of Anopheles gambiae. Insect Molecular Biology 10: 19–23. 11240633
52. Fanello C, Santolamazza F, della Torre A (2002) Simultaneous identification of species and molecular forms of the Anopheles gambiae complex by PCR-RFLP. Med Vet Entomol 16: 461–464. 12510902
53. Coluzzi M (1968) Cromosomi politenici delle cellule nutrici ovariche nel complesso gambiae del genere Anopheles. Parassitologia 10 179–183.
54. Hunt RH (1973) A cytological technique for the study of Anopheles gambiae complex. Parassitologia 15: 137–139. 4788355
55. Martin M (2011) Cutadapt removes adapter sequences from high‐throughput sequencing reads. EMBnet Journal 17: 10–12.
56. Joshi NA, Fass JN (2011) Sickle: A sliding-window, adaptive, quality-based trimming tool for FastQ files (Version 1.33)[Software]. Available at https://github.com/najoshi/sickle
57. Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nature Methods 9: 357–U354. doi: 10.1038/nmeth.1923 22388286
58. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, et al. (2010) The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Research 20: 1297–1303. doi: 10.1101/gr.107524.110 20644199
59. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, et al. (2011) A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genetics 43: 491–498. doi: 10.1038/ng.806 21478889
60. Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, et al. (2012) A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w(1118); iso-2; iso-3. Fly 6: 80–92. doi: 10.4161/fly.19695 22728672
61. Thorvaldsdottir H, Robinson JT, Mesirov JP (2013) Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Briefings in Bioinformatics 14: 178–192. doi: 10.1093/bib/bbs017 22517427
62. Nei M (1987) Molecular evolutionary genetics: Columbia University Press, NY, USA.
63. Appawu MA, Baffoewilmot A, Afari EA, Nkrumah FK, Petrarca V (1994) Species composition and inversion polymorphism of the Anopheles gambiae complex in some sites of Ghana, West Africa. Acta Tropica 56: 15–23. 8203292
Štítky
Genetika Reprodukční medicínaČlánek vyšel v časopise
PLOS Genetics
2015 Číslo 4
Nejčtenější v tomto čísle
- Lack of GDAP1 Induces Neuronal Calcium and Mitochondrial Defects in a Knockout Mouse Model of Charcot-Marie-Tooth Neuropathy
- Proteolysis of Virulence Regulator ToxR Is Associated with Entry of into a Dormant State
- Frameshift Variant Associated with Novel Hoof Specific Phenotype in Connemara Ponies
- Ataxin-2 Regulates Translation in a New BAC-SCA2 Transgenic Mouse Model