Abstract
In this study, we investigated how IBD patterns shared between individuals of the same breed could be informative of its admixture level, with the underlying assumption that the most admixed breeds, i.e. the least genetically isolated, should have a much more fragmented genome. We considered 111 goat breeds (i.e. 2501 individuals) and 156 sheep breeds (i.e. 3304 individuals) from Europe, Africa and Asia, for which beadchip SNP genotypes had been performed. We inferred the breed’s level of admixture from: (i) the proportion of the genome shared by breed’s members (i.e. “genetic integrity level” assessed from ADMIXTURE software analyses), and (ii) the “AV index” (calculated from Reynolds’ genetic distances), used as a proxy for the “genetic distinctiveness”. In both goat and sheep datasets, the statistical analyses (comparison of means, Spearman correlations, LM and GAM models) revealed that the most genetically isolated breeds, also showed IBD profiles made up of more shared IBD segments, which were also longer. These results pave the way for further research that could lead to the development of admixture indicators, based on the characterization of intra-breed shared IBD segments, particularly effective as they would be independent of the knowledge of the whole genetic landscape in which the breeds evolve. Finally, by highlighting the fragmentation experienced by the genomes subjected to crossbreeding carried out over the last few generations, the study reminds us of the need to preserve local breeds and the integrity of their adaptive architectures that have been shaped over the centuries.
Similar content being viewed by others
Introduction
Whereas livestock diversity is essential for food security, the Food and Agriculture Organization of the United Nations’s (FAO) predicts the loss of one breed per month (FAO 2015). In ruminants in particular, local breeds are largely abandoned in favor of more productive ones, or are subject to unsupervised crossbreeding in order to up-grade them (Shrestha 2005). This crossing with more profitable, often exotic breeds, represents one of the major threats to the diversity of the resource, and is largely found in developing countries where state structures are less able to monitor the practices of breeders, under increasing economic pressure (Scherf et al. 2006; Boutrais 2007). Indiscriminate crossbreeding has led to a widespread loss of the original/parent indigenous breeds and to the formation of nondescript crossbreds (Bett et al. 2013), mostly incompatible with local production systems (Okeno et al. 2013), and characterized by a genetic composition probably unstable over time (Paim et al. 2020). These crossbreeding practices favor genetic homogenization (Ouchene-Khelifi et al. 2018; Belabdi et al. 2019) via the loss of rare or specific variants, and induce disruption of co-adapted gene complexes, to the detriment of the integrity and the viability of locally adapted populations (Todesco et al. 2016; Zhang et al. 2019; Ågren et al. 2019).
Several software programs, assessing the genetic structure of populations, can be used to identify crossbreeding practices, by estimating individual ancestry from independent multilocus SNP genotype datasets, i.e. STRUCTURE (Lawson et al. 2012), ADMIXTURE (Alexander and Novembre 2009), PCAdapt (Luu et al. 2017), etc. Whatever the underlying algorithms, the different programs are designed to analyze the differences in the distribution of genetic variants between populations. Hence, these methods imply that if a population (A) is intensively crossed with a population (B) that does not appear in the dataset, and if (A) and (B) have no genetic interaction with the other populations of the dataset, then (A) will appear as not admixed. In developing countries, it can be very challenging to build datasets that are representative of the complexity of field practices. Indeed, the ideal working conditions would imply significant financial means, allowing to palliate the difficulties linked to the sometimes extreme climatic conditions, to the complex geography, to the political tensions, etc., and especially the ability to collaborate with local specialists of the breeding issue. As it is precisely in developing countries that it is essential to detect crossbreeding practices affecting the indigenous breeds (FAO 2007, 2015) particularly adapted to local conditions (environment and management) (Hoffmann 2013), it seems of interest to find an admixture indicator that could remain effective even in the absence of the breeds, involved in the crosses, within the dataset considered.
A haplotype, as a particular combination of alleles on the same chromosome inherited together, captures Linkage Disequilibrium (LD) among genotypes at linked loci. It constitutes a rich but largely underexploited information in the field of conservation genomics, especially since haplotype data could enhance our understanding of genetic admixture (Leitwein et al. 2020). From Fisher (1949, 1954), several scientists including Barton and Bengtsson (1986), Baird et al. (2003), Janzen et al. (2018), have been interested in partitioning admixed individual’s genomes into blocks originating from different ancestral populations. Indeed, crossbreeding and more generally “hybridization” leads to introgression of migrant chromosomes into a receiving genetic background. The lengths of the “migrant tracts” or admixture “chunks” (Falush et al. 2003) will become progressively shorter over generations (Pool and Nielsen 2009). The admixed genomes can be depicted as a mosaic of local ancestry tracts from differentiated populations (Liang and Nielsen 2014). Hybridization induces the fragmentation of the genomic blocks of contiguous ancestry through genetic recombination, which leads to an exchange of genetic material between homologous chromosomes. The breakdown of ancestral haploblocks (i.e. contiguous ancestry blocks), via the introduction of distinct haplotypes will increase the occurrence of mosaics in the genome (Freedman et al. 2016; Aliloo et al. 2020) and reduce the length of Identity-By-Decent (IBD) tracts (Leutenegger et al. 2003).
IBD refers to identical DNA segments inherited from a recent common ancestor without recombination, i.e. IBD is the shared inheritance of an identical portion of the genome between two individuals, providing direct evidence of genetic relatedness. This differs from identity by state (IBS), in which part of the genomes of two individuals may appear to be identical, but not necessarily as a result of recent common inheritance (Browning and Browning 2012). The length and proportion of shared IBD segments serve as an indicator of the age of the most recent common ancestor. The principle underlying is that long haplotypes shared between individuals are more likely, statistically speaking, to be the consequence of a kinship due to a common, deep-rooted history of the population rather than to random recombination or mutation (Browning 2008). The issue of detecting identical segments by descent (IBD) has aroused renewed interest, as it offers unprecedented possibilities for the study of population history and genealogy (Tang et al. 2022). Indeed, patterns of IBD segment sharing between groups of individuals in the same population reveal the population demographic history, recent effective population size and rates of migration. In particular, IBD segments have been used to detect introgression in both animals (Bosse et al. 2014) and plants (Ferdy and Austerlitz 2002).
In this study, we exploit the availability of large SNP genotyped datasets in goats and sheep to investigate the relationship between admixture, i.e. gene flow between populations, and patterns of IBD-sharing, in breeds of small ruminants. We inferred admixture level from (i) the proportion of the genome shared by breed’s members (“genetic integrity level”), obtained with the ADMIXTURE software, and (ii) the “AV index”, calculated from Reynold’s genetic distances (i.e. distances derived from Wright fixation index, Wright 1965) to identify the level of “genetic originality” of each breed in a dataset. The hypothesis was that admixed breeds, i.e. breeds that are the least isolated from a genetic point of view, should have a much more fragmented genome and thus exhibit shorter and fewer shared IBD segments. In this case, the characterization of intra-breed shared IBD segments could be used as an indicator of admixture level, which would not imply the knowledge of the whole genetic landscape in which the breed evolves. It would be extremely valuable in detecting efficiently and quickly the breeds in danger due to crossbreeding.
Material and methods
Datasets
We used the AdaptMap dataset, including goat breeds from Europe (40), Asia (19) and Africa (52) (see details in Table 1 and Supplementary Table 1), genotyped with the Caprine SNP50 BeadChip (Bertolini et al. 2018, available via Dryad: https://doi.org/10.5061/dryad.v8g21pt), for 53,547 SNPs. For sheep, we analyzed European (73), Asian (50) and African (33) breeds (see details in Table 2 and Supplementary Table 2). We used the following datasets : (i) French breeds obtained with the Illumina Ovine HD SNP Beadchip (Rochus et al. 2018, Zenodo repository https://doi.org/10.5281/zenodo.237116); (ii) Italian and Spanish breeds (Ciani et al. 2020, available at https://doi.org/10.23644/uu.8947346); (iii) Asian and North/Central European breeds (Kijas et al. 2012, available at http://www.sheephapmap.org); (iv) Chinese breeds (Zhao et al. 2017); (v) Ethiopian breeds (Edea et al. 2017, available at www.animalgenome.org/repository/pub/KORE2017.1122; Ahbara et al. 2019, available at www.animalgenome.org/repository/pub/NOTT2018.0423; Amane et al. 2020) and (vi) South African sheep breeds (Dzomba et al. 2020, available at osf.io/ceup6/?view_only=a1959659de5f4d5d9bbb1c607b2d83b6; Molotsi et al. 2017). SNP data from French breeds were extracted from the 600 K variation using the Ovine SNP50 BeadChip coordinates of SNPs on the OAR v3.1 reference genome assembly using Vcftools (Danecek et al. 2011). The other sheep datasets were obtained with the Illumina Ovine SNP50 BeadChip. SNPs and animals were pruned with PLINK v1.07 (Purcell et al. 2007) using the following filtering thresholds: (i) SNP call rate ≤ 97%; (ii) SNP minor allele frequency (MAF) ≤ 1%; (iii) animals displaying ≥ 10% of missing genotypes. After filtration of the merged datasets, we retained 50,220 genotypes for 2501 goats and 39,893 genotypes for 3304 sheep. The R-package Hierfstat (Goudet 2005) was used to estimate FIS (mean and 95% confidence interval) by population.
IBD sharing pattern
BEAGLE 4.1 (Browning and Browning 2007) was used to detect IBD segments that define haplotypes originating from a recent and single common ancestor. When detecting an IBD segment, it must be sufficiently long to ensure that it is not an aggregate of several short IBD segments from different ancient common ancestors. The first part of the algorithm, based on probabilistic methods, is focused on data phasing. In a second step, the phased haplotypes are used to build a haplotype frequency model and, for each shared candidate IBD segment, the LOD score, which is the log (base 10) of the likelihood ratio, is calculated. Candidate segments whose LOD score is above a specified threshold are flagged as IBD segments. Segments with a LOD score <4 and a length shorter than 0.5 cM were excluded (knowing that Beagle assumes a constant recombination rate of 1 cM per Mb), and the ibdtrim parameter was set to 40, according to recommendation of Browning and Browning (2007). We used in-house python 3 (Van Rossum and Drake 1995) script (available on request) to obtain the mean length and number of IBD segments shared by individuals of the same breed.
ADMIXTURE analyses
The approaches most frequently used to describe population structure are principal component analysis (Patterson et al. 2006) and admixture proportion inference (Lawson et al. 2012; Alexander and Novembre 2009). Whereas principal component analysis reduces a multidimensional dataset to a much more restricted number of dimensions, with admixture proportion inference, individuals in a sample are modelized as having a fraction of their genome deriving from each of several source populations. The basic assumption of this model is that individuals are members of a set of K discrete groups, each with a specific allele frequency, knowing that individuals can belong fractionally to each group. The aim is to infer the proportions of ancestry in each source population. This model is particularly well suited to the analysis of ruminant breeds, which have developed over time to form populations with specific evolutionary histories linked to a given territory and according to the rules of a pastoral social group (Serranito et al. 2021a). These patterns of genetic uniqueness have been disrupted over the last few decades by agricultural intensification, sometimes leading to recent admixtures, which are detected with great relevance by these approaches, making them widely used (Decker et al. 2014; Upadhyay et al. 2019). ADMIXTURE software is a clustering-based approach for maximum likelihood estimation of individual ancestry, which analyses a dataset of independent multilocus SNP genotypes (Alexander et al. 2009). The first step before using the software is, hence, to filter SNPs on the basis of pairwise LD to produce a reduced set of more independent markers. SNP pruning was carried out using the –indep option of PLINK with the following parameters: 50 SNPs per window, a shift of five SNPs between windows, and a variation inflation factor’s threshold of two (corresponding to R2 > 0.5). ADMIXTURE was run with K = 2 to x, with x corresponding to the total number of breeds considered in the dataset. For each value of K, 10 independent runs were performed. The program CLUMPAK (Kopelman et al. 2015) was used to analyze the multiple independent runs at a single K and visualize the results. For K = x (with x corresponding to the total number of breeds considered in the dataset), the proportion of membership for each predefined breed in each cluster was annotated. These results allowed defining two groups: (i) “admixed” breeds were defined as breeds sharing less than 85% of their genome with all the individuals of the breeds in the majority cluster (Q ≤ 0.85), and breeds were classified as (ii) “slightly admixed” if more than 85% of their genome was shared by all members of the breed for the main cluster (Q > 0.85). The value of 0.85 corresponds approximately to the third quartile of the sheep and goat distributions for the “genetic integrity level” variable (defined below); it was thus chosen as a threshold favoring statistically comparable datasets between the “admixed” and “slightly admixed” groups. We also characterized each breed by a “genetic integrity level” defined as the membership proportion for the main cluster (i.e. Q-value of the main cluster). The PCAdapt R package (Privé et al. 2020) was used to perform principal component analysis (PCA) to visualize the variability available within the different datasets (Supplementary Figures 1-2).
Breed “originality” index
Pavoine et al. (2005) have adapted methods initially developed from phylogenetic trees to measure the “originality” of species and populations. From a biodiversity perspective, “originality” (i.e. distinctiveness or isolation) represents the total contribution of the species or population to the biodiversity of the whole (Pavoine et al. 2005). We used the R package ADIV, to quantify the originality of the breeds in the different datasets, or in other words the strength of the genetic flow linking them to other breeds, via dissimilarity indices. Phylogenetic dissimilarities can be calculated, for example as the sum of the branch lengths of the shortest path linking the two groups considered on the tree (Pavoine et al. 2017). The AV index, for “average”, obtained by the “distinctDis” function, was retained, representing the average dissimilarity between a focal breed and all the others (Eiswerth and Haney 1992; Ricotta 2004). This index was calculated from a pairwise matrix based on Reynolds’ genetic distances (Reynolds et al. 1983), which was used as the dissimilarity matrix. Higher values of AV (towards 1) indicate greater originality, i.e. greater genetic isolation from other breeds, or genetic specificity, while values close to zero indicate strong genetic flow between breeds. NeighborNet graphs based on Reynolds’ genetic distances were constructed using SplitsTree (Huson and Bryant 2006), to visualize the “originality” of breeds within the different datasets, through branch length and arrangement (see Supplementary Figures 3-4).
Statistical analyses
We used Linear Model (LM) and Generalized Additive Model (GAM) to study the weight of IBD variables (number and length) in the prediction of “AV index” and “genetic integrity level”. The different models were compared in terms of performance with chi-square tests. GAM is a nonparametric extension of the generalized linear model, which can deal directly with nonlinear relationships between response variables and multiple explanatory variables. We use the MGCV R-package to construct and test the models (Pedersen et al. 2018; Wood 2011). The adjusted R2, defined as the proportion of variance explained by the model, was used as an indicator of the model’s relevance.
Student’s t-tests were used to compare mean IBD segment lengths and numbers in the “admixed” and “slightly admixed” groups, i.e. according to the “admixture status”. Spearman’s correlation coefficient was used to assess the relationship between the number or length of IBD segments and the “genetic integrity level” or the “AV index”. To compare characteristics (mean “genetic integrity levels”, mean “AV index”, etc.) of the different datasets (African, Asian, and European) ANOVA tests were performed followed by post-hoc Tukey’s tests (alpha = 0.05).
We used chromoMap (Anand and Rodriguez Lopez 2022), an R package, to visualize and map chromosomal feature with known coordinates. The “aggregate_func” argument was parameterized with the “sum” function to represent the set of haplotype blocks shared between pairs of individuals. The software R 4.0.5 (R Core Team 2022) was used for the analyses, with the ggplot2 (Wickham 2016) package, for graphic representations.
Results
Datasets characteristics
For goats, we analyzed 111 breeds via 2501 individuals, including 40 breeds from Europe, 52 from Africa and 19 from Asia. For sheep, we considered 156 breeds, via 3304 individuals, including 74 breeds from Europe, 33 breeds from Africa and 49 from Asia. We randomly kept only 30 individuals, for breeds with a large number of individuals, and we removed breeds represented by less than eight individuals (to ensure accuracy of the results, Browning and Browning 2007). On average we considered 22.53 individuals per breed (s.d. = 6.42) for goats and 21.18 individuals per breed (s.d. = 5.35) for sheep (see details in Supplementary Tables 1-2). For goats, the mean FIS was 0.016 (s.d. = 0.036) and FIS per population ranged from −0.076 to 0.173. The Irish breed, Old Irish goat cross (OIGX) and the Thyolo breed from Malawi (THY), were the only breeds showing FIS values above 0.10, respectively at 0.173 and 0.112. For sheep, the mean FIS was 0.015 (s.d. = 0.039) and FIS by population ranged from −0.092 to 0.198. The South African Dorper breed (DOP, FIS = 0.126), the Asian Bengladeshi breed (BGE, FIS = 0.147), and the French Ouessant breed (OUE, FIS = 0.177) showed FIS values above 0.1 (Supplementary Table 3-4).
Links between admixture levels and IBD sharing patterns
Taking all datasets together, 72.30 and 62.18% of goat and sheep breeds, respectively, were classified as “admixed” (Tables 1, 2, details in Supplementary Tables 1-2 and Supplementary Figures 1-2). It is noteworthy that all the Spanish sheep breeds and 95% of the North and West African goat breeds were classified as “admixed”.
For goats (Table 1), the statistical tests revealed no significant difference between the average “genetic integrity level” of European and Asian breeds, but a trend towards higher genetic integrity level for European breeds (mean = 0.75, s.d. = 0.19) than for African breeds (mean = 0.66, s.d. = 0.21), was detected (p value = 0.07). For sheep datasets (Table 2), the average integrity level was higher for European breeds (0.81, s.d. = 0.20) than for Asian or African breeds (mean around 0.65, s.d. around 0.20), i.e. p values < 0.01 for ANOVA and Tukey post hoc tests.
For goats the mean “AV index” was 0.092 (s.d. = 0.05) and ranged from 0.028 to 0.353. European breeds showed significantly higher mean AV value (mean = 0.11, s.d = 0.06) than African breeds (mean = 0.07, s.d. = 0.04), i.e. p values < 0.001 for ANOVA and Tukey post hoc tests. For sheep the mean “AV index” was 0.097 (s.d. = 0.06) and ranged from 0.017 to 0.293. European breeds showed significantly higher mean AV value (mean = 0.11, s.d. = 0.07) than Asian breeds (mean = 0.07, s.d. = 0.03), i.e. p values < 0.01 for ANOVA and Tukey post hoc tests (details in Supplementary Tables 1-2 and Supplementary Figures 3-4).
The average number of IBD segments per goat breed was 13.30 (s.d. = 19.91), and mean length was 5.4 Mb (s.d. = 4.91). The average number of IBD segments per sheep breed was 8.57 (s.d. = 12.84), and mean length was 7.41 Mb (s.d. = 5.67). For goats, African breeds showed significantly fewer IBD segments (mean = 5.62, s.d. = 10.64) than Asian (mean = 19.00, s.d. = 20.45) and European (mean = 20.59, s.d. = 25.66) breeds, i.e. p values < 0.01 for ANOVA and Tukey post hoc tests. The same observation applied to the length of the IBD segments (mean = 2.94, s.d. = 10.64 for African breeds, against mean = 8.27, s.d. = 20.45 for Asian breeds and mean = 7.64, s.d. = 25.66 for European breeds). For sheep, there was no significant difference in the average number of IBD segments between African, Asian and European breeds. On the other hand, European breeds showed significantly longer IBD segments (mean = 9.77, s.d. = 5.23) than Asian (mean = 5.00, s.d. = 4.02) and African breeds (mean = 5.85, s.d. = 6.71), i.e. p values < 0.001 for ANOVA and Tukey post hoc tests.
In goats, considering all the datasets together, the “admixed” breeds were characterized by fewer IBD segments (on average 6.82 versus 30.82 for the “slightly admixed” breeds, p value < 2.2×10−16), that were also shorter (on average 3.98 Mb versus 9.78 Mb for the “slightly admixed” breeds, p value = 3.66×10−12). In sheep, considering all the datasets together, the “admixed” breeds were characterized by fewer IBD segments (on average 4.03 versus 16.03 for the “slightly admixed” breeds, p value = 9.514×10−5), that were also shorter (on average 5.28 Mb versus 10.92 Mb for the “slightly admixed” breeds, p value = 9.514×10−5). These conclusions were drawn for all datasets tested individually, except for the number of IBD segments in African sheep (mean = 4.58, s.d. = 5.42 for “admixed breeds” versus, mean = 10.56 and s.d. = 9.20, for “slightly admixed” breeds, p value > 0.05) (Tables 1, 2 and Supplementary Figure 5).
Considering the goats and sheep datasets together (Tables 1, 2), the Spearman coefficient showed high correlations between the “AV index” and number or length of IBD segments (ρ around 0.75 in each case) ; correlation coefficient values were lower between “genetic integrity levels” and shared IBD segments number or length, ρ around 0.5 in each case (sheep and goats considered), but still highly significantly different from zero (Tables 1, 2). Considering the datasets independently, correlations between “genetic integrity levels” and number or length of IBD segments were weakest for African goat datasets (ρ around 0.37, p value < 0.001), and not significantly different from zero only for the African sheep datasets (Tables 1, 2).
Modeling admixture from IBD sharing patterns
Considering the “genetic integrity level” as the response (y) and the shared IBD segments “number” or “length” as predictors (x): (i) for goats, it appeared that LM models were more suitable than GAM models to fit the data. In addition, the models including the origin of the dataset (i.e. Africa, Asia or Europe) did not perform better than the basic LM models (p values > 0.05). The relationship (Fig. 1) between genetic integrity and the “number” (adjusted R2 = 0.236, p value < 0.0001) or “length” (adjusted R2 = 0.307, p value < 0.0001) of IBD segments was significant, but at low predictive values, there was a notable failure to adjust to the response, whereas at higher values, the correlation between predictor and response was clearer. (ii) For sheep, GAM models were better than LM models (p values < 0.0001). Table 3 gives details of the various GAM models tested, and Fig. 2 shows the relationships between “genetic integrity level” and shared IBD segments “number” and “length”, taking into account the origin of the datasets as it improved the models (adjusted R2 with “IBD segment number” as predictor = 0.43, adjusted R2 with “IBD segment length” as predictor = 0.45, p values < 0.001). The effective degrees of freedom (edf) estimated from GAM models can be used as an approximation of the degree of nonlinearity in predictor-response relationships. Indeed, an edf of 1 is equivalent to a linear relationship, an edf > 1 and ≤ 2 corresponds to a weakly non-linear relationship, while an edf > 2 indicates a strongly non-linear relationship (Zuur et al. 2009). Highly non-linear relationships are most likely to have inflection points and threshold responses. According to the edf values, the relationships were non-linear for all datasets. It appeared that as the value of the predictor increased, the value of “genetic integrity” increased, with a threshold zone beyond which there appeared a plateau or even a slight decrease. The African dataset showed the weakest relationship between “genetic integrity level” and “number” of IBD segments, with a convoluted curve and again for this dataset the relationship was not significant when considering “genetic integrity level” and IBD segment “length” as predictor (Table 3, Fig. 2).
Considering the “AV index” as the response (y) and shared IBD segments “number” and “length” as predictors (x) (Figs. 3, 4): robust models, indicating an overall increase in genetic originality as the “number” and “length” of IBD shared segments increased, were found. Strong adjusted R2 values were obtained, around 0.6, which even approached 0.8 in the goat dataset considering the “number” of IBD segment as the predictor (Table 4). It can be noted that the Asian dataset displayed linear or quasi-linear relationships.
Sharing IBD patterns and crossbreeding: an example with Irish goat breeds
We exploited the opportunity to have genotypes of Old Irish goat breed (OIG) and also of crossbred individuals (OIGx). The Old Irish Goat is the native Irish breed, now critically endangered and found only in remote mountain ranges, living mostly in wild herds (www.oldirishgoat.ie). OIG are very hardy and adapted to their territory, even though they are supposed to have arrived in Ireland 5000 years ago in Neolithic times (Yalden 1999). In the latter half of the twentieth century, they were widely crossbred with improved-type goats (Swiss and Anglo-Nubian breeds), precipitating their decline (Porter and Tebbit 1996).
For OIG, Admixture analysis revealed that 94% of the genome was shared by all individuals of the breed, compared to 45% for OIGx. Furthermore, OIG shared a very high number of IBD segments, 46.8 versus 9.8 for OIGx, and these shared segments were very long, averaging 12.3 Mb versus 2.1 Mb for OIGx. The Fig. 5 shows, by way of example, the number of shared segments for chromosomes 1 to 6, according to chromosomal position, and considering the same number of individuals for each breed (i.e. ten individuals). Unshared portions of the genome are symbolized in black, while the blue gradient indicates the maximum number of shares towards deep blue, and a low number of shares towards white. The number of shared segments is represented by the histogram above the chromosome. For OIGx, chromosome “mosaicism” was pronounced, with large proportions of genomes with little or no sharing, whereas for OIG, almost all parts of the chromosomes were shared, with frequency values at least 4 to 5 times higher than for crossbred individuals. Taking chromosome 6 as an example, 5 signatures associated with adaptation to temperature and altitude gradients, have been identified in Mediterranean goats by Serranito et al. (2021a), including signatures targeting: (i) the gene GSTCD (Glutathione S-Transferase C-Terminal Domain Containing), also identified by Wang et al. (2019) as being linked to hypoxia adaptation in yaks and by Mdladla (2016) as favoring adaptation to climate variables in South African goats. The gene (ii) HERC6 (HECT and RLD Domain Containing E3 Ubiquitin Protein Ligase Family Member 6), was also found linked to climate adaptation in South African goats (Mdladla 2016), associated with stress grazing tolerance in sheep (Mwacharo et al. 2017), and associated with arid adaptation in Chinese sheep (Yang et al. 2016). In contrast to OIG, the GSTCD gene zone (chromosomal location 19,640,852–19,780,384 bp) was no longer shared by OIGx, and sharing of the HERC6 gene was extremely low between OIGx individuals (chromosomal location 36,819,275–36,872,479 bp) (Fig. 5).
Discussion
In this study, we assessed how the “length” and the “number” of IBD segments shared between individuals of the same breed could be informative of its admixture level, or, in other words, its isolation in terms of gene flow. To do this, we considered the available SNP genotype datasets for goats and sheep, and we retained datasets that allowed the most relevant screening of some countries or world regions.
We found that breeds classified as “admixed” showed fewer and smaller IBD segments in both goat and sheep datasets. Moreover, Spearman correlations were highly significant between the “genetic integrity level” and the “number” or “length” of IBD tracts, with values around 0.5, except for African goat breeds showing lower values and for African sheep breeds for which no correlation was shown. Spearman correlations between the “AV index” and the number or length of IBD segments, on the other hand, were all significant, with higher values around 0.7–0.8. In line with these results, statistical modeling showed particularly pronounced links between the “AV index” and the variables “number” or “length of shared IBD segments”. Different patterns appeared depending on the geographical origin of the samples. For example, GAM models showed more convoluted curves for African breeds, involving relationships between IBD patterns and levels of genetic originality, more complex to explain. Refining the analyses by considering more restricted and coherent geographical zones, for which a substantial number of genotyped breeds would be available, as well as precise information concerning management methods, appears essential in order to understand which parameters influence IBD-sharing patterns according to the regions of the world considered.
Taken together, the results showed a clear link between the genome fragmentation, inferred by the characteristics of the IBD segments, and the various indicators considered to assess the level of admixture of the different small ruminant breeds. It is interesting to note that the IBD pattern predicts the “AV index”, focusing on the strength of the population’s isolation rather than on the genetic similarity between individuals, more accurately than the “admixture level”, which does not integrate these notions of genetic proximity or originality.
All these results imply that the intra-breed shared segments of IBD patterns could be used to develop relevant indicators of admixture, or genetic isolation of breeds. More comprehensive theoretical and experimental studies, designed to take into account the evolutionary trajectories of the breeds (e.g. bottleneck experience, Marsden et al. 2016), and to precisely control the levels of admixture are needed to further our understanding of the phenomena. In particular, it is necessary to elucidate (i) which parameters induce a large “number” and/or “length” of IBD segments within breeds that are however admixed. Moreover (ii), it is important to disentangle the factors other than the admixture level (artificial selection pressure, inbreeding patterns, demographic histories, genetic drift, domestication, etc., see Xiang et al. 2022; Kristensen and Sørensen 2005; Orozco-TerWengel et al. 2015) that influence, differentially or not, the “length” and “number “of IBD shared segments. If we consider the “inbreeding” parameter, much work has been done on runs of homozygous genotypes (ROH) segments (i.e. long stretches of homozygous genotypes that commonly arise when individuals inherit haplotypes identical by descent) as indicators to estimate inbreeding levels and associated depression (Zhao et al. 2017; Peripolli et al. 2017). Indeed, the more recent the inbreeding, the longer these segments will tend to be, as few opportunities will have arisen for recombination to break them. For the “artificial selection pressure” parameter, methods also based on haplotype characteristics have been developed (Sabeti et al. 2002). The main property of positive selection is that it causes an unusually rapid increase in allele frequency, over a sufficiently short period to ensure that recombination does not lead to substantial breakdown of the haplotype on which the selected mutation occurs (Liu et al. 2013; Vanvanhossou et al. 2021). More generally, linkage disequilibrium in livestock has been largely influenced (Amaral et al. 2008), since domestication, by human decisions, which have governed livestock demography and the intensity and direction of artificial selection, itself closely linked to the level of inbreeding. Taking account of the different evolutionary trajectories is therefore essential in understanding haplotype patterns (Mészáros et al. 2021). Finally, (iii) the notion of timing needs to be further explored. The issue of crossbreeding in local breeds arose in the last few decades, so we are looking for an estimator that is particularly sensitive to recent events that have shaped genome profiles. Browning and Browning (2015) postulated that IBD segments could approximate the past 100 generations of demographic history. This rationale has been used to estimate the timing of recent mixing events (e.g., Patterson et al. 2004; Hoggart et al. 2004; Koopman et al. 2007).
To illustrate our point, we can apply our results to the conclusions of Belabdi et al. (2019), for North African sheep. The study revealed that, while the majority of Algerian and Moroccan breeds were highly admixed, the Sidaoun and Hamra breed (from pilot farms) were clearly differentiated (according to FST values, Weir and Cockerham 1984; ADMIXTURE; NetView, Steinig et al. 2016; fineSTRUCTURE, Lawson et al. 2012, analyses), appearing as not admixed. Analysis of IBD segments revealed that the ten highly admixed breeds shared an average number of intra-breed IBD segments close to zero, while for breeds appearing to be preserved from admixture, the mean number of segments was substantial at around 45.6 for Hamra but only at 0.11 for Sidaoun. The authors noted the discrepancy in the IBD sharing patterns, relative to other analyses for the Sidaoun breed. Combining these results with the present study, we can hypothesize that Sidaoun, would in fact probably be subject to outcrossing but with breeds not analyzed in the dataset. Indeed, the Sidaoun, bred by the Bedouins of the desert, is found in southern Algeria. For sanitary reasons, this breed cannot travel to the north, where all the other breeds analyzed by Belabdi et al., are found. It thus remains to study the links that Sidaoun breeders have with the southern border countries, particularly Mali and Niger, in order to identify whether the breed is indeed subject to crossbreeding via these as yet ungenotyped populations.
This study highlights the fact that European breeds (with the exceptions discussed below) appear to be globally better “preserved” than Asian and African breeds, probably due to early breed standardization as dictated by breed societies and stringent selection schemes. However, we must not neglect the fact that a breed may be in a good state of preservation in terms of admixture levels, but nevertheless suffer from genomic alterations induced by different factors, leading in particular to the erosion of genetic diversity. In fact, while selection schemes, with their strict control of sires, maintain the integrity of the breed, their application also leads to a reduction in the pool of genetic diversity, and taken to the extreme, intensive selection leads to severe loss of resilience (Doublet et al. 2020, Stachowicz et al. 2011, Taberlet et al. 2008, Rauw et al. 1998). For developing countries, the problem is quite different. If we consider the goat dataset, from East, North and West Africa, i.e. 17 countries represented through 55 breeds, we note that for 35 of them, less than one IBD segment shared, was recorded on average. These results probably reflect crossbreeding practices that weaken the resource by damaging its genetic integrity and the adaptive architecture established over time. In North Africa, Ouchene-Khelifi et al. (2018), showed a genetic homogenization of the goat stock, probably due to anarchic crossbreeding of breeders subjected to increasing economic pressure. In East Africa, Serranito et al. (2021b) highlighted very pronounced levels of admixture for goat breeds of Uganda, Tanzania and Kenya. The authors drew attention to the potentially decisive causal role of various structures involving religious organizations, government institutions, non-governmental organization, such as Heifer International, British Farm-Africa, National Livestock Production Development Programs, German-GTZ, etc. (Wilson et al. 1990; Mruttu et al. 2016). Indeed, these programs have been implemented since the 1980s to improve goats, essentially represented by small indigenous breeds highly adapted to the regions environment, such as the Small East African (SEA) goat, via crossbreeding with imported exotic breeds, notably Boer, Kamorai, Toggenburg, Saanen, Norwegian, Alpine and Anglo-Nubian.
The situation of the sheep resource in Spain spurred our attention. The ten breeds considered were all classified as “admixed” with an average number of IBD segments of 1.13. These values are very close to those found in Africa for goats, indicating highly fragmented genomes, potentially resulting from crossbreeding in the recent past and/or present. Careful analysis of the ADMIXTURE results (Supplementary Figure 2) reveals genetic proximity between certain Spanish breeds, that can be explained by a common origin (e.g. Latxa and Sasi Ardi belonging to the Churro branch, Rendo et al. 2004). However, for many breeds, this argument is hard to sustain. If we consider the Segurena breed, originating from the Entrefino branch, analyses showed that it shares significant proportions of its genome with breeds derived from the Iberico and Merino branches. This ancient breed, whose cradle is the Sierra de Segura, is an important part of the local heritage, with transhumance practices that have been known since the Middle Ages. Like a large number of local Spanish breeds, it is registered in the Official Catalog of Spanish Livestock, supervised by the Ministry of Agriculture (MAPA). However, it turns out that “industrial crossbreeding”, i.e. mating of males from meat breeds with females from local breeds, was a technique widely used in Spain between the 1970s and 1990s. In this respect, many crossbreeding studies have been carried out, notably with meat breeds known as Ovinos Precoces (Espejo et al. 1977; Esteban 2003). In particular, the Segurena breed has been tested for crosses with the Texel, Merino Landschaf and Ile de France breeds (Baro Shakery 1975). The interest in this issue remains valid today, as evidenced by work published in 2016, assessing the value of crosses between Texel and Segurena breeds (Blasco et al. 2016). As pointed out by Sierra Alfranca (1984), the use of these crossbreeds is not risk-free, since without strict F1 slaughtering, the spread of crossbred individuals in the populations can lead to the rapid breed’s loss. An in-depth study of the history of local Spanish breeds, with a particular focus on the agricultural policies pursued over the last few decades, seem essential to the understanding of the genetic results obtained in our study. Perea and Arias (2022) note that Spain is one of the countries with the greatest diversity of local breeds, thanks to an extensive and heterogeneous cultural and agro-ecological heritage. However, in the context of intensification of agriculture, there is a loss of economic competitiveness of traditional agricultural systems, and globally, an abandonment of local breeds. Our results suggest the urgent need to deepen genetic analyses and couple them with anthropological and historical studies in order to understand the changing dynamics in this country.
Conclusion
This study highlighted the potential of IBD sharing patterns as indicators of admixture. Such indicators would optimize the protection of local breeds, enabling the detection of endangered breeds due to crossbreeding without the need for exhaustive knowledge of management practices and/or genotyping of breeds likely to be crossed with the breed in question, which is currently the case. The results also emphasize the fragmentation of genomes and the disruption of unique adaptation patterns, induced by crossbreeding practices driven by short-term productivity objectives.
Data availability
The goat AdaptMap dataset, is available via Dryad: https://doi.org/10.5061/dryad.v8g21pt). For sheep datasets : French breeds are available via Zenodo repository https://doi.org/10.5281/zenodo.237116; Italian and Spanish breeds via https://doi.org/10.23644/uu.8947346; Asian and North/Central European breeds via http://www.sheephapmap.org; Ethiopian breeds via www.animalgenome.org/repository/pub/KORE2017.1122 and also www.animalgenome.org/repository/pub/NOTT2018.0423; South African sheep breeds via osf.io/ceup6/?view_only=a1959659de5f4d5d9bbb1c607b2d83b6.
References
Ågren A, Vainikka A, Janhunen M, Hyvärinen P, Piironen J, Kortet R (2019) Experimental crossbreeding reveals strain-specific variation in mortality, growth and personality in the brown trout (Salmo trutta). Sci Rep. 9:2771. https://doi.org/10.1038/s41598-018-35794-6
Ahbara A, Bahbahani H, Almathen F, Al Abri M, Agoub MO, Abeba A, Kebede A, Musa HH, Mastrangelo S, Pilla F, Ciani E, Hanotte O, Mwacharo JM (2019) Genome-wide variation, candidate regions and genes associated with fat deposition and tail morphology in Ethiopian indigenous sheep. Front Genet 9:699. https://doi.org/10.3389/fgene.2018.00699
Alexander DH, Novembre J, Lange K (2009) Fast model-based estimation of ancestry in unrelated individuals. Genome Res 19:1655–1664
Aliloo H, Mrode R, Okeyo AM, Gibson JP (2020) Ancestral Haplotype Mapping for GWAS and Detection of Signatures of Selection in Admixed Dairy Cattle of Kenya. Front Genet 11:544. https://doi.org/10.3389/fgene.2020.00544
Amane A, Belay G, Nasser Y, Kyalo M, Dessie T, Kebede A et al. (2020) Genome-wide insights of Ethiopian indigenous sheep populations reveal the population structure related to tail morphology and phylogeography. Genes Genom 42:1169–1178. https://doi.org/10.1007/s13258-020-00984-y
Amaral AJ, Megens H-J, Crooijmans RPMA, Heuven HCM, Groenen MAM (2008) Linkage Disequilibrium Decay and Haplotype Block Structure in the Pig. Genetics 179:569–579. https://doi.org/10.1534/genetics.107.084277
Anand L, Rodriguez Lopez CM (2022) ChromoMap: an R package for interactive visualization of multi-omics data and annotation of chromosomes. BMC Bioinforma 23:33. https://doi.org/10.1186/s12859-021-04556-z
Baird S, Barton NH, Etheridge AM (2003) The distribution of surviving blocks of an ancestral genome. Theor Popul Biol 64(4):451–471
Barton NH, Bengtsson BO (1986) The barrier to genetic exchange between hybridising populations. Heredity 57:357
Baro Shakery E (1975) Razas ovinas especializadas en la produccion de carne. Hojas Divulgadoras, Ministerio de Agricultura.
Belabdi I, Ouhrouch A, Lafri M, Gaouar SBS, Ciani E, Benali AR et al. (2019) Genetic homogenization of indigenous sheep breeds in Northwest Africa. Sci Rep. 9:7920. https://doi.org/10.1038/s41598-019-44137-y
Bertolini F. et al. (2018) Signatures of selection and environmental adaptation across the goat genome post-domestication. Genetics, selection, evolution: GSE. 50. https://doi.org/10.1186/s12711-018-0421-y
Bett RC, Okeyo A, Malmfors B, Johansson K, Agaba M, Kugonza D et al. (2013) Cattle Breeds: Extinction or Quasi-Extant? Resources 2:335–357. https://doi.org/10.3390/resources2030335
Blasco M, Campo MM, Balado J, Sañudo C (2016) Influencia del cruce industrial en los rendimientos productivos y la calidad de la canal de corderos de la raza ovina Segureña. Archivos de Zootec 65:421–424. https://doi.org/10.21071/az.v65i251.707
Bosse M, Megens HJ, Frantz LA, Madsen O, Larson G, Paudel Y et al. (2014) Genomic analysis reveals selection for Asian genes in European pigs following human‐mediated introgression. Nature. Communications 5:4392–5000
Boutrais J (2007) The Fulani and Cattle Breeds: Crossbreeding and Heritage Strategies. Africa 77:18–36. https://doi.org/10.3366/afr.2007.77.1.18
Browning SR, Browning BL (2007) Rapid and Accurate Haplotype Phasing and Missing-Data Inference for Whole-Genome Association Studies By Use of Localized Haplotype Clustering. Am J Hum Genet 81:1084–1097
Browning S (2008) Estimation of pairwise identity by descent from dense genetic marker data in a population sample of haplotypes. Genetics 178:2123–2132
Browning SR, Browning BL (2012) Identity by descent between distant relatives: detection and applications. Annu Rev Genet 46:617–633. https://doi.org/10.1146/annurev-genet-110711-155534
Browning SR, Browning BL (2015) Accurate Non-parametric Estimation of Recent Effective 563 Population Size from Segments of Identity by Descent. Am J Hum Genet 97:404–418
Ciani E, Mastrangelo S, Da Silva A, Marroni F, Ferenčaković M, Ajmone-Marsan P et al. (2020) On the origin of European sheep as revealed by the diversity of the Balkan breeds and by optimizing population-genetic analysis tools. Genet Selection Evolution 52:25. https://doi.org/10.1186/s12711-020-00545-7
Danecek P et al. (2011) The variant call format and VCFtools. Bioinformatics 27:2156–2158
Decker JE, McKay SD, Rolf MM, Kim J, Alcalá AM, Sonstegard TS et al. (2014) Worldwide Patterns of Ancestry, Divergence, and Admixture in Domesticated Cattle. PLOS Genet 10:e1004254. https://doi.org/10.1371/journal.pgen.1004254
Doublet A-C, Restoux G, Fritz S, Balberini L, Fayolle G, Hozé C et al. (2020) Intensified Use of Reproductive Technologies and Reduced Dimensions of Breeding Schemes Put Genetic Diversity at Risk in Dairy Cattle Breeds. Anim (Basel) 10:1903. https://doi.org/10.3390/ani10101903
Dzomba EF, Chimonyo M, Snyman MA, Muchadeyi FC (2020) The genomic architecture of South African mutton, pelt, dual-purpose and nondescript sheep breeds relative to global sheep populations. Anim Genet 51:910–923. https://doi.org/10.1111/age.12991
Edea Z, Dessie T, Dadi H, Do K-T, Kim K-S (2017) Genetic Diversity and Population Structure of Ethiopian Sheep Populations Revealed by High-Density SNP Markers. Frontiers in Genetics. 8. Available: https://www.frontiersin.org/articles/10.3389/fgene.2017.00218
Eiswerth ME, Haney JC (1992) Allocating conservation expenditures: accounting for inter-species genetic distinctiveness. Ecol Econ 5:235–249. https://doi.org/10.1016/0921-8009(92)90003-B
Espejo M, Mora M, García Barreto L (1977) Crecimiento y calidad carnicera de los productos obtenidos del cruce entre moruecos de razas cárnicas y ovejas de razas locales españolas. An Inia Ser Prod Anim 8:55–67
Esteban C (2003) Razas ganaderas españolas. II. Ovinas. Ed. MAPA, Madrid. España
Falush D, Stephens M, Pritchard JK (2003) Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164:1567–1587. https://doi.org/10.1093/genetics/164.4.1567
FAO, 2007. The State of the World’s Animal Genetic Resources for Food and Agriculture. B. Richkowsky & D. Pilling, eds. Rome.
FAO, 2015. The Second Report on the State of the World’s Animal Genetic Resources for Food and Agriculture, Ed. by Scherf, B. D & Pilling, D., FAO Commission on Genetic Resources for Food and Agriculture Assessments, Rome.
Ferdy JB, Austerlitz F (2002) Extinction and introgression in a community of partially cross‐fertile plant species. Am Naturalist 160(1):74–86
Fisher RA (1949) The theory of inbreeding. Oliver and Boyd.
Fisher RA (1954) A fuller theory of “junctions” in inbreeding. Heredity 8:187–197
Freedman AH, Schweizer RM, Ortega-Del Vecchyo D, Han E, Davis BW, Gronau I et al. (2016) Demographically-Based Evaluation of Genomic Regions under Selection in Domestic Dogs. Plos Genet 12:e1005851. https://doi.org/10.1371/journal.pgen.1005851
Goudet J (2005) Hierfstat, a package for R to compute and test variance components and F-statistics. Mol Ecol Notes 5:184–186
Hoffmann I (2013) Adaptation to climate change – exploring the potential of locally adapted breeds. Animal 7:346–362. https://doi.org/10.1017/S1751731113000815
Hoggart CJ, Shriver MD, Kittles RA, Clayton DG, McKeigue PM (2004) Design and analysis of admixture mapping studies. Am J Hum Genet 74:965–978
Huson DH, Bryant D (2006) Application of Phylogenetic Networks in Evolutionary Studies. Mol Biol Evol 23:254–627
Janzen T, Nolte AW, Traulsen A (2018) The breakdown of genomic ancestry blocks in hybrid lineages given a finite number of recombination sites. Evolution 72:735–750. https://doi.org/10.1111/evo.13436
Kijas JW, Lenstra JA, Hayes B, Boitard S, Neto LRP, Cristobal MS et al. (2012) Genome-Wide Analysis of the World’s Sheep Breeds Reveals High Levels of Historic Mixture and Strong Recent Selection. PLOS Biol 10:e1001258. https://doi.org/10.1371/journal.pbio.1001258
Koopman WJ, Li Y, Coart E, Van De Weg WE, Vosman B et al. (2007) Linked vs. unlinked markers: multilocus microsatellite haplotype-sharing as a tool to estimate gene flow and introgression. Mol Ecol 16:243–256
Kopelman NM, Mayzel J, Jakobsson M, Rosenberg NA, Mayrose I (2015) Clumpak: a program for identifying clustering modes and packaging population structure inferences across K. Mol Ecol Resour 15:1179–1191. https://doi.org/10.1111/1755-0998.12387
Kristensen TN, Sørensen AC (2005) Inbreeding – lessons from animal breeding, evolutionary biology and conservation genetics. Anim Sci 80:121–133. https://doi.org/10.1079/ASC41960121
Lawson DJ, Hellenthal G, Myers S, Falush D (2012) Inference of Population Structure using Dense Haplotype Data. PLoS Genet 8:e1002453
Leitwein M, Duranton M, Rougemont Q, Gagnaire P-A, Bernatchez L (2020) Using Haplotype Information for Conservation Genomics. Trends Ecol Evolution 35:245–258. https://doi.org/10.1016/j.tree.2019.10.012
Leutenegger A-L, Prum B, Génin E, Verny C, Lemainque A, Clerget-Darpoux F et al. (2003) Estimation of the Inbreeding Coefficient through Use of Genomic Data. Am J Hum Genet 73:516–523. https://doi.org/10.1086/378207
Liang M, Nielsen R (2014) The lengths of admixture tracts. Genetics 197:953–967
Liu X, Ong RT-H, Pillai EN, Elzein AM, Small KS, Clark TG et al. (2013) Detecting and Characterizing Genomic Signatures of Positive Selection in Global Populations. Am J Hum Genet 92:866–881. https://doi.org/10.1016/j.ajhg.2013.04.021
Luu K, Bazin E, Blum MGB (2017) pcadapt: An R package to perform genome scans for selection based on principal component analysis. Mol Ecol Resour 17(1):67–77. https://doi.org/10.1111/1755-0998.12592
Marsden CD, Ortega-Del Vecchyo D, O’Brien DP, Taylor JF, Ramirez O, Vilà C et al. (2016) Bottlenecks and selective sweeps during domestication have increased deleterious genetic variation in dogs. Proc Natl Acad Sci USA 113:152–157. https://doi.org/10.1073/pnas.1512501113
Mdladla K (2016) Landscape genomic approach to investigate genetic adaptation in South African indigenous goat populations (PhD thesis). University of KwaZulu-Natal Pietermaritzburg, South Africa
Mészáros G, Milanesi M, Ajmone-Marsan P, Utsunomiya YT (2021) Editorial: Haplotype Analysis Applied to Livestock Genomics. Frontiers in Genetics. 12. Available: https://doi.org/10.3389/fgene.2021.660478
Molotsi AH, Taylor JF, Cloete SWP, Muchadeyi F, Decker JE, Whitacre LK et al. (2017) Genetic diversity and population structure of South African smallholder farmer sheep breeds determined using the OvineSNP50 beadchip. Trop Anim Health Prod 49:1771–1777. https://doi.org/10.1007/s11250-017-1392-7
Mruttu H, Ndomba C, Nandonde S, Nigussie Brook K (2016) Animal Genetics Strategy And Vision For Tanzania. Tanzania Ministry of Agriculture, Nairobi
Mwacharo JM, Kim ES, Elbeltagy AR et al. (2017) Genomic footprints of dryland stress adaptation in Egyptian fat-tail sheep and their divergence from East African and western Asia cohorts. Sci Rep. 7:17647. https://doi.org/10.1038/s41598-017-17775-3
Okeno TO, Kahi AK, Peters KJ (2013) Evaluation of breeding objectives for purebred and crossbred selection schemes for adoption in indigenous chicken breeding programmes. Br Poult Sci 54:62–75. https://doi.org/10.1080/00071668.2013.764492
Orozco-TerWengel P, Barbato M, Nicolazzi E, Biscarini F, Milanesi M, Davies W, et al. (2015) Revisiting demographic processes in cattle with genome-wide population genetic analysis. Frontiers in Genetics. 6.
Ouchene-Khelifi N-A et al. (2018) Genetic homogeneity of North-African goats. PLoS ONE 13:e0202196
Paim T, Hay EHA, Wilson C, Thomas MG, Kuehn LA, Paiva SR et al. (2020) Dynamics of genomic architecture during composite breed development in cattle. Anim Genet 51:224–234. https://doi.org/10.1111/age.12907
Patterson N, Hattangadi N, Lane B, Lohmueller KE, Hafler DA et al. (2004) Methods for high-density admixture mapping of disease genes. Am J Hum Genet 74:979–1000
Patterson N, Price AL, Reich D (2006) Population Structure and Eigen analysis. PLOS Genet 2:e190. https://doi.org/10.1371/journal.pgen.0020190
Pavoine S, Ollier S, Dufour AB (2005) Is the originality of a species measurable? Ecol Lett 8:579–586
Pavoine S, Bonsall MB, Dupaix A, Jacob U, Ricotta C (2017) From phylogenetic to functional originality: Guide through indices and new developments. Ecol Indic 82:196–205. https://doi.org/10.1016/j.ecolind.2017.06.056
Pedersen EJ, Miller DL, Simpson GL, Ross N (2018) Hierarchical generalized additive models: an introduction with mgcv. PeerJ 7:e6876. https://doi.org/10.7287/peerj.preprints.27320
Perea J, Arias R (2022) Competitiveness of Spanish Local Breeds. Animals 12:2060. https://doi.org/10.3390/ani12162060
Peripolli E, Munari DP, Silva MVGB, Lima ALF, Irgang R, Baldi F (2017) Runs of homozygosity: current knowledge and applications in livestock. Anim Genet 48:255–271. https://doi.org/10.1111/age.12526
Pool JE, Nielsen R (2009) Inference of Historical Changes in Migration Rate From the Lengths of Migrant Tracts. Genetics 181:711–719. https://doi.org/10.1534/genetics.108.098095
Porter V, Tebbit J (1996) Goats of the world. Farming Press Ltd, London
Privé F, Luu K, Vilhjálmsson BJ, Blum MGB (2020) Performing highly efficient genome scans for local adaptation with R Package pcadapt Version 4. Mol Biol Evol 37:2153–2154. https://doi.org/10.1093/molbev/msaa053
Purcell S et al. (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81:559–575
R Core Team (2022) R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing). https://www.R-project.org.
Rauw WM, Kanis E, Noordhuizen-Stassen EN, Grommers FJ (1998) Undesirable side effects of selection for high production efficiency in farm animals: a review. Livest Prod Sci 56:15–33. https://doi.org/10.1016/S0301-6226(98)00147-X
Rendo F, Iriondo M, Jugo BM, Mazón LI, Aguirre A, Vicario A et al. (2004) Tracking diversity and differentiation in six sheep breeds from the North Iberian Peninsula through DNA variation. Small Rumin Res 52:195–202. https://doi.org/10.1016/j.smallrumres.2003.07.004
Reynolds J, Weir BS, Cockerham C (1983) Estimation of the coancestry coefficient: Basis for a short-term genetic distance. Genetics 105:767–779
Ricotta C (2004) A parametric diversity measure combining the relative abundances and taxonomic distinctiveness of species. Diversity Distrib 10:143–146. https://doi.org/10.1111/j.1366-9516.2004.00069.x
Rochus C, Tortereau F, Plisson-Petit F, Restoux G, Moreno-Romieux C, Tosser-Klopp G et al. (2018) Revealing the selection history of adaptive loci using genome-wide scans for selection: An example from domestic sheep. BMC Genomics 19:71. https://doi.org/10.1186/s12864-018-4447-x
Sabeti PC, Reich DE, Higgins JM, Levine HZP, Richter DJ, Schaffner SF et al. (2002) Detecting recent positive selection in the human genome from haplotype structure. Nature 419:832–837. https://doi.org/10.1038/nature01140
Scherf B, Rischkowsky B, Pilling D and Hoffmann I (2006) The state of the world’s animal genetic resources. 8th Wrld Congr. Gen. Appl. to Livest. Prod., Belo Horizonte, M. G., Brazil, p. 13–18.
Serranito B, Cavalazzi M, Vidal P, Taurisson-Mouret D, Ciani E, Bal M et al. (2021a) Local adaptations of Mediterranean sheep and goats through an integrative approach. Sci Rep. 11:21363. https://doi.org/10.1038/s41598-021-00682-z
Serranito B, Taurisson-Mouret D, Harkat S, Laoun A, Ouchene-Khelifi N-A, Pompanon F et al. (2021b) Search for Selection Signatures Related to Trypanosomosis Tolerance in African Goats. Front Genet 12:715732. https://doi.org/10.3389/fgene.2021.715732
Shrestha JNB (2005) Conserving domestic animal diversity among composite populations. Small Rumin Res 56:3–20
Sierra Alfranca I (1984) El cruce industrial en la producción de carne ovina. Ministerio de Agricultura, Hojas Divulgadoras
Stachowicz K, Sargolzaei M, Miglior F, Schenkel FS (2011) Rates of inbreeding and genetic diversity in Canadian Holstein and Jersey cattle. J Dairy Sci 94:5160–5175. https://doi.org/10.3168/jds.2010-3308
Steinig EJ, Neuditschko M, Khatkar MS, Raadsma HW, Zenger KR (2016) Netview p: a network visualization tool to unravel complex population structure using genome-wide SNPs. Mol Ecol Resour 16:216–227
Taberlet P, Valentini A, Rezaei HR, Naderi S, Pompanon F, Negrini R et al. (2008) Are cattle, sheep, and goats endangered species. Mol Ecol 17:275–284. https://doi.org/10.1111/j.1365-294X.2007.03475.x
Tang K, Naseri A, Wei Y, Zhang S, Zhi D (2022) Open-source benchmarking of IBD segment detection methods for biobank-scale cohorts. GigaScience 11:giac111. https://doi.org/10.1093/gigascience/giac111
Todesco M, Pascual MA, Owens GL, Ostevik KL, Moyers BT, Hübner S et al. (2016) Hybridization and extinction. Evolut Appl 9:892–908. https://doi.org/10.1111/eva.12367
Upadhyay M, Bortoluzzi C, Barbato M, Ajmone‐Marsan P, Colli L, Ginja C et al. (2019) Deciphering the patterns of genetic admixture and diversity in southern European cattle using genome‐wide SNPs. Evol Appl 12:951–963. https://doi.org/10.1111/eva.12770
Van Rossum G, Drake Jr FL (1995) Python reference manual. Centrum voor Wiskunde en Informatica, Amsterdam
Vanvanhossou SFU, Yin T, Scheper C, Fries R, Dossa LH, König S (2021) Unraveling Admixture, Inbreeding, and Recent Selection Signatures in West African Indigenous Cattle Populations in Benin. Front Genet 12:657282. https://doi.org/10.3389/fgene.2021.657282
Wang H, Chai Z, Hu D, Ji Q, Xin J, Zhang C, Zhong J (2019) A global analysis of CNVs in diverse yak populations using whole-genome resequencing. BMC Genomics 20(1):61. https://doi.org/10.1186/s12864-019-5451-5
Weir BS, Cockerham CC (1984) Estimating F-Statistics for the Analysis of Population-Structure. Evolution 38:1358–1370
Wickham H (2016) ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag, New York. https://ggplot2.tidyverse.org
Wilson RT, Araya A, Dominique S (1990) Répertoire des projets de recherche et de développement caprins et ovins en Afrique. Draft. CIPEA, Edinburgh
Wood SN (2011) Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J R Stat Soc: Ser B (Stat Methodol) 73:1
Wright S (1965) The Interpretation of Population Structure by F-Statistics with Special Regard to Systems of Mating. Evolution 19:395–420. https://doi.org/10.2307/2406450
Xiang H, Derks MFL, Yi G, Zhao X (2022) Editorial: Early Domestication and Artificial Selection of Animals. Frontiers in Genetics. 13. Available: https://doi.org/10.3389/fgene.2022.841252
Yalden D (1999) The history of British mammals. T. & A. D. Poyser Ltd, London
Yang J, Li W-R, Lv F-H, He S-G, Tian S-L, Peng W-F et al. (2016) Whole-Genome Sequencing Of Native Sheep Provides Insights Into Rapid Adaptations To Extreme Environments. Mol Biol Evolution 33(10):2576–2592. https://doi.org/10.1093/molbev/msw129
Zhang C, Lin D, Wang Y, Peng D, Li H, Fei J et al. (2019) Widespread introgression in Chinese indigenous chicken breeds from commercial broiler. Evol Appl 12:610–621. https://doi.org/10.1111/eva.12742
Zhao Y-X, Yang J, Lv F-H, Hu X-J, Xie X-L, Zhang M et al. (2017) Genomic Reconstruction Of The History Of Native Sheep Reveals The Peopling Patterns Of Nomads And The Expansion Of Early Pastoralism in East Asia. Mol Biol Evol 34:2380–2395. https://doi.org/10.1093/molbev/msx181
Zuur A, Ieno E, Walker N, Saveliev A, Smith G (2009) Mixed effects models and extensions in ecology with r. Springer, New York
Acknowledgements
This work was supported by the AAP CNRS 2020: “Adaptation du Vivant à son Environnement” (EMPREINTES project, 2020–2021). JMM was partially supported by the SAPLING Initiative of the One CGIAR whose global funders are acknowledged.
Author information
Authors and Affiliations
Contributions
SBDS was responsible for Python analyses and statistical processing, he took part in writing the article; JMM, ML, AA, FCM, and EFD have collaborated by sharing their datasets and breed knowledge, JAL has provided the expertize to build robust datasets and performed preliminary genetic processes and ADS designed the study, carried out the analyses, wrote the article. All authors have reviewed the article.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Associate editor: Sara Knott.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Blondeau Da Silva, S., Mwacharo, J.M., Li, M. et al. IBD sharing patterns as intra-breed admixture indicators in small ruminants. Heredity 132, 30–42 (2024). https://doi.org/10.1038/s41437-023-00658-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41437-023-00658-x