Composite core set construction and diversity

Full text

Turn on search term navigation

About the Authors:

Razieh Mahmoodi

Roles Data curation, Formal analysis, Investigation, Software, Writing – original draft

Affiliations Department of Horticulture Sciences, Faculty of Agriculture, University of Tabriz, Tabriz, Iran, Temperate Fruits Research Center, Horticultural Science Research Institute, Agricultural Research, Education and Extension Organization (AREEO), Karaj, Iran

Mohammad Reza Dadpour

Roles Conceptualization, Data curation, Formal analysis, Methodology, Supervision, Writing – original draft

* E-mail: [email protected] (DH); [email protected] (MRD)

Affiliation: Department of Horticulture Sciences, Faculty of Agriculture, University of Tabriz, Tabriz, Iran

Darab Hassani

Roles Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft

* E-mail: [email protected] (DH); [email protected] (MRD)

Affiliation: Temperate Fruits Research Center, Horticultural Science Research Institute, Agricultural Research, Education and Extension Organization (AREEO), Karaj, Iran

ORCID logo https://orcid.org/0000-0002-3569-8889

Mehrshad Zeinalabedini

Roles Data curation, Formal analysis

Affiliation: Systems Biology Department, Agricultural Biotechnology Research Institute of Iran, Agricultural Research, Education and Extension Organization (AREEO), Karaj, Iran

Elisa Vendramin

Roles Data curation

Affiliation: Centro di Ricerca per l’Olivicoltura, Frutticoltura e Agrumicoltura, Consiglio per la Ricerca in Agricoltura e l’Analisi dell’Economia Agraria, Roma, Italy

Charles A. Leslie

Roles Validation, Visualization, Writing – original draft

Affiliation: Department of Plant Sciences, University of California, Davis, CA, United States of America

Introduction

Persian walnut (Juglans regia L.), is the most important species of the Juglandaceae family for its valuable nuts. Iranian plateau has been considered as a center of origin and domestication of this species, where it still exhibits a great diversity. In overall, sexual propagation of the walnut might be the main reason for high genetically variation which still exists among natural populations. Thoroughly, economic and nutritional value of the Persian walnut has been lead to world-wide distribution of the species especially in temperate regions. As it is known, the walnut is a monoicous species. The existence of protandry which usually cause the outcrossing, increase the variability and affects the population structure. This phenomenon together with the sexual propagation, created a huge segregated walnut population in Iran.

During the recent decades, there has been remarkable progress in evaluation of walnut germplasm and establishment of a collection in Iran. Basically, germplasm collections are considered as a valuable sources of conserving genetic diversity and providing plant material for breeders, but in collections with very high number of accessions, it can be more expensive to identify the appropriate stock. On the other hands, management of a large number of individuals could be effectively difficult [1]. To overcome this issue, Brown [1] proposed the concept of core collection. Several methods have been exploited to develop core collections. Initially, most researchers performed random sampling [2]. More recently, progresses in molecular biology have facilitated establishment of core collections using molecular markers, either alone [3] or with phenotypic traits [4–7]. The maximization strategy has been developed using PowerCore v. 1.0 software upon which, selection of accessions with the highest diversity could be possible [8]. Up to now, core collections have been established for a number of fruit tree species [4, 9, 10].

Regarding to the Persian walnut, the first core collection was established using phenological traits [11]. Since phenotypic traits are substantially influenced by environmental factors, this research has been used as complementary to create a more robust core collection. The main reason for pooling the phenotypic data and molecular procedures together is that, the molecular markers can only reflect variation at DNA level that is not necessarily expressed in the phenotype [12].

During recent decades, many studies focused on the genetic diversity and population structure of Persian walnut based on SSR [13], RAPD [14], AFLP [15]and SNP [16].

This study tried to describe the genetic diversity and population structure of a collection of 104 walnut accessions in order to assume variation and to identify a robust core collection. To the best of our knowledge, this is the first report for utilizing molecular variation and phenotypic data in walnut.

Materials and methods

The main collection used in this study has been established at 35.754888° N and 50.952986° E in Karaj in 2006. For the establishment of the main collection, in the first step, pre-selections were done among more than 10000 walnut genotypes from the walnut producing areas including Karaj, Qazvin, Tabriz, Urmia, Kerman, Tuyserkan, and Shahroud from 2003 to 2004. Subsequently, the 104 walnut genotypes selected based on phenotypic traits, were propagated by grafting and were planted together with nine foreign walnut cultivars: Chandler, Pedro, Hartley, Serr, Howard, Ronde de Montignac, Alsozentivani 117, Frjean and Roxana, in the main walnut collection. The 104 walnut genotypes were belonging to seven autochthonous origin (Alborz, Kerman, Qazvin, Shahroud, Tabriz, Touyserkan and Urmia), and two foreign groups (USA and Europe). Details, including accession ID, name, and origin, are described in Mahmoodi et al [11].

Measurement of phenotypic data

The accessions were evaluated for 17 qualitative traits (nut size, nut shape in longitudinal section through suture, nut shape in longitudinal section perpendicular to suture, nut shape in cross section, nut shape of base perpendicular to suture, shape of apex perpendicular to suture, prominence of apical tip, position of pad on suture, prominence of pad on suture, width of pad on suture, depth of grove along pad on suture, structure of surface of shell, adherence of two halves of shell, thickness of dividing membranes, ease of removal, intensity of ground color and kernel size), 18 quantitative traits (bud break; start, end and duration of pollen shedding and pistillate flowers receptivity; nut length, width, thickness and roundness; nut and kernel weight; kernel percentage; shell and membrane thickness; and number of nuts to scaffold and tranck cross area) [11] and AFLP markers. The measurements of nut and kernel traits were based on 20 nut samples. The descriptor for qualitative traits, were reported in S1 Table. The Shannon diversity index (H), a parameter commonly used to characterize diversity in populations, was calculated based on Shannon [17].

Genomic DNA extraction and AFLP analysis

For determining the genetic variability among the walnut genotypes, 13 AFLP primers were evaluated. Genomic DNA was extracted from 200 mg of leaf tissue of each accession using the CTAB method of Doyle and Doyle with minor modifications [18]. Based on the results, concentrations of DNA extracts were standardized for AFLP analysis. The AFLP was performed using the method of Vos et al. [19] with modifications, using enzyme combination EcoRI/MseI. The AFLP primer combinations (MseI: 3 selective nucleotides, EcoRI:2 selective nucleotides) were labeled with infrared dyes IRD-700 and IRD-800 at the 5´ end, accompanied by three and two selective nucleotides at the 3´end. Briefly, 5 μl of extracted DNA at a concentration of approximately 50 ng/ μl genomic DNA was digested with EcoRI/MseI (1 U) and incubated at 37°C for 3 h. The fragments were ligated with T4 DNA ligase to EcoRI and MseI adapters at 37°C for 3 h followed by 4°C overnight. Ligated DNA was diluted 1:5 with water and used for pre-amplification. The pre-amplification reactions were performed using non-selective primers (E000 and M000) in a 25 μl reaction volume (containing: 3.75 μl of (1:3) diluted ligation product, 1 unit of Taq polymerase, 1X Taq polymerase buffer, 0.4 μM of each of the two primers, 150 μM of each of dATP, dCTP, dGTP and dTTP, and 2 mM MgCl2). This amplification was performed using the following cycling parameters: 25 cycles, each consisting of 1 min at 94°C, 1 min at 60°C, 2 min at 72°C and final extension was done at 72°C for 7 min. The preamplification products were diluted in the ratio 1:9 by sterile distilled water.

Selective amplification was performed using reaction amplification performed in a 25 μl reaction mixture volume containing: 3.75 μl of diluted pre-amplification product, 1x Taq polymerase buffer, 2 mM MgCl2, 1 Unit of Taq polymerase, 150 μM of dNTPs, and 0.4 μM of each of the two primers with two or three additional nucleotides at the 3´end.

PCR program was continued by 10 cycle of 94°C for 3 min for pre per denaturation and followed by 30 s at 95°C, 30 s at 63°C as touchdown with 1°C lowering for each cycle, 2 min at 72°C. The amplified products were run on a 6.5% polyacrylamide gel using DNA analyzer (LI-COR 4300, USA). The Sequences of all adaptors and primers are listed in S2 Table.

Data analysis

Amplified AFLP products were assembled into a binary matrix by scoring each fragment manually as presence (1) or absence (0) of a band across all 104 walnut cultivars for each primer combination. The variability parameters were assessed using POPGENE version 1.32 [20]. To determine which of the AFLP primer combinations has most effectively differentiated the genotypes, polymorphism information content (PIC), marker index (MI) and resolving power (RP) were calculated [21].

To investigate genetic differentiation among the walnut populations, analysis of molecular variance (AMOVA) was performed using Arlequin 3.11 software [22]. The analysis could be performed using intra-population and inter-population methods. In the first option statistical information would be extracted independently from each population, whereas in the second method, samples would be compared to each other.

To determine phylogenetic relationships among accessions, clustering analysis based on the repeated bisection (RB) method was performed using gCLUTO software (version 1.0, University of Minnesota, Twin Cities, MI, USA). This is a graphical application for clustering low- and high-dimensional datasets and analyzing the characteristics of the clusters. Principal coordinate analyses were also performed using GenAlEx ver 6.502 [23].

STRUCTURE software ver. 2.3.4 was used to analyze the population structure of the full germplasm collection. The number of clusters was selected after 10 independent runs of a burn-in period of 100,000 iterations and 100,000 MCMC repetitions for each value of K (k = 1–10). The optimum value of K was obtained by calculating the ΔK and highest LnP(D) value to determine the most likely number of groups [24]. The results from STRUCTURE were processed with the online software STRUCTURE HARVESTER v.0.6.1 to obtain the most acceptable K value.

Development of core collection

A core collection was developed using PowerOWERCore program [8], genotypic data and 17 qualitative traits (S1 Table). Four methods were used to determine core collection options; 1) genetic analysis of the entire collection, 2) phenotypic analysis based on qualitative traits of the entire collection, 3) phenotypic analysis based on quantitative traits of the entire collection and, 4) a combination of phenotypic and molecular variability by merging core collections. Categorical variables (genetic and phenotypic) were applied based on distinct characters. Continuous variables, i.e. quantitative traits, were classified into different categories by the software, based on Sturges’ rule [25].

Evaluation of the core collections

To evaluate the ability of each proposed core set to represent the full collection, the Mean Difference, Coincidence Rate, Variance Difference, Variable rate and Coverage (%) were calculated [16].

In addition, Shannon’s diversity index (I) and Nei’s gene diversity (H) values were calculated using POPGENE version 1.32 [20].

For a core collection to be considered representative of its primary collection, MD% should be less than 20% and CR% more than 80%. Lower VD values and higher VR values indicate a more effective core collection [8]. The core coverage CR% should exceed 80% of the full collection [26].

Principal coordinate analysis was used to assess segregation patterns in the full and core collections.

Results

Among the 13 evaluated primers, five primer combinations showed polymorphism with a total of 499 total and 197 polymorphic fragments (Table 1). The primer pair E-TG/M-CAG was the most efficient in discriminating the individuals with a polymorphism rate of 52.08%. The least discriminatory primer was the pair of E- CT/M-GAG with a polymorphism rate of 29.29%. For dominant markers such as AFLP, estimation of marker index, together with PIC value, has been used to assess the degree of informativeness of markers [21]. In this study, the range of PIC was varied from 0.106 to 0.169 with a general mean of 0.139. The marker index value (MI) for each primer pair was computed too. The mean value for MI in this study was 2.35 with a range from 0.90 (E- CT/M-GAG) to 4.37 (E-TG/M- CAG). In addition, the Shannon’s information index (I) was in concordance with PIC and MI. The highest and lowest Shannon index were belong to primer E-TG/M- CAG (I = 0.254) and CT/M-GAG (I = 0.157) respectively (Table 1). The Resolving power (RP) varied from 12.13 to 17.31. In summary, the primer pair E-TG/M-CAG was found to be the most effective in detecting genetic variation among walnut germplasm. Genetic diversity parameters for 9 populations are shown in Table 2. According to the results, the highest number of polymorphic loci (NP) was obtained in the Qazvin population (37.05% polymorphic loci) while the lowest NP was 63 in the Shahroud population (12.55% polymorphic loci). The observed number of alleles (Na) ranged from 1.342 to 1.125, and the effective number of alleles (Ne) varied from 1.228 in Qazvin to 1.091 in Shahroud population. The mean values for Na and Ne were 1.248 and 1.165, respectively. Likewise, the values for both Shannon’s information index and Nei’s gene diversity were highest in the Qazvin accessions (0.197 and 0.132, respectively). The Shannon’s information index and Nei’s gene diversity were lowest in the Shahroud accessions (0.075 and 0.051, respectively).

[Figure omitted. See PDF.]

Table 1. Variability parameters for five AFLP primer combinations.

https://doi.org/10.1371/journal.pone.0248623.t001

[Figure omitted. See PDF.]

Table 2. Estimated genetic diversity of nine walnut germplasm populations.

https://doi.org/10.1371/journal.pone.0248623.t002

Population structure analysis

To determine the structure of walnut populations and the genetic relationship among samples, analyses of- population structure, cluster analysis, and principal coordinate analysis, (PCoA) were performed. Cluster analysis provides an easy and effective way to evaluate genetic diversity [27]. AFLP data using the RB algorithm, grouped the accessions into six clusters (Fig 1). The highest and lowest number of accessions were belong to cluster 1 (40 accessions) and cluster 3 (10 accessions). Internal similarity (ISim) and external similarity (Esim) for K = 6, along with the sample size, are presented in S3 Table. Cluster 4 had the highest value for internal similarities (0.580), while Cluster 3 had the lowest (0.389) amount. Fig 1 shows a mountain visualization of the results for the six clusters.

[Figure omitted. See PDF.]

Fig 1. Mountain visualization of k-means clustering analysis combined with multidimensional scaling.

https://doi.org/10.1371/journal.pone.0248623.g001

As indicated by the distances between peaks, Cluster 6 had the lowest value for external similarities and was the farthest group from the other clusters. Cluster 4 had the lowest value for internal similarity.

To characterize collection structure, principal coordinate analysis (PCoA) was performed on the dataset of 104 genotypes/cultivars (S1 Fig). PCoA based on a similarity matrix explained 19.96% and 27.74% of variance on the first and second axis respectively (S1 Fig).

The genetic structure of walnut germplasm was analyzed by STRUCTURE software. Then the STRUCTURE output was submitted to STRUCTURE HARVESTER software to obtain the most likely K value. A clear pinpointed peak at K = 3 was observed, which classified the 104 accessions into three main groups (Fig 3). Fig 3 illustrates the level of admixture of each individual in the population. The first genetic group (in red) contained 21 individuals from Qazvin and Alborz. The second group (in green) consisted of 43 individuals, mainly from Touyserkan, Europe, Tabriz and a few from Qazvin. The third (in blue) included 50 individuals from Kerman, Shahroud, Urmia and some of the genotypes from Alborz.

The separation of populations by origination has not been seen typically in Persian walnut [23, 28]. For classification using multivariate methods, the cluster analysis (Fig 1) displays more complexity than STRUCTURE analysis. In general, both STRUCTURE and cluster analyses showed the same strong genetic division among walnut germplasm (Fig 1). Our results agreed with Ebrahimi et al [13], compared the genetic diversity of Juglans regia L. growing in the cold temperate region of the eastern U.S.A. with J. regia growing in the cold Mediterranean regions of Europe. Their results indicated that ‘‘Early Mature” walnuts were exhibiting relatively high levels of genetic diversity and accessions were genetically different from ‘‘Normal Growth” group.

Partitioning the variation within and between populations using the analysis of molecular variance (AMOVA) showed that 93.98% of the genetic variability was existed within while 6.32% of variation was between populations (S4 Table). Similarly, Wang et al [29] found that 81.4% of genetic diversity was within and 18.6% was between the walnut populations from Central and South western China. Nei’s genetic distances for these nine populations were shown in Fig 2.

[Figure omitted. See PDF.]

Fig 2. Nei’s genetic distance coefficient for the nine walnut populations.

https://doi.org/10.1371/journal.pone.0248623.g002

Qualitative phenotypic traits

S5 Table displays the range, median and mean values for the evaluated nut traits, their coefficient of variation and the Shannon index values. Among the analyzed characters, the shape in longitudinal section through the suture showed the highest coefficient of variation (CV = 57.17%). Light kernel color has high economic value and is therefore very important in selection of new cultivars [25]. The median value for kernel color was 5 (medium color) with CV = 34.76%. Other important kernel traits are kernel size and kernel removal. The median for kernel removal was 3, indicating the kernels of most genotypes easily separable from the shell. Adherence of the two halves of the shell is another important trait [25]. Nuts with poor shell seal are more easily damaged by pests during storage [30]. The median for this trait was 5 (medium) with CV = 33.02%. All these traits have been considered in selecting promising genotypes for walnut breeding programs [31].

Among the analyzed traits, the shape in longitudinal section through suture showed the highest Shannon diversity index (H´ = 0.934) (S5 Table).

For explanation of the measured character symbols, see S1 Table

The dendrogram for nut traits, based on neighbor-joining method, classified the accessions into three major clusters (Fig 3).

[Figure omitted. See PDF.]

Fig 3. Genetic classification of J. regia accessions using the Neighbor-Joining method.

The genotypes selected for a core collection are illustrated in red.

https://doi.org/10.1371/journal.pone.0248623.g003

Development and evaluation of core collections

To determine a core collection, initial selections were made based on use of AFLP (CC1), quantitative phenotypic traits (CC2), and qualitative phenotypic data (CC3). Then, these three core collections (CC1–CC3) were merged to generate a composite core collection (CC4) (Fig 4). Kumar et al. [5] suggested that in order to capture the maximum range of allelic diversity/traits in a core set, and to prevent trade-off between two data types when used together, it is better to combine phenotypic and molecular variability. Therefore, three core collections, CC1 (27 accessions), CC2 (13 accessions), and CC3 (18 accessions) were combined to form a non-redundant composite core collection referred to as CC4. CC4 was comprised of 46 accessions from Alborz, Kerman, Qazvin, Shahroud, Tabriz, Touyserkan and Urmia, USA and Europe (except shahroud walnut populations (Table 3). The CC4 showed a 100% coverage value for the different phenotypic and genetic variables under consideration.

[Figure omitted. See PDF.]

Fig 4. Flowchart describing the steps for development of a core collection for walnut.

Numerical values indicate the number of accessions in respective cores.

https://doi.org/10.1371/journal.pone.0248623.g004

[Figure omitted. See PDF.]

Table 3. Representation from different regions in the developed walnut Core Collections (CC).

https://doi.org/10.1371/journal.pone.0248623.t003

For a core collection to be considered representative, the MD% must be less than 20% and CR% must be > 80% [5]. In addition, more effective core collection must have a lower VD and higher VR (more than 100%) [8]. The composite core collection (CC4) was built using a combination of the genotypic and phenotypic data (Table 3).

The Shannon-Weaver diversity index range (I) for all core collections varied from 0.338 to 0.497. The Nei’s genetic diversity (H) ranged from 0.203 to 0.328 (Table 4).

[Figure omitted. See PDF.]

Table 4. Evaluation indices for the developed core collections.

https://doi.org/10.1371/journal.pone.0248623.t004

The composite collection, CC4, provided a more logical and exhaustive representation of all the phenotypic and genetic variability of the independent core collections (CC1–CC3).

PCA was performed to validate and confirm the distribution of the four core collections. The distribution of the individuals in these collections was explained by the first two principal components in Fig 5.

[Figure omitted. See PDF.]

Fig 5. PCA graphs depicting spread of members of the composite and the three independent core collections.

https://doi.org/10.1371/journal.pone.0248623.g005

Discussion

Previous studies have shown that AFLP markers could be a tool for characterization of genetic diversity and population structure in walnut [15, 32]. In this study, the phenotypic traits and AFLP molecular markers were combined to characterize the genetic diversity of a walnut collection and to suggest a core collection. The polymorphism detected by the five AFLP primer combinations used in this study is higher than was reported by some researchers [32, 33] while less than another [34]. Nicese et al [35], using 18 RAPD primers and 19 walnut genotypes, observed 23 polymorphic fragments corresponding to about 25% of the polymorphism. Dadras et al [14], using 20 RAPD primers, scored 3.1 polymorphic bands per primer in characterizing of 82 walnut accessions. Pop et al [36] obtained 76.3% polymorphism using 25 RAPD primers in 20 walnut accessions.

The three nucleotide extensions (M-GAG and M-CAG) can also be used to develop Sequence Tagged Site (STS) markers for the identification and tagging of the germplasm. In order to determine the utility of these markers, Polymorphic Information Content (PIC), Resolving Power (RP) and Marker Index (MI) were calculated [37]. The primer combinations used in this study exhibited RP values in the range of 12.13–17.31 (Table 1). The observed range of RP values for the AFLP primer combinations was greater than the result obtained by other researches [32, 38]. Additionally there was a strong linear relationship between the ability of a primer combination to distinguish genotypes and RP values [23]. The primer combination of E-TG × M-CAG, with the highest RP value and polymorphism, was determined to be the most informative combination for estimating the genetic-diversity. This primer combination also had the highest RP value and polymorphism in apricot and peach [39].

In this study the mean higher PIC value than previously observed [32], indicated higher variation among these walnut genotypes. For dominant markers, such as AFLP, estimated marker index in combination with PIC value has been used to assess the informativeness of markers. Because AFLP markers provide a large number of polymorphic fragments, they can assist efficient evaluation of genetic diversity and provide a valuable tool for breeding programs [40].

Population differentiation and structure

Based on the qualitative phenotypic traits, the 104 walnut genotypes classified into three major clusters using the Neighbor-Joining method, while the mountain visualization of k-means clustering method with AFLP data grouped them into six clusters. Both AFLP and the phenotypic method, with classifying the genotypes of different origins together, showed no clear relationship with geographic origin. Principal coordinate and Structure analysis based on AFLP data also produced three groups.

Structure analysis is a widely used method for inference of hidden population structure in plant species [41]. In this study, three major subpopulations were identified which were not corresponding with geographical origin.

PCoA clustered these genotypes into three main groups and confirmed the K value of the structure analysis.

Structure analysis and PCoA showed similar genetic divisions among the sampled sites that is similar to others reports [13, 42]. The poor association between the molecular marker data and the geographic origin of the genotypes, has also been reported in previous studies [34, 43].

There were few differences in genetic diversity parameters between the nine studied walnut populations. The AMOVA attributed more than 93.98% of the diversity to individuals, a level similar to that found by Aradhya et al [42] (86%), Ebrahimi et al [13] (85%), and Christopoulos et al [44] (89%).

Development and size of core collections

In recent years, considerable advances in molecular markers have enabled their utilization for development of core collections [4, 5, 45]. It is clear that, the molecular markers are useful more, when would be used together with the morphological markers. Molecular markers just reflect the DNA attributes, while the morphological markers can be affected by genetic and by epigenetic variation which is not considered if DNA will be used alone.

There are a lot of methods for producing of core collections, including the random method, principal component scoring, the distance-based methods such as Core Hunter, the Maximum Length Sub Tree method (MLST), maximization of the allelic diversity using MSTRAT and PowerCore [46]. Some studies have emphasized use of a maximization strategy for development of robust core collections [47]. Maximization with heuristic searching is considered to be a powerful approach for maintaining diverse and maximum number of alleles at each locus [2]. PowerCore programs have been used successfully to construct core collections with high genetic diversity for various plant species [4, 6, 48].

Various additional information has been used to form core collections, including phenotypic and ecogeographical traits and molecular markers, either alone or in combination [3, 5]. Use of genotypic or phenotypic information alone for the establishment of core collections may not efficiently capture the entire genetic diversity of a species. Therefore, a combination of them was used in our study for the construction of a walnut core collection.

In our study, the four generated core collections efficiently captured the entire range of trait variability. Many accessions were common between different core collections. For instance, 12 accessions were common between CC1, CC2 and CC3. Only 9 accessions were unique to CC1, 7 to CC2, and 18 to CC3. The presence of common accessions between core collections using different types of data indicates an overlap in genetic and phenotypic components of accessions [5]. These constitute a subset of genotypes/cultivars that are extremely diverse at both the molecular and phenotypic level.

Kumar et al [5], argued that in order to capture the maximum range of allelic diversity/traits in a core collection and to prevent trade-off between two data types, it is better to combine phenotypic and molecular variability by merging core collections derived from each type of data separately. For this reason, the core collections were merged to derive a more robust and non-redundant composite core collection (CC4) (Fig 4). The indices (MD%, VD%, VR%, CR%, I, H) for CC4 reflect the composite core collection effectiveness in capturing the diversity of the full walnut collection (Table 4).

Conclusions

This study demonstrates the usefulness of AFLP markers in characterizing the genetic variation and population structure of the walnut collection and use of this information for creating a core collection. This study is the first attempt in walnut, in which the molecular diversity has been used in conjunction with phenotypic data to develop a core collections. The walnut core collection will provide access to a genetically diverse and important germplasm that can facilitate characterization of the genetic determinants of trait variability. This information can be used to design more effective breeding programs.

Supporting information

S1 Data.

https://doi.org/10.1371/journal.pone.0248623.s001

(XLSX)

S1 Fig. Principal coordinate analysis of the accessions based on AFLP markers.

https://doi.org/10.1371/journal.pone.0248623.s002

(DOCX)

S2 Fig. Pattern of individual assignments into three subsets (K = 3) using the STRUCTURE software.

Each individual is shown by a vertical line with one to three colored segments, according to its estimated membership probabilities (Q).

https://doi.org/10.1371/journal.pone.0248623.s003

(DOCX)

S1 Table. Descriptors for the qualitative traits utilized.

https://doi.org/10.1371/journal.pone.0248623.s004

(DOCX)

S2 Table. Sequences of oligonucleotide adaptors and primers used for AFLP.

https://doi.org/10.1371/journal.pone.0248623.s005

(DOCX)

S3 Table. Internal and external similarity measures of groups and membership of walnut in each cluster corresponding to Fig 1.

https://doi.org/10.1371/journal.pone.0248623.s006

(DOCX)

S4 Table. Analysis of molecular variance (AMOVA) on based on AFLP markers of 104 accessions.

https://doi.org/10.1371/journal.pone.0248623.s007

(DOCX)

S5 Table. Range median, mean, coefficient of variation and Shannon Diversity Index for the traits evaluated.

https://doi.org/10.1371/journal.pone.0248623.s008

(DOCX)

Acknowledgments

We gratefully acknowledge Dr. R. Ghaffari, Mis. Farsi and Dr. A. Soleimani for their cooperation.

Citation: Mahmoodi R, Dadpour MR, Hassani D, Zeinalabedini M, Vendramin E, Leslie CA (2021) Composite core set construction and diversity analysis of Iranian walnut germplasm using molecular markers and phenotypic traits. PLoS ONE 16(3): e0248623. https://doi.org/10.1371/journal.pone.0248623

References

1. Brown A. Core collections: a practical approach to genetic resources management. Genome. 1989;31(2):818–24.

2. Thies JA, Fery RL. Evaluation of a core of the US Capsicum germplasm collection for reaction to the Northern root-knot nematode. HortScience. 2002;37(5):805–10.

3. Zhang C, Chen X, Zhang Y, Yuan Z, Liu Z, Wang Y, et al. A method for constructing core collection of Malus sieversii using molecular markers. Scientia Agricultura Sinica. 2009;42(2):597–604.

4. Belaj A, del Carmen Dominguez-García M, Atienza SG, Urdíroz NM, De la Rosa R, Satovic Z, et al. Developing a core collection of olive (Olea europaea L.) based on molecular markers (DArTs, SSRs, SNPs) and agronomic traits. Tree Genetics & Genomes. 2012;8(2):365–78.

5. Kumar S, Ambreen H, Variath MT, Rao AR, Agarwal M, Kumar A, et al. Utilization of molecular, phenotypic, and geographical diversity to develop compact composite core collection in the oilseed crop, safflower (Carthamus tinctorius L.) through maximization strategy. Frontiers in plant science. 2016;7:1554. pmid:27807441

6. Lee H-Y, Ro N-Y, Jeong H-J, Kwon J-K, Jo J, Ha Y, et al. Genetic diversity and population structure analysis to construct a core collection from a large Capsicum germplasm. BMC genetics. 2016;17(1):142. pmid:27842492

7. Yun W, Ban S, Kim G, Kim J, Kwon S, Choi C. Assessment of apple core collections constructed using phenotypic and genotypic data. Genetics and Molecular Research. 2015;14(2):6453–64. pmid:26125850

8. Kim K-W, Chung H-K, Cho G-T, Ma K-H, Chandrabalan D, Gwag J-G, et al. PowerCore: a program applying the advanced M strategy with a heuristic search for establishing core sets. Bioinformatics. 2007;23(16):2155–62. pmid:17586551

9. Dhanaraj AL, Rao EB, Swamy K, Bhat M, Prasad DT, Sondur SN. Using RAPDs to assess the diversity in Indian cashew (Anacardium occidentale L.) germplasm. The Journal of Horticultural Science and Biotechnology. 2002;77(1):41–7.

10. Garcia‐Lor A, Luro F, Ollitrault P, Navarro L. Comparative analysis of core collection sampling methods for mandarin germplasm based on molecular and phenotypic data. Annals of Applied Biology. 2017;171(3):327–39.

11. Mahmoodi R, Dadpour MR, Hassani D, Zeinalabedini M, Vendramin E, Micali S, et al. Development of a core collection in Iranian walnut (Juglans regia L.) germplasm using the phenotypic diversity. Scientia Horticulturae. 2019;249:439–48.

12. LI Y-x, GAO Q-j, LI T-h. Sampling strategy based on fruit characteristics for a primary core collection of peach cultivars [J]. Journal of Fruit Science. 2006;3:359–64.

13. Ebrahimi A, Zarei A, Fardadonbeh MZ, Lawson S. Evaluation of genetic variability among “Early Mature” Juglans regia using microsatellite markers and morphological traits. PeerJ. 2017;5:e3834. pmid:29085742

14. Dadras AR, Sabouri H, Nejad GM, Sabouri A, Shoai-Deylami M. Association analysis, genetic diversity and structure analysis of tobacco based on AFLP markers. Molecular Biology Reports. 2014;41(5):3317–29. pmid:24488320

15. Xu Z, Hu T, Zhang F. Genetic diversity of walnut revealed by AFLP and RAPD markers. J Agr Sci. 2012;4:271–6.

16. Zhu T, Wang L, You FM, Rodriguez JC, Deal KR, Chen L, et al. Sequencing a Juglans regia× J. microcarpa hybrid yields high-quality genome assemblies of parental species. Horticulture research. 2019;6(1):1–16.

17. Shannon CE. A mathematical theory of communication. ACM SIGMOBILE mobile computing and communications review. 2001;5(1):3–55.

18. Doyle JJ, Doyle JL. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. 1987.

19. Vos P, Hogers R, Bleeker M, Reijans M, Lee Tvd, Hornes M, et al. AFLP: a new technique for DNA fingerprinting. Nucleic acids research. 1995;23(21):4407–14. pmid:7501463

20. Yeh FC, Yang R, Boyle T, Ye Z, Mao JX. POPGENE, version 1.32: the user friendly software for population genetic analysis. Molecular Biology and Biotechnology Centre, University of Alberta, Edmonton, AB, Canada. 1999.

21. Mohammadi SA, Shokrpour M, Moghaddam M, Javanshir A. AFLP-based molecular characterization and population structure analysis of Silybum marianum L. Plant Genetic Resources. 2011;9(3):445.

22. Excoffier L, Lischer HE. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Molecular ecology resources. 2010;10(3):564–7. pmid:21565059

23. Shamlu F, Rezaei M, Lawson S, Ebrahimi A, Biabani A, Khan-Ahmadi A. Genetic diversity of superior Persian walnut genotypes in Azadshahr, Iran. Physiology and Molecular Biology of Plants. 2018;24(5):939–49. pmid:30150868

24. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Molecular ecology. 2005;14(8):2611–20. pmid:15969739

25. McGranahan G, Leslie C. Breeding walnuts (Juglans regia). Breeding plantation tree crops: Temperate species: Springer; 2009. p. 249–73.

26. Hu J, Zhu J, Xu H. Methods of constructing core collections by stepwise clustering with three sampling strategies based on the genotypic values of crops. Theoretical and Applied Genetics. 2000;101(1–2):264–8.

27. Belamkar V, Selvaraj MG, Ayers JL, Payton PR, Puppala N, Burow MD. A first insight into population structure and linkage disequilibrium in the US peanut minicore collection. Genetica. 2011;139(4):411. pmid:21442404

28. Foroni I, Woeste K, Monti L, Rao R. Identification of ‘Sorrento’walnut using simple sequence repeats (SSRs). Genetic Resources and Crop Evolution. 2007;54(5):1081–94.

29. Wang Y, Zhang J, Sun H, Ning N, Yang L. Construction and evaluation of a primary core collection of apricot germplasm in China. Scientia Horticulturae. 2011;128(3):311–9.

30. Hassan D, Mozaffari M, Souraki Y, Soleimani A, Loni A. Vegetative and reproductive traits of some Iranian local and foreign cultivars and genotypes of walnut (Juglans regia L.). Seed and Plant Improvement Journal. 2013;29(4).

31. Eskandari S, Hassani D, Abdi A, editors. Investigation on genetic diversity of Persian walnut and evaluation of promising genotypes. V International Walnut Symposium 705; 2004.

32. Kafkas S, Ozkan H, Sutyemez M. DNA polymorphism and assessment of genetic relationships in walnut genotypes based on AFLP and SAMPL markers. Journal of the American Society for Horticultural Science. 2005;130(4):585–90.

33. Shrestha MK, Volkaert H, Straeten DVD. Assessment of genetic diversity in Tectona grandis using amplified fragment length polymorphism markers. Canadian Journal of Forest Research. 2005;35(4):1017–22.

34. Sreekanth P, Balasundaran M, Nazeem P, Suma T. Genetic diversity of nine natural Tectona grandis Lf populations of the Western Ghats in Southern India. Conservation genetics. 2012;13(5):1409–19.

35. Nicese F, Hormaza J, McGranahan G. Molecular characterization and genetic relatedness among walnut (Juglans regia L.) genotypes based on RAPD markers. Euphytica. 1998;101(2):199–206.

36. Pop IF, Vicol AC, Botu M, Raica PA, Vahdati K, Pamfil D. Relationships of walnut cultivars in a germplasm collection: comparative analysis of phenotypic and molecular data. Scientia Horticulturae. 2013;153:124–35.

37. Vaishnaw V, Mohammad N, Wali SA, Kumar R, Tripathi SB, Negi MS, et al. AFLP markers for analysis of genetic diversity and structure of teak (Tectona grandis) in India. Canadian Journal of Forest Research. 2015;45(3):297–306.

38. Fatahi R, Ebrahimi A, Zamani Z. Characterization of some Iranians and foreign walnut genotypes using morphological traits and RAPD markers. Hortic Environ Biotechnol. 2010;51(1):51–60.

39. Gharaghani A, Solhjoo S, Oraguzie N. A review of genetic resources of almonds and stone fruits (Prunus spp.) in Iran. Springer; 2017.

40. Zhang H-Y, Liu X-Z, Li T-S, Yang Y-M. Genetic diversity among flue-cured tobacco (Nicotiana tabacum L.) revealed by amplified fragment length polymorphism. Bot Stud. 2006;47(3):223–9.

41. Xiao Y, Cai D, Yang W, Ye W, Younas M, Wu J, et al. Genetic structure and linkage disequilibrium pattern of a rapeseed (Brassica napus L.) association mapping panel revealed by microsatellites. Theoretical and Applied Genetics. 2012;125(3):437–47. pmid:22437490

42. Aradhya M, Woeste K, Velasco D, editors. Genetic diversity, structure and differentiation in cultivated walnut (Juglans regia L.). VI International Walnut Symposium 861; 2009.

43. Dangl GS, Woeste K, Aradhya MK, Koehmstedt A, Simon C, Potter D, et al. Characterization of 14 microsatellite markers for genetic analysis and cultivar identification of walnut. Journal of the American Society for Horticultural Science. 2005;130(3):348–54.

44. Christopoulos MV, Rouskas D, Tsantili E, Bebeli PJ. Germplasm diversity and genetic relationships among walnut (Juglans regia L.) cultivars and Greek local selections revealed by Inter-Simple Sequence Repeat (ISSR) markers. Scientia Horticulturae. 2010;125(4):584–92.

45. El Bakkali A, Haouane H, Moukhli A, Costes E, Van Damme P, Khadari B. Construction of core collections suitable for association mapping to optimize use of Mediterranean olive (Olea europaea L.) genetic resources. PLoS One. 2013;8(5):e61265. pmid:23667437

46. Odong T, Jansen J, Van Eeuwijk F, van Hintum TJ. Quality of core collections for effective utilisation of genetic resources review, discussion and interpretation. Theoretical and Applied Genetics. 2013;126(2):289–305. pmid:22983567

47. McKhann HI, Camilleri C, Bérard A, Bataillon T, David JL, Reboud X, et al. Nested core collections maximizing genetic diversity in Arabidopsis thaliana. The Plant Journal. 2004;38(1):193–202. pmid:15053772

48. Zhang Y, Zhang X, Che Z, Wang L, Wei W, Li D. Genetic diversity assessment of sesame core collection in China by phenotype and molecular markers and extraction of a mini-core collection. BMC genetics. 2012;13(1):102. pmid:23153260

Word count: 6056

Show less

© 2021 Mahmoodi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

Iran is a center of origin and diversity for walnuts (Juglans regia L.) with very good potential for breeding purposes. The rich germplasm available, creates an opportunity for study and selection of the diverse walnut genotypes. In this study, the population structure of 104 Persian walnut accessions was assessed using AFLP markers in combination with phenotypic variability of 17 and 18 qualitative and quantitative traits respetively. The primers E-TG/M-CAG, with high values of number of polymorphic bands, polymorphic information content, marker index and Shannon’s diversity index, were the most effective in detecting genetic variation within the walnut germplasm. Multivariate analysis of variance indicated 93.98% of the genetic variability was between individuals, while 6.32% of variation was among populations. A relatively new technique, an advanced maximization strategy with a heuristic approach, was deployed to develop the core collection. Initially, three independent core collections (CC1–CC3) were created using phenotypic data and molecular markers. The three core collections (CC1–CC3) were then merged to generate a composite core collection (CC4). The mean difference percentage, variance difference percentage, variable rate of coefficient of variance percentage, coincidence rate of range percentage, Shannon’s diversity index, and Nei’s gene diversity were employed for comparative analysis. The CC4 with 46 accessions represented the complete range of phenotypic and genetic variability. This study is the first report describing development of a core collection in walnut using molecular marker data in combination with phenotypic values. The construction of core collection could facilitate the work for identification of genetic determinants of trait variability and aid effective utilization of diversity caused by outcrossing, in walnut breeding programs.

Details

Title

Composite core set construction and diversity analysis of Iranian walnut germplasm using molecular markers and phenotypic traits

Author

Mahmoodi, Razieh; Dadpour, Mohammad Reza; Hassani, Darab; Zeinalabedini, Mehrshad; Vendramin, Elisa; Leslie, Charles A

First page

e0248623

Section

Research Article

Publication year

2021

Publication date

Mar 2021

Publisher

Public Library of Science

e-ISSN

19326203

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1371/journal.pone.0248623

ProQuest document ID

2501837841

Composite core set construction and diversity analysis of Iranian walnut germplasm using molecular markers and phenotypic traits

Jump to:

Full text

Abstract

Details

Suggested sources