ABSTRACT
The aim of this study was to investigate the genetic variability among early-maturing and extremely early-maturing corn cultivars in relation to grain productivity, crude protein content and amino acid profile to obtain future crosses of cultivars with improved nutritional characteristics. Data from two experiments, which were both conducted in randomized block design and three replicates. Thirty-six early-maturing and 22 extremely early-maturing cultivars were evaluated of the experiments. Grain productivity, crude protein, lysine, methionine, cysteine, threonine, tryptophan, valine, isoleucine, leucine, phenylalanine, histidine and arginine were evaluated. Analysis of variance was performed for each variable and means were compared by Scott-Knott test. Mahalanobis generalized distance matrix was calculated, and the cultivars were grouped using the unweighted pair group method with arithmetic mean (UPGMA). A dendrogram was then constructed, and the cophenetic correlation coefficient was calculated. The group means were compared using the t-test for independent samples. Genetic variability was detected in 13 variables for both maturity groups. For the early-maturing and extremely early-maturing cultivars, five and four groupings were obtained, respectively, based on grain productivity, crude protein content and amino acid profile, indicating that the crossbreeding of these cultivars may be used to improve the grain protein content and amino acid profile.
Key words: Zea mays L., genetic variability, nutritional variability, clustering analysis, near-infrared reflectance spectroscopy.
Studies on the variability of the nutritional composition of corn are important because they contribute to increase nutrient use efficiency and minimize environmental damage. Although corn is a cereal used for energy purposes, studies on its protein contribution, particularly those related to its profile of essential amino acids are needed. Studies in this area are relevant because corn is the most expensive nutrient in the feed formulation (Jompuk et al., 2011), resulting in significant effects on production costs. In this direction, Pollak and Scott (2005) emphasized that breeding for grain quality provides end users with grain better suited to their needs and the improved amino acid balance lowers the cost of animal feed.
In recent decades, genetic improvement in corn has mainly focused on increasing grain productivity (PROD), which may have diminished the nutritional value of grain. For instance, Idikut et al. (2009) and Aliu et al. (2012) demonstrated that hybrids and corn populations show an inverse relationship between PROD and the crude proteins (CP) content of grain, and from 1930 to 1991, the level of this nutrient decreased 0.3% per decade (Duvick, 2005). Consequently, studies have been conducted to improve the protein quality and amino acid profile of corn grains. A review performed by Prasanna et al. (2001) shows that the QPM (Quality Protein Maize) cultivars with high quality protein provide benefits for human nutrition due to a significant increase on lysine and tryptophan levels compared to conventional corn cultivars. In order to improve the amino acid composition, Shewry (2007) demonstrates the possibility of using genetic engineering strategies to increase both the total protein content and composition of essential amino acids or even for exploitation of natural high lysine mutants.
Study done by Mittelmann et al. (2011), evaluating ten Brazilian corn populations under three environments, showed that these populations differ in relation to most of the nutritional traits evaluated, indicating the possibility of selecting genotypes with improved nutritional quality. In the study developed by Kil et al. (2014), with yellow corn from the United States and with yellow and white corn from South Africa, was observed that the concentrations of CP and AA (amino acids) varied among corn from different origins. Also, in a study of genetic divergence among modern inbred lines, teosinte accessions, and landraces, Flint-Garcia et al. (2009) verified that teosinte has very small seeds, but twice the protein content of landraces and inbred lines and a lower carbohydrate content, being considered a source of genetic variability for maize breeding programs.
Another important characteristic to be considered in a breeding program for corn is the texture of the grain, which is characteristic of fundamental importance to the industry, producers and processors of grain. Once the proteins, grain texture and hardness grains are inter-related, it is worth considering these features in a breeding program in order to address the storage pest resistance in maize (Mwololo et al., 2013). From the viewpoint of processing to feed grains, grains hard texture consume more energy in milling process and the difficulties in obtaining uniform grain size can affect the digestion of swine (Lawrence, 1972). In developing countries, is commonly used hard and dent grain, but the emergence of mutants with alterations in protein synthesis, thus causing changes in the texture of the grains, which is not always desirable (Sofi et al., 2009).
The study of genetic diversity is the basis of genetic improvement programs for corn plants (Subramanian and Subbaraman, 2010; Cruz and Carneiro, 2006). Clustering analysis using hierarchical methods is among the most commonly applied techniques for the description of genetic diversity patterns. By this technique, it is possible to classify individuals into distinct groups according to the variables of interest (Cruz and Carneiro, 2006) and to identify individuals that can be crossbred to maximize the gain in heterosis or hybrid vigor in the F1 generation.
To estimate the distance between cultivars in replicated experiments, the Mahalanobis generalized distance (D2), which takes into account the correlations between characters, is the most appropriate (Singh, 1981). These distances can be represented later in a dendrogram generated by different methods, such as the average linkage between groups clustering method [unweighted pair group method with arithmetic mean (UPGMA)], which is considered appropriate for the grouping of corn cultivars (Cargnelutti Filho and Guadagnin, 2011). This clustering method was used by Alves et al. (2014) to assess the variability of grain productivity and energy profile of maize cultivars. They indicated that it is possible to plan crossings between groups of genotypes in terms of productivity, ethereal extract and amylose, with the goal of maximizing heterosis. The UPGMA clustering method was also used by Osorno and Carena (2008) to determine the genetic diversity in early maturity maize, based on quality (percentage of protein, oil, and starch) and agronomic traits.
Despite the large number of genotypes, hybrids, cultivars and strains of corn grown, there is still missing data concerning their genetic variability in relation to PROD and nutritional characteristics. The aim of the present study was to investigate the genetic variability among early-maturing and extremely early-maturing cultivars of corn in relation to PROD and the CP content and the amino acid profiles of grain, thereby providing a basis for obtaining cultivars with improved nutritional characteristics in future crossbreeding studies.
Field experiments
Two experiments were conducted with corn (Zea mays L.) cultivated during the 2009/2010 agricultural year in the experimental area of the Department of Plant Science, in the Federal University of Santa Maria, Santa Maria, Rio Grande do Sul State, at 29°42'S, 53°49'W and an altitude of 95 m. The experimental area is included in the physiographic region of the Central Depression of the state of Rio Grande do Sul, Brazil. Fundamentally, the region is characterized by a humid subtropical climate, type Cfa according to the Köppen classification, and the soil is classified as Paleudalf (Embrapa, 2006), with a sandy loam surface texture.
The first experiment included 36 early-maturing corn cultivars, and the second included 22 extremely early-maturing corn cultivars. The cultivars used in each of the experiments (early-maturing and extremely early-maturing) belonged to the maize cultivar competition testing of the State of Rio Grande do Sul, Brazil. The relation of the cultivars included in each of the experiments was determined by the Agriculture Research State Foundation (Fundação Estadual de Pesquisa Agropecuária - FEPAGRO), which coordinates the maize cultivars evaluation testing in Rio Grande do Sul, based on information provided by the companies responsible for these cultivars, with respect to the cycle (early-maturing and extremely early-maturing). Further details of each cultivar, as commercial name, genetic basis, company and grain texture are presented in Table 1.
In both experiments, a randomized block design with three replications was used. The experimental plots consisted of two rows each 5 m long and spaced 0.8 m apart. Sowing was performed manually on October 26, 2009, with basic fertilization at a rate of 37.5 kg?ha−1 of nitrogen (N), 150 kg?ha−1 of phosphorus (P2O5) and 150 kg?ha−1 of potassium (K2O). At the time of sowing, two seeds were placed every 0.20 m in each row, and after emergence, the plants were thinned to adjust the population to 62,500 plants ha−1. The plants emerged between days 01 and 03 of November 2009 and were harvested on March 15, 2010. N fertilizer was applied as a top-dressing when plants were at the 3, 5 and 10 leaves, totaling 200 kg?ha−1 of N (Fancelli and Dourado Neto, 2004). All other cultivation practices were performed as locally recommended for the corn crop. During the experimental period, rainfall totaled 1,374 mm, and the average daily air temperature was 23.9°C [data collected at a conventional weather station belonging to the 8th District of Meteorology of the National Meteorological Institute (8° DISME/INMET), located 50 m from the experimental area] (Figure 1).
Measurement of variables and laboratory procedures
After the corn cob harvest, PROD was expressed in t?ha−1 for each repetition of each cultivar at 13% humidity. A sample of 500 g was collected, wrapped in a paper bag and maintained in a forced-air incubator (50°C) until 10% moisture was attained. After drying, the grain samples were ground in a micromill (model MA-630, Marconi, Piracicaba, São Paulo, Brazil) to obtain a particle size between 0.3 and 0.5 mm. Based on the percentage of raw material (% RM) in each milled sample, crude protein (CP) and the amino acids lysine (Lys), methionine (Met), cysteine (Cys), threonine (Thr), tryptophan (Trp), valine (Val), isoleucine (Ile), leucine (Leu), phenylalanine (Phe), histidine (His) and arginine (Arg) were quantified using near infrared reflectance spectroscopy (NIR - Near Infrared reflectance), with specific methodology in equipment manual. The analyses were performed by the Laboratory CEAN - ADISSEO using the XDS EQUIPMENT FOSS® 2.5002, with wavelength range from 400 nm to 2500 nm, with calibration adjusted by analytical procedure CEAN 010 Adisseo Brazil SA.
Verification of assumptions, analysis of variance and comparison of means
For each variable measured in each experiment, analysis of variance was performed according to the mathematical model of a randomized complete block design given by: Yij = μ + τi + βj + εij, where Yij is the observed value of the response variable in the plot ij, μ is the overall mean, τi is the effect of cultivar, βj is the effect of block and εij is the effect of experimental error (Storck et al., 2011). To ensure the reliability of the analysis of variance results, the assumptions of the mathematical model were verified. For this, the Kolmogorov-Smirnov test for verifying the normality of the errors (Campos, 1983) and Levene's test for checking the homogeneity of residual variances among the treatments (Steel et al., 1997) were performed. In addition, the selective accuracy (SA) was estimated by the equation SA = (1 (1 / Fc))0.5 and used to evaluate the experimental accuracy, in accordance to the limits postulated by Cargnelutti Filho and Storck (2009). SA can alternatively be obtained by square root of the heritability and accuracy is considered very high when the SA≥0.90, high when 0.70≤SA<0.90, moderate when 0.50≤SA<0.70 and low when the SA<0.50 (Cargnelutti Filho and Storck, 2009). After, the means of the cultivars were compared using the Scott-Knott test (1974) at the 5% probability level.
Genotypic correlation matrix and diagnosis of multicollinearity
In each experiment (early and extremely early maturing cultivars), a genotypic correlation matrix between the 13 variables was calculated, and the multicollinearity was diagnosed to identify and eliminate highly correlated variables. The degree of multicollinearity of the genotypic correlation matrix was evaluated based on the condition number (CN). CN an efficiente procedure for the diagnosis of multicollinearity should directly reflect the intensity of its effects and allow the identification of the independent variables involved whit this problem (Montgomery and Peck, 1982). The decision criterion used was based on the classification proposed by Montgomery and Peck (1982) and described by Cruz and Carneiro (2006).
Analysis and validation of clusters
For the cluster analysis, a Mahalanobis generalized distance matrix (D2) between the cultivars of each experiment was determined first. Then, the D2 matrix was used on a relative scale as the dissimilarity measure for cluster analysis of the cultivars by the average linkage between groups hierarchical method (UPGMA) (Cruz and Carneiro, 2006). Next, a dendrogram was constructed, and the cophenetic correlation coefficient (CCC) was calculated. Thereafter, using 50% dissimilarity (Cruz, 1990) as the cutoff criterion for the dendrogram, the cultivars were identified for each group. This percentage was set at 50%, because groups formed with more than 50% of genetic variability would present little practicality, due to the large genetic differentiation between genotypes considered in homogeneous groups (Barroso and Arts, 2003). The mean of each variable was then estimated within each group, and subsequently for grouping validation, the group means were compared two by two with Student's t test for independent samples. Next, these means were transformed into vectors, and the highest mean was considered equal to one. For each experiment, the group means were represented on a graph of means profiles (Barroso and Artes, 2003). In addition, multivariate analysis of variance (MANOVA) at a 5% probability level was used to test for differences between the profiles of group means using Wilk's criterion. The statistical analysis was performed using Genes (Cruz, 2013) and Bioestat 5.0 (Ayres et al., 2007) software and the Microsoft Office Excel application.
Assumptions verification and analysis of variance
In the experiment with the early maturing cultivars, the errors in nine of the 13 variables (69.23%) conformed to a normal distribution based on the Kolmogorov-Smirnov test. In the experiment with the extremely early-maturing cultivars, the errors in 12 variables (92.31%) conformed to the normal distribution (Table 2). In both experiments, the residual variances of the variables were homogeneous by Levene's test. Therefore, in general, the assumptions of normality of errors and homogeneity of variances were met, which offered credibility to the results of the analysis of variance.
The high values for selective accuracy (SA) indicated that the experimental precision was high at 7.69% (one variable) and 15.38% (two variables) for the 13 variables measured in the experiments with the early and extremely early-maturing cultivars, respectively (Table 2). Likewise, the precision was quite high at 92.31% and 84.62% for the variables measured in the experiments with the early and extremely early-maturing cultivars, respectively (Resende and Duarte, 2007; Cargnelutti Filho and Storck, 2009). Furthermore, the coefficient of variation (CV) was low for all the variables measured in the two experiments (CV ≤ 13.32%).
The effect of cultivar was significant for all the variables measured in the two experiments (Table 2), which indicated that there was genetic variability among the early and extremely early-maturing cultivars for PROD and for the grain CP and amino acid levels. Considering that the assumptions of the model were met, the experimental precisions were high, and there was variability among the cultivars for all the variables, it can be inferred that this database is suitable for genetic diversity studies among cultivars by clustering analysis.
For the experiments with the early and extremely early-maturing cultivars, the PROD averaged 5.00 and 5.92 t?ha−1, respectively, and the CV (experimental coefficient of variation) was 13.32% and 10.64% (Table 2). Despite the higher CV for the PROD in relation to the other variables, the SA was 0.94 for the two experiments, which indicated quite high experimental precision in the measurement of PROD (Resende and Duarte, 2007; Cargnelutti Filho and Storck, 2009). The PROD ranged between 2.7581 and 7.1936 t?ha−1 for the early-maturing cultivars (Table 3) and between 3.8391 and 7.8526 t?ha−1 for the extremely early-maturing cultivars (Table 4). These results indicated wide variability in the PROD values of the cultivars evaluated in both experiments.
The mean CP content for the early and extremely early-maturing cultivars was 7.54 and 7.31%, respectively (Table 2). The values ranged between 7.0467% and 8.5667% for the early-maturing cultivars (Table 3) and between 6.7600% and 7.9933% for the extremely early-maturing cultivars (Table 4). In both experiments, the experimental precision was high (SA = 0.78 and 0.87) according to the criteria of Resende and Duarte (2007) and Cargnelutti Filho and Storck (2009) (Table 2).
In the experiment with the early-maturing cultivars, the mean % RM value for each amino acid was 0.21 for Lys, 0.14 for Met, 0.16 for Cys, 0.21 for Thr, 0.05 for Trp, 0.30 for Val, 0.18 for Ile, 0.79 for Leu, 0.27 for Phe, 0.19 for His and 0.33 for Arg. In the experiment with the extremely early-maturing cultivars, the mean % RM value for each amino acid was 0.22 for Lys, 0.15 for Met, 0.16 for Cys, 0.22 for Thr, 0.05 for Trp, 0.31 for Val, 0.19 for Ile, 0.84 for Leu, 0.29 for Phe, 0.19 for His and 0.33 for Arg (Table 2). Thus, the mean percentage for each amino acid was found to be similar between the early and extremely early-maturing cultivars. However, these values fluctuated widely (variability) among the early-maturing cultivars (Table 3) and among the extremely early-maturing cultivars (Table 4).
The CV values associated with the amino acid measurements ranged from 2.37 to 6.49% in the early-maturing cultivars, resulting from the Met and Trp variables, respectively, and from 2.67 to 8.40% in the extremely early-maturing cultivars, resulting from the Arg and Trp variables, respectively (Table 2). The SA in the experiment with the early-maturing cultivars ranged between 0.90 and 0.96, indicating that the experimental precision was quite high for all the variables. In the experiment with the extremely early-maturing cultivars, the SA varied between 0.86 for the Met variable and 0.97 for the His variable, indicating that for most of the amino acids (90.9%), the experimental precision was quite high according to the criteria established by Resende and Duarte (2007) and Cargnelutti Filho and Storck (2009).
From the Scott-Knott means comparison test, two or more groupings of the cultivars were obtained for all the variables in both experiments (Tables 3 and 4). These results confirmed the above inference that there was genetic variability among the cultivars of the same plant maturity class for PROD and for grain CP and amino acid levels. Thus, it is possible to investigate the genetic variability among cultivars through cluster analysis.
Diagnosis of multicollinearity and elimination of variables
The diagnosis of multicollinearity based on the matrices of genetic correlations between the 13 variables resulted in a CN of 40,261 and 2,220, respectively, for the early and extremely early-maturing cultivars. Therefore, in both experiments, multicollinearity was severe according to the criteria of Montgomery and Peck (1982), and clustering analysis is not recommended under this condition because the multicollinear variables contribute greater weight in the clustering process (Cruz and Carneiro, 2006). With the elimination of the highly correlated variables (Cys, Thr, Trp, Ile, Phe, His and Arg for the early-maturing cultivars and Thr, Trp, Val, Ile, Phe, His and Arg for the extremely early-maturing cultivars), the CN was reduced to 86 and 93, respectively, which indicated weak collinearity and therefore enabled the clustering analysis to proceed satisfactorily.
Analysis and validation of clusters
In both experiments, the dendrogram was generated by the average linkage between groups method of grouping (UPGMA) using D2 as a measure of dissimilarity. For the 36 early-maturing cultivars (Figure 3A), five groupings were obtained based on the PROD, CP, Lys, Met, Val and Leu variables, using 50% as the criterion of dissimilarity. The number of cultivars in Groups 1 through 5 was 21, 10, 1, 3 and 1, respectively. The 22 extremely early-maturing cultivars were divided into four groups (Figure 2B) based on the PROD, CP, Lys, Met, Cys and Leu variables and using 50% dissimilarity as the criterion for defining the groups. The number of cultivars in Groups 1 through 4 was 13, 2, 6 and 1, respectively. The CCCs of 0.67 for the early-maturing cultivar groups (Figure 2A) and of 0.58 for the extremely early-maturing cultivar groups (Figures 2B), were statistically significant, which indicated good relationships between the D2 matrix and the graphical distance matrix and, consequently, good group consistency.
The means of the six variables used in the clustering of the early-maturing cultivars and of the six variables used in the clustering of the extremely early-maturing cultivars differed between the groups, which revealed contrasting groups (Table 5). For the early-maturing cultivars, Groups 2 and 3 had the highest mean PROD, followed by Groups 1, 4 and 5. Group 5 had the highest mean CP. The highest mean Lys values were observed in Groups 3, 4 and 5, followed by Groups 1 and 2. Met showed the highest means in groups 3 and 4, followed by Group 1 and then Groups 2 and 5. Group 4 showed the highest mean Val value and Group 2, the lowest. For Leu, Groups 4 and 5 showed the highest mean values, followed by Group 1 and then Groups 2 and 3.
For the extremely early-maturing cultivars, group 2 had the highest mean PROD, followed by Group 1 and then Groups 3 and 4 (Table 5). The mean CP value was highest in Group 4, followed by Groups 1 and 3, and was lowest in Group 2. The mean Lys value was highest in Groups 4 and 5, followed by Group 1, and was lowest in Group 2. For the Met, Cys and Leu variables, the highest means were in Group 4, followed by Group 3, then Group 1 and finally Group 2, with the lowest mean.
In the early-maturing (PROD, CP, Lys, Met, Val and Leu variables) and extremely early-maturing (PROD, CP, Lys, Met, Leu and Cys variables) cultivars after the completion of the MANOVA using Wilk's criterion, the group mean vectors were found to differ. In the experiment with the early-maturing cultivars, considering the results from the comparison tests of the group means (Table 5) and the vectors of the means (Figure 3A), Group 2 was found to have higher PROD values and, in general, the smallest values of CP, Lys, Met, Val and Leu compared with the other groups. In addition, Groups 4 and 5 showed lower mean PROD and, generally, higher mean CP, Lys, Met, Val and Leu values.
For the experiment with the extremely early-maturing cultivars, Group 2, formed by the cultivars with higher PROD values, was found to have the lowest CP and amino acid levels (Figure 3B). By contrast, Group 4, formed by the cultivars with lower PROD values, showed higher CP, Lys, Met, Leu and Cys levels. Behavior similar to that of Group 4 was also observed for Group 3.
However, the cultivars of Group 1 showed intermediate values for all the variables measured. Thus, based on the results of the experiment with the early-maturing cultivars (Table 4 and Figure 3A) and, especially, the results of the experiment with the extremely early-maturing cultivars (Table 4 and Figure 3B), as the PROD of a cultivar group increased, the group protein and essential amino acid levels declined.
Assumptions and analysis of variance
In both experiments, the high proportion of variables with normally distributed errors and the homogeneity observed among the residual variances for all the variables indicated that the database was suitable for performing the analysis of variance and complementary studies. The analysis of variance based on the 13 measured variables showed that the early and extremely early-maturing cultivars were genetically variable. The existence of genetic variability is essential in a breeding program, as highlighted by Vilela et al. (2008), and is a requirement for performing clustering analysis. Furthermore, the experimental precision based on the SA values was found to be high or extremely high in the present study. Accordingly, Cargnelutti Filho and Storck (2009) noted that SA is more suitable than the coefficient of variation (CV) for the verification of experimental precision in genetic studies. In addition, according to Resende and Duarte (2007), an SA value exceeding 0.90 (exceptionally high experimental precision) would be ideal for safer statistical inference.
The Scott-Knott means comparison test confirmed the existence of genetic variability, forming two or more distinct groupings of cultivars for all the variables measured (Tables 3 and 4). The mean PROD of the early and extremely early-maturing cultivars was 5.00 and 5.92 t?ha−1, respectively, which surpassed the 4.46 t?ha−1 mean productivity of the state of Rio Grande do Sul, Brazil for the 2009/2010 crop (CONAB, 2010). However, some of the 36 early maturing and 22 extremely early-maturing cultivars showed PROD values that were below the state average. The variations occurred for each variable in both early and extremely early maturing cultivars are related to the genetic characteristics of each cultivar, once all the genotypes were exposed to similar growing conditions, indicating the existence of variability. According to Zhu and Khan (2001), the genotype, the environment and the interaction between these factors influence the protein and amino acid composition in grain. Although, the protein content of corn is less expressive than its carbohydrates content, variations in the content and amino acid composition of this nutrient can cause significant effects on animal metabolism, which uses this cereal as a major ingredient of the diet.
CP is the major variable in the chemical composition of food to be determined to adequately balance animal feed (Vieira et al., 2007). In the experiments with the early-maturing and extremely early-maturing corn cultivars, the grain CP content averaged 7.54 and 7.31% RM, respectively. These values were lower than those observed by Has et al. (2010), which ranged from 11.2 to 15.6%, and also lower than those described by Prasanna et al. (2001), which ranged from 8.9 to 10.2%. Aliu et al. (2012) found CP values between 11.02 and 13.02%. However, Vieira et al. (2007) reported levels between 6.73% and 10.04%, similar to those obtained in the present study. Moore et al. (2008) evaluated six maize hybrids, found an average value PB of 5.16%, lower than those observed in this study. Although these results suggested that the cultivars used in the present study, which are available for cultivation in Rio Grande do Sul, have low CP contents, some studies have shown that the grain CP content varies with the level of N fertilizer, making it difficult to obtain convergent values (Shewry, 2007). Furthermore, Singh et al. (2005) determined the variability in corn protein by near-infrared transmission and observed CP values between 5.7 and 11.0%, with the highest protein content in plots who received 202 kg ha-1 of N, close to those used in this study, of 200 kg?ha-1 of N. The protein quality of corn grains depends on their amino acid profile. In this sense, the corn grains were deficient in essential amino acids, such as Lys, Met and Trp (Sofi et al., 2009). The mean values of amino acids obtained in the present study were lower than the levels determined by Abou-Deif et al. (2012), except for Ile, Cys and Trp, which were not quantified by the authors. In a study by Moore et al. (2008), were obtained lower amino acid values that in the present study, with the exception of TRP, ILE and FEN which presented similar values in early genotypes and TRP and ILE in superprecoce genotypes. Values higher than in the present study were reported by Lovatto et al. (2006) for the amino acids LIS, MET, THR, TRP, VAL, IT, READ, FEN, HIS and ARG. Although the corn kernels are use essentially as energetic ingredients in feed, variations in its protein content as well as in its composition in its amino acid profile, can cause significant effects on animal metabolism, which have this cereal as the majority of their diet.
Diagnosis of multicollinearity and elimination of variables
In clustering analysis, the diagnosis of multicollinearity is a basic assumption to be met because the variables will be weighted more heavily in the presence of multicollinearity (Barroso and Artes, 2003; Hair et al., 2009). The degree of multicollinearity of the matrix can be established based on the condition number, which is the ratio between the higher and the smallest eigenvalue of the matrix. Thus, when the number of conditions is less than 100, multicollinearity is weak; between 100 and 1000, multicollinearity is moderate to heavy, and when it is larger than 1000, multicollinearity is severe (Montgomery and Peck, 1982). In the case of the moderately or severe multicollinearity, proceeded to elimination of highly correlated variables.
In the study developed by Alves et al. (2014), was also observed severe multicollinearity among the ethereal extract and nitrogen-corrected apparent metabolizable energy variables in the extremely early-maturing cultivars. In a clustering analysis of corn crops, Cargnelutti Filho and Guadagnin (2011) performed a diagnosis of multicollinearity. These authors found that it was not necessary to discard variables because those used showed a low degree of multicollinearity, whereas certain variables were discarded in the present study because they created multicollinearity.
Analysis and validation of clusters
After variability among the cultivars was verified, the study of genetic diversity in corn was initiated based on a hierarchical method, enabling the classification of cultivars into groups. The formation of groups using hierarchical methods was also utilized by Osorno and Carena (2008), Silva et al. (2009), Subramanian and Subbaraman (2010), Oliboni et al. (2012) and Alves et al. (2014). In clustering analysis, the average linkage between groups hierarchical method (UPGMA), which was considered to be usual and appropriate by Mohammadi et al. (2003) and Cargnelutti Filho and Guadagnin (2011), was used. In the early and extremely early-maturing corn cultivars examined in the present study, five and four groupings, respectively were formed. According to Osorno and Carena (2008), the clustering analysis by UPGMA method showed the presence of five diversity groups based on their breeding origin. Osorno and Carena (2008), used ten early maturity maize improved populations, in a diallel mating design without reciprocals, based on grain quality traits (percentage of protein, oil, and starch) and agronomic traits. They also highlighted that a preliminary classification of the genotypes into groups of genetic diversity could help decide which combinations are the best.
The high and significant values of the CCCs indicated that the dendrograms of the two experiments reported here reliably expressed the similarity of the cultivars within each group and the genetic differences between the groups.
In each experiment, the analysis of variance, the tests to compare means and the cluster analysis indicated genetic variability, in early and extremely early maturing corn cultivars. In the study developed by Kil et al. (2014), significant differences in the coefficients of ileal apparent digestibility of CP and AA (arginine, histidine, lysine, tryptophan, aspartic acid, cysteine, glycine, and proline) among corn sources from different origins were observed. Already Alves et al. (2014), using cluster analysis to assess the variability of grain productivity and energy profile of maize cultivars, observed that groups with more productive genotypes presented lower ethereal extract and higher amylase in grain from early and extremely early-maturing corn cultivars.
Finally, the present study found that the higher the PROD of a particular group of cultivars, the lower the protein and amino acid levels were in these cultivars. This inverse relationship has been described by Duvick (2005), Idikut et al. (2009) and Aliu et al. (2012). Therefore, it is necessary for researchers involved in corn genetic breeding programs to be attentive to the relationship between PROD and the nutritional composition of the grains. It is important to minimize the reduction of nutrient levels in cultivars of high productive potential or seek the development of cultivars with highly nutritious grain without significantly compromising the productivity of the cultivar.
There was genetic variability for PROD and for grain CP and amino acid levels in the early and extremely early maturing corn cultivars. For the early and extremely early maturing corn cultivars, respectively, five and four groupings were formed based on the PROD, CP content and amino acid profile, indicating the possibility of crossbreeding these cultivars to improve grain protein content and amino acid profiles. Groupings of cultivars with higher grain yields generally exhibited lower CP and amino acid levels.
The authors have not declared any conflict of interest.
The authors thank the Brazilian National Council for Scientific and Technological Development (CNPq) and the Coordination for the Improvement of Higher Level Personnel (CAPES), for granting scholarships to the authors. Mr. Washington Neves of the Adisseo Brasil Company who kindly conducted the laboratory tests, Student fellows and volunteers who assisted in data collection were also acknowledged.
REFERENCES
Abou-Deif MH, Mekki BB, Mostafa EAH, Esmail RM, Khattab SAM (2012). The genetic relationship between proteins, oil and grain yield in some maize hybrids. World J. Agric. Sci. 8:43-50. |
|
Aliu S, Rusinovci I, Fetahu S, Simeonovska E (2012). Genetic diversity and correlation estimates for grain yield and quality traits in Kosovo local maize (Zea mays L.) populations. Acta Agric. Slov. 99:121-128.
Crossref |
|
Alves BM, Cargnelutti Filho A, Toebe M, Burin C, Silva LP (2014). Variability of grain productivity and energy profile of maize (Zea mays L.) genotypes. J. Cereal Sci. 60:164-171.
Crossref |
|
Ayres M, Ayres Jr M, Ayres DL, Santos AS (2007). BioEstat: aplicações estatísticas nas áreas das ciências biomédicas. Belém: Sociedade Civil Mamirauá 364 pp. |
|
Barroso LP, Artes R (2003). Análise multivariada. Lavras: UFLA, 151 pp. |
|
Campos H (1983). Estatística experimental não-paramétrica. 4 ed. Piracicaba: Departamento de Matemática e Estatística - ESALQ, 349 pp. |
|
Cargnelutti Filho A, Guadagnin JP (2011). Consistência do padrão de agrupamento de cultivares de milho. Cien. Rural. 41:1503-1508.
Crossref |
|
Cargnelutti Filho A, Storck L (2009). Medidas do grau de precisão experimental em ensaios de competição de cultivares de milho. Pesq. agropec. Bras. 44:111-117. |
|
CONAB (Companhia Nacional de Abastecimento) (2010). Acompanhamento da Safra Brasileira de Grãos 2009/2010 – Sexto Levantamento, Março/2010/ Companhia Nacional de Abastecimento. Brasília - DF: CONAB, 2010. |
|
Cruz CD (1990). Aplicação de algumas técnicas multivariadas no melhoramento de plantas. Tese de doutorado. Escola Superior de Agricultura Luiz de Queiroz, Piracicaba 188 pp. |
|
Cruz CD (2013). GENES - a software package for analysis in experimental statistics and quantitative genetics. Acta Sci. Agron. 35:271-276.
Crossref |
|
Cruz CD, Carneiro PCS (2006). Modelos biométricos aplicados ao melhoramento genético.2.ed, v.2. Viçosa: UFV, 585 pp. |
|
Duvick DN (2005). The contribution of breeding to yield advances in maize (Zea mays L.). Adv Agron. 86:83-145.
Crossref |
|
EMBRAPA - Empresa Brasileira De Pesquisa Agropecuária (2006). Centro Nacional de Pesquisa de Solos. Sistema Brasileiro de Classificação de Solos. Brasília: Embrapa produção de informação, Rio de Janeiro: Embrapa Solos, 306 pp. |
|
Fancelli AL, Dourado-Neto D (2004). Produção de Milho. Guaíba: 2° Ed. Agropecuária, 360 pp. |
|
Flint-Garcia SA, Bodnar AL, Scott MP (2009). Wide variability in kernel composition, seed characteristics, and zein profiles among diverse maize inbreds, landraces, and teosinte. Theor. Appl Genet. 119:1129-1142.
Crossref |
|
Hair JF, Black WC, Babin BJ, Anderson RE (2009). Multivariate Data Analysis. 7 ed. Prentice Hall, 816 pp. |
|
Has V, Has I, Pamfil D, Copandean A (2010). Characterization of turda" maize germplasm for the chemical composition of the grain. Rom. Agric. Res. 27:60-67. |
|
Idikut L, Atalay AI, Kara SN, Kamalak A (2009). Effect of hybrid on starch, protein and yields of maize grain. J. Anim. Vet. Adv. 8:1945-1947. |
|
Jompuk C, Cheuchart P, Jompuk P, Apisitwanich S (2011). Improved tryptophan content in maize with opaque-2 gene using marker assisted selection (MAS) in backcross and selfing generations. Kasetsart J. Nat. Sci. 45:666-674. |
|
Kil DY, Park CS, Son AR, Ji SY, Kim BG (2014). Digestibility of crude protein and amino acids in corn grains from different origins for pigs. Anim. Feed Sci. Technol. 196:68-75.
Crossref |
|
Lawrence TLJ (1972). A review of some aspects on health and performance of variations in the physical form of the diet of the growing pig. I. Cereal processing. Vet. Rec. 91:67-70. |
|
Lovatto PA, Mallmann CA, Ceccantini M, Hauschild L, Storck L, Dilkin P (2006). Relations entre les caractéristiques agronomiques, nutritionnelles et mycotoxicologiques de différents hybrides de maïs. J. Recherche Porcine 38:163-170. |
|
Mittelmann A, Miranda Filho JB, Nass LL, Lima GJMM, Hara-Klein C, Silva RM (2011). Quantitative variation for grain quality in Brazilian maize populations. Sci. Agric. 68:50-56.
Crossref |
|
Mohammadi SA, Prasanna BM (2003). Analysis of genetic diversity in crop plants - Salient statistical tools and considerations. Crop Sci. 43:1235-1248.
Crossref |
|
Montgomery DC, Peck EA (1982). Introduction to linear regression analysis. New York: John Wiley & Sons 504 pp. |
|
Moore SM, Stalder KJ, Beitz DC, Stahl CH, Fithian WA, Bregendahl, K (2008). The correlation of chemical and physical corn kernel traits with production performance in broiler chickens and laying hens. Poult. Sci. 87:665–676.
Crossref |
|
Mwololo JK, Mugo S, Tefera T, Munyiri SW (2013). Evaluation of traits of resistance to postharvest insect pests in tropical maize. Int. J. Agric. Crop Sci. 6:926-933. |
|
Oliboni R, Faria MV, Neumann M, Battistelli GM, Tegoni RG, Resende JTV (2012). Genetic divergence among maize hybrids and correlations with heterosis and combining ability. Acta Sci. Agron. 34:37-44.
Crossref |
|
Osorno JM, Carena MJ (2008). Creating groups of maize genetic diversity for grain quality: implications for breeding. Maydica 53:131-141. |
|
Pollak LM, Scott MP (2005). Breeding for grain quality traits. Maydica 50:247-257. |
|
Prasanna BM, Vasal SK, Kassahun B, Singh NN (2001). Quality protein maize. Curr. Sci. 81:1308-1319. |
|
Resende MDV, Duarte JB (2007). Precisão e controle de qualidade em experimentos de avaliação de cultivares. Pesqui. Agropecu. Trop. 37:182-194. |
|
Scott AJ, Knott M (1974). A cluster analysis method for grouping means in the analysis of variance. Biometrics 30:507-512.
Crossref |
|
Shewry PR (2007). Improving the protein content and composition of cereal grain. J. Cereal Sci. 46:239-250. |
|
Silva TA, Pinto RJB, Scapim CA, Mangolin CA, Machado MFPS (2009). Genetic divergence in popcorn genotypes using microsatellites in bulk genomic DNA. Crop Breed. Appl. Biotechnol. 9:31-36.
Crossref |
|
Singh D (1981). The relative importance of characters affecting divergence. Indian J. Genet. Plant Breed. 41:237-245. |
|
Singh M, Paulsen MR, Tian L, Yao H (2005). Site-specific study of corn protein, oil, and extractable starch variability using NIT spectroscopy. Appl. Engine. Agric. 21:239-251.
Crossref |
|
Sofi PA, Wani SA, Rather AG, Wani SH (2009). Review article: Quality protein maize (QPM): Genetic manipulation for the nutritional fortification of maize. J. Plant Breed. Crop Sci. 1:244-253. |
|
Steel RGD, Torrie JH, Dickey DA (1997). Principles and procedures of statistics a biometrical approach, 3rd ed. McGraw-Hill, Nova York. |
|
Storck L, Garcia DC, Lopes SJ, Estefanel V (2011). Experimentação vegetal. Santa Maria: UFSM. 200 pp. |
|
Subramanian A, Subbaraman N (2010). Hierarchical cluster analysis of genetic diversity in Maize germplasm. Electron J. Plant Breed. 1:431-436. |
|
Vieira RO, Rodrigues PB, Freitas RTF, Nascimento GAJN, Silva ELS, Hespanhol R (2007). Composição química e energia metabolizável de híbridos de milho para frangos de corte. R. Bras. Zootec. 36:832-838.
Crossref |
|
Vilela FO, Júnior ATA, Pereira MG, Scapim CA, Viana AP, Júnior SPF (2008). Effect of recurrent selection on the genetic variability of the UNB-2U popcorn population using RAPD markers. Acta Sci. Agron. 30:25-30. |
|
Zhu J, Khan K (2001). Effects of genotype and environment on glutenin polymers and breadmaking quality. Cereal Chem. 78:125-130.
Crossref |