This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
1. Introduction
In agricultural experiments, control of experimental error is often important to improve research precision. In the field, the existence of systematic spatial variability attributable to the physical-chemical properties of the soil is common, for example, that associated with gradients due to slope, fertility, or humidity that are easily recognized in the field. The sources of this variability are very diverse, so in these situations, it requires an adequate experimental design and data analysis, where blocking and its analysis of variance continue to be the important techniques to consider this variability. The European Journal of Agronomy (2012) published an article related to the review of assumptions of analysis of variance (AOV) in the agronomic context, revealing a large number of inappropriate applications due to the few times they review the assumptions of the technique, implying the revision of the assumption of independence [1].
There are several investigations that generate data that can show dependency structures, for example, those that involve designs in repeated measures, time series data, hierarchical or grouped studies, and the one that we consider in the current investigation that could be associated with the study of the physical or chemical properties of the soil, of which multiple studies have been accomplished showing clear structures of spatial dependence in variables such as apparent electrical conductivity, soil organic carbon content, texture, pH, moisture, apparent density, porosity, and mineral content of soils [2, 3]. This spatial variability caused Fisher [4] almost 100 years ago to address the issue of blocking in agricultural experimentation to minimize the systematic error caused by the inherent variability of the soils or environments where the experiments were conducted.
Despite being a fairly well-known technique, some benefits in data analysis and their role in the assumption of independence of the residuals in the models are still unknown or cursorily considered. The great diversity of blocking modalities, such as complete, incomplete, generalized, fixed, random, solvable, and nested blocking, may not be adequately treated, thus rendering AOV tables not only incomplete but also erroneous. This could include, for example, when there is a generalized blocking where the interaction that could occur between blocks and factors is usually omitted [5–8].
The omission of the role of blocking as a factor for attenuation or suppression of spatial dependence can complicate data analysis. Gotway and Cressie [9] first showed a spatial AOV to consider dependence, pointing out that in the presence of spatial autocorrelation, the usual tests for the presence of treatment effects may no longer be valid. This is where blocking is important as it can eliminate spatial dependency on error terms. This assumption is part of the methods usually considered in the basic courses of experimental design or standard linear models [5, 10], specifically in the design models [11] since in advanced courses techniques to deal with this dependency are usually taught [12].
In this research, multiple AOV were made by Monte Carlo simulation for both a simple factorial design in a completely randomized arrangement (CRD) and a randomized complete block with one observation per cell (RCB). Spatial dependence was induced, and each plot was assumed as an experimental unit and as a response to any indicator of the yield of a crop. In addition, a factor with three levels was considered, and it was blocked at 10 levels for reasons of a hypothetical gradient of some soil property. The imposition of spatial dependency was done in order to note the effect of considering the blocking in the modeling process when comparing the models with and without blocking. The blocking efficiency was evaluated with a new strategy much more convenient to demonstrate the suppressive effect of spatial dependence using the blocking efficiency statistic, since the simulations evidenced a lower probability of suggesting the elimination of the blocking effect due to the misuse of the
2. Materials and Methods
On the same data set, two linear models with a single factor were fitted, which differed by the incorporation of the effect of the blocks to the CRD model, thus generating the RCB model, i.e., the randomized block model with one observation per cell. In general, the matrix form of both models is written as
For the management of spatial information, artificial coordinates were created associated with the distribution of the plots in both rows and columns. The center of the plot was used as a representative coordinate to create the distance matrix of all the neighbors and thus generate the spatial weight matrix (
In order not only to work with synthetic data, several databases of recognized experiments available in the R agridat library [22] were used to show the mitigating or suppressing effect of dependency, specifically the bases: rothamsted.brussels, mead.strawberry, federer.tobacco, and rothamsted.oats, where RCB designs were run. In our case, we ran it with the absence and presence of blocking to see the effect of blocking on those cases and to explain the importance of incorporating other variables in the modeling process to finish correcting the effect of spatial dependence that with only blocking is not possible and manages to correct. It is important to note that the models adjusted in principle are models with blocks, to which the removal of the blocks is done to see their effect, and it is not that they were completely randomized models to which the blocking was done later.
As the analysis of spatial dependence was carried out on the residuals of each model, the dependency relationship on the residuals with the spatial dependence on the response variable was explored, thus generating a result related to this practice. To summarize, the information and scatter diagrams were constructed with the regions associated with the classifications according to the
As the proposal with blocking was used as a spatial dependence attenuator or suppressor and since it is known there is no valid test for the effect of blocking, the proposal of the
2.1. Pseudocode for the Generation of Statistical Analysis
Next, two algorithms were presented in pseudocode, the first (Algorithm 1) to calculate the tests of normality and homoscedasticity of residuals, as well as for the calculation of the MI, and it was associated with the
Algorithm 1: Pseudocode for calculating normality and homoscedasticity tests.
Function tests( r )
Start
pvSW = pvalue( ShapiroWilks_NormalityTest( r ) )
pvSF = pvalue( ShapiroFrancia_NormalityTest( r ) )
pvB = pvalue( Bartlett_HomoscedasticityTest( r ) )
pvFK = pvalue( FlignerKilleen_HomoscedasticityTest( r ) )
MI = MoranIndex( r, W )
pvMI = pvalue( MI )
Return [pvSW, pvSW, pvSF, pvAD, pvB, pvL, pvFk, MI, pvMI]
End
Algorithm 2: Pseudocode to form the structure of the adjusted models.
Var
Trt: vector [1 ... 3] #treatments
Block: vector [1… 10] #blocks
Nsim: #simulations
Start
Solution: matrix [1… Nsim, 1… 20]
For i =1 to Nsim
Resp = vector [1… 30] ~ N (μ =300, σ =20)
Resp = PartialSpatialSorted (Resp)
Mod1 = AnalysisOfVariance (Resp ~ Trt)
Mod2 = AnalysisOfVariance (Resp ~ Bloq + Trt)
H = BlockEfficiency (Mod2)
Solution [i, 1… 9] = tests (Residuals (Mod1))
Solution [i, 10… 18] = tests (Residuals (Mod2))
Solution [i, 19] = H
Solution [i, 20] = pvalue (block (Mod2))
End For
End
3. Results and Discussion
The arrangements of the treatments and blocks were made in the perpendicular direction of the gradient of the soil property (indicated by the arrow in Figure 1). The gradient was indicated from a lower value
[figure omitted; refer to PDF]
The CRD design models were adjusted:
[figure omitted; refer to PDF]
Figure 3 shows the distribution of the
[figure omitted; refer to PDF]
Figure 2 shows the counts and the percentage distribution of the rankings in the four regions in Figure 3. As mentioned before, region D was the most important in the current research, since it corresponds to 788 simulations of 1000 carried out (78.8%) with the effect of interest, namely, reduce or eliminate from a statistical point of view the spatial dependence, thus reaching the fulfillment of one of the important assumptions in the modeling with AOV in the basic experimental designs where independence is assumed in the residuals, which in this case is considered potentially attributable to the gradients of some soil properties that are considered spatial in nature, as shown in various studies [27, 28]. If instead of
Currently, there are many tools available to perform an adequate blocking available for students of basic design courses, such as spectral information using different indices, generated from remote or proximal sensing, which with simple processing can easily define the structure and dimension of the blocks so that the orientation of an important gradient in the modeling process can be simple [29].
MI has undoubtedly proven to be an appropriate measure to assess the spatial dependence of soil properties [30, 31]. Before a spatial inferential analysis, it has become routine to do an exploratory analysis of the variables involved in the modeling process; this usually involves calculating the MI but using the response variable as descriptive statistics prior to spatial modeling or a spatial interpolation process [32]. However, in our research, we found that the application of MI on a response variable in an exploratory way evidencing spatial dependence did not guarantee that this was precisely the result when this variable was incorporated into the modeling, because precisely the effect of the blocking can remove such dependency, something that was not observed before modeling where it had not been blocked.
Figure 4 shows the scatter diagrams of the
[figures omitted; refer to PDF]
Once the positive effect of blocking in the modeling process was evidenced, this result was used to discuss another aspect of interest associated with the
As the
The automatically generated
[figure omitted; refer to PDF]
To give greater validity to the simulations and not to create a confusing effect by suggesting the advantages of blocking in minimizing the effects of spatial dependence, when the results could be due more to the nonfulfillment of other assumptions, it is well known that the departure from normality of residuals and homoscedasticity of treatment variances can lead to nonfulfillment of other assumptions such as independence, for which two tests considered with high power were applied to detect normality of residuals, Shapiro-Wilk (S-W) and Shapiro-Francia (S-F), as well as the homogeneity of variances by power Bartlett (B) and that of Fligner-Kileen (F-K) for being robust to deviations from normality [38–40]. The results were quite satisfactory both in the normality tests (Figure 6) and in the homogeneity of variances (Figure 7).
[figure omitted; refer to PDF]
To validate the simulations performed, we used the databases of the agridat library of R [22] (Table 1), which has available data of a series of real experiments developed in the last decades. This is one of the purposes of this library and is precisely the one that has been given in this research, to be able to contrast the facts with real data with what others could do about the same data set. From these databases, some designs with blocks were selected with the respective spatial coordinates for which the respective weight matrix was calculated and the AOV of the RCB and CRD models were run, from which the residual vectors were extracted and the test associated with Moran’s index was used to test the hypothesis of independence of the residuals. In all cases, the
Table 1
Database | MI-CRD ( | MI-RCB ( |
Rothamsted.brussels | 0.47 | |
Mead.strawberry | 0.47 | |
Rothamsted.oats | 0.48 | |
Federer.tobacco |
The data from real and simulated experiments show the advantages of blocking in the suppression or mitigation of spatial dependence, and above all, the use of the
4. Conclusions
The benefits of blocking must be weighed against any consideration, including the reduction of degrees of freedom for error, since, as demonstrated by the simulations, it had a positive effect on mitigating and even eliminating the effect of spatial dependence that might be present in some response variables associated with some edaphic property. Undoubtedly, although blocking apparently did not seem desirable from the point of view of the
Appropriate blocking by considering some edaphic property or by spatial zoning can make standard linear modeling under the assumption of independence of the residuals of a design model meeting the assumptions for using analysis of variance as a statistical technique to compare the effects of treatments in the presence of blocks.
The strategy presented is appropriate for reviewing another important assumption in modeling using analysis of variance, since a result that has demonstrated efficient blocking may indirectly fit the data to meet the assumption of independence in the residuals and thus may be making the technique valid.
Acknowledgments
The authors are very thankful to Sinha Surendra Prasad, professor at the Universidad de los Andes in Venezuela. The funding came from the employer Universidad Nacional de Colombia.
[1] M. Acutis, B. Scaglia, R. Confalonieri, "Perfunctory analysis of variance in agronomy, and its consequences in experimental results interpretation," European Journal of Agronomy, vol. 43, pp. 129-135, DOI: 10.1016/j.eja.2012.06.006, 2012.
[2] K. K. Mann, A. W. Schumann, T. A. Obreza, M. Teplitski, W. G. Harris, J. B. Sartain, "Spatial variability of soil chemical and biological properties in Florida citrus production," Soil Science Society of America Journal, vol. 75 no. 5, pp. 1863-1873, DOI: 10.2136/sssaj2010.0358, 2011.
[3] M. A. Wani, N. Shaista, Z. M. Wani, "Spatial variability of some chemical and physical soil properties in Bandipora district of Lesser Himalayas," Journal of the Indian Society of Remote Sensing, vol. 45 no. 4, pp. 611-620, DOI: 10.1007/s12524-016-0624-z, 2017.
[4] R. A. Fisher, "The arrangement of field experiments," Journal of the Mtinistry of Agricuilture, vol. 33, pp. 503-513, 1926.
[5] J. P. Queen, G. P. Quinn, M. J. Keough, Experimental Design and Data Analysis for Biologists, 2002.
[6] D. Raghavarao, L. V. Padgett, Block designs: analysis, combinatorics and applications, vol. 17,DOI: 10.1142/5846, 2005.
[7] A. Dean, M. Morris, J. Stufken, D. Bingham, Handbook of Design and Analysis of Experiments, vol. 7, 2015.
[8] J. Lawson, Design and Analysis of Experiments with R, vol. 115, 2014.
[9] C. A. Gotway, N. A. C. Cressie, "A spatial analysis of variance applied to soil-water infiltration," Water Resources Research, vol. 26 no. 11, pp. 2695-2703, DOI: 10.1029/WR026i011p02695, 1990.
[10] M. D. Casler, "Fundamentals of experimental design: guidelines for designing successful experiments," Agronomy Journal, vol. 107 no. 2, pp. 692-705, DOI: 10.2134/agronj2013.0114, 2013.
[11] F. A. Graybill, Theory and Application of the Linear Model, 1976.
[12] M. A. Islam, R. I. Chowdhury, M. A. Islam, R. I. Chowdhury, "Covariate–dependent Markov models," Analysis of Repeated Measures Data, pp. 51-66, DOI: 10.1007/978-981-10-3794-8_5, 2017.
[13] R. Mead, S. G. Gilmour, A. Mead, Statistical Principles for the Design of Experiments: Applications to Real Experiments, vol. 36,DOI: 10.1017/CBO9781139020879, 2012.
[14] D. C. Montgomery, Introduction to Statistical Quality Control, 2012.
[15] R. Christensen, Plane Answers to Complex Questions: The Theory of Linear Models,DOI: 10.1007/978-3-030-32097-3, 2020.
[16] A. R. Hoshmand, Design of Experiments for Agriculture and the Natural Sciences Second Edition, 2006.
[17] T. P. Ryan, J. P. Morgan, "Modern experimental design," Journal of Statistical Theory and Practice, vol. 1 no. 3–4, pp. 501-506, DOI: 10.1080/15598608.2007.10411855, 2007.
[18] N. M. Madhyastha, S. Ravi, A. S. Praveena, A First Course in Linear Models and Design of Experiments,DOI: 10.1007/978-981-15-8659-0, 2020.
[19] K. G. Russell, Design of Experiments for Generalized Linear Models,DOI: 10.1201/9780429057489, 2018.
[20] G. Arbia, A Primer for Spatial Econometrics with Applications in R, 2014.
[21] H. Chen, D. L. Reuss, D. L. S. Hung, V. Sick, "A practical guide for using proper orthogonal decomposition in engine research," International Journal of Engine Research, vol. 14 no. 4, pp. 307-319, DOI: 10.1177/1468087412455748, 2013.
[22] K. Wright, "agridat: agricultural datasets," 2021. R package version, 1.18
[23] M. Lentner, J. C. Arnold, K. Hinkelmann, "The efficiency of blocking: how to use ms(blocks)/ms(error) correctly," The American Statistician, vol. 43 no. 2, pp. 106-108, DOI: 10.1080/00031305.1989.10475629, 1989.
[24] D. K. Cassel, O. Wendroth, D. R. Nielsen, "Assessing spatial variability in an agricultural experiment station field: opportunities arising from spatial dependence," Agronomy Journal, vol. 92 no. 4, pp. 706-714, DOI: 10.2134/agronj2000.924706x, 2000.
[25] R. D. Landes, K. M. Eskridge, P. S. Baenziger, D. B. Marx, "Are Spatial Models Needed with Adequately Blocked Field Trials?," Conference on Applied Statistics in Agriculture,DOI: 10.4148/2475-7772.1227, .
[26] A. Venter, M. F. Smith, D. J. Beukes, A. S. Claassens, M. van Meirvenne, "Spatial variation of soil and plant properties and its effects on the statistical design of a field experiment," South African Journal of Plant and Soil, vol. 26 no. 4, pp. 231-236, DOI: 10.1080/02571862.2009.10639960, 2009.
[27] C. C. Tsui, C. C. Tsai, Z. S. Chen, "Soil organic carbon stocks in relation to elevation gradients in volcanic ash soils of Taiwan," Geoderma, vol. 209-210, pp. 119-127, DOI: 10.1016/j.geoderma.2013.06.013, 2013.
[28] H. Rezaei, A. A. Jafarzadeh, A. Alijanpour, F. Shahbazi, K. V. Kamran, "Effect of slope position on soil properties and types along an elevation gradient of Arasbaran forest, Iran," International Journal on Advanced Science, Engineering and Information Technology, vol. 5 no. 6, pp. 449-456, DOI: 10.18517/ijaseit.5.6.589, 2015.
[29] L. Polidori, M. el Hage, "Digital elevation model quality assessment methods: a critical review," Remote Sensing, vol. 12 no. 21, pp. 3522-3536, DOI: 10.3390/rs12213522, 2020.
[30] C. Zhang, L. Luo, W. Xu, V. Ledwith, "Use of local Moran's I and GIS to identify pollution hotspots of Pb in urban soils of Galway, Ireland," Science of the Total Environment, vol. 398 no. 1–3, pp. 212-221, DOI: 10.1016/j.scitotenv.2008.03.011, 2008.
[31] Q. Liu, W. J. Xie, J. B. Xia, "Using semivariogram and Moran’s I techniques to evaluate spatial distribution of soil micronutrients," Communications in Soil Science and Plant Analysis, vol. 44 no. 7, pp. 1182-1192, DOI: 10.1080/00103624.2012.755999, 2013.
[32] R. Awal, M. Safeeq, F. Abbas, S. Fares, S. K. Deb, A. Ahmad, A. Fares, "Soil physical properties spatial variability under long-term no-tillage corn," Agronomy, vol. 9 no. 11,DOI: 10.3390/agronomy9110750, 2019.
[33] N. Lazar, "Ockham's razor," Wiley Interdisciplinary Reviews: Computational Statistics, vol. 2 no. 2, pp. 243-246, DOI: 10.1002/wics.75, 2010.
[34] G. W. Oehlert, "Comparing models: the analysis of variance," A First Course in Design and Analysis of Experiments, pp. 44-52, 2000.
[35] B. Jones, Avoiding Data Pitfalls: How to Steer Clear of Common Blunders When Working with Data and Presenting Analysis and Visualizations, 2019.
[36] M. S. McIntosh, "Can analysis of variance be more significant?," Agronomy Journal, vol. 107 no. 2, pp. 706-717, DOI: 10.2134/agronj14.0177, 2015.
[37] M. Krzywinski, N. Altman, "Analysis of variance and blocking," Nature Methods, vol. 11 no. 7, pp. 699-700, DOI: 10.1038/nmeth.3005, 2014.
[38] H. R. Lindman, Analysis of Variance in Experimental Design, 2012.
[39] J. Gross, U. Ligges, "nortest: tests for normality," 2015. R package version 1.0-4
[40] A. G. Bunn, "A dendrochronology program library in R (dplR)," Dendrochronologia, vol. 26 no. 2, pp. 115-124, DOI: 10.1016/j.dendro.2008.01.002, 2008.
[41] M. O. Grondona, N. Cressie, "Using spatial considerations in the analysis of experiments," Technometrics, vol. 33 no. 4, pp. 381-392, DOI: 10.2307/1269411, 1991.
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer
Copyright © 2021 Aquiles E. Darghan et al. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0/
Abstract
One of the basic principles of experimental design is blocking, which is an important factor in the treatment of the systematic spatial variability that can be found in the edaphic properties where agricultural experiments are conducted. Blocking has a mitigating or suppressing effect on the spatial dependence in the residuals of a model, something desirable in standard linear modeling, specifically in design models. Some computer programs yield a
You have requested "on-the-fly" machine translation of selected content from our databases. This functionality is provided solely for your convenience and is in no way intended to replace human translation. Show full disclaimer
Neither ProQuest nor its licensors make any representations or warranties with respect to the translations. The translations are automatically generated "AS IS" and "AS AVAILABLE" and are not retained in our systems. PROQUEST AND ITS LICENSORS SPECIFICALLY DISCLAIM ANY AND ALL EXPRESS OR IMPLIED WARRANTIES, INCLUDING WITHOUT LIMITATION, ANY WARRANTIES FOR AVAILABILITY, ACCURACY, TIMELINESS, COMPLETENESS, NON-INFRINGMENT, MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Your use of the translations is subject to all use restrictions contained in your Electronic Products License Agreement and by using the translation functionality you agree to forgo any and all claims against ProQuest or its licensors for your use of the translation functionality and any output derived there from. Hide full disclaimer