|
|
||||||||
Dep. of Crop Sciences, 1102 S. Goodwin Ave., Univ. of Illinois, Urbana, IL 61801
* Corresponding author (dbullock{at}uiuc.edu)
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: EC, electrical conductivity GPS, global positioning system MWD, moderately well-drained soil PD, poorly drained soil SWPD, somewhat poorly drained soil
| INTRODUCTION |
|---|
|
|
|---|
One of the disadvantages of discriminant analysis approach to mapping soil properties is that the information on the data locations is not used in development of the predictive model. However, this information can be utilized by geostatistical methods of data analysis to determine spatial structure in data distribution across the studied area. Combining the predictive capabilities of discriminant analysis with those of geostatistics may provide more accurate tools for soil drainage mapping and prediction.
Bell et al. (1992)(1994) incorporated a variety of topographical and parent material data to map drainage classes of a relatively large study area (about 14000 ha) in south-central Pennsylvania with diverse lithology and topography. They found that soil parent material was one of the most important factors in predicting soil drainage classes. The factors that were found to be significant in discriminating between the drainage classes within the same parent material included slope gradient, elevation above local stream, and distances to local streams and drainageways. Soil drainage mapping and prediction for relatively small areas (e.g., agricultural fields or small watersheds) are also of great importance, particularly for farming and site-specific field management. However, the factors affecting soil drainage classes within farm fields might be different from the factors acting at regional scales. Parent materials within a single field are not usually as diverse as in a relatively large area, but at the same time they might be highly spatially variable. Unfortunately, very often it is not possible to define parent material of a particular site with a considerable degree of accuracy without taking soil coresa task that is time-consuming and expensive. Therefore, it would be beneficial to determine other easily measured factors for quantitative prediction of soil drainage on small scales and to estimate potential accuracy of such prediction.
Topography is one of the easily evaluated drainage affecting factors that can greatly facilitate mapping and prediction of soil drainage (Bell et al., 1992; Bell et al., 1994; Troeh, 1964). Another easily measured factor potentially related to drainage is soil EC. It can be measured in the field using recently developed fast and nondestructive methods (Doolittle et al., 1994; Kitchen et al., 1996; Kitchen et al., 1999). Electrical conductivity depends on a number of soil physical properties, including soil salinity, and soil water and clay content (Rhoades et al., 1989). Sheets and Hendrickx (1995) found a linear relationship between the EC measurements and soil profile water content. Williams and Hoey (1987) observed positive correlations between conductivity and soil clay content. Since both soil profile water contents and clay contents are related to the soil drainage properties, it would be reasonable to expect that EC data might be helpful in mapping and predicting soil drainage classes on a field scale basis.
The objectives of this study are (i) to examine the relationships between soil drainage and various topographical factors and to determine the factors that can be most helpful in predicting soil drainage classes on a field scale using the results from a farm field in Central Illinois, (ii) to study possibility of using soil EC data with or without topographical information for predicting soil drainage classes, (iii) to compare accuracy and efficiency of multivariate statistical methods with geostatistical procedures in mapping soil drainage classes on an agricultural field scale.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Soil survey map (Soil Survey of McLean County, Illinois, 1998) characterizes the soils of the field as the Varna series (fine, illitic, mesic Typic Argiudoll), the Drummer series (fine-silty, mixed, mesic Typic Endoaquoll), and the Chenoa series (fine, illitic, mesic Aquic Argiudoll). According to the soil classification of Illinois, soils of Varna series are described as WD and MWD, Drummer soils are described as PD, and Chenoa soils as SWPD (Fehrenbacher et al., 1984). Since soils of Varna series can belong to both WD and MWD drainage classes, for comparing accuracy in predicting drainage classes of soil survey map with that of quantitative methods we used only two drainage classes: WD/MWD (combined WD and MWD sites) and SWPD/PD. Soil series map unites of the studied area are shown in Fig. 1.
Soil EC data were collected in Fall of 1999 using a Veris 3100 sensor cart (Division of Geoprobe Systems, Salina, KS) which operates on the principle of electromagnetic induction. Georeferenced EC measurements were taken every 3 to 5 m with the distance between cart passes about 10 m, resulting in about 6900 EC measurement points for the studied area. Two sets of EC measurements were collected corresponding to depths of
0 to 30 cm (shallow EC) and 0 to 90 cm (deep EC) (Sudduth et al., 1998).
Survey grid global positioning system (GPS) (Leica 500 RTK, Leica, Heerbrugg, Switzerland) was used to measure elevation. About 1500 elevation measurements were taken on a semiregular grid with a mean distance between measurements of
10 m. The map of field elevation ranges is shown in Fig. 1. ArcView Spatial Analyst (Environmental Systems Research Institute, 1996) was used to analyze elevation data and to derive the topographical land features, namely, slope, curvature, and flow accumulation. Detailed description of the derivation procedures is presented by Kravchenko and Bullock (2000). ArcView was also used to calculate the shortest distance from each of the sampling sites to the drainageway that ran in the middle of the studied area (Fig. 1). This distance was used as one of the variables with potential effect on soil drainage class.
Discriminant Analysis
In this study discriminant analysis was used to find a decision rule for separating sites with different drainage classes based on the values of the topographical and EC variables measured at each site. The decision rule was defined based on the measured data, then, the decision rule was applied to predict drainage classes at unsampled sites where measurements of topographical and EC variables were available.
Two combinations of the variables were considered. The first combination included topographical variables, such as elevation, slope, curvature, flow accumulation, and distance to a drainageway. The purpose of this combination was to assist with the first objective of the study, i.e., to determine how accurate drainage class prediction can be if it is based on topographical data only. The second combination of the variables was related to the second objective of the study, namely, analysis of the possibility of using soil EC data with or without topographical information for predicting soil drainage classes. All the topographical and EC variables measured at each site were used in the second combination.
Since meaningful application of discriminant analysis requires data to be normally distributed, we analyzed the normality of all studied variables. Nonnormally distributed variables were transformed using either log-normal or normal score (Goovaerts, 1997) transformations. Stepwise discriminant procedure, STEPDISC (SAS Institute, Cary, NC) was applied to both combinations of variables to select the variables that had significant influence on the soil drainage class. Only variables significant at 0.15 significance level (Bell et al., 1992) were used in further discriminant analysis.
Based on the values of soil and topographical variables combined in a variable measurement vector x, for each data location discriminant analysis calculated the probability that this location belonged to each of the drainage classes, t, called posterior probability,
. Then, the location was classified into the drainage class for which it had the highest posterior probability (Khattree and Naik, 2000):
![]() | [1] |
The highest posterior probability is equivalent to the smallest squared distance from the studied location to the drainage class t, D2t(x), where k is the total number of drainage classes. Subscript t in Eq. [1] refers to a certain drainage class for which the posterior probability is calculated, subscript j indicates that information on all drainage classes is used to calculate posterior probability for each particular drainage class t. Depending on the variance-covariance matrix of the studied combination of variables, the squared distance is calculated as either a linear discriminant function:
![]() | [2] |
![]() | [3] |
is the common variance-covariance matrix in Eq. [2], St is the variance-covariance matrix of drainage class t in Eq. [3], and
t is the prior probability of occurrence of drainage class t in the studied area. In this study, we calculated prior probabilities based on the measured proportions of drainage classes. Based on the number of locations belonging to a certain drainage class, prior probabilities were equal to 0.499, 0.355, and 0.196 for WD, MWD, and SWPD/PD classes, respectively. Discriminant analysis was conducted using DISCRIM procedure (SAS Institute, Cary, NC). Cross-validation was applied to evaluate accuracy of drainage data prediction and classification by discriminant analysis. For cross-validation each value from the data set was eliminated in turn and, then, estimated using information from the rest of the data (Khattree and Naik, 2000). Posterior probabilities of the three drainage classes obtained for each data point were compared, and the site was assigned a drainage class with the highest posterior probability. Percentage of the correct drainage class estimates was used to compare different combinations of variables with their effectiveness in predicting drainage classes.
Effect of the EC and topographical variables in discrimination between the drainage classes was also studied using logistic discrimination (Khattree and Naik, 2000). Unlike discriminant analysis, logistic discrimination is not sensitive to the departures from normality, hence, no data transformation is necessary. We conducted logistic discrimination for each drainage class separately, using constructed indicator variable i(
) and, then, compared the results from all drainage classes. For each of the three drainage classes, drainage class at each data location
was coded using the indicator approach as (Goovaerts, 1997; Khattree and Naik, 2000):
![]() | [4] |
Value of the indicator variable is equivalent to a probability,
, of finding a certain drainage class at a certain location. For a location with known drainage such probability equals to either 1 (site belongs to a certain drainage class) or to 0 (site belongs to any of the other drainage classes). The response variable in the logistic determination is expressed in terms of probability,
, as a logit function, logit(
):
![]() | [5] |
The response variable is assumed to be a linear function of a number of explanatory variables (x1...xn):
![]() | [6] |
) is determined by the unknown parameter ßi. In this study, EC and topography variables are used as the explanatory variables and the relationship expressed in Eq. [6] produces a model for discriminating between the drainage classes. Similarly to the discriminant analysis, the posterior probabilities of the three drainage classes obtained using logistic discrimination were compared and the location was assigned a drainage class with the highest posterior probability value. Stepwise variable selection was used to determine the variables with the highest discriminative effect (P = 0.15) on drainage classes using logistic discrimination procedure, LOGISTIC (SAS Institute, Cary, NC).
Geostatistical Analysis
For geostatistical analysis, soil drainage class, t, was treated as a categorical variable. At each data location this variable assumed only one of the three mutually exclusive possible states, corresponding to either WD, MWD, or SWPD/PD drainage classes. Indicator transformation was applied to each drainage class data (Eq. [4]) resulting in three indicator variables.
Spatial variability of each indicator variable was described by a sample variogram:
![]() | [7] |
and
+ h were sampling locations separated by a distance h, i(
) and i(
+ h) were values of the indicator variable i at the corresponding locations. The sample variograms were fitted with variogram models and accuracy of the fitting was examined using cross-validation approach (Goovaerts, 1997). The model that produced the cross-validation results of the highest accuracy was retained for further analysis (Kravchenko and Bullock, 1999). Variable values at unsampled locations were obtained using kriging procedure based on the variogram model parameters. For an unsampled location, the kriging estimate was equivalent to the probability of finding certain drainage class at this location. Such probability assumes a value between 0 and 1 depending on the variable values from the surrounding sampled locations and on the spatial structure of variable distribution reflected in the variogram model parameters. For each drainage class kriging produces a map of probabilities of finding this drainage class at each particular location. The cross-validation results for kriging and cokriging procedures were further used to compare their accuracy with accuracy of discriminant analysis. Two geostatistical procedures compared in the study included ordinary indicator kriging and soft indicator cokriging (Goovaerts, 1997). Indicator kriging uses the drainage data only, hence, the estimates of the indicator variables depend only on spatial structure of the available drainage data. Cokriging allows us to combine primary drainage data with any available secondary data that are related to drainage, hence, drainage estimates depend not only on distribution of the primary data, but also on spatial variability of the secondary data and their relationship with the primary variable. In case of a strong correlation between primary and secondary variables and when abundantly sampled secondary data complement sparse primary data sampling, cokriging can produce much more accurate and reliable estimation results comparing with ordinary kriging technique. One of the disadvantages of cokriging is that it becomes extremely cumbersome and time-consuming with large number of secondary variables. In this study, we conducted cokriging with one primary variable (indicator transformed drainage class) and two secondary variables. The variables that were found to have a significant effect on drainage class during stepwise discrimination procedure were further used in cokriging estimation. Variograms and crossvariograms for cokriging were modeled using linear model of coregionalization (Deutsch and Journel, 1998). Geostatistical analysis including variogram calculation, cross-validation, kriging, and cokriging was performed using the geostatistical software package GSLIB (Deutsch and Journel, 1998).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
|
|
Distance to drainageway seemed to be a useful parameter in discriminating between WD and MWD drainage classes in this field. Sites with MWD soils were located generally closer to the drainageway than the sites with WD soils (Table 1). Slopes and deep EC values of WD and MWD soils were similar, hence, these parameters were not effective in separating WD and MWD drainage classes using stepwise discriminant procedure.
Further insight in the factors discriminating between soil drainage classes was achieved using logistic discrimination (Table 2). Since logistic regression procedure was conducted separately for each drainage class, a set of variables was obtained for each drainage class that were significant in discriminating between this class and all the other drainage classes. When only topographical variables were included in the logistic discrimination procedure, the variables significant in discriminating between WD soils and other drainage classes were distance to drainageway and flow accumulation. Well drained soils were characterized by lower flow accumulation values than either MWD or SWPD/PD soils. For SWPD/PD soils significant discriminating variables were distance to drainageway and slope. When shallow, and deep EC measurements were added to the topographical data, then, distance to drainageway, flow accumulation and deep EC were found to be significant for separating WD soils. For SWPD/PD sites, both shallow and deep EC measurements along with the distance to drainageway were significant for separating this drainage class from WD and MWD soils. Slope was a significant variable for discriminating between MWD soils and all the other drainage classes, however, the effectiveness of the logistic discrimination was extremely low for MWD soils (Table 2).
|
|
70% of each being correctly estimated based on topography and topography plus EC data. Separation of WD and MWD classes was less successful because of previously reported insignificant differences between slope and deep EC values corresponding to WD and MWD drainage classes. Discriminant analysis with topography and EC slightly improved the estimation accuracy comparing with topography only because of better separation of WD and MWD drainage classes (Table 4).
|
The most substantial difference between discriminant analysis and kriging and cokriging for creating maps pertains to the fact that kriging and cokriging procedures are exact estimators, while discriminant analysis is not. Hence, each of the sampled locations is estimated correctly in the maps produced by kriging and cokriging, while the values in between the sampled locations are estimated using kriging and cokriging procedure. The maps created based on the discriminant analysis ignore the actual data in sampled locations. Hence, indicator kriging and cokriging are preferable comparing with discriminant analysis for creating maps. Drainage maps obtained from (i) discriminant analysis with deep EC and distance to drainageway, (ii) indicator kriging, and (iii) indicator cokriging with slope and distance to drainageway are shown on Fig. 4a, 4b, and 4c , respectively. The data for the maps were obtained on a 11 by 11 m grid. The grid size was selected in correspondence with sampling density of the secondary data, i.e., elevation and EC.
|
| CONCLUSIONS |
|---|
|
|
|---|
70% of sites with WD and SWPD/PD drainage classes. However, much lower accuracy (
45% correct) was achieved in separating sites with MWD soils from WD and SWPD/PD sites. Accuracy of discriminant analysis (6264% correct) in predicting soil drainage class was comparable with that of cokriging (5865% correct), whereas kriging, which does not use secondary information in data estimation, was less accurate (56% correct). However, since kriging and cokriging are exact estimators, they are more effective for mapping purposes than discriminant analysis. Since cokriging procedure becomes time-consuming and cumbersome with a large number of secondary variables, the most advantageous seems to be an approach to drainage class mapping that would combine both discriminant analysis and geostatistics. In this approach the data are, first, analyzed with discriminant analysis and the limited number of variables with significant effect on the drainage classes is selected among all the available topographical and soil variables. Then, the significant variables are used in the cokriging for most accurate mapping of soil drainage classes.
Received for publication March 1, 2001.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
D. L. Shaner, R. Khosla, M. K. Brodahl, G. W. Buchleiter, and H. J. Farahani How Well Does Zone Sampling Based on Soil Electrical Conductivity Maps Represent Soil Variability? Agron. J., September 8, 2008; 100(5): 1472 - 1480. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Tarquis, N. R. A. Bird, A. P. Whitmore, M. C. Cartagena, and Y. Pachepsky Multiscale Entropy-based Analysis of Soil Transect Data Vadose Zone J., May 27, 2008; 7(2): 563 - 569. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. K. Jung, N. R. Kitchen, K. A. Sudduth, and S. H. Anderson Spatial Characteristics of Claypan Soil Properties in an Agricultural Field Soil Sci. Soc. Am. J., June 21, 2006; 70(4): 1387 - 1397. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Guretzky, K. J. Moore, C. L. Burras, and E. C. Brummer Distribution of Legumes along Gradients of Slope and Soil Electrical Conductivity in Pastures Agron. J., March 1, 2004; 96(2): 547 - 555. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. N. Kravchenko, K. D. Thelen, D. G. Bullock, and N. R. Miller Relationship among Crop Grain Yield, Topography, and Soil Electrical Conductivity Studied with Cross-Correlograms Agron. J., September 1, 2003; 95(5): 1132 - 1139. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Crop Science | |||
| Journal of Natural Resources and Life Sciences Education |
Vadose Zone Journal | ||||
| Journal of Plant Registrations | Journal of Environmental Quality |
The Plant Genome | |||