|
|
||||||||
Australian Cotton Cooperative Research Centre, Department of Agricultural Chemistry and Soil Science, Ross St. Building A03, The University of Sydney, NSW 2006, Australia
* Corresponding author (budiman{at}acss.usyd.edu.au)
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: AIC, Akaike's information criterion ANN, artificial neural network MD, mean deviation MR, mean residuals neuro-m, neural network PTF with objective function that matches the measured values neuro-p, neural network PTF with objective function that matches the parameters P<2, mass particles <2 mm P220, mass particles 2 to 20 mm P202000, mass of particles 20 to 2000 mm PTF, pedotransfer function RMSD, root mean squared deviation RMSR, root mean squared of residuals SSR, sum of square residuals
| INTRODUCTION |
|---|
|
|
|---|
In probably the first research of its kind, Bloemen (1977)(1980) derived the relationships between parameters of the Brooks-Corey model and particle-size distribution. This approach is now called parametric PTFs. For the water-retention curve, we can assume that the water content,
, and the potential, h, relationship can be described adequately by a soil hydraulic model that is a closed-form equation with a certain number of parameters, e.g., Brooks and Corey or van Genuchten equation. A parametric approach is usually preferred to single-point regression (predicting water retention at a specific potential), as it yields a continuous function of the
(h) relationship. Water retention at any potential can be estimated. Many soil-water transport models only require the parameters of the soil hydraulic models as inputs, thus the predicted parameters can be used directly to run them.
The usual steps in deriving parametric PTFs are fitting a soil hydraulic model to individual water-retention data, estimating the parameters of the model, and forming empirical relationships between basic soil properties and parameters. The latter step can be achieved by various mathematical methods, e.g., multiple linear regression (Wösten et al., 1995), or artificial neural networks (ANN) (Schaap et al., 1998).
The most widely used soil hydraulic model is the van Genuchten function (van Genuchten, 1980):
![]() | [1] |
r and
s are the residual and saturated water content,
is the scaling parameter, n is the curve shape factor and m is an empirical constant, which can be related to n by m = 1 -
. Attempts have been made to correlate the parameters of the van Genuchten equation (or other soil hydraulic models) to basic soil properties (Vereckeen et al., 1989), however many researchers have encountered difficulties (Tietje and Tapkenhinrichs, 1993; Scheinost et al., 1997). Van den Berg et al. (1997) suggested that this could be caused by interdependency amongst the parameters. The problems are also likely because the model does not always fit the data, and overparametization of the model (too many parameters to fit over a limited water-retention data). Hence the fitted parameters may carry errors and bear no significant physical meaning. To overcome this problem, Van den Berg et al. (1997) suggested the following approach: fit the model to observed data, apply multiple regression analysis to one of the parameters, fit the model again by fixing the parameter calculated from the regression, continue to fit a regression to another parameter and repeat the process until all the parameters are fitted. Alternatively, Scheinost et al. (1997) proposed: set-up the expected relationship between the parameters of the model and soil properties, and then insert the relationship into the model and estimate the parameters of the relationship by fitting the extended model using nonlinear regression.
The extended nonlinear regression method of Scheinost et al. (1997) has been found to provide good estimates of water retention data (Minasny et al., 1999). However there are limitations, relationships between the parameters and basic soil properties have to be formulated first. Furthermore, since we are fitting the whole data set to a single extended version of the van Genuchten function, the predictions regress to a mean curve. This is demonstrated using the data from the study by Minasny et al. (1999). Figure 1a shows the plot of scaled water content Se =
and scaled potential (
h) of the measured data. Where the symbol
represents a parameter estimated from the raw water retention data and
represents a parameter predicted using extended nonlinear regression. If we plot the values predicted from extended nonlinear regression
e =
vs.
h. (Fig. 1b), the large variations in the original data are scaled to a mean curve. This method would be inadequate if we wish to use the PTFs to characterize the spatial variability within a field.
|
| THEORY |
|---|
|
|
|---|
The mathematical model of a neural network comprises of a set of simple functions linked together by weights. The network consists of a set of input units x, output units y, and hidden units z, which link the inputs to outputs (Fig. 2)
. The hidden units extract useful information from inputs and use them to predict the outputs. The type of ANN considered here is called the multilayer perceptron (Nørgaard, 2000). A network with an input vector of elements xl (l = 1, ... , Ni) is transmitted through a connection that is multiplied by weight, wjl, to give the hidden unit zj(j = 1, ... , Nh):
![]() | [2] |
![]() | [3] |
|
![]() | [4] |
The outputs from hidden units pass another layer of filters:
![]() | [5] |
![]() | [6] |
The weights are adjustable parameters of the network and are determined from a set of data through the process of training (Nørgaard, 2000). The training of a network is accomplished using an optimization procedure (such as nonlinear least squares). The objective is to minimize the sum of squares of the residuals (SSR) between the measured and predicted output.
Schaap et al. (1998) estimated van Genuchten parameters for 1209 soil samples from the USA using a neural network. They distinguished their PTFs based on the level of available information: texture (clay, silt, and sand); texture and bulk density; texture, bulk density, and measured
at -33 (
-33) and -1500 kPa (
-1500). They found that their PTFs performed better than four previously published multiple-regression PTFs.
A New Objective Function for Neural Network Pedotranfer Functions
Neural network analysis is quite powerful and according to Gershenfeld (1999) with one hidden layer that has enough hidden units, it can describe any continuous function. Conventionally, parametric PTFs train the network to fit the estimated van Genuchten parameters. Problems using this method are:
. However, we train the neural networks to minimize the difference between of the predicted and estimated parameter values p.
(Schaap et al., 1998). Since our aim is to predict water retention it will be sensible to train the network to fit the measured water content. We propose a new objective function for neural network training, which predicts the van Genuchten parameters and minimizes the difference between the measured water content and the one calculated from the predicted parameters. This is illustrated in Fig. 3 .
|
r,
s,
, and n.
from basic soil properties by minimizing objective function:
![]() | [7] |
(x) is the predicted parameter from inputs x.
![]() | [8] |
(x, h, p) is the predicted water content at potential h using the parameters p, which is calculated from inputs x. We term this neuro-m, a neural network with an objective function that matches the measured or observed values. | MATERIALS AND METHODS |
|---|
|
|
|---|
-h pairs, and basic soil properties: field bulk density (
b) and particle-size distribution, comprising the mass of particles <2 (P<2), 2 to 20 (P220), and 20 to 2000 µm (P202000), which were normalized to sum to 100%. Assuming a log-normal distribution, the geometric mean particle-size diameter (dg) of the particle-size distribution can be calculated according to Shirazi and Boersma (1984). Usually the data are only reported in terms of clay (P<2), silt (P220), and sand (P202000) which sums to 100%, dg can be simply calculated as:
![]() | [9] |
The water retention data was fitted to the van Genuchten equation using nonlinear regression with the Levenberg-Marquardt algorithm (Marquardt, 1963). The
r,
and n parameters of van Genuchten were estimated with the following constraints imposed:
r
0 m3 m-3, 0.0001
1 hPa-1, and 1.01
n
10. If
r < 0.0001 then its value is fixed at 0. The
s is fixed at water content of 0 kPa.
The prediction set alone was used to train the neural network. Since the prediction and validation data came from a similar source, it will be inequitable to compare the performance of other PTFs that have training sets containing exotic (non-Australian) soil sample. As an independent test of the PTFs, we used a published database GRIZZLY (Haverkamp et al., 1997) from Laboratoire d'Étude des Transfers en Hydrologie et Environnement (LTHE), Grenoble. The data consist of 660 soil samples collected from laboratories and field experiments in different countries. The mass of particles P220, P202000, P250, and P502000 were calculated from the cumulative particle-size distribution function, which has similar form as the van Genuchten equation (Haverkamp and Parlange, 1996), provided in the database. The statistics of the data are given in Table 1.
|
r,
s,
, n. Two levels of inputs were considered: (i) three inputs, when only particle-size distribution are available, the inputs are P<2, P202000, and ln(dg); (ii) four inputs, where bulk density and particle-size distribution are available, the inputs are
b, P<2, P202000, and ln(dg).Because some of the parameters exhibit nonnormal distributions, the following transformations were made: (
r)1/2, ln(
), and ln(n - 1). The network consists of one hidden layer with sigmoidal (tanh) activation function in the hidden layer and linear function in the output. Estimation of the weights were initially carried out using program MATLAB ver. 5.3 with freeware toolbox NNSYSID ver. 2.0 (Nørgaard, 2000). This is the neuro-p method, which minimized objective function Eq. [7]. The trained weights were used as an initial guess for neuro-m. The fine-tuning steps of the neuro-m method were programmed in Fortran. Minimization of the new objective function (Eq. [8]) was carried out using the Levenberg-Marquardt nonlinear least-squares (Marquardt, 1963). We use different initial estimates and select the solution which gives lowest SSR, nevertheless most solutions give similar SSR.
The performance of the proposed method in predicting water retention was compared with a published neural-network PTF. We used the neural network PTF developed by Schaap et al. (1998) which was developed from a wide range of soil types from the USA and was optimized to fit the hydraulic parameters. The network has six hidden units and calculation was performed using the program Rosetta available from the world-wide web (Schaap, 2000). Imam et al. (1999) have found that Rosetta performs better than other multiple regression parametric PTFs for a wide range of soil types from a global soil database. They also found that it is less sensitive to soil texture classes, implying that it is applicable to a wide range of soil materials. Two levels of input information were used (i) three inputs, which only used particle-size distributions: P<2, P250, P502000, (ii) four inputs, where bulk density is included: P<2, P250, P502000,
b. Particle-size data of the Australian system (P220, P202000), were converted to the USDA system (P250, P502000) using the empirical equations of Minasny and McBratney (2001):
![]() | [10] |
Perfomance Criteria
The performance of the PTFs predicting the individual water retention curve was assessed using four criteria. Mean residuals (MR) measures the bias, indicating the tendency of under- or overestimation:
![]() | [11] |
(h) pairs in a water-retention curve. Negative values of MR indicate that the PTFs underestimate the water content and positive values indicate overestimation. The root mean square of residuals (RMSR) calculates mean accuracy of prediction, which represents the expected magnitude of error:
![]() | [12] |
The mean deviation (MD) and root mean squared deviation (RMSD) of Tietje and Tapkenhinrichs (1993) calculate the area difference between the predicted and measured water retention curves:
![]() | [13] |
![]() | [14] |
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
![]() | [15] |
![]() | [16] |
![]() | [17] |
|
The elements of the neural network's weights, W and U, for Neuroman are given in Table 2. With these weights, the van Genuchten parameters can be calculated from basic soil properties (sand, silt, clay, and bulk density) using Eq. [6], or by a simple matrix operation on a spreadsheet. So that researchers can potentially use these PTFs, an example is given in the Appendix. There are different techniques proposed for the interpretation of the neural networks (Abrahart et al., 2001), but the weights are usually difficult to interpret. Neural networks that were trained using different initial values will produce different weights but might yield identical performance measures.
|
-h pairs:
![]() | [18] |
Table 3 shows the RMSR of the three methods applied to the Australian prediction, Australian validation and independent GRIZZLY data sets. In the prediction set, Neuroman clearly performs better than other methods. The relative improvement over Neuropath is 19%. Obviously Rosetta will not perform better as it is trained outside the Australian data set. However in the validation set, which is not used in training, using three inputs (sand, silt, and clay), Rosetta performs better than Neuroman. But when incorporating bulk density (four inputs) the performance of Neuroman is enhanced. The improvement over Neuropath is 11 and 29% over Rosetta. In the independent GRIZZLY data set, which does not come from the same population as the Australian data set, with three inputs, the improvement of Neuroman over Neuropath is 12% and only 5% over Rosetta. Using four inputs, the improvement is 25% over Rosetta. As the prediction improves significantly when incorporating bulk density, it suggests that bulk density is an important factor in predicting volumetric water retention. Overall the improvement of Neuroman is 13% over Neuropath and 30% over Rosetta.
|
|
|
Rosetta, which has more hidden units (six) and was trained using a larger data set, does not perform as well as Neuroman (Fig. 6) . This is confirmed from the analysis on an independent data set GRIZZLY (Tables 3 and 4), which shows that the RMSD is higher (Rosetta = 0.03 m3 m-3, Neuroman = 0.02 m3 m-3). Moreover, it has the tendency to underestimate the water content as shown by the negative values of MD and MR. Figure 5 shows that the predictions tend to lie below the 1:1 line. Figure 6 shows the mean of MR and RMSR as evaluated at each soil sample. A desired PTF should have MR near 0 and small RMSR, thus showing the relative better performance of Neuroman.
|
and n. Schaap et al. (1998) found low correlations between PTFs predicted and fitted parameters, but when used to predict
the performance is quite good. Since the neural networks attempt to predict the water content accurately, the predicted parameters should only be treated empirically.
|
| CONCLUSIONS |
|---|
|
|
|---|
Example of Using Neural-network Weights from Neuroman4 to Predict van Genuchten Parameters
If we have a soil with bulk density (
b) of 1.30 Mg m-3, clay (P<2) content of 45% and sand (P202000) content of 30%, we can calculate ln(dg) according to Eq. [9] which gives us ln
= -4.234. Using the weights from Neuroman4 (Table 2), we apply Eq. [6] as a matrix operation to predict the van Genuchten parameters. We arrange the input in as a vector: x =
T =
T, where 1 serve as bias. We multiply the input vector x by the weight matrix W to give us vector z:
![]() |
![]() |
We apply activation function f(z) = tanh(z) to each element of vector z to form vector r =
T. We add a constant 1 to the last row of r to serve as bias. We multiply weight matrix U with r to give v. Since the output activation is linear F
= v, output vector y is equal to v.
Transforming the result gives us an estimate of
r = 0.0002 m3 m-3,
s = 0.497,
= 0.027 hPa-1 and n = 1.102.
![]() |
![]() |
| ACKNOWLEDGMENTS |
|---|
Received for publication March 6, 2001.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
G. Tranter, B. Minasny, A. B. McBratney, R. A. V. Rossel, and B. W. Murphy Comparing Spectral Soil Inference Systems and Mid-Infrared Spectroscopic Predictions of Soil Moisture Retention Soil Sci. Soc. Am. J., August 20, 2008; 72(5): 1394 - 1400. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Lilly, A. Nemes, W. J. Rawls, and Ya. A. Pachepsky Probabilistic Approach to the Identification of Input Variables to Estimate Hydraulic Conductivity Soil Sci. Soc. Am. J., January 11, 2008; 72(1): 16 - 24. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Minasny and A. B. McBratney Estimating the Water Retention Shape Parameter from Sand and Clay Content Soil Sci. Soc. Am. J., June 8, 2007; 71(4): 1105 - 1110. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. A. Agyare, S. J. Park, and P. L. G. Vlek Artificial Neural Network Estimation of Saturated Hydraulic Conductivity Vadose Zone J., May 17, 2007; 6(2): 423 - 431. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Nemes, W. J. Rawls, Ya. A. Pachepsky, and M. Th. van Genuchten Sensitivity Analysis of the Nonparametric Nearest Neighbor Technique to Estimate Soil Water Retention Vadose Zone J., November 20, 2006; 5(4): 1222 - 1235. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Parasuraman, A. Elshorbagy, and B. C. Si Estimating Saturated Hydraulic Conductivity In Spatially Variable Fields Using Neural Network Ensembles Soil Sci. Soc. Am. J., September 20, 2006; 70(6): 1851 - 1859. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. K. Sharma, B. P. Mohanty, and J. Zhu Including Topography and Vegetation Attributes for Developing Pedotransfer Functions Soil Sci. Soc. Am. J., August 3, 2006; 70(5): 1430 - 1440. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zhu, B. P. Mohanty, and N. N. Das On the Effective Averaging Schemes of Hydraulic Properties at the Landscape Scale Vadose Zone J., March 8, 2006; 5(1): 308 - 316. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Nemes, W. J. Rawls, and Y. A. Pachepsky Use of the Nonparametric Nearest Neighbor Approach to Estimate Soil Hydraulic Properties Soil Sci. Soc. Am. J., February 2, 2006; 70(2): 327 - 336. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Somaratne, G. Seneviratne, and U. Coomaraswamy Prediction of Soil Organic Carbon across Different Land-use Patterns: A Neural Network Approach Soil Sci. Soc. Am. J., August 25, 2005; 69(5): 1580 - 1589. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Schaap, A. Nemes, and M. Th. van Genuchten Comparison of Models for Indirect Estimation of Water Retention and Available Water in Surface Soils Vadose Zone J., November 1, 2004; 3(4): 1455 - 1463. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Minasny, J. W. Hopmans, T. Harter, S. O. Eching, A. Tuli, and M. A. Denton Neural Networks Prediction of Soil Hydraulic Functions for Alluvial Soils Using Multistep Outflow Data Soil Sci. Soc. Am. J., March 1, 2004; 68(2): 417 - 429. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||