Identification of Soil Property Variation Using Spectral ... Output/Mila Luleva.pdf · Identifying...
Transcript of Identification of Soil Property Variation Using Spectral ... Output/Mila Luleva.pdf · Identifying...
Identification of Soil Property Variation Using Spectral and Statistical
Analyses on Field and ASTER Data.
A Case Study of Tunisia
Mila I. Luleva
February, 2007
Course Title: Geo-Information Science and Earth Observation
for Environmental Modelling and Management
Level: Master of Science (MSc)
Course Duration: September 2005 - March 2007
Consortium partners: University of Southampton (UK)
Lund University (Sweden)
University of Warsaw (Poland)
International Institute for Geo-Information Science
and Earth Observation (ITC) (The Netherlands)
GEM thesis number: 2005-26
Identification of Soil Property Variation Using Spectral and Statistical
Analysis on Field and ASTER Data.
A Case Study of Tunisia
by
Mila I. Luleva
Thesis submitted to the International Institute for Geo-information Science and Earth
Observation in partial fulfilment of the requirements for the degree of Master of
Science in Geo-information Science and Earth Observation: Specialisation:
Environmental Modelling and Management
Thesis Assessment Board:
Chairperson Prof. Dr A. K. Skidmore
External examiner Prof. P. M. Atkinson
Supervisor Prof. Dr F. D. van der Meer
Member 1 Prof. P. Pelesjö
Member 2 Dr. H. M. A. van der Werff
International Institute for Geo-Information Science and
Earth Observation, Enschede, The Netherlands
Disclaimer
This document describes work undertaken as part of a programme of study at
the International Institute for Geo-information Science and Earth Observation.
All views and opinions expressed therein remain the sole responsibility of the
author, and do not necessarily represent those of the institute.
i
Abstract
Identifying the most suitable and cost- time effective technique for studying soil
properties variation has been a challenge for many soil scientists. Even though the
spectral characteristics of the soil and its constituents have been widely examined,
the use of satellite remote sensing data and ASTER imagery in particular, has been
limited. The aim of this study was to investigate soil properties such as moisture
content, organic matter, electroconductivity, pH, Calcium, Carbonates, Magnesium,
Manganese, Sodium, Potassium, and Iron within similar soil types by ASTER. The
study area was located in South-East Tunisia, chosen mainly due to the limited
presence of vegetation and extensive regions of bare soil. The methodology consists
of chemical and physical composition determination, moving through fine resolution
soil laboratory spectra measurements and development of Partial Least Squared
Regression (PLSR) models to link these results to ASTER imagery. Two ASTER
scenes were selected- winter season, years 2003 and 2005, in terms of gathering
better overview of the current situation. These two have been pre-processed and
classified by Maximum Likelihood (two different band combinations), using the
same techniques and the same regions of interest (training/testing sites), identified
from reference soil map and ISODATA unsupervised algorithm. Three accuracy
assessment techniques were applied in terms of identifying the performance of the
classifications as well as probability levels for correctly classified pixels. It was
established that with 95% confidence, only one out of ten samples was ‘probably’
correctly classified. The spectra collected under laboratory conditions using ASD
FIELDSPEC spectrometer were stored in a spectral library, which was then
resampled to ASTER and, together with the derived spectra from the ASTER scenes,
was input into the PLSR model collectively with the determined concentrations. The
PLSR models resulted in a set of correlations between the properties and the
reflectance values (R2 >0.9 for EC, Ca, Mg). The PLSR coefficients were plotted in
terms of identifying important wavelengths related to the properties. With the
decrease of spectral resolution, these wavelengths were reduced and the final set was
observed mainly in the shortwave infrared region. Even though the findings were
based on limited data available, the conclusions were optimistic, acknowledging the
potential use of ASTER in soil properties prediction and determination. Further work
can emphasise on the use of more soil samples, implementation of different terrain
parameters and selection of number of homogeneous, but with different nature, soil
clusters for sampling.
ii
Acknowledgements
There are so many people that contributed to the completion of this thesis but
unfortunately I have to limit myself to mentioning only some of them.
First of all, I would like to express my greatest gratitude to the academic team
involved in this research. Prof. Freek Van der Meer, thank you very much for all the
help, support, patience and constructive criticism. Your professionalism has
contributed greatly not only to this project but also to the development of my interest
and knowledge in the area of remote sensing. Dr. Ulrik Martensson and Dr. Harald
Van der Werff, thank you for the help and support throughout this journey. Drs.
Boudewijn de Smeth, thank you for the hours you spend in the laboratory
The Tunisian team- Ali, Nejeh and Abderrazak, thank you for making me feel at
home during my stay in your beautiful country. The long and very hot hours we spent
in the field as well as all the help in general, have been much appreciated.
Enormous ‘Thank you’ goes to everyone involved in the development of GEM MSc.
I am extremely grateful that you gave me the opportunity to travel, study and meet so
many great people at the same time.
To all my amazing GEM classmates: I really cannot believe one year and a half has
gone so fast. You guys made it unforgettable. Thank you for everything! I wish you
all lots of luck and great success! Sarika, Yan, Shawn and Chen- I will miss you guys
a lot, especially when I want a nice coffee with a great company in the “Sandy
place”, or a portion of these amazing mushrooms with Chinese leaves- NO spices!! I
will keep quiet about the long hours we spent cooking and baking...
I would like to thank my family and my friends for all the support during those
difficult little moments that everyone has once in a while. Mom and Dad, without
you none of these would have been possible- ‘Благодаря ви за всичко’. Ivo, thanks
for being the best brother ever! Stef and Iain- you guys—there are no words to
explain how grateful I am to you—if one day I can help somebody at least half as
much, I will be the happiest person on Earth!!!
Everyone who helped me, supported me and believed in me- Thanks for everything,
you people made it possible!
iii
Table of contents
Abstract........................................................................................................................ i
Acknowledgements..................................................................................................... ii
Table of contents .......................................................................................................iii
1. Introduction........................................................................................................ 1
1.1. Background.............................................................................................. 1
1.2. Problem Statement................................................................................... 5
1.3. Research Questions.................................................................................. 6
1.4. Research Objectives ................................................................................ 6
1.5. Research Hypotheses ............................................................................... 7
2. Materials and Methods....................................................................................... 9
2.1. Study Area ............................................................................................... 9
2.2. Soil Types and Characteristics Within the Region of Interest ............... 10
2.3. Field Data Collection............................................................................. 12
2.4. Data Processing ..................................................................................... 13
2.4.1. Laboratory Samples Analyses........................................................... 13
2.4.2. Spectral analysis ............................................................................... 14
2.4.3. Image analysis................................................................................... 17
2.4.4. Soil types identification per sample .................................................. 22
3. Results.............................................................................................................. 25
3.1. Laboratory analysis results .................................................................... 25
3.2. Spectral and image analysis ................................................................... 26
3.2.1. Laboratory spectra and CIE diagram ................................................ 26
3.2.2. PLSR on laboratory spectral library.................................................. 27
3.2.3. Image analysis................................................................................... 28
3.3. Soil types and properties identification per sample ............................... 35
4. Discussion........................................................................................................ 39
4.1. Spectral library analysis......................................................................... 39
4.2. Partial Least Squared Regression models .............................................. 40
4.3. ASTER scenes analysis ......................................................................... 44
4.3.1. Spectral profiles per sample.............................................................. 44
4.3.2. Classifications................................................................................... 45
4.4. Soil types identification within the study area ....................................... 49
5. Conclusions...................................................................................................... 55
6. Recommendations............................................................................................ 57
7. References........................................................................................................ 59
8. Appendices....................................................................................................... 63
iv
8.1. Appendix 1: Selected figures and tables ................................................63
8.1.1. Figures ..............................................................................................63
8.1.2. Tables................................................................................................70
8.2. Appendix 2: Field work report...............................................................76
8.3. Appendix 3: Samples photographs ........................................................83
8.4. Appendix 4: Procedures for soil analysis...............................................84
8.5. Appendix 5: Classification Accuracies ..................................................87
v
List of figures
In Text:
Figure 1: Location of the Study Area
Figure 2: Flowchart of data processing methodology
Figure 3: PLSR coefficients from laboratory spectral library
Figure 4: PLSR coefficients: laboratory spectra vs. resampled to ASTER spectra
Figure 5: PLSR coefficients: ASTER 2003 vs. ASTER 2005
Figure 6: Aster classifications outputs
Figure 7: Soil type coverage defined by the classification techniques at different
confidence levels
Figure 8: Soil types according to classification for the sampling points
Figure 9: Soil types according to classification for the test points
In Appendix:
Figure A1: Soil maps
Figure A2: Flowchart of PLSR data arrangement procedure (Wold et al. 2001)
Figure A3: Distribution of sampling points
Figure A4: Eigen values from MNF transformed bands
Figure A5: a) CIE tristimulus curves; b) CIE Chromaticity diagram
Figure A6: CIE diagram and soil sample distribution according to their colour
Figure A7: Spectral signatures derived under laboratory conditions
Figure A8: Spectral profiles extraction from ASTER at each sampling location
Figure A9: Analysis of the three spectral libraries for selected sampling points
Figure A10: Misclassified pixels associated with Lithosols, Brown Lithosols, and
Lithosols/Regosols
vi
List of tables
In Text
Table 1: Results from laboratory analysis and precision assessment
Table 2: Albedo and soil colour analysis
Table 3: Spectral wavelength regions important for the PLSR prediction model of
the properties
Table 4: Image classifications overall accuracy, kappa coefficients and most
successfully classified soil types
Table 5: Validation matrix for sampling points against soil types from ASTER
scenes, 2 classification methods and different probability levels
In Appendix
Table A1: PLSR prediction model assessment from laboratory results and spectral
library
Table A2: PLSR prediction model assessment for the resampled to ASTER library
Table A3: PLSR prediction model assessment for spectra derived from ASTER scene
2003
Table A4: PLSR prediction model assessment for spectra derived from ASTER scene
2005
Table A5: Extraction of soil types from each classification per sample point
Table A6: Extraction of soil types from each classification per test point
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
1
1. Introduction
Over the last couple of decades, it has been a challenge to find the most appropriate
technique for studying soil properties effectively and, at the same time, reducing the
time and effort involved in field sampling and laboratory analysis. This has been of a
major concern not only for soil scientists but also environmental managers. In this
respect, the scope of remote sensing has been widely studied, looking more closely at
how the spectral soil response can be linked to various soil properties and
characteristics. Furthermore, as stated by Palacios-Orueta and Ustin (1998), the
relatively high spectral resolutions as well as contiguous placement of bands,
together with the shortwave infrared region of the reflected spectrum offered by most
of the newly launched sensors, create opportunities for the specialists in the field.
Even though, the heterogeneous nature of soil creates a lot of difficulties in terms of
assessing its properties from reflectance spectra, if the spectral response is well-
known, multispectral imagery could be successfully used for this purpose (Ben-Dor
et al 2002).
1.1. Background
As stated by Udelhoven et al. (2003), soil parameters are neither static nor
homogenous in space and time; however, the costs of the analytical procedures are
often a limiting factor when address spatial soil variability especially in large-scale
applications. Reflectance spectra have been used for many years as a source of
information about the variation in the Earth's surface composition (Van der Meer et
al., 2001). Consequently, the soil properties derived from spectra have been studied
long before the 1980s. Many studies have drawn their main emphasis on the visible
and infrared regions of the spectrum- between 0.3 and 2.8 µm (Baumgardner et al.
1985), and the trend has been followed to date, with some relationships established
from data in the thermal and microwave regions (Barnes et al. 2003). The majority
of the absorption features, diagnostic for mineral composition, occur in the short-
wave infrared portion of the wavelength spectrum ranging from 2 to 2.5 µm.
Moreover, Shepherd and Walsh (2002) state that most of the basic physical and
chemical soil properties show high correlation with derivative reflectance values
within the visible and short-wave infrared wavelengths. These authors also suggest
that subtle differences in the spectral shape can serve as a valuable base for
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
2
identifying soil properties mainly due to the fact that the soil spectra forms as a result
of the overlap between absorption features of many organic and inorganic
compounds (Shepherd & Walsh, 2002). To a large extent, the changes in the spectral
response, according to Ben-Dor et al. (2003), occur due to changes in soil albedo
and soil chromophores, where the former is strictly related to the physical soil
properties, while the latter – to the chemical. Many researchers have identified clear
relationships between the soil colour, organic matter and iron content, which are also
strongly correlated with the soil albedo. There has been a general consensus that soil
albedo is determined by three main factors: moisture content, surface roughness and
soil colour (Post et al., 2000). In many cases, however, other factors may also
strongly influence the albedo. Soil electroconductivity for example is a property of
the soil affected by not only soil moisture but also dissolved salts content, clay
content, mineralogy and soil temperature. A relationship established between this
soil property and soil albedo can indicate that the latter is influenced by one of the
factors affecting the electroconductivity. Baumgardner et al. (1985) identified that
the differences in moisture content, organic matter, soil texture, iron content as well
as salinity determine most of the spectral responses in this spectrum range.
The presence of organic matter in the soil has been found to decrease the overall
reflectance and result in decrease in the spectral contrast. Consequently, this makes
the relationship between spectral and physicochemical properties much more
difficult to examine (Palacios-Orueta and Ustin, 1998). Furthermore, as outlined by
Palacios-Orueta et al. (1999), organic matter is a crucial indicator not only of soil
erosion but also of soil water holding capacity and permeability. Measurements of
organic matter content in soil have been included in many studies. The wavelengths
identified by Barnes et al. (2003) as ones having strong correlation with organic
matter, are between 0.425 and 0.695 µm, however this is only a case if the soils have
the same parental material. The authors also identified that this relationship could be
affected by the iron and manganese oxides present in the soil. If the soils are from
different parental material, it has been suggested that the middle infrared region
should be examined (Barnes et al. 2003).
On the other hand, the shape of soil reflectance curves, according to Baumgardner et
al. (1985), is affected by the presence of strong water absorption bands at 1.45 and
1.95 µm. Recently, Barnes et al. (2003) have observed that soil moisture is most
effectively correlated to visible and near infrared reflectance of bare soil fields if the
data are collected a few days after rainfall. According to Van der Meer et al. (2001),
the increasing moisture content decreases the overall reflectance of the soil. The
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
3
microwave part of the spectrum has also been suggested by some; however, the
process could be highly limited by the presence of vegetation (Barnes et al. 2003).
Salinity of soils has been studied from spectral reflectance as well, where most of the
features seen in the spectrum of saline minerals can be quantitatively measured by
laboratory or field spectroradiometers (Farifteh et al., 2006). It has been found that
saline soils give lower reflectance than non-saline throughout the range between 0.5
and 2.38 µm (Baumgardner et al., 1985). In the case of salt-affected soils, the white
crust that forms on the surface tends to have higher reflectance in the visible and near
infrared part of the spectrum (Barnes et al. 2003). However, as stated by Barnes et
al. (2003), this should not be considered as a clear indicator due to the fact that soils
with high sand content will have a response curve, which has similar shape, in the
visible and near infrared region, to the saline soils.
Soil texture has been studied by many using analysis of spectral reflectance. To
minimise the effect of other properties, Barnes et al. (2003) suggest the use of
combined data from the visible, near infrared and thermal infrared spectra. The
authors also recommended direct field sampling in terms of improving the accuracy
of interpretation of the bare soil imagery. Particle size has been identified to
influence strongly the soil reflectance. As stated by Baumgardner et al. (1985)
referring to an earlier study done by Bowers and Hanks (1965), with the decrease in
particle size, a strong exponential increase in reflectance can be noted, throughout all
wavelengths in the range of 0.4 to 1.0 µm. Although several techniques have been
found relatively efficient in studying soil texture from reflectance properties, it is
important to note that very often only textural classes can be identified from the
spectral response (Barnes et al., 2003).
Many studies have been conducted relating Potassium (K) to spectral response.
Shepherd and Walsh (2002) applied a non-linear regression technique known as
MARS (multivariate adaptive regression spline), to predict the concentration of
exchangeable K and P (Phosphorus) from soil reflectance. The results from the study
were relatively poor however the authors suggest that the relationships are strong
enough to permit discrimination of soils falling above or below a specific threshold
value. It is known, however, that non-polluted soil has concentrations of around 2-
3% for exchangeable K and P. Therefore, the change in the concentration will not be
easily detected using spectral response curves, unless high amounts of these elements
have been artificially added to the soil.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
4
As indicated so far, there is a great amount of literature covering the in-situ
laboratory procedures for estimating and predicting soil properties. These
procedures, however, cannot always be applied, especially when it comes to large or
remote areas. Interestingly, not much has been done on the use and applications of
remote sensing techniques in this field in terms of identifying soil properties, despite
the fact that, according to Baumgardner et al. (1985), studying soil for mapping
purposes with aerial photographs dates from year 1929. Ben-Dor et al. (2002)
identify two general, but very important limitations related to the use of remote
sensing in soil science: only the top few centimetres can be studied and soil
vegetation masks the response from the soil surface. Recently it has been concluded
that multispectral remote sensing is an advanced technique that allows the
identification of targets, in this case soil properties, based on their extensively
studied spectral absorption features, with a near-laboratory quality detection of
reflectance spectra (Ben-Dor et al. 2002). There have been many efforts to include
aerial photographs and satellite images of bare soil in terms of studying organic
matter and phosphorus levels. Lopez-Granados et al. (2005) pointed out that these
approaches were mainly based on linear regression with brightness values from the
blue, green and near infrared bands. Chabrillat et al. (2002), on the other hand,
outline the fact that, unlike multispectral imagery, hyperspectral remote sensing
provides a continuous spectrum for each pixel, which enables the spectral
identification of minerals, rocks, or soils with laboratory-like reflectance
spectroscopy at the remote sensing scale (Chabrillat et al. 2002). Ben-Dor et al.
(2002) also note that since soil is a very heterogeneous material it is more difficult to
apply quantitative analysis to multispectral remotely sensed data.
ASTER (Advanced Spaceborne Thermal Emission and Reflection Radiometer) is a
cooperative effort between NASA and Japan's Ministry of Economy and Industry.
The instrument captures high spatial resolution data in 14 bands from the visible to
thermal infrared wavelengths and provides stereo viewing capability for digital
elevation model creation. It consists of three separate subsystems- VNIR, SWIR and
TIR. The VNIR (visible and near infrared) operates at the visible and near infrared
wavelengths with a resolution of 15m, while the SWIR (short wave infrared) has a
resolution of 30m and operates in six spectral bands in the near infrared region. The
TIR (thermal infrared) has a resolution of 90m and obtains data in 5 bands all in the
thermal infrared region (California Institute of Technology, 2006).
In a study by Chang & Islam (2000), soil moisture and brightness temperature have
been determined from ASTER imagery, in terms of identifying different soil types by
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
5
examining a sequence of remotely sensed images. A great disadvantage of the
procedure, however, is the fact that the results contain information only about the
near-surface soil properties. The procedure is valuable when performed on remote or
large areas. ASTER imagery has also been used in a study by Apan et al. (2002),
identifying different soil attributes, soil colour and texture in particular, as well as
land cover types. Reasonable separability has been indicated in ASTER Bands 2 and
8, for both soil colour and texture. The authors also point out that SWIR band ratios
are not useful for separability purposes.
1.2. Problem Statement
Spectrometry under laboratory conditions has been proven to be rather effective, and
therefore has been used in many soil related studies. Remote sensing techniques,
however, have not been applied to such extent, especially when it comes to
multispectral imagery such as ASTER. This is mainly due to the difficulties related
to the choice of study area clear from vegetation and containing most of the variation
in the top 10 cm as well as the coarser resolution provided by the sensor compared to
the laboratory spectra. Moreover, as stated by De Jong & Epema (2001), reflection
measurements, when taken in the field, represent a complex mixture of soil
components, varying soil surface conditions and atmospheric effects. The main
purpose of this study, therefore, is to explore the possibilities of using this type of
imagery in identifying direct and indirect relationships between soil constituents
measured in the laboratory, spectrometer measurements also performed in the
laboratory, and ASTER data.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
6
1.3. Research Questions
• Can soil properties, such as moisture content, organic matter and Calcium,
Potassium, Magnesium, Manganese and Sodium concentrations, at trace
level, be determined from the spectral response derived under laboratory
conditions as well as from ASTER images?
• Can relationships obtained from laboratory analysis, both direct and
indirect- via the spectral response, between the above mentioned soil
properties, be established with analyses of ASTER images?
• Is ASTER imagery suitable for detecting variation of the chosen soil
properties over small areas, considering the similar nature of different soil
types?
1.4. Research Objectives
• To identify the most suitable soil properties for analysis according to the
characteristics of the study area as well as the literature.
• To analyse two ASTER scenes in terms of identifying variation in soil types
and respectively soil properties associated with them.
• To create spectral libraries of all soil samples in terms of identifying unique
absorption features and variation between the samples, from spectral
responses obtained under laboratory conditions as well as these extracted
from ASTER imagery.
• To identify relationships between certain wavelengths from the spectrum
and the absorption features associated with them, using PLSR models and
assess their applicability.
• To evaluate and combine the results from the image analysis with those
derived under laboratory conditions, in terms of achieving maximum
performance of the technique.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
7
1.5. Research Hypotheses
• The presence of soil properties such as moisture content, organic matter and
Calcium, Potassium, Magnesium, Manganese and Sodium concentrations in
soil can be derived from the spectral response measured in the laboratory as
well as from ASTER images.
• Relationships can be derived from laboratory analysis, both direct and
indirect, between the above mentioned soil properties, and be established
with analyses of ASTER images.
• ASTER imagery is suitable for detecting variation of the chosen soil
properties over small areas.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
9
2. Materials and Methods
2.1. Study Area
The study site is located in South-Eastern Tunisia (see figure 1). The area is 16.206
km2, with top left and top right coordinates respectively x: 601530 y: 3729154 and x:
602546 y: 3728787 (WGS 84, UTM zone 32). It is located in the Matmata
Mountains through the Jeffara plain into the Gulf of Gabes. The region has been
chosen not only due to the great interest from governmental and non-governmental
organisations, but also because of the relatively large areas of bare soil. It is
characterised by typical Mediterranean climate with maximum temperatures reached
in the period between June and August (48ºC), whereas the coldest temperatures are
measured between December and February. Due to the proximity to the sea, the
climate of the study area slightly differs from the typical arid or semi-arid. The
rainfall is very irregular and ranges between 150-240mm with an average of 30 rainy
days per year (September/October). This low rate of rainfall combined with the high
temperatures in the summer result in very high potential evapotranspiration up to
around 1321mm. The yearly climatic water balance is almost negative. There are two
major geological landscapes- the Djabel Matmata and the Jeffara. The former is
characterized by an average altitude of 500 meters (northern altitudes reach more
than 600). The latter, on the other hand, reaches the highest level of 100m above sea
level. The geology is comprised of continental and marine formations, the most
recent one dating from recent Quaternary. There are two main soil groups- Aridisols
and Entisols, with very small depth, with big patches of bare rock due to extensive
erosion. The area is mainly rural, with livestock and rain fed farming activities. A lot
of the land is agricultural, mainly olive trees plantations. The population of the
region is estimated at around 62 00 inhabitants spread across 13 imadas
(administrative areas) divided into 5 ethnic groups.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
10
Figure 1: Location of the Study area.
Source of map: http://www.africa-expedition.com/images/ct/tunisia-map.jpg [accessed 5/01/2006]
2.2. Soil Types and Characteristics Within the Region of Interest
The diversity of the soil types within the study area is limited. To come to this
conclusion, two soil maps have been studied in terms of initially identifying the
variation. The older version has been produced by the Soil Division of the Tunisian
Ministry of Agriculture in year 1973. The scale of the map is 1:500,000. The second
map, produced in year 1986, has been prepared by the ACSAD service of mapping
in scale 1:1,000,000 (Figure A1- Appendix 1.1). The types of soils as well as their
codes are explained below. The description of the soil maps is provided by the Soil
Survey Staff (1999), and it is done in numerical order per class, taking into account
the units used on the soil map from 1986.
Aridisols are soils with limited moisture content and holding capacity. Due to runoff
or a very low storage capacity the soil moisture regime is aridic, despite the fact that
during periods of heat, the soil water is held at potentials less than the permanent
wilting point or has a great content of soluble salts caused by the extreme imbalance
between evapotranspiration and precipitation. Tunisia’s main soil group is the
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
11
Aridisols, covering 75% of the Southern part of the country, and having 21
subclasses in total. Within the area of interest, covered by the ASTER scenes (60 x
60 km), there are 5 soil sub-classes (see Figure A1-Appendix 1.1). These include
typic Salids (codes 4 and 5-Figure A1b), typic Paleorthids (code 8), which type is
also associated with the sampling study area, typic and petrogypsic Gypsids, or also
known as Gypsiothords (code 10 and 14 respectively), typic Calcids (code16 and
18). The Salids (also referred to as Halomorphs in different classifications) are most
commonly found in depressions or closed basins in the wetter areas bordering the
desserts. These are usually associated with accumulation of excessive amount of
salts, more soluble than gypsum, with typical electroconductivity of 30 µS/cm. If this
type of soil is used for agriculture, the salts are usually leached out. Paleorthids are a
class removed from the revised legend produced by USDA; however the main
characteristics of this type are stony soils, prominent to Calcium Magnesium
encrusting (Ca/Mg) with relatively low salts content. Gypsids are soils that have
gypsic or petrogypsic horizon with its upper boundary within 100cm of the soil
surface. These are characterised by accumulation of gypsum that takes place initially
as crystals aggregates in the voids of the soil. These aggregates build up, which
process results in displacement of the enclosed soil material. Some of the Gypsids
are characterised by a calcic horizon which overlays the gypsic horizon. The last
subclass of the Aridisols found in the area of interest is the Calcids. These are
developed from Calcium Carbonate as a parent material. Precipitation tends to leach
or move the carbonates to great depth. The upper boundary of the calcic horizon is
usually within 50cm of the soil surface. If the soils are irrigated or cultivated,
micronutrient deficiencies are normal.
The second main group of soils is the Entisols (also referred to as Lithosols). These
have little or no evidence of the development of pedogenic horizons, mainly
explained by the fact that in many landscapes the soil material is not in place long
enough to form these horizons. These soils are present either on steep slopes which
are actively eroded, or on flood planes, receiving new deposits of alluvium at
frequent intervals. The only features common to all soils under the order of Entisols
are the virtual absence of diagnostic horizons and the mineral nature of the soils. It is
possible, due to the specific location and different composition, these soils to be
mixed with other soil types resulting in sub-classes such as Brown Lithosols,
Lithosols/Regosols, etc. These, however, are essentially Lithosols, with small
number of additional constituents. The most clearly defined two subclasses
representing this diverse soil group in the region of interest are Torriorthents (code
33) and typic Torrifluvents (code 34), referred to as ‘fluvial soil’ further in the
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
12
discussion. The typic Torriorthents are shallow to weakly cemented rocks. Some are
moderately deep or deep to hard rock and are extremely dry. These are mainly
present on gently slopes where the sediments are recent and have little organic
carbon. On the other hand, the Torrifluvents are the Fluvents of the arid climates.
Characterized by aridic moisture regime, these soils have higher pH and are mainly
calcareous; however some could be salty depending on their location. The typic
Torrifluvents are the driest Fluvents. These are defined by big cracks and
slickensides, or wedge-shaped aggregates. In their upper 75cm, they do not have a
deposit of pyroclastic minerals (Soil Survey Staff, 1999).
2.3. Field Data Collection
The fieldwork was spread over 2 weeks between the 15th
and 30th
of September
2006, when the first was associated with visual inspection and selection of the study
area. Information about particular soil features and processes such as soil encrusting
formations, erosion rate and debris formation, vegetation characteristics and
seasonality (if any), was also obtained. The study area was selected according to the
following criteria: Areas with no mixture of soil types, availability of bare soil (no
vegetation and no bare rock), and easy access.
The sampling procedure is clustered random sampling. Thirty five soil sampling
units were collected. Six duplicate samples for accuracy assessment of the chemical
analysis were also located at every fifth location. The locations of the sampling units
were selected according to randomly generated coordinate pairs within the study area
(defined by the watershed and the Matmata Mountain)- see Figure A3- Appendix
1.1. Around 200g of soil material was collected at each location and placed in paper
and plastic bags for storage.
The measurements in the field included: location the sampling units (using GPS),
measurements of reflectance using ASD FIELDSPEC spectrometer (not included in
the analysis due to the missing measurements in the SWIR region), pH, and
identification of soil colour using Munsell colour chart. A record of these
characteristics as well as GPS accuracy and projection was kept and it is attached in
Appendix 2. The grain size was not measured in the field due to the many lumps of
soil that have occurred because of the high temperatures. A soil map was not
available at the time of sampling, and therefore soil types were identified after the
field work. Any clear remarks that were considered critical for the analysis (e.g.
vegetation, weather conditions) were noted and are also included in Appendix 2.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
13
2.4. Data Processing
The methodology of the data processing is displayed on Figure 2. Each one of the
stages is described in more details further in the text.
Figure 2: Data Processing flowchart. The numbers correspond to the appropriate
section in the text
2.4.1. Laboratory Samples Analyses
Out of the 35 soil samples, the geochemical laboratory analyses were performed on
only 9 (plus one duplicate) of them transported to the Netherlands, due to financial
and time constraints. The analysis of the soil in the laboratory included analysis of
pH, electroconductivity, moisture content, organic matter, as well as chemical
concentration detection of Calcium (Ca), Magnesium (Mg), Manganese (Mn),
Sodium (Na), Potassium (K) and Iron (Fe). The carbonate concentrations have also
been estimated.
A factor, which strongly affects the spectral response of soil, is the soil structure and
grain size. In the laboratory this has been established, where the larger than 2mm
fractions have been placed in dishes and observed. The large lumps, originally
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
14
identified in the field, have not been identified due to the fact that they have broken
down during the transportation and/or sieving. The large particles found in the
laboratory had a diameter of around 1 cm. The procedures applied on the samples
can be found in Appendix 4 or referring to ISRIC Technical paper 9, (Van Reeuwijk,
2002). The concentrations of Ca, Mg, Mn, Na, K and Fe have been detected using
Induced Coupled Plasma- Optical Emission Spectrometer (ICP-OES). The actual
procedures followed during the laboratory analysis are included in Appendix 4,
while the results are summarised in Table 1. Iron concentrations are not included in
the table due to the fact that they have been found to be lower than the detection limit
of the apparatus. Records of the results have been kept and these have then been
implemented into the following stages of the analysis.
2.4.2. Spectral analysis
2.4.2.1. Laboratory spectra
Spectral analyses of the soil have been performed in laboratory conditions using
ASD FIELDSPEC spectrometer. Albedo and soil colour analysis have also been
done using special extension to ENVI software and a device-independent colour
space, originally developed in 1931 by the Commision Internationale de l’Eclairage
(CIE). It is based on the tristimulus colour theory of the human vision (Van der
Werff, 2006) and it has been accepted internationally as a standard for colour
measurements (De Jong & Epema, 2001). The relationships between the bands and
the soil properties have been studied using simple statistical correlations. Partial least
squared regression models (PLSR) using each of the produced spectral libraries
produced, have been run in terms of linking the laboratory spectra to the ASTER
images.
2.4.2.2. CIE diagram
CIE diagram and special extension to the ENVI software have been used. The
program separates the colour form the albedo providing coordinates for referencing
with the CIE diagram and at the same time calculating the percentage albedo. As
described by Mainam (1999), the system uses a set of imaginary XYZ primaries, also
known as tristimulus values that allow most colours, to be defined by a triplet of
these primaries. According to Van der Werff (2006), the CIE standard observer
shows how much of each primary would be used by an average observer to match
each wavelength to light. For the computation of the XYZ tristimulus values, the
tristimulus curves (Figure A5a) are integrated over a spectrum. To plot the colours
on a two-dimensional graph (such as a chromaticity diagram- Figure A5b),
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
15
chromaticity coordinates can be used. These are colour definitions independent of
the luminance (Y) and are calculated by a normalization of the tristimulus values
(XYZ), Van der Werff (2006). More detailed description of the procedure is given
by Brown & Feringa (2003). In this research, the CIE coordinates used are x, y
(chromaticity coordinates) and Y (luminance). The main purpose of these analyses
was to estimate the variation of albedo and soil colour between the samples, which
then determines the pre-processing techniques applied on the spectra prior to Partial
Least Squared Analysis (see next section).
2.4.2.3. PLSR
PLSR (Partial Least Squares Regression) was originally developed by Wold (1981)
and it provides quantitative multivariate modelling methods, with inferential
possibilities similar to multiple regression, t-test and ANOVA. It can analyse data
with strongly correlated x variables, and at the same time model several response
variables. PLSR is a way to estimate parameters in a scientific model, which
basically is linear. This model, like any other scientific models, consists of several
parts: the philosophical, conceptual, the technical, the numerical, the statistical and
so on. It is important to note that before the analysis, the X and Y variables are often
transformed to make their distributions be fairly symmetrical. Furthermore, the result
from the method depends highly on the scale. In this respect, with the appropriate
scaling, the focus of the model could be on more important Y-variables and use
experience to increase the weights of more informative X variables. However, in the
absence of knowledge about the relative importance of the variable, the standard
multivariate approach is to, firstly, scale each variable to unit variance by dividing
them by their standard deviation, and secondly, centre them by subtracting their
averages (a process also called auto-scaling). This corresponds to giving each
variable the same weight or the same prior importance in the analysis (Wold et al.
2001). The latter is also strongly recommended for ease of interpretation and for
numerical stability. The PLSR data arrangement procedure is displayed in Figure A2.
There are a number of assumptions that are made related to PLSR: the investigated
system or process is influenced by just a few underlying variables (latent variables).
Both X and Y variables are assumed to be realisations of these latent variables and
are not therefore considered independent. In spectroscopy, it is clear that the
spectrum of a sample is the sum of the spectra of constituents multiplied by their
concentrations on the sample. Furthermore, any data analysis is based on the
assumption of homogeneity. This means that the investigated system or process must
be in a similar state throughout all the investigation, and the mechanism of influence
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
16
of X and Y must be the same, which limits the diversity of the variables. X should
contain a quantitative description of the variation in chemical structure between the
investigated compounds. The objective is to understand the variation of Y.
There are some limitations related to PLSR models. Udelhoven et al. (2003) point
out that when the spectra is normalised, there is a clear reduction in the spectral
albedo variations due to grain size differences between the soil samples. Typically,
the small soil surface variations and micro shadows have a large impact on the
measurements which cannot be eliminated by simple spectra pre-treatment. Another
reason for different PLS results could be unequal distribution of nutrients and water
content (Udelhoven et al., 2003). It has been found that on no account the achieved
results can be compared to measurements of the soil surface taken by remote
operating system. If up-scaling is planned, factors such as the atmospheric
transmission path as well as larger spectrally mixed footprints are taken into account.
PLSR has been performed on the measured variables and wavelength range between
500 and 2500nm, using the 10 samples analysed in the laboratory. The software
used for the analyses is ParLeS version 2.1a (Viscarra-Rossel, 2005). The produced
prediction model, as stated by Scholte (2005), is a set of equations (or PLSR
factors), computed as linear combinations of the spectral amplitudes, by using a
regression coefficient for each wavelength position. Therefore it is clear that the
quality and accuracy of the prediction model strongly depends on the pre-processing
techniques, such as scaling or auto-scaling of the data, detection of outliers and also
the number of factors used in the model. In more detail the importance of the pre-
processing is discussed in Scholte (2005). The best training set has been achieved
when five factors are used and the spectra is normalised. Here the normalisation has
been applied in terms of removing the albedo effect, due to the fact that from the CIE
analysis, the albedo has shown little variation between the different samples. A
number of techniques for accurate determination of the optimal number of factors
have been discussed in the literature (Davies, 2001; Scholte, 2005). The smoothing
technique applied to the library derived under laboratory conditions is Savitzky-
Golay filter, second polynomial. This function performs a local polynomial
regression on a distribution to determine the smoothed value for each point (Savitzky
& Golay, 1964). The main advantage of this filter is the fact that the features of the
distribution (relative maxima, minima, width) are preserved. The smoothing
technique proven to be more effective for the libraries derived from the two ASTER
scenes is Median Filter. This is a non-linear digital filtering used for noise removal,
by examining a sample and assessing if it is representative or not.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
17
The results from the PLSR analyses are a set of calculated values (summarised in the
results section). These include the root mean square error (RMSE), ratio of
prediction to deviation (RPD) and mean and relative mean absolute errors (MAE),
where only the first two are used for the assessment of the models performance.
Additionally, the R2, or also known as coefficient of determination, has been
estimated, showing the strength of statistical correlations between the measured in
the laboratory properties concentrations and the models predictions. The RPD
values together with the R2 have been used in terms of assessing the accuracy of the
predicted Y variables. According to Williams (2001), to indicate good prediction of
the model the RPD value should be higher than 2.5 while the R2 should be estimated
0.91. Viscarra Rossel (2005) provides a classification which defines the quality of
the predicted value with the PLSR software. This is stated under each of the tables
containing the results from these analyses. The procedure has been applied on the
laboratory derived spectral library, the resampled to ASTER spectral library, as well
as on the spectra measured from the ASTER scenes after processing, in terms of
identifying the effect of the reduction of spectral resolution. The produced PLS
regression coefficients were then plotted for each property of interest in terms of
identifying the most significant wavelength regions for the prediction of the soil
constituents. The results from these analyses have been combined with those from
the following image manipulations (see section 2.4.3 and Figure 2), in terms of
identifying and predicting accurately the soil properties of interest (see section
2.4.4).
2.4.3. Image analysis
Two ASTER scenes have been acquired, same season (winter) but different years
(2003, 2005), covering the same area. The raw 2A type images have been
georeferenced, and the Digital Elevation Models extracted from them. The softwares
used for the analysis are Arc GIS 9.1 and ENVI 4.2. A number of analytical
techniques have been used in terms of identifying soil types and properties, as well as
variation between and within soil classes. The purpose of using two ASTER scenes
is not to detect changes in soil composition or land cover changes, but rather to
minimise effects that were not identified by the pre-processing techniques. This is
done by comparing and combining the information from the two.
2.4.3.1. Atmospheric correction
Atmospheric correction was required in terms of converting the radiance values into
reflectance. The procedure originally chosen for this was ENVI FLAASH (Fast Line-
of-sight Atmospheric Analysis of Spectral Hypercubes). This is known as the first-
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
18
principles atmospheric correction modelling tool for retrieving spectral reflectance
from radiance images. It corrects wavelengths in the visible through near infrared
and short wave infrared range. In terms of achieving faster and more efficient
performance of the operations, originally the image was planned to be resampled to a
smaller area that covers only the study area and some surrounding regions. This
procedure was applied to both scenes; however, due to the extensive cloud cover, not
removed after the atmospheric correction, over the scene from year 2005, it did not
result in useful output and therefore all analyses were performed on full scene
coverage (wherever possible). Unfortunately, the results produced were not
satisfactory and therefore other techniques were examined.
Log Residuals as well as Internal Average Relative Reflectance (IARR) calibration
tools were then applied in terms of obtaining the optimal atmospheric correction
technique. The Log Residuals is used when there is a need of solar irradiance,
atmospheric transmittance, instrument gain, topographic effects and albedo effects
removal from radiance data. The result is a pseudo reflectance image, based on in-
scene statistics that could be used in absorption features identification. In terms of
defining the logarithmic residuals the input spectrum is divided by the spectral
geometric mean, and then divided by the spatial geometric mean. The spectral
geometric mean is the mean of all bands for each pixel and it is used to remove the
topographic effects while the spatial geometric mean is used because the
transmittance and other effects are considered multiplicative. The spatial geometric
mean is the mean of all pixels for each band and accounts for the solar irradiance,
atmospheric transmittance and instrument gain (ENVI 4.2, 2007). On the other hand,
the IARR is used to normalise images to a scene average spectrum. It is found to be
effective for reducing image data to relative reflectance in an area where no ground
measurements exist and little is known about the scene. It has proven to work best in
arid areas with little or no vegetation. The algorithm is designed to calculate an
average spectrum from the entire scene and use it as a reference spectrum, which is
then divided into the spectrum at each pixel of the image. The method found to be
the most useful and give the most accurate (similar to the resampled library) results
is the Log Residuals algorithm which was chosen for the analysis.
2.4.3.2. Histogram matching
In terms of equalising (or make it as close as possible) the brightness of both scenes,
histogram matching was performed. The function automatically matches the
histogram of one image to another, resulting in a histogram of the display, where the
function has started, matching the source histogram (ENVI Help, 2006).
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
19
2.4.3.3. Minimum Noise Fraction
Minimum Noise Fraction (MNF) has been applied to exclude the bands from the
ASTER scenes that contain noise. This is essentially a sequence of two principal
component transformations. It is assumed that the data for each pixel consists of
signal and noise, where the adjacent pixels contain the same signal but different
noise (Zhou et al. 2005). The resulting MNF bands represent linear combination of
the original spectral bands. The first MNF band contains the largest percentage of
data variance and the highest spatial coherence, the second contains the second
largest eigenvalue and coherence and so on until reaching the last band, which is
essentially noise-dominated and has the least variance. The MNF bands included in
this study are bands 1 to 4, which are most coherent and provide noise-filtered
datasets.
2.4.3.4. Spectral profiles per sample
The spectral signatures have been measured per sampling point, from the ASTER
scenes. These, then have been examined in terms of recognising main regions of
variation between the samples as well as common absorption features that coincide
with the peaks and sinks outlined by the PLSR analyses (as stated earlier).
2.4.3.5. Image classifications
A number of classifications have also been performed on both images. The purpose
of this is to define the distribution of the different soil types across the area of
interest. The results are then implemented into the following stages of the method
(see figure 2) in terms of identifying and predicting the soil properties. ISODATA
(Iterative Self-Organising Data Analysis Technique) unsupervised classification was
chosen in terms of identifying areas suitable for training sites, while Maximum
Likelihood supervised classifications have been done using ASTER bands 1-9 and
MNF bands 1-4. The results from the supervised classifications were then compared
in terms of assessing the accuracy.
ISODATA is an unsupervised classification scheme, developed with empirical
knowledge gained through experimentation, which requires relatively little
information. Because of the nature of this approach, many problems may emerge.
One of them, typical for all unsupervised image classification techniques, is the fact
that certain proportion of the generated clusters cannot be labelled as an
informational class and therefore the pixels should be excluded from the image or
reclassified (Klein, 2006). Another problem is that, unless the number of desired
classes is well-known, the choice is usually a guess that does not necessary lead to
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
20
the desired outcome. Nevertheless, the classifier was used so that a general idea
about the area is obtained. Applying unsupervised classification method before the
supervised allowed the identification of the most common and the most insignificant
classes. The result served as a base for the definition of the training sites. The most
homogeneous areas were compared against the topographic map. The ones identified
as bare soil have been checked with the available soil map from year 1973 (scale
1:500000) and have been used as the so-called ‘Regions of Interest’ in the maximum
likelihood classification.
Maximum Likelihood supervised classification algorithm has been chosen for this
stage of the analysis. It considers the cluster centre as well as its size, shape and
orientation, by calculating a statistical distance based on the mean variance and
covariance matrix of clusters. The main assumption behind this classification
technique is that the statistics of the clusters have a Gaussian distribution. This
method for image classification has been used by many and it is known to give the
most accurate results (Gomez et al., 2005). Another reason for the choice of this
classifier over others is the lack of soil variability within the study area. It allows
more accurate classification of the mixed pixels typical for areas located on the
borders between two different soil classes.
Two band selections have been used, chosen according to previous studies on soil
image classifications. The band combination includes all visible and infrared (one to
nine) ASTER bands, since not only it is identified by Apan et al (2002) that all
ASTER bands contain information about soil attributes, but also due to the fact that
different band combinations and ratios were tested and produced poorer results due
to the nature of the soil types in the area. The second method uses the first four
Minimum Noise Fraction (MNF) bands, found to contain 98% of the de-noised
information. To avoid confusion between the results from the two approaches in
ASTER image classification, the former will be referred to as Maximum Likelihood
(MaxLik) and the latter as MNF ASTER, although both procedures are Maximum
likelihood classifications.
The accuracy assessment of the classifications has been performed applying three
different techniques: assessment of the accuracy according to the regions of interest
(training sites for each class), assessment of accuracy according to the soil map from
year 1973 and examination of the rule images produced prior to the classifications.
Realistically, the first two procedures are relatively unrepresentative due to the fact
that the regions of interest (training and testing sites) are selected according to the
same soil map which is later used for the accuracy assessment. Furthermore, this soil
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
21
map is produced 30 years prior to the date of production of the older ASTER scene
(2003). Therefore, differences outlined from the classifications, and considered
inaccuracies, could be simply detected changes. In this sense, the third accuracy
assessment procedure was performed, implementing the use of Rule images. Each
pixel in a rule image represents the similarity between the corresponding pixel in the
image to a reference spectrum (Debba et al. 2005), which in this case are the regions
of interest selected to represent the soil types. The digital number (DN) value
represents the angular distance in radius between each pixel spectrum and the
reference. The user defines a threshold, according to a probability level, and if the
angular distance is smaller than the defined threshold, the pixel is assigned to the
category of the respective reference material (Debba et al. 2005).
Ultimately, to assess what is the probability a particular pixel to be assigned to the
correct soil type class, the classification methods were tested against different
probability levels at 75%, 85% as well as 95% degree of confidence. The limitation
here is also related to the selection of the training and testing sites, which ultimately
leads to the conclusion that the separability and detailed investigation prior to the
selection of the regions of interest are crucial. In ENVI 4.2, this procedure is done,
plotting the DN values on a histogram and calculating the threshold value for each
class. This is followed by calculating the basic statistical parameters such as mean,
median, mode, standard deviation as well as the percentage area covered by the
classes taking into account the probability level.
Due to the limited number of validation points, the rule images have been resized to
the region covered by the selected study area and only the performance of the soil
types known to be found in this area (from the soil map) have been tested. Figure 7
is produced in terms of comparing the results from the classification techniques as
well as the performance of each ASTER scene. It is important to note that the
percentages allocated to each soil type do not add up to 100%, which means that the
rest of the area should be covered by a different soil type, or the pixel simply could
not be allocated to any class due to spectral mixing.
Furthermore, the 9 validation points were tested against the results. For this purpose,
a validation matrix was created (see Table 5). Each point is tested against this matrix.
If the point is classified as one soil type on both images by both classification
techniques (receiving 4 validation units), the soil type would be correct for the
confidence level at which the point is tested. Respectively, if one of the classification
techniques performed differently for only one of the scenes, therefore the point will
be assigned with a validation unit of 3, and will fall into class ‘probably classified
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
22
correctly’. The procedure continues by assigning 2 units (‘possibly classified
correctly’) or 1 ‘not classified correctly or belongs to a different class’. The points
with assigned values of 2 and below are considered to fall on mixed pixels, being
defined by a spectral mixture which prevents the classifier to perform accurately.
The validation plot represents not only the difference between the two ASTER
scenes and the performance of the classifications, but also the number of classes, per
validation point, which are supposed to be found within the study area. The
assessment of this has been done so that if a point falls completely outside any of the
predicted classes, a validation unit of 4 is not assigned to it, but rather it receives a 0.
The accuracy assessments of the classifications play a crucial role when it comes to
soil types/properties identification. The results from these procedures have then been
combined with those from the previous analyses in terms of predicting the presence
of the properties of interest (see Figure 2). Accurate soil type identification allows
better determination of the expected soil properties and therefore contributes to the
explanation of any anomalies that may occur during the process.
2.4.4. Soil types identification per sample
The soil types per sampling unit have been extracted not only from the soil maps but
also from the classifications in terms of identifying the performance capabilities of
the techniques. To evaluate the use of ASTER imagery in identifying soil types and
respectively the soil properties discussed earlier (organic matter, moisture content,
electroconductivity, carbonates, calcium, potassium, magnesium, manganese, and
sodium), the classifications and the reference soil maps have been used. This process
involved outlining the study area onto the classification images and the soil maps
followed by comparison of the results for each of the sampling points.
A number of limitations with this technique should be mentioned before any
conclusion is drawn. These are: only 9 soil samples analysed in a laboratory (plus
one duplicate at location A30), the scale of the map used for validation is 1:500000,
while the resolution of ASTER imagery in the VINR is 15m, and 30m in the SWIR.
Here it is important to note that although two soil maps are available, the one with
finer scale was chosen. Furthermore, the accuracy of the soil map has not been
recorded and consequently, this may result in misinterpretation. Therefore, any
points that fall on, or are located near the border between soil types on the map, may
be misclassified. If there is such case, the laboratory results, the results from the
PLSR models as well as the characteristics of the soil types will be the primary factor
in terms of drawing the appropriate conclusion. Another drawback is the cloud
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
23
formation present on the scene from 2005, because of which the accuracy of the
classifications in the region where the clouds are, is poor. The same procedure has
been applied for the rest of the samples, which have not been analysed in the
laboratory, in terms of predicting to what extent the technique can be applied on
unknown locations. If there is mismatch between the classification results and the
soil map, instead of laboratory results, the PLSR results have been used, relating the
spectral response to the soil properties of interest.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
25
3. Results
3.1. Laboratory analysis results
The results from the laboratory analysis are summarised in Table 1, while the
procedures followed are shown in Appendix 4. It is important to note that the
analysis of only 10 out of 32 samples has been completed. The remaining samples
are currently being analysed, and therefore not included in this study.
Table 1: a) Results from laboratory analysis. Electroconductivity (EC) µS/cm, Moisture
Content (MC) %, Organic Matter (OM) %, Carbonate, Calcium (Ca), Potassium (K),
Magnesium (Mg), Manganese (Mn), Sodium (Na)- mg Kg-1; b) Precision at 95% confidence
ID pH EC MC OM Carb Ca K Mg Mn Na
A28 7.9 167 0.89 1.16 6.03 25.3 9.9 4.3 0.08 4.7
A32 7.9 131 0.84 1.02 5.68 11.7 9.1 5.1 0.08 3.5
A30A 7.9 155 0.71 1.35 9.39 23.0 6.3 2.7 0.31 2.5
A30B 7.7 140 0.85 1.63 6.37 36.1 * 5.3 0.31 *
A25 8.0 135 1.19 2.51 5.78 17.6 6.5 3.9 0.00 3.5
A22 7.9 130 0.86 0.89 9.26 21.6 8.5 3.2 0.29 3.1
A16 8.1 125 1.17 2.38 6.58 20.0 7.6 3.6 0.00 5.4
A18 8.1 121 0.82 1.62 6.29 13.6 9.6 5.6 0.00 3.0
A23 8.0 150 0.59 0.89 9.72 19.0 7.9 3.0 0.22 3.8
A15 8.0 126 0.07 2.05 7.42 18.7 6.5 3.4 0.00 2.5
*Not detected due to bottle contamination
pH EC MC OM Carb Ca Mg Mn
Precision% 3.63 14.38 25.38 26.58 54.20 62.69 91.92 0
The precision of the measurements has been estimated using the duplicate sample
collected at location A30, following the formula: Precision (%)= 200*SD/mean (with
95% confidence). It can be seen that the worst precision has been found for
Magnesium (91.92%) and therefore the estimations for this property will be taken
into consideration only if necessary. Except Magnesium, unsatisfying results are
obtained for nearly all properties except pH and electroconductivity (accepted
precision maximum 15%). This does not necessary state that the results are
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
26
unreliable since a detection of external contamination at sample A30B have been
reported and therefore the results from the analysis of this sample should not be
considered as determining. Larger dataset would have overcome this problem.
3.2. Spectral and image analysis
3.2.1. Laboratory spectra and CIE diagram
Table 2: Albedo and soil colour analysis
x y Albedo %
A23 0.42 0.38 35.01
A22 0.43 0.38 29.48
A18 0.43 0.38 32.28
A16 0.43 0.38 30.05
A32 0.43 0.38 30.55
A15 0.42 0.38 29.57
A25 0.43 0.38 30.34
A28 0.42 0.38 31.67
A30A 0.43 0.38 31.6
All ten samples, analysed in the laboratory, have been found to be pale brown
according to the standard Munsell colour chart used in the field. Furthermore, as it
can be seen from Figure A7- Appendix 1.1, the variation between the samples is
minimal. All absorption features expected due to moisture seen in 1000nm, 1416nm,
1919nm and 2217nm are present. Slight absorption is also noted at 841nm, which is
a wavelength associated with presence of Iron. The spectral library has also been
converted into image format in terms of performing albedo and soil colour analysis.
The original idea behind this is to remove the soil albedo from the measurements so
that soil colour variation could be noted. When the results from these analyses,
summarised in table 2, are plotted on the same diagram (see Figure A6), it can be
seen that there is limited variation and indeed the identified colour is the same as the
one observed in the field. The albedo varies from 29.48% (at location A22) to
35.02% (at A23).
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
27
3.2.2. PLSR on laboratory spectral library
The results from the PLS performed on the spectral library derived under laboratory
conditions are summarised in Table A1- Appendix 1.2. The PLSR prediction model
could not provide reliable results for Magnesium (RPD of 1.71), Manganese (RPD
of 1.96) and Carbonates (1.97). The poor prediction for Mg and Mn could be
explained by the lack of spectral significance of these three elements/ properties, as
well as by the limited number of samples analysed in the laboratory. The low
prediction values for the Carbonates can be explained by the fact that these are
usually bound to other elements (Calcium in this case). The best results emerged for
Moisture Content (RPD of 5.05), Organic Matter (RPD of 2.6), Sodium (RPD of
3.86) and Potassium (RPD of 2.49). Here it is important to note that due to the
limited number of samples (nine plus one duplicate at location A30) some of the
correlations as well as quality of the models may not be as high as expected. Using
the calculated regression coefficients for each wavelength region, plots are produced
(see Figure 3) so that the spectrally active constituents can be identified. Looking at
these plots, the most significant to the prediction model wavelength regions can be
seen, according to the peaks and valleys. These are summarised in Table 3, outlining
the regions typical for each one of the examined properties, and compared to the
existing literature.
Table 3: Spectral wavelength regions important for the PLSR prediction
Wavelengths (nm) Property/Variable
Most Significant Other Literature
Electroconductivity 730, 2031 1001, 1439, 2297 7393, 1952
5
Moisture Content 1478, 1901 1005, 1100, 2200 14001, 1900
1
Organic Matter 1415, 1911 949, 1806, 2468 14002, 1900
2
Carbonate 2333 949,1910, 2210 23331,6
Calcium 1768, 2333 1582, 1768, 2030 23334,6
Manganese 1906 1094, 1408, 2201 19006, 2300
6
Magnesium 2331 1424, 1805, 1921 23312
Potassium 1005 1765, 1917, 2286 10034
Sodium 2022 1435, 1571, 2286 19525
References: 1Ben-Dor et al. 2003, 2Palacios-Orueta & Ustin 1998, 3Ben-Dor et al.2002, 4Sholte 2005, 5Farifteh et al. 2006, 6Udelhoven et al. 2003
The spectral library has been resampled to ASTER and has been input into the PLSR
model in terms of achieving results similar to those derived from the sensor. Table
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
28
A2-Appendix 1.2, contains the values used for the assessment of the prediction
models ran on the resampled library. It has been proven to be very good only for
electroconductivity, with highest RPD of 3.79. Good results are obtained for organic
matter, Calcium and Sodium, while in the RPD range between 1.5 and 2 fall
Moisture Content, Potassium, Magnesium and Manganese. The worst results have
been observed for Carbonates, which could be driven by the same factors that affect
this property in the previous PLS analysis. The regression coefficients from these
PLSR analyses were plotted against those from the first set and the results are
displayed on Figure 4. This has been done in terms of outlining peaks and sinks
commonly present on the graphs from the laboratory derived spectra and the
resampled library. The important wavelengths are: moisture content: 2264nm and
2333nm; Carbonates: 1655, and 2165 to 2208; electroconductivity: 805 and
2333nm; organic matter: 2264nm; Manganese: 805 and 2208nm; Calcium: 2208 and
2264nm; Magnesium: 2333nm; Potassium: range between 2165 and 2208nm;
Sodium: 2208 and 2333nm.
3.2.3. Image analysis
3.2.3.1. PLSR for image library
The spectral signatures have been derived per sample, from the ASTER scenes.
These have then been examined in terms of recognising main regions of variation
between the samples as well as common peaks and sinks that coincide with the peaks
and sinks outlined by the previous PLSR analyses (see Figure 5). The PLSR
prediction models have been performed separately for the two images. The results
are summarised in Tables 6 and 7 (Appendix 1.2) for images from 2003 and 2005
respectively. The regression coefficients have been plotted against each other for
identifying the common wavelengths explaining the properties of interest between
the two scenes (Figure 5), and then compared with already identified wavelength
regions from the previous PLS analyses. The model have been proven suitable (very
good) for electroconductivity (RPD of 3.03), while good results obtained for
moisture content, organic matter, and Sodium, result in RPD values greater than 2.
Lower accuracy results are those for Carbonates concentration, Calcium, Potassium,
Manganese and especially Magnesium (RPD of 1.40).
IDE
NT
IFIC
AT
ION
OF
SO
IL P
RO
PE
RT
Y V
AR
IAT
ION
US
ING
SP
EC
TR
AL A
ND
ST
AT
IST
ICA
L A
NA
LY
SIS
29
Figure 3: PLSR coefficients for Moisture Content, Carbonates, Electroconductivity, Organic Matter, Manganese,
Calcium, Magnesium, Potassium, and Sodium derived from analyses on laboratory spectral library
IDE
NT
IFIC
AT
ION
OF
SO
IL P
RO
PE
RT
Y V
AR
IAT
ION
US
ING
SP
EC
TR
AL A
ND
ST
AT
IST
ICA
L A
NA
LY
SIS
30
Figure 4: PLSR coefficients (resampled to ASTER spectral library vs Library derived under laboratory conditions) for Moisture
Content, Carbonates, Electroconductivity, Organic Matter, Manganese, Calcium, Magnesium, Potassium, and Sodium
IDE
NT
IFIC
AT
ION
OF
SO
IL P
RO
PE
RT
Y V
AR
IAT
ION
US
ING
SP
EC
TR
AL A
ND
ST
AT
IST
ICA
L A
NA
LY
SIS
31
Figure 5: PLSR coefficients: ASTER 2003 vs ASTER 2005 for Moisture Content, Carbonates, Electroconductivity,
Organic Matter, Manganese, Calcium, Magnesium, Potassium, and Sodium
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
32
When PLSR was run on the library extracted from the ASTER scene (year 2005), the
best results were produced for Magnesium (RPD of 15.37), Calcium (RPD of 4.29),
electroconductivity (RPD of 3.18) and Carbonates concentration (RPD of 2.81). The
prediction models for Manganese and Sodium had RPD of 2.35 and 2.02
respectively, while unsuitable prediction has been done in terms of identifying
Potassium, Organic matter and Moisture Content (RPD<2.0). Figure 5 displays the
common peaks and valleys derived from the PLSR coefficients for the two scenes.
Common wavelengths are: carbonates –1656nm and 2262nm, electroconductivity-
2209nm, Calcium- 1656nm, valley at 2336nm, Magnesium at 2209nm, Potassium-
1656nm and Sodium- 1656nm and 2262nm. The rest of the properties do not
indicate clear common features and therefore the model with more accurate result
has been considered. For Moisture Content and Organic Matter, the model taken into
account is the one developed for spectra extracted from ASTER scene (year 2003),
where the valley at 2336nm have been most clearly defined for both properties. The
same wavelength has been associated with Manganese concentration, however here
the model from ASTER scene (year 2005) has been considered.
3.2.3.2. Classifications
Minimum noise fraction (MNF) is a modified version of the procedure Principle
Component Analysis. It is used to identify and isolate noise (and bands containing it)
in an image. It results in eigenvalues, where the larger the eigenvalue is, the higher
the variance of the data in the transformed band is. If the value is 1, this indicates
that the noise is left in the transformed band (Ramsey, 2006). Ideally, this procedure
should indicate potential end-members in the images. The plots on Figure A4-
Appendix 1.1 show the eigenvalues for each MNF transformed band, for each of the
two ASTER scenes. It can be seen that the MNF bands containing the most noise are
6, 7, 8 and 9 for both images. ISODATA was performed on both ASTER scenes,
selecting 5-10 initial classes, in terms of identifying main homogeneous areas on the
images. The scene dating from year 2003 resulted in 7 classes while after running
ISODATA on the one from 2005, the classes were 9. This could be explained with
the extensive presence of clouds and cloud shadows in the latter scene. An important
issue to note here is that the ISODATA results do not necessarily represent soil types
but most likely represent landcover. The results are displayed on Figure 6a.
The image dating from 2003 has been classified into 9 main soil classes and 1 water
class/no data, while the image from 2005 has two additional classes: clouds and
cloud shadows. The first set of Maximum Likelihood supervised classification
performed included the first nine ASTER bands, and the results are illustrated on
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
33
Figure 6b. In terms of improving the accuracy of the classification and at the same
time removing the noise from the scenes, Maximum Likelihood classification has
been applied on the first four Minimum Noise Fraction bands (MNF-ASTER). The
results are displayed in Figure 6c. The full reports of the accuracy assessments,
according to the regions of interest and the reference soil map, as well as the
confusion matrices are included in Appendix 5, while summary of the overall
accuracies of the classifications as well as a list of the most successfully classified
soil types are shown in Table 4.
Table 4: Image classifications overall accuracy, kappa coefficients and most
successfully classified soil types.
Image
classification
Overall
accuracy
Kappa
coefficient
Classes
Maximum
Likelihood-
ASTER 2003
88.05% 0.86 Halomorphic soils, Lithosols,
Water
Maximum
Likelihood-
ASTER 2005
71.34% 0.67 Halomorphic soils, Lithosols,
Fluvial soil, Cloud
MNF- ASTER
2003*
76.12% 0.72 Halomorphic soils, Lithosols
MNF- ASTER
2005*
64.96% 0.59 Halomorphic soils, Fluvial
soils, Clouds, Cloud shadows
*MNF ASTER stands for Maximum Likelihood classification applied on MNF bands 1-4
Another procedure for accuracy assessment has been applied according to the rule
images produced prior to the classifications, and the results for the study area have
been plotted (see Table 5). The results from the two ASTER scenes classified using
all ASTER bands and the first four MNF bands have been combined in terms of
estimating the reliability of the classifications with certain level of confidence. In
general, the overall accuracy of the classifications performed on the image from year
2003 is better than this on the 2005 scene. Explanation behind this is the cloud and
cloud shadow classes, which on its own are classified with good accuracy, however,
they cause problems in terms of the performance of other classes.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
34
Figure 6:ASTER classifications: scene 2003 (left), 2005 (right)a)
ISODATA, b) Maximum Likelihood-all bands; c) MNF bands
a
b
c
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
35
Some interesting patterns emerge, when the plots are studied. According to the
previous accuracy assessment, the overall performance of the classification methods
applied on the ASTER scene from year 2005 have proven less successful than those
performed on the scene from 2003 mainly associated with the presence of cloud
cover. Therefore, it is expected the outcome to show decrease in area covered by a
certain soil type as the confidence level increases. Furthermore, it is also assumed
that the classifications performed on the scene from 2003 should result in higher
number if pixels classified correctly, than this estimated from the scene dating from
2005. This, however, is not always the case and therefore each class should be
discussed separately, taking into account the quality and performance of the two
ASTER images, in terms of soil type and respectively properties identification. The
results from the analysis of the rule images are displayed and summarised in Figure 7
(a and b), while the threshold matrix for each probability level and each sampling
point is included in Table 5.
Table 5: Validation matrix for sampling points against soil types from the two
ASTER scenes, 2 classification methods and different probability levels
3.3. Soil types and properties identification per sample
The results from these analyses, for the samples measured under laboratory
conditions are summarised in Table A5-Appendix 1.2, while the extracted from the
image classifications and soil map soil types and consequently properties of the
samples not tested in laboratory are shown in Table A6- Appendix 1.2 and discussed
later on in the report. The accuracy assessment of the procedure is done according to
the classification accuracy per class. Figure A8 displays the spectra derived from the
two ASTER scenes for the two groups of samples, while figures 8 and 9 visualise the
points of extraction as well as the classes assigned to them by the classification
techniques.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
36
Figure 7: Soil types coverage defined by classification techniques at
confidence levels 75%, 85% and 95%
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
37
Figure 8: Soil types according to classifications for sampling points
Figure 9: Soil Types according to classifications for selected Test points
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
39
4. Discussion
4.1. Spectral library analysis
In this part of the discussion, attention has been paid on the soil colour and albedo
estimates. As it has been determined so far, the difference between the analysed
samples in respect to these two soil characteristics is minimal. However, some
correlations are identified, between these two and the rest of the measured properties.
Correlation analyses have been done so that the variation in albedo, derived from the
CIE diagram and the soil colour analysis, is explained. The results from this clearly
showed that the most significant factor in this case is the electroconductivity (R2
0.78). Even though the electroconductivity has not been found to be the main force
behind any differences in soil albedo, the observations can be explained by the high
salts content associated with the soil types, when compared with the soil maps
described earlier. Correlations of 0.42 and 0.47 have been estimated between the
albedo and organic matter and pH respectively. It has been noted by Post et al.
(2000), that this sort of relationship between soil colour (soil reflectance) and
organic matter is obtained when soils are collected from an extensive geographic
area or the soils have a wide range of soil properties. The small study area as well as
the limited number of sampling units, explain the weak correlation, however, it is
important to acknowledge the fact that the number of tested properties is relatively
small and there might be other, significant ones, not considered in the study.
Interestingly, moisture content, the factor identified in the literature (Post et al.,
2000) as one of the most important when it comes to albedo estimates, has not been
found to be correlated or significant (R2 0.27). This can be explained mainly by the
limited presence of moisture in the soil, due to the geographic location of the study
area as well as the season of sampling.
Overall, it can be seen that the soil colour and albedo can play a significant role if
there is wider dataset and greater variation between the estimated values. However,
what was established so far serves as a base for choice of normalisation technique in
the following PLSR analysis. The lack of significant difference between the albedo
estimates determined the removal of this characteristic from further analysis
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
40
4.2. Partial Least Squared Regression models
PLSR modelling has been applied extensively by many (Viscarra Rossel, 2006;
Scholte, 2005, Lee et al., 2003, Udelhoven et al., 2003), in terms of identifying the
significance of some bands/wavelengths in soil properties identification. Diffuse
reflectance has proven itself as a successful screening method to characterise soil
chemical properties (Ben-Dor & Banin, 1995). It has been made clear that the PLSR
prediction model identifies more significant wavelengths than what can be seen from
the soil spectral signatures alone. It is important to note, however, that many
chemical compounds show overlapping bands in the SWIR region. According to
Scholte (2005), the regions between 1115-1150 nm, 1345- 1417 and 1820-1932nm
should be left out of the discussion because they are associated with major water
vapour absorption areas. Furthermore, a great limitation is the fact that soil chemical
properties, as outlined by Udelhoven et al. (2003), are not independent from each
other. Variables, not studied and therefore not identified, play a major role in
outlining correlation, causing indirect effects on the results. Another limitation is the
bound compounds such as Calcium Carbonate (CaCO3). The PLSR model indirectly
estimates Calcium since in most cases the Carbonate group is stronger in terms of
estimating the values for the model, or in other words, the form of the compounds is
crucial. A constraint, which should not be excluded, is the limited number of
laboratory measurements integrated into the model.
Udelhoven et al. (2003) tested the feasibility of the analysis of certain chemical
properties using PLSR and diffuse reflectance spectrometry. They indicated that the
indirect correlations are those that prevent PLSR models from being applied to other
areas where the correlation structure among the soil constituents significantly differs.
In theory, a possible up-scaling requires the consideration of factors such as the
atmospheric transmission path and larger spectrally mixed footprints. Therefore, the
correlations of certain properties with the spectra, found in the present study, have
been compared with those estimated by Udelhoven et al.( 2003), Farifteh et al.
(2006), and Lee et al. (2003), in terms of testing the overall applicability of the
PLSR technique. Conclusions can be drawn only about the correlations regarding the
spectral library derived in the laboratory due to the fact that no PLSR analyses have
been done on the spectra from satellite imagery in the above three studies. A
correlation of 0.8 between the spectra and electroconductivity (EC) has been found
by Farifteh et al. (2006), which concurs with the R2 estimated for EC by the PLSR
model for the soil spectral library produced in the laboratory. Much better estimates
have been found when the library is resampled to ASTER (R2 of 0.92) and the
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
41
ASTER derived spectral library for the same locations (R2 of 0.88). This could be
explained by the fact that due to the poorer spectral resolution of the scene, the
spectral variations at the image level are much smaller (Farifteh et al. 2006).
Furthermore, within an area of one ASTER pixel, soil moisture, soil type and
reflectance of other scene components affect the spectral variation and therefore
result in an error in the prediction model, even though the RPD and RMSE values do
not indicate this. Lee et al. (2003), performed PLSR analyses on Ca, OM, Mg, K and
P in terms of finding more accurate and advance solution to farm land management.
Generally, however, their findings in terms of accuracy of the PLSR models are
limited. The strongest correlation found between spectra and soil properties is for
Calcium (0.88). The rest performed poorly, which the authors explain by the
complex interaction between soil colour and soil properties.
The correlation found between the spectra and moisture content, has been estimated
by all, and this coincides with the reported results from this study for the laboratory
spectral library analyses (R2 of 0.95). Much weaker correlation have been estimated
for the resampled library as well as for the library derived from ASTER (R2 of 0.63
and 0.73 respectively). The study by Udelhoven et al. (2003) looked at the
correlations between Calcium, Organic Carbon, Manganese and Potassium. The
Calcium correlation with the spectra established in their work (R2 of 0.94) has been
found to be the same as what was estimated in the current report; however, this value
is derived from the analyses of the library from the ASTER scenes. When it comes to
the correlation between the spectra from the laboratory analyses and the Calcium
concentration, the R2 estimate is 0.74. There could be many reasons behind this
including the small sample size, relatively different concentrations of Ca found
within the two studies, climate and atmospheric condition differences. Another
disagreement has been recognised in terms of Potassium even though strong
correlation was found only between the property and the spectra from the library
produced from the spectroscopy measurements obtained in the laboratory (R2 of
0.82). Although, Udelhoven et al. (2003) state that lower value than R2
of 0.5 has
been produced from the PLSR analysis for Potassium, which is the case of the
analysis on the resampled to ASTER library and the derived from ASTER scenes
library. Considering the Manganese concentrations, perfect match between the
derived R2 values in the current study from the laboratory spectral library and the
Udelhoven et al. (2003) team, has been found (R2 of 0.71). Lower correlations were
observed between this property and the spectra included in the resampled and
ASTER spectral libraries. Overall, the highest correlations (R2 higher than 0.9)
derived from the laboratory spectra as outlined in earlier chapters, are those of
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
42
Moisture Content and Sodium (found at 1100nm and 1760nm respectively), from the
resampled library is only the Electroconductivity (at 2333nm), and from ASTER
scenes spectra- Magnesium, Electroconductivity and Calcium (at 2209nm for the
first two and 1656nm for Ca). In terms of assessing if these correlations correspond
to the spectral regions typical for the properties, five sampling locations were used
(see Figure A9- Appendix 1.2). These locations were associated with the highest
concentrations estimated in the laboratory for Moisture Content, Electroconductivity,
Sodium, Calcium and Magnesium- A16, A18, A25, A28 and A30.
The specific wavelengths associated with the properties giving highest correlation
are illustrated on the Figure A9. The first issue that emerges is the missing
absorption feature at 1100nm associated with moisture content. It can be seen
slightly on the spectral response from sample A25, which is also the one with the
highest laboratory estimate. This weak absorption band, according to Baumgardner
et al. (1985) corresponds to the absorption bands observed in transmission spectra of
water of a few millimetres thickness. It is assigned in terms of water vibrations, with
additional combination of librations of the water molecule in the 1770 nm band,
which is also not present. The most reasonable explanation behind this as well as the
determination of the strong correlation by the PLSR model is not only the small
variation between all observations, but also the limited presence of moisture content
(highest of 1.19%) for all soil samples. The absorption feature expected to explain
the Sodium concentrations is present and distinctive for the samples with the highest
correlation estimated by the PLSR model. Obviously, if the spectral library is
considered on its own, without combining the results from all PLSR models, there
are other important wavebands associated with Moisture Content, and picked by the
PLSR model including 1450nm and 1950nm.
When the created spectral library is resampled, as mentioned earlier, only the
electroconductivity showed coefficient of determination higher than 0.9. It was
determined by the model that this property corresponds to 2333nm wavelength and
the absorption feature is present for all samples having high levels of it. On the other
hand, interesting observation is the matching wavelength typical for Magnesium and
Electroconductivity (2209nm) identified by the PLSR model performed on the
library extracted from the ASTER scenes. According to Farifteh et al., 2006, the
accurate prediction of electroconductivity could be used to predict Mg containing
salts. The lack of correlation observed from the laboratory analysis (R2 of 0.33),
between the two properties, suggests that there are other factors that affect the
spectra and the prediction model for ASTER imagery. Mainly, however, the
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
43
insufficient number of samples prevents from determining whether the error occurs
at correlation estimation level, or ASTER scene level. In the case of Calcium, the
corresponding waveband, determined by the PLSR model to be 1656nm, has been
explored in terms of identifying the reliability of the prediction. The deepest
absorption feature has been observed for sample A30, even though the location has
been identified with having the second highest Ca concentration. However, as
clarified earlier, the PLSR model takes into account the Carbonate group that acts as
the stronger property influencing the spectra at the same spectral region, despite the
fact that strong correlation was not identified by the model for this group.
The results from the PLSR analysis in this report illustrate how from a large number
of significant for the prediction model wavelengths per property, extracted from the
laboratory spectral library, the spectral regions of importance have been reduced to
maximum of two or in some cases none common wavelengths, when ASTER
spectral library is studied. The results also show that there are many factors that
should be taken into account when soil samples analysed in laboratory are linked to
satellite images. The most important of all is the change in spectral resolution
ultimately resulting in the removal of the weaker absorption features. Furthermore,
the wavelengths at 2208 and 2333nm have been observed to be the most important
for the PLS prediction modelling, including most of the properties (Moisture
Content, Electroconductivity, Organic Matter, Manganese, Calcium, Potassium and
Sodium). The wavebands identified as the most important for the determination of
the soil properties by the PLSR models can be explained by the occurrence of certain
absorption features or general observations that determine the soil reflectance
spectra. These include the decrease of soil reflectance between 400 and 2500nm as
the organic matter content increases, the two very strong absorptions at 1400 and
2200 nm associated with typical kaolinite reflectance with lack of appreciable bound
water, resulting in only a weak band at 1900nm (Baumgardner et al. 1985). The
presence of kaolinite is also associated with a small amount of ferrous ion resulting
in the slight absorption at 1011nm. Furthermore, the Aridisols, generally high in
soluble salts, determine the high average reflectance observed between 520 and
900nm, which is also confirmed by Baumgardner et al. (1985).
A more accurate results could be observed if the area of sampling is classified
according to main geological features and the PLS models are calibrated with these
results instead of with a global model. To do this, however, as shown with this study
and further proven by Udelhoven et al. (2003), not only a larger data set is required
in terms of producing a stable model, but also water accumulation rate as well as
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
44
other properties influencing the measured, should be taken into consideration.
Further work can also look into a way of sampling one pixel (30 by 30 meters in the
SWIR region) and exclude noise caused by properties that are not of interest.
Generally, the PLSR procedure was applied to the spectra in terms of identifying the
most important for the properties prediction and identification wavelength regions.
Furthermore, the results have been used as a link between the laboratory spectra and
the spectra derived from the two ASTER scenes (discussed in the next section). The
purpose if this is to estimate or predict, as accurately as possible, the level of
properties concentration.
4.3. ASTER scenes analysis
4.3.1. Spectral profiles per sample
According to Figure A6, the feature at 2167nm observed at sampling locations A30
and A32 in year 2003 and at all profiles derived from the scene from year 2005,
could be associated with Potassium concentrations. It is interesting to examine the
behaviour of the spectra at 1656nm observed for all location in year 2003 and deep
absorption at the same wavelength found for all locations in year 2005. This
wavelength is also associated with abundance of carbonate groups, which could be
explained by the limestone presence (CaCO3), Calcium and higher moisture content.
Furthermore, it can be seen that variation between locations is mainly found between
2209 and 2336nm for ASTER scene from 2003 and 2167 and 2262nm for ASTER
scene from 2005. It can also be observed that there are three wavelengths common
for both scenes: 660nm, 2209nm and 2262nm. Referring to the PLS analysis these
three features are associated with Calcium and Magnesium content characteristic for
the type of soil in the region. The variation within the above wavelength regions has
been compared with the actual concentrations of the properties, measured in the
laboratory.
The negative correlation observed between Calcium and Magnesium concentrations
(R2 of -0.67), can be also noted looking at the derived spectral profiles. Interestingly,
however, the fluvial soils with Ca/Mg encrusting do not have defined ratio, typical
for all sampling points. This varies from 8.5:1 at A30 associated with Magnesium
deficiency (Department of Natural Resources, 2007) to 2.3:1 at A32. According to
the soil map of the area, the soil samples at locations A32, A18 and A15 are the three
not associated with Ca/Mg encrusting, which could explain the small ratio between
the elements. Some unexpected findings have been related to 2203nm wavelength,
typical for Calcium rich soils. The pattern that should emerge in terms of absorption
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
45
values does not follow the pattern established from the laboratory results. In other
words, the soil samples showing the highest concentrations of Calcium do not appear
on the spectral response as having the strongest absorption on any of the ASTER
scenes. This could be explained by the undefined form of Calcium in the samples. In
terms of Magnesium concentrations, the spectra at 661nm behave as expected, with
some exceptions at location A23 and A30. It is also very important to note that the
characteristics of the soil types, especially when it comes to soil properties, apply to
the soil horizon, while the samples have been collected from the top 10cm in terms
of appropriate comparison with the results from the satellite imagery.
The spectra derived from the ASTER scenes was not only included in the PLSR
analyses, but also it was used as a determining for the soil composition factor. The
observed differences between the laboratory spectra and these measured on the
images are caused by the decrease in spectral resolution (as discussed earlier). The
strong absorption features identified by the PLSR analyses were studied and
implemented later in the soil properties identification per sampling point.
4.3.2. Classifications
Looking at the Table 4 it can be seen that the soil types most successfully classified
by all classification techniques are the Halomorphic soils and the Lithosols. As
mentioned earlier in the report both types of soils are typically very pale brownish
soils rich in soluble salts. The former are typical for regions close to saline water
bodies while the latter are one of the most commonly found soils in Tunisia and most
of the arid and semi-arid regions. Moreover, the region South-West of the study area
is known as the largest salt marsh in Sahara desert- Chott el Djerid. The problem
associated with this is the high reflectance occurring due to the nature of the salt
marsh. On ASTER imagery this region appears completely white causing problems
in identifying soils or any other characteristics of the surrounding areas. A main
limitation of all classifications is the fact that the regions of interest (training and
testing sites) have been selected according to a soil map dating from 1973. Knowing
the high rate of erosion and desertification in the area, as well as the lack of
sufficient field data for validation, the main challenge behind the image classification
description is to identify whether the difference that appears is due to change, image
noise or inaccurate classification. Furthermore, due to the cloud formation over some
parts of the image from year 2005, the information is limited for some of the soil
classes. Nevertheless, similarities in the classifications as well as patterns do emerge.
In terms of the Halomorphic soils, their location and extent corresponds to what is
expected as an output from the classifications. There is a problem occurring due to
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
46
the extensive cloud cover present on the scene from 2005, however this did not affect
the classification of the Halomorphic soils to a great extent. The Lithosols, Lithosolic
brown and the mixed Lithosols/Regosols classes are more problematic, although the
estimated accuracies for these classes are found to be over 87%. Here, the need of
validation field data is essential, since the rate with which the salt marsh close to the
study area increases and varies over the years, driven by many complex climatic
conditions. However, it could be expected that the change appearing between the two
classifications is not entirely due to noise and faults in the second scene, but also due
to actual change. Further contribution to this statement has the fact that the three
classes are similar in nature (all of them belonging to the group of Lithosols) and
therefore more difficult to differentiate. In terms of the class that includes bare rock,
the change could be explained by increase in the soil erosion rate. The fluvial soils,
on the other hand, characterised by their specific location (near rivers) and the
presence of relatively large stones and sediment deposits, are more difficult to
identified with simple classification technique such as Maximum Likelihood, even
when the noise is removed from the scenes. Moreover, the irregular precipitation and
the extensive evaporation result in the formation of gypsum. The fact that the
gypsum is in sandy form can be an indicator for easy wind transportation of particles.
However, the poor quality of the ASTER scene from 2005, especially in the regions
where gypsum is present, prevents from drawing any conclusions about change or
pattern. To identify the regions on the ASTER scenes, which have been misclassified
due to noise, accuracy assessment have been performed comparing the classification
on the nine ASTER bands with the ones done on the MNF bands 1- 4. The
misclassification mainly applies to the bare rock class, which phenomenon could be
explained by possible difference in rock types not identified due the unavailability of
geological maps for the region. The error, discussed earlier, explaining the
classification of the Lithosols, Brown Lithosols and Lithosols/Regosols classes is
illustrated in Figure A10-Appendix 1.2. The least error has been indicated for the
Halomorphic soil type, which was noted after the performance of accuracy
assessment according to the regions of interest earlier in the report.
From the analyses of the Rule images, it can be seen that there are two dominating
classes in the area and these are the ‘Brown Lithosols’ and ‘Fluvial soil’. Maximum
Likelihood classification including all ASTER bands and the same classification on
the first four Minimum Noise Fraction bands, performed on the ASTER scene from
year 2005 resulted in higher number of accurately classified pixels for these
dominant classes, despite the fact that the overall accuracy for this scene is lower.
The classifications performed on the scene from 2003, on the other hand, resulted in
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
47
more pixels classified correctly at 95% level of confidence for the ‘Bare rock’ and
‘Gypsiorthids’ classes (see Figure A1-Appendix 1.2). There are many reasons behind
this, one of them being the equalisation of the image brightness between the two
scenes (by histogram matching) prior to the classifications, which consequently
resulted in removing the finer detail caused by the albedo from the scene from year
2005, and therefore affecting the classes less present at the scene. Another
explanation is the atmospheric conditions differing for both scenes. Since the
FLAASH algorithm that takes into account the time of acquisition as well as
geographic location of the image did not show satisfactory results, factors such as
atmospheric transmittance, combined with the spectral mixing, could cause reduction
of the accuracy and performance of the scenes.
The expected phenomenon related to the reduction of the number of pixels classified
correctly with the increase in confidence level, can be seen for all classes on both
scenes and classifications. A main limitation is the fact that the four soil types
included in this discussion, tend to have similar spectral response due to the issues
emerging from particle size, soil crusting formation, smaller variation in soil colour
and properties, all discussed earlier. Thus, the results mainly show high spectral
mixing creating a problem in terms of accurately assigning the pixel to the
appropriate class. To overcome this, the results from the two scenes and the
classification of the different ASTER and MNF bands were compared and combined.
Three probability levels were selected (75%, 85% and 95% confidence), and the 9
validation points were tested against the pixel probability. The procedure resulted in
the creation of a validation matrix (Table 5), illustrating the assigning of classes to
the pixels containing the points at each probability level. The results are then
compared with the actual concentrations of the properties of interest measured in the
laboratory, and a conclusion is drawn for each point. A great drawback of the
procedure is the limited number of validation points, which prevents from
interpolating the accuracy results beyond the tested pixels.
As it can be expected, at the lowest confidence level of 75%, more pixels are
assigned to the same class on both scenes by the different classifications, than at
higher level. Referring to Table 5, it can be seen that three of the validation pixels
were assigned the correct soil class not only with 75 but also 85% confidence- A15,
A16 and A18: fluvial soil. With 95% confidence, however, A15 is assigned to a
different from these 4 classes, or it is not classified. The classification of the first
four MNF bands of the ASTER image from 2003 did not allocate the pixel
containing A16 to any of the four classes of interest, while as ‘Fluvial soil’ it was
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
48
recognised by the rest of the classification outputs. A18 was classified as ‘Fluvial
soil’ by the two ASTER scenes, but only when Maximum Likelihood algorithm was
applied on all ASTER bands. The results from this analysis in regards to the rest of
the validation samples were accepted. Evidently moving closer to the border
between two soil types, the number of mixed pixels increases. This is especially the
case with the locations A22, A25 and A32, where the mixing of soil properties
causes the problem associated with the classification of these pixels. Therefore, these
are allocated to class ‘possibly correctly classified’ on Table 5. At 95% confidence,
two locations have not been allocated to any of the classes of interest by any of the
classification techniques- A28 and A30. Reasonable explanations for this could be
many including the slightly higher percentage albedo characteristic for the two
locations (as well as for location A25), preventing the classification algorithm to
assign accurately (or at all) these pixels to a soil class typical for the other locations.
The removal of the albedo from the laboratory spectra prior to the PLSR modelling
seemed to have overcome the problem. Consequently removing it from the image
spectra could improve the accuracy of the technique. A combination of the results
from the classifications performed using different ASTER and MNF bands on both
scenes, provides a reasonable validation for area where not much is known, and not
many validation points are tested. To further improve the accuracy assessment from
Rule images, the correctly classified pixels per class from each of the performed
analyses, can be overlaid and only the ones that coincide on both scenes and both
classification techniques, for particular confidence level, should be taken into
account. In this way, the limited number of validation points can be overcome by
comparing spectral responses from correctly classified pixels with the rest of the
pixels.
Interesting discussion emerges from the issue related to band selection and expected
spectral response for the soil types and consequently the soil properties. As already
established, a number of properties have been associated with the same waveband
from the SWIR region of ASTER by the PLSR models interest (MC, EC, Mg, Na- at
2333nm, Carbonates, EC, Mn, K at 2208nm, etc.). Although this is the case, as
discussed earlier, properties such as EC, OM, salts and sand content affect the whole
spectrum by decreasing or increasing the overall spectral reflectance. Therefore, the
selection of specific ASTER bands prior to the classifications in terms of reducing
the noise from outside factors, is a challenge. A further work can emphasise on
certain bands or band ratios typical for only one of the properties and repeat the
procedure accordingly for the rest of the properties of interest. Unfortunately due to
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
49
limited time available for the completion of the current study, this approach could
not be implemented.
The image classifications were performed so that the soil types within the area of
interest are identified. Due to the limited dataset, sufficient information about the soil
properties was not acquired and therefore the determination of the soil types by the
classifications was used as a reference to the expected soil properties. The results
from the classification techniques are referred to in more detailed in the next section
of the discussion, where soil properties identification has been done per sampling
point taking into account the results from the geochemical laboratory analysis, PLSR
models as well as the derived spectra in the laboratory and from the ASTER scenes.
4.4. Soil types identification within the study area
Looking at Table A5-Appendix 1.2, it can be seen that for some points the
classification results coincide with what is displayed on the soil map, while for other
points this is not the case (illustrated on Figure 8). To identify the correct soil type
of the samples the properties organic matter (OM), electroconductivity (EC) and
Calcium (Ca), Magnesium (Mg), Sodium (Na) and Potassium (K) concentrations are
considered. In this respect, literature has been consulted in terms of identifying the
characteristics of the soil types falling within the study area according to the
classifications as well as the reference soil map-Ministry of Agriculture, 1973. The
same procedure is applied on the 25 samples which have not been analysed in the
laboratory. It is important to note that all the arguments and assumptions used in this
section apply only to the soil surface (0-10cm depth). The characteristics of the soil
profile related to different soil types, have not been studied and therefore may result
in misinterpretation.
With the CIE analysis and the field observations (see Appendix 2) it was shown that
the soil colour at all of these locations is the same as the 9 previously tested.
Considering the small study area and the small variation between the tested samples,
it can be assumed that the additional set have similar spectra and properties and
therefore the PLSR models could be used in terms of interpreting the spectra derived
from the images, if required.
The findings show that the large sediment particles, distinctive for fluvial soils, have
a strong influence on the spectral response. This, therefore, explains the fact that at
locations A16 and A28, the soil type (Fluvial soil) has been allocated most
accurately by all classifications. Moreover, locations A16, as well as A22, are a clear
illustration of the fact that clear borders between soil types do not exist, and the scale
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
50
of the reference soil map does influence the accuracy of the results. The grain size
can cause confusion not only in the case of fluvial soils identification. A good
example for this is location A25, where the classifications of the image from year
2005 recognise the soil type as fluvial soil (match with soil map), while the scenes
from 2003 indicate bare rock. These could not only be related to the limited quality
of the image from year 2005, but also to the stage of formation of the Calci-
Magnesium crust, typical for the region. To identify the main reason behind this,
laboratory data for the actual concentrations of Calcium and Magnesium, from both
years, could have played an important role. Furthermore, it is generally known that
the region is rich on limestone (CaCO3); however, more detailed investigation of the
geology of the region is useful in terms of identifying the properties of the bare
rocks.
When the laboratory analyses are taken into account, the values for Calcium and
Magnesium observed at this sampling point are almost identical to those observed at
A16 (fluvial soil with Ca/Mg encrusting). The spectral curve also follows the same
pattern with difference of 0.01 in reflectance (SWIR region). Looking at the spectra
derived under laboratory conditions (Figure A7), a feature can be observed at
2215nm, which does not appear for any of the other sampling locations. This
wavelength is associated with moisture content according to the PLSR analysis and
the finding corresponds to the laboratory measurements (A16 has the highest
percentage moisture content). Interestingly, referring to the spectra derived from the
ASTER scenes (see figure A8), location A16 is not characterised by a feature at
661nm, which, according to the PLSR analyses, should indicate small Magnesium
concentrations. This could indicate that due to the specific location of this point (on
the border between fluvial soils with Ca/Mg encrusting and Brown lithosols), mixing
of some properties may have occurred. The same problem arises at location A32,
however here the ASTER scene from year 2005 also indicates different class- bare
rock. An explanation behind this is the fact that due to erosion and relatively thin
layer of soil, larger sediment material may have been present. Not as clear is the fact
that the classification performed on the first four MNF bands launches different from
the reference map results. Both scenes identify the soil class as brown lithosols,
while the map indicates fluvial soils with Calcium-Magnesium encrusting. Here,
comparison between the laboratory results at location A16 (fluvial soil), A22 (brown
lithosols) and A32 has been done. A great difference is observed between the values
estimated for Calcium and Magnesium between A32 and the other two reference
locations, most probably due to the mixed nature of soils at location A22, as
explained earlier. Similarities have been observed between A22 and A32 in terms of
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
51
electroconductivity, moisture content, organic matter, potassium and sodium
determined in the laboratory. Moreover, the spectra derived from the ASTER scenes
are for the two locations, are characterised by much higher overall reflectance
values. The reason behind this could be higher sand content or presence of different
salts, not included in the present study. Therefore, it could be concluded that the
difference between the predicted by the MNF classifications and the reference soil
map occurs due to the scale of the map, inaccuracy of the map, problems with
georeferencing or undefined soil constituents. Referring to the probability levels
associated with the classifications accuracies, with 95% confidence only location
A16 falls into ‘probably’ classified correctly group, while A18 and A32 are the
‘possibly’ classified correctly locations. Therefore, a larger dataset is crucial in terms
of accurate prediction of the soil types and consequently soil properties associated
with them.
Nevertheless, the possibility of observing landcover change between the two years is
not excluded, considering the high erosion rates as well as the lack of validation data.
At location A23 bare rock is identified by ASTER scene from 2003, while image
from 2005 recognises the soil type as Brown Lithosols. The problem here, however,
is mainly the fact that the soil reference map shows fluvial soil as characteristic for
the location. To find out whether fluvial soils can be mistaken for bare rock on
ASTER imagery, or this misclassification is simply a result of a limitation of the
technique; the properties estimated in the laboratory, as well as the results from the
PLSR analyses and the derived spectra have been compared for the two locations
recognised as fluvial soils on the soil map, with location A16 correctly classified on
all scenes. Limited difference, mainly associated with brightness, has been observed
in the SWIR region of the spectra, for sample locations A16 and A25. In contrast,
location A23 is defined not only by the highest reflectance values, but also by much
more defined feature at 2262nm related to Calcium carbonate presence (PLSR
analyses section 3.2.3.1.). Furthermore, great similarities can be observed between
the estimated laboratory concentrations at location A25 and those at A16 in terms of
electroconductivity, moisture content, organic matter, carbonates, Calcium,
Magnesium and Manganese, while between A16 and A23 resemblance occurs only
for Calcium, Potassium and Magnesium (refer to table 1, section 3.1). Therefore it
could be concluded that the soil types at A16 and A25 are more likely to be the
same, but to defer from the one at location A23. It is possible, however, that the
presence of Calci-Magnesium encrusting has been very strong in year 2003, however
no validation data is available from this year. Moreover, the possibility of an error on
soil map level is also considered.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
52
Looking at locations characterised by Brown Lithosols soil type and considering that
this soils are mainly found at steep slopes areas with risk of severe erosion (FAO,
2006), the effect from elevation difference should be considered. Referring to the
digital elevation model extracted from the ASTER scenes, interesting observations
can be seen for locations A22 and A23. The uncertainty that is noted in terms of
identifying the soil type from the ASTER scene (year 2003), by supervised
classification on ASTER bands 1-9, can be related to the shadow due to elevation.
Before the noise is removed with Minimum Noise Fraction (MNF), the soil is
recognised as Gypsiorthids, characterised by high gypsum content and appearing
dark on satellite imagery. PLSR and laboratory analyses of gypsum could overcome
the problem. If this is not considered, the soil type at location A22 is classified
correctly according to the soil map.
Before moving on to the testing sites not measured in the laboratory, selected in
terms of assessing performance and applicability of the technique, it is important to
refer to the probability levels associated with the correctly classified pixels. Taking
into account the small number of validation points, as well as the relatively small
amount of correctly classified pixels with 95% confidence, the properties prediction
done on the samples not tested in the laboratory is only hypothetical and cannot be
considered statistically accurate. Soil material collection and laboratory analysis
should be performed prior to the study in terms of drawing statistically significant
conclusions. Figure 9 illustrates the position of the test locations and the soil types
corresponding to each of them, while Table A6 includes the identified from the
classifications and the soil map soil types. It can be seen that for some of the samples
the extensive cloud cover does not allow identification of the soil type from the
ASTER scene (2005). These include B1, B2, B3, B5 and B6. The classification
methods however show similarities between the soil type shown on the soil map and
the one identified by the images, considering that the class “recent soil” represents
soil carried by the river flow included in the class ‘Fluvial soil’. Due to the fact
stated earlier, that the ratio between the Calcium and Magnesium is not stable, no
conclusion can be drawn. Correctly classified, according to the soil map, are also
locations B7, B8, B20, B24 and B38.
On the other hand, the areas classified as “bare rock” have one of the lowest
accuracies after the classifications are performed. As discussed earlier, this is either
due to the larger size deposits or thick Calcium/Magnesium crust, which can explain
the misclassification at locations B9, B21, B27, B37 and B39. It was made clear
from the PLSR models (section 4.2) that most of the variation between the properties
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
53
of interest was noted in the SWIR region and therefore this part of the spectrum was
studied from the extracted spectra in more detail (see Figure A8). The presence of
Carbonates (absorption feature at 2165nm) can be noted from the spectral response
curves at all of the testing locations, and therefore did not play a major role in the
identification process of the soil types. Looking at the spectra derived from ASTER
scene (year 2003), location B29 has been characterised by the highest reflectance
values, while B19 with the lowest. In contrast, the highest brightness values on
ASTER scene from 2005 are associated with location B21, while the lowest with
location B37. Even though the locations differ between the two scenes, the highest
reflectance is associated with points falling within Fluvial soils, while the lowest with
points characterised by Brown Lithosols, on both images. Furthermore, both B29 (on
ASTER 2003) and B21 (ASTER 2005), have different from the rest spectral
behaviour at 2209nm. This wavelength is related to higher salt content and
electroconductivity (according to the PLSR analyses). Another general observation
in regards to spectra is noted at locations B19 (ASTER 2003) and B21 and B26
(ASTER 2006). These three locations have the same behaviour as location A16,
lacking absorption at 661nm on the spectra derived from the ASTER scenes,
defining lower Magnesium content according to the PLSR analyses on the laboratory
spectra.
In addition locations B31 to B39 have fallen on the border between two soil types. In
this respect, the allocation of soil type should be determined not only by the
classification techniques and accuracy but also by the actual concentrations of the
properties. These two locations are characterised by defined peak at 2264nm, which
corresponds to Calcium concentration and very slight absorption feature at 2333nm,
which defines the Magnesium presence. Therefore it could be stated that there is a
presence of Calcium- Magnesium encrusting and the classifications performed on the
scene from 2003 (for location B31) and those on the ASTER scene from 2005 (for
location B39) have identified the class correctly (see table A6).
Similar situation occurs at locations B21, B26 and B27, where the absorption
features associated with Calcium and Magnesium are reported and the correct soil
class has been identified as Fluvial Soil. The bare rock identified by some of the
classifications (see table A6- Appendix 8.1.2) in these cases could be again due to
larger size stones or sediment deposits. Location B34, on the other hand, has been
classified by the Maximum Likelihood on the first nine ASTER bands as well as
MNF classification of the image from year 2003 as “bare rock”, by the 9 ASTER
band classification on the scene from year 2005 as Brown Lithosols, by MNF
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
54
classification of the image from 2005 as Fluvial soil while the map indicates border
between Fluvial and Brown Lithosolic soil. Referring to the spectral response,
however, it is noted that the signature matches perfectly the one at B39 on the image
from 2003, and differs greatly on the image from 2005. The reasons behind this can
be many. The most obvious one is the specific location of both sampling locations on
the border between Fluvial soil and Brown Lithosols. Furthermore, the difference in
the spectra is noted greatly at 807nm, associated with electroconductivity and
magnesium concentration, which consequently may indicate change in soil
composition over the two year period.
According to the reference soil map, sample B35 has been classified by all
classification techniques as “Bare Rock” instead of fluvial soil (as indicated on the
soil amp). The spectral curve indicates deep absorption at 2264nm related to high
Calcium concentration and considering the relatively high Carbonate concentration
defined earlier for all locations from the spectra, it could be assumed that the pixel
containing the location could be comprised of bare rock (probably limestone) or
thick Calcium crusting. To improve the prediction at this, as well as all the other
locations, more detailed geological investigation should be carried out.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
55
5. Conclusions
Overall, ten soil properties were identified as important in relation to the main soil
types typical for the location of the study area- in South-East Tunisia. These include
electroconductivity (EC), Sodium (Na), Calcium (Ca), Carbonates, and Magnesium
(Mg), distinctive for the dominant soil group- Aridisols. In terms of the Entisols, Iron
(Fe), Potassium (K) and organic matter (OM) were studied. In addition, the moisture
content and pH of the soil were selected due to their strong influence on the soil
reflectance in general. Applying the standard techniques for analysis in the
laboratory, the concentrations of most of these properties were estimated as trace,
and for Fe these were even lower than the detection limit of the analytical apparatus,
for nine (plus one duplicate) samples.
Although a limited amount of material was available for laboratory analysis, the
concentration estimation, together with spectral information obtained under
laboratory conditions, was enough for the development of Partial Least Squared
Regression (PLSR) prediction models. The albedo was estimated (by CIE
chromaticity diagram) and since small variation was observed, it has been removed
from the PLS analysis. The ratio of prediction to deviation (RPD), in combination
with the coefficient of determination (R2), was used for the assessment of the
prediction model. These have indicated satisfying results for MC, OM, K and Na. A
number of significant wavelengths for the prediction model were also identified by
plotting the regression coefficients produced by the PLSR and examining the highest
peaks and the deepest sinks on the plot, most of them related to the shortwave
infrared region (2000-2500nm).
Two ASTER scenes (year 2003 and 2005) were pre-processed applying Log
Residual Atmospheric correction, Histogram matching and Minimum Noise
Fraction. ISODATA unsupervised classification was performed in terms of
identifying homogeneous areas and selecting appropriate regions of interest in
combination with soil maps from year 1975 (1:500,000). Maximum Likelihood
algorithm was chosen for the supervised classification. Generally, the classification
techniques applied on the ASTER scene from year 2003 showed better overall
accuracies than those applied on the scene from 2005, applying three different
accuracy assessment techniques: according to regions of interest, according to the
reference soil map and exploring the Rule images created in addition to the
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
56
classifications. It has been established that the selection and separability of the
regions of interest are crucial for achieving accurate results.
To overcome the gap between the laboratory measurements and the information
derived from the ASTER scenes, the spectral library obtained under laboratory
conditions was resampled to ASTER, so that the spectral resolution is reduced. This
was input into the PLSR model. The libraries derived from the scenes were also
tested against the measured properties. The result indicated stronger relationships
between the properties and the spectra as well as reduction in the number of
important wavelengths associated with the properties of interest.
PLSR coefficients were plotted against each other resulting in good predictions and
strong relationships established between spectra and electroconductivity, Calcium
and Magnesium (R2>0.9 and RPD>3.18). The important wavelengths for most of the
properties have been recognised in the shortwave infrared region of the spectrum,
and the absorption features corresponding to these wavelengths have been studied, in
terms of explaining the bands that overlap for different properties.
The link between the classification techniques and the PLSR analyses has been
established by determining the soil types, and the related soil properties assigned to
the sampling locations. The emerging problems are mainly related to the old soil map
used as reference; the scale of 1:500,000 compared to the 30m spatial resolution of
ASTER (in the SWIR region); the similarity between the soil types and general lack
of validation and analytical data. These prevented the study from drawing significant
conclusions. With 95% confidence, only small amount of pixels that belong to the
main soil types present in the study area can be considered as correctly classified,
resulting in one out of ten samples ‘probably’ assigned accurately. With 85%
confidence, the number of samples classified correctly is three. The applicability
and performance of the method have been tested with additional 25 sampling points.
The use of ASTER imagery in soil properties variation detection has indeed a great
potential related to the many applications such study can have in agriculture
management, soil erosion monitoring, soil salinisation, soil pollution detection, etc..
Many drawbacks, mainly associated with the quality of the available data, limit the
scope of this study, however the findings suggest that more detailed study
implementing this analytical methodology could greatly widen the prospective of
using readily available multispectral remotely sensed data.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
57
6. Recommendations
• Further study should have a larger dataset available for analysis and
validation
• More recent soil maps as well as geological maps should be made available
• A larger and more heterogeneous study area or a combination of a number
of homogeneous but different in terms of soil types and properties soil
clusters should be selected, so that the class separability computation could
launch useful results.
• The image classifications should be performed according to soil properties
rather than soil types.
• Future study can also emphasise on the potential of PLSR coefficients in
terms of gaining information about actual concentrations of certain
elements
• More accurate results could be obtained if terrain parameters such as digital
elevation model, slope and aspect are included in the analysis of the
ASTER scenes.
• The potential of different band combinations associated with the properties
of interest could be studied in terms of improving the accuracy of the
prediction.
• The possibilities of better accuracy assessment related to Rule images
analyses are many. Further study could emphasise on analyses related to
overlapping of correctly classified pixels derived from a number of different
classifications.
• Field spectrometer measurements, areal photographs and hyperspectral
imagery could be applied in terms of minimising the gap between the fine
spectral resolution of the laboratory measurements and the coarse resolution
of the ASTER images.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
59
7. References
• ACSAD Service of Mapping, 1986, Soil Map of Tunisia- 1:500,000, Soil
Science Division, Tunis
• Apan, A., Kelly, R., Jensen, T., Butler, D., Strong, W. & Basnet, B., 2002,
Spectral Discrimination and Seperability Analysis of Agricultural Crops and
Soil Attributes Using ASTER Imagery, 11th
Australian Remote Sensing and
Photogrammetry Association Conference, ISBN 0-9581366-0-2.397.
• Barnes, E., Sudduth, K., Hummel, J., Lesch, S., Corwin, D., Yang, C.,
Daughtry, C. & Bausch, W., 2003, Remote- and Ground-based Sensor
Technologies to Map Soil Properties, Photogrammetric Engineering and
Remote Sensing, 69(6): 619-630
• Baumgardner, M., Silva, L., Biehl, L. & Stoner, E., 1985, Reflectance
Properties of Soil, Advances in Agronomy, 38:1-44
• Ben-Dor, E. & Banin, A., 1995, Near Infrared analysis as a rapid method to
simultaneously evaluate several soil properties, American Journal of Soil
Science Society,159: 259-269
• Ben-Dor, E., Patkin, K., Banin, A. & Karnieli, A., 2002, Mapping of Several
Soil Properties Using DAIS-7915 Hyperspectral Scanner Data- A Case Study
Over Clayey Soils in Israel, International Journal of Remote Sensing, 26(6):
1043-1062
• Ben-Dor, E., Goldlshleger, N., Benyamini, Y., Agassi, M. & Blumberg, D.,
2003, The Spectral Reflectance Properties of Soil Structural Crust in the 1.2-2.5
µm Spectral Region, Soil Science Society American Journal, 67:289-294
• Brown, A. & Feringa, W., 2003, Colour Basics for GIS Users, Pearson
Education Limited, England
• California Institute of Technology, 2006, Advanced Spaceborne Thermal
Emission and Reflection Radiometer: Instrument Subsystems,
http://asterweb.jpl.nasa.gov/instrument.asp, [accessed 04/12/2006]
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
60
• Chabrillat, S., Goetz, A., Krosley, L. & Olsen, H., 2002, Use of hyperspectral
images in the identification and mapping of expansive clay soils and the role of
spatial resolution, Remote sensing of Environment, 82: 431-445
• Chang, D. & Islam, S., 2000, Estimation of Soil Physical Properties Using
Remote Sensing and Artificial Neural Network, Remote Sensing of
Environment, 74: 534-544
• Davies, A., 2001, Uncertainty testing in PLS regression, Spectroscopy Europe,
Norwich Near Infrared Consultancy
• De Jong, S. & Epema, G., 2001, Imaging Spectrometry for Surveying and
Modelling land degradation, In: Van der Meer, F. & de Jong, S., 2001,
Imaging Spectroscopy: Basic Principles and Prospective Applications, Kluwer
Academic Publishers, the Netherlands
• Department of Natural Resources, NSW Government 2007, Soil Survey:
Explanation of Exchangeable Cation Analysis and Interpretation,
http://naturalresources.nsw.gov.au/soils/index.shtml, [accessed 24/01/2007]
• ENVI 4.1 Tutorial- HELP, 2007
• Farifteh, J., Van der Meer, F., Atzberg, C. & Carranza, E., 2006, Quantitative
analysis of salt-affected soil reflectance spectra: A comparison of two adaptive
methods- PLSR & ANN, Remote Sensing of Environment, [on review].
• Gomez, C., Delacourt, C., Allemand, P., Ledru, P. & Wackerle, R., 2005, Using
ASTER Remote Sensing Data Set for Geological Mapping in Namibia, Physics
and Chemistry of the Earth, 30: 97- 108
• Klein, C., 2006, Unsupervised Classification: Iterative Self-Organising Data
Analysis Technique,
http://geog.tamu.edu/klein/geog661/lecture_notes/lecture22/lecture22.pdf,
[accessed 10/01/2007]
• Lee, W., Sanchez, J.F., Mylavarapu, R.S. & Choe, J., 2003, Estimating
Chemical Properties of Florida Soils Using Spectral Response, American
Society of Agricultural Engineers, 46(5); 1443-1453
• Lopez-Granados, F., Jurado-Exposito, M., Pena-Barragan, J. & Garcia-Torres,
L., 2005, Using Geostatistical and Remote Sensing Approaches for Mapping
Soil Properties, European Journal of Agronomy, 23: 279-289
• Mainam, F., 1999, Modelling Soil Erodibility in the Semiarid Zone of
Cameroon, PHD Thesis, Ghent University, 90-6164-179-9
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
61
• Ministry of Agriculture, 1973, Soil Map of Tunisia- 1:1,000,000, Soil Science
Division, Tunis
• Palacios- Orueta, A. & Ustin, S., 1998, Remote Sensing of Soil Properties in
the Santa Monica Mountains I, Remote Sensing the Environment, 65: 170-183
• Palacios- Orueta, A., Pinzon, J., Ustin, S. & Roberts, D., 1999, Remote Sensing
of Soils in the Santa Monica Mountains: II. Hierarchical Foreground and
Background Analysis, Remote Sensing the Environment, 68: 138-151
• Post, D., Fimbres, A., Matthias, A., Sano, E., Accioly, L., Batchily, A. &
Ferreira, L., 2000, Predicting Soil Albedo from Soil Colour and Spectral
Reflectance Data, Soil Science Society American Journal, 4: 1027-1034
• Ramsey, 2006, Advanced Image Processing Techniques, Applied Remote
Sensing and GPS Techniques, GEO2460,
http://ivis.eps.pitt.edu/courses/ars/lectures/8.pdf, [accessed 10/01/2007]
• Savitzky, A. & Golay, M., 1964, Smoothing and Differentiation of Data by
Simplified Least Squares Procedures, Analytical Chemistry, 36: 1627-1639
• Scholte, K., 2005, Hyperspectral Remote Sensing and Mud Volcanism in
Azerbaijan, PhD Thesis, ITC, The Netherlands
• Shepherd, K. & Walsh, M., 2002, Division S-8-Nutrient Management and Soil
and Plant analysis: Development of Reflectance Spectral Libraries for
Characterisation of Soil Properties, Soil Science Society American Journal, 66:
988-998
• Soil Survey Staff, 1999, Soil Taxonomy: A Basic System of Soil Classification
for Making and Interpreting Soil Surveys, United States Department of
Agriculture, Washington DC, 20402
• Udelhoven, T., Emmerling, C. & Jarmer, T., 2003, Quantitative analysis of soil
chemical properties with diffuse reflectance spectrometry and partial least-
square regression: A feasibility study, Plant and soil, 251: 319-329
• USGS, 2006, Sensors- ASTER, http://eo1.usgs.gov/ASTER.php , [accessed
12/7/06]
• Van der Meer, F., de Jong & Bakker, W., 2001, Imaging Spectrometry: Basic
Analytical Techniques, In: Van der Meer, F. & de Jong, S., 2001, Imaging
Spectroscopy: Basic Principles and Prospective Applications, Kluwer
Academic Publishers, the Netherlands
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
62
• Van der Werff, H. 2006, Knowledge-based remote sensing of complex objects:
Recognition of spectral and spatial patterns resulting from natural
hydrocarbon seepages, PHD Thesis, ITC, Netherlands
• Van Reeuwijk, L.P., 2002, Procedures for soil analysis, International Soil
Reference and Information Centre- ISRIC, Technical paper 9, Sixth Edition,
USA
• Viscarra Rossel, R.A. 2005. ParLeS version 2.1a. Software for Chemometric
Analysis of Spectroscopic Data.
http://www.usyd.edu.au/su/agric/acpa/people/rvrossel/soft01.htm, [accessed
11/2006]
• Viscarra Rossel, R.A., Walvoort, D.J, McBratney, A.B, Janik, L.J. &
Skiemstad, J.O. 2006, Visible, Near infrared or combined diffuse reflectance
spectroscopy for simultaneous assessment of various soil properties, Geoderma
131:59-75
• Williams, P. 2001, Implementation of Near Infrared technology, In: Williams,
P. & Norris, K., 2001, Near Infrared technology in the agricultural and food
industries, American Association of Cereal Chemists, Minnesota
• Wold, S., Sjostrom, M. & Eriksson, L., 2001, PLS-regression: a basic tool of
chemometrics, Chemometrics and Intelligent Laboratory Systems, 58: 109-130
• Zhou, L., Dickenson, R. & Tian, J., 2005, Derivation of a soil albedo dataset
from MODIS using principle component analysis: Northern Africa and The
Arabian Peninsula, Geophysical research letters, 32, L21407
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
63
8. Appendices
8.1. Appendix 1: Selected figures and tables
8.1.1. Figures
Figure A1:a) Soil map 1:500000- ASTER coverage; b) Soil map 1:1000000- ASTER
coverage; c) Soil map 1:500000- Study area; d) Soil map 1:1000000- Study area
Sources of maps: Ministry of Agriculture, 1973; ACSAD, 1986
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
64
Figure A2: Flowchart of PLSR data arrangement procedure (Wold et al., 2001).
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
65
Figure A3: Distribution of sampling points
Figure A4: Eigen values for MNF transformed bands: a) ASTER image 2003; b)
ASTER image 2005
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
66
Figure A6: CIE diagram and soil samples colour distribution showing no difference
in soil colour
Figure A5: a) CIE tristimulus
curves (Van der Werff, 2006);
b) CIE chromaticity diagram
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
67
Figure A8: Spectral Profile extraction from ASTER at each sampling location:
a)Spectral library for sampling points; b) Spectral library for test point
Figure A7: a) spectra of samples A22
and A23-showing highest brightness
variation; b) Duplicate samples at
location A30-no variation; c) spectral
library of all samples displaying limited
variation between the samples
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
69
Figure A9: Analysis of the three spectral libraries (continuum removed) for selected
sampling points.
Figure A10: Misclassified pixels associated with Lithosols [green], Brown Lithosols
[blue] and Lithosols/Regosols [red]; a) ASTER 2003; b) ASTER 2005
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
70
8.1.2. Tables
Table A1: PLSR prediction model for Electroconductivity (EC), Moisture Content
(MC), Organic Matter (OM), Carbonate, Calcium (Ca), Potassium (K), Magnesium
(Mg), Manganese (Mn), Sodium (Na), using ASD FIELDSPEC spectrometer
measurements under laboratory conditions. SD (standard deviation) RMSE (root
mean square error), RPD (relative performance determinant).
Table A2: PLSR prediction model for Electroconductivity (EC), Moisture Content
(MC), Organic Matter (OM), Carbonate, Calcium (Ca), Potassium (K), Magnesium
(Mg), Manganese (Mn), Sodium (Na), using ASD FIELDSPEC spectrometer
measurements under laboratory conditions. Resampled to ASTER library. SD
(standard deviation) RMSE (root mean square error), RPD (relative performance
determinant)
Element Factors Variance SD R² RMSE RPD*
EC 5 193.35 13.91 0.80 6.64 2.38
MC 5 0.1 0.32 0.95 0.06 5.05
OM 5 0.4 0.63 0.83 0.27 2.60
Carbonate 5 1.6 1.27 0.71 0.79 1.97
Ca 5 13.74 3.71 0.74 2.05 2.09
K 5 1.62 1.27 0.82 0.55 2.49
Mg 5 0.59 0.77 0.62 0.57 1.71
Mn 5 0.01 0.11 0.71 0.07 1.96
Na 5 0.86 0.93 0.93 0.25 3.86
Element Factors Variance SD R2
RMSE RPD*
EC 5 226.11 15.04 0.92 4.17 3.79
MC 5 0.07 0.27 0.63 0.19 1.74
OM 5 0.35 0.59 0.73 0.34 2.04
Carbonate 5 0.87 0.93 0.41 1.13 1.37
Ca 5 14.26 3.78 0.77 1.94 2.22
K 5 1.17 1.08 0.49 0.93 1.48
Mg 5 0.23 0.48 0.16 0.84 1.51
Mn 5 0.01 0.1 0.56 0.08 1.6
Na 5 0.76 0.87 0.75 0.46 2.12
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
71
Table A3: PLSR prediction model for Electroconductivity (EC), Moisture Content
(MC), Organic Matter (OM), Carbonate, Calcium (Ca), Potassium (K), Magnesium
(Mg), Manganese (Mn), Sodium (Na): Spectral library derived from ASTER image
2003. SD (standard deviation) RMSE (root mean square error), RPD (relative
performance determinant)
*RPD classification for indicating the model prediction abilities (<1.5v bad; 1.5-2 bad; 2-2.5 good;
>2.5v good)
Table A4: PLSR prediction model for Electroconductivity (EC), Moisture Content
(MC), Organic Matter (OM), Carbonate, Calcium (Ca), Potassium (K), Magnesium
(Mg), Manganese (Mn), Sodium (Na): Spectral library derived from ASTER image
2005. SD (standard deviation) RMSE (root mean square error), RPD (relative
performance determinant)
Element Factors Variance SD R2
RMSE RPD*
EC 5 238.75 15.15 0.88 5.22 3.03
MC 5 0.08 0.29 0.73 0.16 2.05
OM 5 0.38 0.61 0.80 0.29 2.35
Carbonate 5 1.63 1.28 0.66 0.85 1.82
Ca 5 13.79 3.71 0.68 2.30 1.87
K 5 1.00 1.00 0.59 0.84 1.65
Mg 5 0.37 0.61 0.43 0.69 1.40
Mn 5 0.01 0.11 0.69 0.07 1.90
Na 5 0.75 0.87 0.74 0.47 2.07
Element Factors Variance SD R2
RMSE RPD*
EC 5 203.82 14.28 0.89 4.97 3.18
MC 5 0.05 0.22 0.43 0.24 1.40
OM 5 0.24 0.49 0.46 0.48 1.44
Carbonate 5 1.95 1.40 0.86 0.55 2.81
Ca 5 16.64 4.08 0.94 1.00 4.29
K 5 0.99 0.99 0.66 0.77 1.79
Mg 5 0.94 0.07 0.99 0.06 15.37
Mn 5 0.01 0.11 0.80 0.06 2.35
Na 5 0.68 0.82 0.73 0.48 2.02
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
73
Table A5: Extraction of soil types for each classification per sample point
Point MAXLIK
2003
MAXLIK 2005 MNF ASTER 2003 MNF ASTER 2005 Map scale 1:500000
A15 Fluvial soil Fluvial soil Fluvial soil Brown Lithosols Brown Lithosols
A16 Fluvial soil Fluvial soil Fluvial soil Fluvial soil Brown Lithosols
border with Fluvial
A18 Fluvial soil Fluvial soil Fluvial soil Fluvial soil Brown Lithosols
A22 Gypsiorthids Brown
Lithosols
Lithosols Brown Lithosols Fluvial soil/border
with Brown Lithosols
A23 Bare Rock Brown
Lithosols
Bare Rock Brown Lithosols Fluvial soil/Calci-
Magnesium
encrusting
A25 Bare Rock Fluvial soil Bare Rock Fluvial soil Fluvial soil/Calci-
Magnesium
encrusting
A28 Fluvial soil Fluvial soil Fluvial soil Brown Lithosols Fluvial soil/Calci-
Magnesium
encrusting
A30 Bare Rock Brown
Lithosols
Brown Lithosols Brown Lithosols Fluvial soil/Calci-
Magnesium
encrusting
A32 Gypsiorthids Bare Rock Brown Lithosols Brown Lithosols Fluvial soil/Calci-
Magnesium
encrusting
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
74
Table A6: Extraction of soil types for each classification per test sample point
Point
MAXLIK
2003
MAXLIK
2005 MNF 2003 MNF 2005 Soil Map
B1 Recent soil Cloud Recent soil Cloud Fluvial soil/Ca-Mg
B2 Fluvial Soil Cloud Fluvial Soil Cloud Fluvial soil/Ca-Mg
B3 Recent soil Cloud Recent soil Cloud Fluvial soil/Ca-Mg
B4 Fluvial Soil Fluvial Soil Fluvial Soil Fluvial Soil Fluvial soil/Ca-Mg
B5 Fluvial Soil Cloud Fluvial Soil Cloud Fluvial soil/Ca-Mg
B6 Fluvial Soil Cloud Fluvial Soil Cloud Fluvial soil/Ca-Mg
B7 Fluvial Soil Fluvial Soil Fluvial Soil Fluvial Soil Fluvial soil/Ca-Mg
B8 Fluvial Soil Fluvial Soil Fluvial Soil Fluvial Soil Fluvial soil/Ca-Mg
B9 Fluvial Soil Fluvial Soil Bare Rock Fluvial Soil Fluvial soil/Ca-Mg
B11 Fluvial Soil
Brown
Lithosols Recent soil Brown Lithosols Fluvial soil/Ca-Mg
B17 Bare Rock Bare Rock Bare Rock Bare Rock Brown Lithosols
B19 Fluvial Soil
Brown
Lithosols Fluvial Soil Brown Lithosols Brown Lithosols
B20 Fluvial Soil Fluvial Soil Fluvial Soil Fluvial Soil Fluvial soil/Ca-Mg
B21 Recent soil Bare Rock Recent soil Bare Rock Fluvial soil/Ca-Mg
B24 Fluvial Soil Fluvial Soil Fluvial Soil Fluvial Soil Fluvial soil/Ca-Mg
B26 Bare Rock Fluvial Soil Gypsiorthids Fluvial Soil Fluvial soil/Ca-Mg
B27 Bare Rock Fluvial Soil Bare Rock Fluvial Soil Fluvial soil/Ca-Mg
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
75
B29 Fluvial Soil Bare Rock Recent soil Bare Rock Fluvial soil/Ca
B31 Fluvial Soil
Brown
Lithosols Fluvial Soil Brown Lithosols
Fluvial soil/Ca
B33 Bare Rock
Brown
Lithosols Bare Rock Brown Lithosols
Fluvial border
with Brown Lithosols
B34 Bare Rock
Brown
Lithosols Bare Rock Fluvial Soil
Fluvial border
with Brown Lithosols
B35 Bare Rock Bare Rock Bare Rock Bare Rock
Fluvial border
with Brown Lithosols
B36 Fluvial Soil
Brown
Lithosols Fluvial Soil Brown Lithosols
Fluvial border
with Brown Lithosols
B37 Bare Rock Fluvial Soil Recent soil Bare Rock
Fluvial border
with Brown Lithosols
B38 Fluvial Soil Fluvial Soil Fluvial Soil Fluvial Soil
Fluvial border
with Brown Lithosols
B39 Bare Rock Fluvial Soil Bare Rock Fluvial Soil
Fluvial border
with Brown Lithosols
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
76
8.2. Appendix 2: Field work report
Sample
ID Date X Y Grain size pH
Soil
colour
Visible
contamination/Other
comments
Projectio
n
GPS
accuracy
A15 28/09/2006 600891 3724263
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some dry
small grass, dry lumps WGS 84 6m
A16 28/09/2006 600550 3723646
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some
stones WGS 84 7m
A18 28/09/2006 601003 3723984
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some
stones, dry grass WGS 84 6m
A22 28/09/2006 600483 3723119
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some
stones, small bushes,
stones WGS 84 6m
A23 28/09/2006 599203 3722903
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture (olive
trees), lumps of dry
soil WGS 84 8m
A25 28/09/2006 599386 3722868
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Dry small bushes,
lumps of dry soil and
stones WGS 84 7m
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
77
A28 28/09/2006 600121 3722139
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, lumps of
dry soil, trees close by WGS 84 6m
A30 27/09/2006 598675 3721309
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some
palm trees near by,
crust WGS 84 7m
A32 27/09/2006 600159 3721074
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Some dry vegetation,
stones and crust WGS 84 6m
B1 26/09/2006 601576 3728960
Fine:
<2mm
NOT IN
FIELD
VERY
PALE
BROWN
Small bushes, dry
crust, no stones WGS 84 6m
B2 26/09/2006 601558 3728815
Fine:
<2mm
NOT IN
FIELD
VERY
PALE
BROWN
Small bushes, dry
crust, stones are
present WGS 84 6m
B3 26/09/2006 601916 3728174
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some
stones and dry lumps
of soil WGS 84 6m
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
78
B4 26/09/2006 602259 3727279
Fine:
<2mm,
stones
NOT IN
FIELD
VERY
PALE
BROWN
Big stones, dry
bushes, crust WGS 84 10m
B5 26/09/2006 601346 3726698
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Some dry lichen, crust
and big lumps of dry
soil WGS 84 6m
B6 26/09/2006 600969 3726616
Fine:
<2mm
NOT IN
FIELD
VERY
PALE
BROWN
Some dry vegetation,
stones and crust WGS 84 6m
B7 26/09/2006 601473 3726399
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Crust and a lot of dry
vegetation and roots WGS 84 10m
B8 26/09/2006 601574 3726443
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Crust and dry
vegetation WGS 84 10m
B9 26/09/2006 600998 3726168
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some
crust, small trees and
some stones WGS 84 6m
B11 26/09/2006 600807 3725665
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN Crust, no vegetation WGS 84 6m
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
79
B17 28/09/2006 600622 3723922
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some
stones, some small
bushes WGS 84 6m
B19 28/09/2006 600684 3723274
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some
grass and small
bushes WGS 84 7m
B20 28/09/2006 600316 3723169
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Some grass and small
bushes, and trees WGS 84 6m
B24 28/09/2006 599372 3722846
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Dry small bushes,
lumps of dry soil and
stones WGS 84 8m
B26 28/09/2006 599676 3722231
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, dry grass
and trees near by WGS 84 8m
B27 28/09/2006 598970 3722035
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some
trees and buildings WGS 84 7m
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
80
B29 28/09/2006 598250 3721413
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, some
small bushes, crust,
lumps WGS 84 6m
B31 27/09/2006 599045 3720967
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Lumps and crust,
some green and dry
vegetation, buildings
near by WGS 84 6m
B33 27/09/2006 598385 3720822
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, lumps of
dry soil, stones WGS 84 7m
B34 27/09/2006 598775 3720577
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Some dry and green
vegetation, lumps of
dry soil, land prepared
for agriculture WGS 84 10m
B35 27/09/2006 599343 3720737
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Land prepared for
agriculture, crust and
lumps of dry soil,
some trees around WGS 84 7m
B36 27/09/2006 599814 3720654
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Some dry bushes and
stones, some trees and
buildings near by WGS 84 7m
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
81
B37 27/09/2006 600087 3720676
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Some dry bushes,
trees and buildings
near by, dry lichen WGS 84 6m
B38 27/09/2006 599374 3720059
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN
Some dry bushes,
trees and buildings
near by, dry lichen WGS 84 7m
B39 27/09/2006 598003 3720844
Fine:
<2mm, dry
lumps
NOT IN
FIELD
VERY
PALE
BROWN Dry vegetation WGS 84 6m
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
83
8.3. Appendix 3: Samples photographs
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
84
8.4. Appendix 4: Procedures for soil analysis
Reference: Van Reeuwijk, L.P. (2002)
pH
1. Weigh 20g fine soil into a 100ml polythene wide-mouth type bottle. Include
a blank
2. Add 50ml water and cap the bottle
3. Shake for 2 hours
4. Before opening the bottle for measurement, shake by hand once more
5. Immerse pH electrode in the upper part of suspension
6. Read pH when reading has stabilised
Electroconductivity
1. Add about 30ml standard 0.01M KCl solution to a 50ml beaker and
measure temperature
2. Rinse and fill pipette cell with the standard KCl solution or insert dip cell in
this solution
3. Set temperature compensation deal at measured temperature and adjust
reading of the meter
4. Measure the temperature of the extract and set temperature compensation
dial at this temperature
5. Fill pipette cell with extract or insert dip cell into extract and read
condictivity
Moisture Content
1. Transfer approximately 5g fine material to a tared moisture tin and weigh
2. Dry overnight at 105 degrees
3. Remove tin from the oven, close with a lid, cool in desiccators and weigh
4. Calculate moisture content (water loss)
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
85
Soil Organic Carbon/Organic Matter
1. 1g of soil material is places into a 500ml flask
2. Add 10ml dichloromate solution
3. Add 20ml sulphuric acid, swirl the flask and allow to stand on a pad for 30
minutes
4. Add 200 ml water and 10ml phosphoric acid
5. Add 1ml indicator solution and titrate with ferrous sulphate solution. Stir
until the brown colour turns purple or violet-blue. Slow down the titration
6. At the end point the colour changes to green
7. Use the formula to convert to Organic Matter:
%Organic Matter= 2x % Organic Carbon
Carbonate
1. Weigh 5g fine material into a shaking bottle
2. Add 100ml 0.2M HCL by pipette and swirl
3. Loosely screw on lid and swirl occasionally during the next hour. Let then
stand over night
4. The next day, indent the bottle by hand, tighten the lid and shake for 2 hours
in reciprocating shaker.
5. Let the suspension settle, pipette 10ml supernatant solution into 100ml
Erlenmeyer flask and add about 25ml water
6. Add a few drops phenolphthalein indicator and titrate with 0.1M NaOH
Calcium, Potassium, Magnesium, Manganese, Sodium by ICP-OES
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
86
1. For the digestion of solid samples, a representative 1 gram (wet weight)
sample is digested with repeated additions of nitric acid (HNO3) and
hydrogen peroxide (H2O2).
2. For ICP-OES analysis, hydrochloric acid (HCl) is added to the initial
digested solution and the sample is refluxed and then diluted to a final
volume of 200 mL. The digested sample is analyzed using the ICP-OES
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
87
8.5. Appendix 5: Classification Accuracies
MAXLIK 2003
Overall accuracy: 88.0482%; Kappa coefficient: 0.86
Ground Truth
(pixels)
Class Eroded rocky Lithosols Lithosols/Rig Brown Lithosols Gypsum/gley
Unclassified 0 0 0 0 0
Eroded rocky 62946 2 2713 754 25362
Lithosols [Gr 0 116602 44 5029 5
Lithosols/Rig 2090 127 105969 297 12702
Brown Lithosols 243 4004 1250 107432 1411
Gypsum/gley [ 17318 38 14225 2459 79141
Halomorphs/ u 0 0 0 0 0
Young soil 0 0 0 0 3
Fluvial soil 6160 0 23 4 805
Sols bruns je 778 1 15 41 81
Water/no data 0 0 0 0 0
Total 89535 120774 124239 116016 119510
Ground Truth
Class Halomorphs Young soil Fluvial soil Sols bruns je
Water/no
data
Unclassified 0 0 0 0 0
Eroded rocky 0 0 5041 370 0
Lithosols [Gr 0 0 0 0 0
Lithosols/Rig 0 0 71 0 68
Brown Lithosols 0 0 5 20 0
Gypsum/gley [ 0 0 548 28 6
Halomorphs/ u 8702 1 0 0 0
Young soil 0 4239 41 136 0
Fluvial soil 0 0 80668 394 0
Sols bruns je 0 10 2174 39546 0
Water/no data 0 0 0 0 182257
Total 8702 4250 88548 40494 182331
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
88
Ground Truth
Class Total
Unclassified 0
Eroded rocky 97188
Lithosols [Gr 121680
Lithosols/Rig 121324
Brown Lithosols 114365
Gypsum/gley [ 113763
Halomorphs/ u 8703
Young soil 4419
Fluvial soil 88054
Sols bruns je 42646
Water/no data 182257
Total 894399
Ground Truth%
Class Eroded rocky Lithosols Lithosols/Rig Brown Lithosols Gypsum/gley
Unclassified 0 0 0 0 0
Eroded rocky 70.3 0 2.18 0.65 21.22
Lithosols [Gr 0 96.55 0.04 4.33 0
Lithosols/Rig 2.33 0.11 85.29 0.26 10.63
Brown Lithosols 0.27 3.32 1.01 92.6 1.18
Gypsum/gley [ 19.34 0.03 11.45 2.12 66.22
Halomorphs/ u 0 0 0 0 0
Young soil 0 0 0 0 0
Fluvial soil 6.88 0 0.02 0 0.67
Sols bruns je 0.87 0 0.01 0.04 0.07
Water/no data 0 0 0 0 0
Total 100 100 100 100 100
Ground Truth%
Class Halomorphs/ uS ols peu evol Fluvial soil Sols bruns je
Water/no
data
Unclassified 0 0 0 0 0
Eroded rocky 0 0 5.69 0.91 0
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
89
Lithosols [Gr 0 0 0 0 0
Lithosols/Rig 0 0 0.08 0 0.04
Brown Lithosols 0 0 0.01 0.05 0
Gypsum/gley [ 0 0 0.62 0.07 0
Halomorphs/ u 100 0.02 0 0 0
Young soil 0 99.74 0.05 0.34 0
Fluvial soil 0 0 91.1 0.97 0
Sols bruns je 0 0.24 2.46 97.66 0
Water/no data 0 0 0 0 99.96
Total 100 100 100 100 100
Ground Thruth %
Class Total
Unclassified 0
Eroded rocky 10.87
Lithosols [Gr 13.6
Lithosols/Rig 13.56
Brown Lithosols 12.79
Gypsum/gley [ 12.72
Halomorphs/ u 0.97
Young soil 0.49
Fluvial soil 9.85
Sols bruns je 4.77
Water/no data 20.38
Total 100
Class Commission Omission Commission Omission
(Percent) (Percent) (Pixels) (Pixels)
Eroded rocky 35.23 29.7 34242/97188 26589/89535
Lithosols [Gr 4.17 3.45 5078/121680 4172/120774
Lithosols/Rig 12.66 14.71 15355/121324 18270/124239
Brown Lithosols 6.06 7.4 6933/114365 8584/116016
Gypsum/gley [ 30.43 33.78 34622/113763 40369/119510
Halomorphs/ u 0.01 0 01 8703 0/8702
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
90
Young soil 4.07 0.26 180/4419 11 4250
Fluvial soil 8.39 8.9 7386/88054 7880/88548
Sols bruns je 7.27 2.34 3100/42646 948/40494
Water/no data 0 0.04 0/182257 74/182331
Class Prod. Acc. User Acc. Prod. Acc. User Acc.
(Percent) (Percent) (Pixels) (Pixels)
Eroded rocky 70.3 64.77 62946/89535 62946/97188
Lithosols [Gr 96.55 95.83 116602/120774 116602/121680
Lithosols/Rig 85.29 87.34 105969/124239 105969/121324
Brown Lithosols 92.6 93.94 107432/116016 107432/114365
Gypsum/gley [ 66.22 69.57 79141/119510 79141/113763
Halomorphs/ u 100 99.99 8702/8702 8702/8703
Young soil 99.74 95.93 4239/4250 4239/4419
Fluvial soil 91.1 91.61 80668/88548 80668/88054
Sols bruns je 97.66 92.73 39546/40494 39546/42646
Water/no data 99.96 100 182257/182331 182257/182257
ASTER 2005
Ground Truth pixel
Class Eroded rocky Lithosols Lithosols/Rig Brown Lithosols Gypsum/gley
Unclassified 0 0 0 0 0
Eroded rocky 53568 1187 2493 8397 19963
Lithosols [Gr 998 106466 5622 16865 11637
Lithosols/Rig 1433 3478 111642 2976 32508
Brown Lithosols 11200 9225 2500 82225 30673
Gypsum/gley [ 2910 362 1847 2304 18851
Cloud [White] 1194 0 0 185 2280
Cloud shadow 255 0 0 0 58
Halomorphs/ u 0 0 0 0 0
Young soil 285 0 0 16 26
Sols bruns je 3354 0 8 170 622
Fluvial soil 14338 56 127 2878 2892
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
91
Total 89535 120774 124239 116016 119510
Ground Truth-
pixel
Class Cloud Cl
oud
shadow
Halomorphs/
u Young soil Sols bruns je
Unclassified 0 0 0 0 0
Eroded rocky 156 128 0 35 220
Lithosols [Gr 0 0 0 0 0
Lithosols/Rig 0 0 0 0 0
Brown Lithosols 9 0 0 1 13
Gypsum/gley [ 9 73 0 9 30
Cloud [White] 22575 41 24 118 673
Cloud shadow 0 23103 0 0 43
Halomorphs/ u 0 0 8678 0 0
Young soil 171 1 0 5479 2114
Sols bruns je 345 347 0 352 15208
Fluvial soil 12 1 0 3 184
Total 23277 23694 8702 5997 18485
Ground Truth-
pixel
Class Fluvial soil Total
Unclassified 0 0
Eroded rocky 6154 92301
Lithosols [Gr 11 141599
Lithosols/Rig 64 152101
Brown Lithosols 1563 137409
Gypsum/gley [ 160 26555
Cloud [White] 57 27147
Cloud shadow 0 23459
Halomorphs/ u 0 8678
Young soil 76 8168
Sols bruns je 1222 21628
Fluvial soil 79241 99732
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
92
Total 88548 738777
Ground Truth%
Class Eroded rocky Lithosols Lithosols/Rig Brown Lithosols Gypsum/gley
Unclassified 0 0 0 0 0
Eroded rocky 59.83 0.98 2.01 7.24 16.7
Lithosols [Gr 1.11 88.15 4.53 14.54 9.74
Lithosols/Rig 1.6 2.88 89.86 2.57 27.2
Brown Lithosols 12.51 7.64 2.01 70.87 25.67
Gypsum/gley [ 3.25 0.3 1.49 1.99 15.77
Cloud [White] 1.33 0 0 0.16 1.91
Cloud shadow 0.28 0 0 0 0.05
Halomorphs/ u 0 0 0 0 0
Young soil 0.32 0 0 0.01 0.02
Sols bruns je 3.75 0 0.01 0.15 0.52
Fluvial soil 16.01 0.05 0.1 2.48 2.42
Total 100 100 100 100 100
Ground Truth%
Class Cloud Cl
oud
shadow
Halomorphs/
u Young soil Sols bruns je
Unclassified 0 0 0 0 0
Eroded rocky 0.67 0.54 0 0.58 1.19
Lithosols [Gr 0 0 0 0 0
Lithosols/Rig 0 0 0 0 0
Brown Lithosols 0.04 0 0 0.02 0.07
Gypsum/gley [ 0.04 0.31 0 0.15 0.16
Cloud [White] 96.98 0.17 0.28 1.97 3.64
Cloud shadow 0 97.51 0 0 0.23
Halomorphs/ u 0 0 99.72 0 0
Young soil 0.73 0 0 91.36 11.44
Sols bruns je 1.48 1.46 0 5.87 82.27
Fluvial soil 0.05 0 0 0.05 1
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
93
Total 100 100 100 100 100
Ground Truth (Percent)
Class Fluvial soil Total
Unclassified 0 0
Eroded rocky 6.95 12.49
Lithosols [Gr 0.01 19.17
Lithosols/Rig 0.07 20.59
Brown Lithosols 1.77 18.6
Gypsum/gley [ 0.18 3.59
Cloud [White] 0.06 3.67
Cloud shadow 0 3.18
Halomorphs/ u 0 1.17
Young soil 0.09 1.11
Sols bruns je 1.38 2.93
Fluvial soil 89.49 13.5
Total 100 100
Class Commission Omission Commission Omission
(Percent) (Percent) (Pixels) (Pixels)
Eroded rocky 41.96 40.17 38733/92301 35967/89535
Lithosols [Gr 24.81 11.85 35133/141599 14308/120774
Lithosols/Rig 26.6 10.14 40459/152101 12597/124239
Brown Lithosols 40.16 29.13 55184/137409 33791/116016
Gypsum/gley [ 29.01 84.23 7704/26555 100659/119510
Cloud [White] 16.84 3.02 4572/27147 702/23277
Cloud shadow 1.52 2.49 356/23459 591/23694
Halomorphs/ u 0 0.28 0/8678 24/8702
Young soil 32.92 8.64 2689/8168 518/5997
Sols bruns je 29.68 17.73 6420/21628 3277/18485
Fluvial soil 20.55 10.51 20491/99732 9307/88548
Class Prod. Acc. User Acc. Prod. Acc. User Acc.
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
94
(Percent) (Percent) (Pixels) (Pixels)
Eroded rocky 59.83 58.04 53568/89535 53568/92301
Lithosols [Gr 88.15 75.19 106466/120774 106466/141599
Lithosols/Rig 89.86 73.4 111642/124239 111642/152101
Brown Lithosols 70.87 59.84 82225/116016 82225/137409
Gypsum/gley [ 15.77 70.99 18851/119510 18851/26555
Cloud [White] 96.98 83.16 22575/23277 22575/27147
Cloud shadow 97.51 98.48 23103/23694 23103/23459
Halomorphs/ u 99.72 100 8678/8702 8678/8678
Young soil 91.36 67.08 5479/5997 5479/8168
Sols bruns je 82.27 70.32 15208/18485 15208/21628
Fluvial soil 89.49 79.45 79241/88548 79241/99732
MNF classification- ASTER 2003
Ground Truth-
pixel
Class Eroded rocky Lithosols Lithosols/Rig Brown Lithosols Gypsum/gley
Unclassified 0 0 0 0 0
Eroded rocky 36125 4 14481 1461 25182
Lithosols [Gr 1 112491 15 7770 2
Lithosols/Rig 21257 130 79731 950 30538
Brown Lithosols 1842 7899 1940 100709 6780
Gypsum/gley [ 20805 250 18009 4823 52051
Halomorphs/ u 0 0 0 0 0
Young soil 49 0 0 1 6
Fluvial soil 7569 0 10044 193 4673
Sols bruns je 1887 0 19 109 278
Water/no data 0 0 0 0 0
Total 89535 120774 124239 116016 119510
Ground Truth-
pixel
Class Halomorphs/ u Young soil Fluvial soil Sols bruns je
Water/no
data
Unclassified 0 0 0 0 0
Eroded rocky 0 0 7706 543 0
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
95
Lithosols [Gr 0 0 0 0 0
Lithosols/Rig 0 0 7605 35 0
Brown Lithosols 0 0 55 13 0
Gypsum/gley [ 0 0 645 13 0
Halomorphs/ u 8702 0 0 0 0
Young soil 0 4221 216 357 0
Fluvial soil 0 1 66656 1734 0
Sols bruns je 0 28 5665 37799 0
Water/no data 0 0 0 0 182331
Total 8702 4250 88548 40494 182331
Ground Truth-
pixel
Class Total
Unclassified 0
Eroded rocky 85502
Lithosols [Gr 120279
Lithosols/Rig 140246
Brown Lithosols 119238
Gypsum/gley [ 96596
Halomorphs/ u 8702
Young soil 4850
Fluvial soil 90870
Sols bruns je 45785
Water/no data 182331
Total 894399
Ground Truth %
Class Eroded rocky Lithosols Lithosols/Rig Brown Lithosols Gypsum/gley
Unclassified 0 0 0 0 0
Eroded rocky 40.35 0 11.66 1.26 21.07
Lithosols [Gr 0 93.14 0.01 6.7 0
Lithosols/Rig 23.74 0.11 64.18 0.82 25.55
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
96
Brown Lithosols 2.06 6.54 1.56 86.81 5.67
Gypsum/gley [ 23.24 0.21 14.5 4.16 43.55
Halomorphs/ u 0 0 0 0 0
Young soil 0.05 0 0 0 0.01
Fluvial soil 8.45 0 8.08 0.17 3.91
Sols bruns je 2.11 0 0.02 0.09 0.23
Water/no data 0 0 0 0 0
Total 100 100 100 100 100
Ground Truth %
Class Halomorphs/ u Young soil Fluvial soil Sols bruns je
Water/no
data
Unclassified 0 0 0 0 0
Eroded rocky 0 0 8.7 1.34 0
Lithosols [Gr 0 0 0 0 0
Lithosols/Rig 0 0 8.59 0.09 0
Brown Lithosols 0 0 0.06 0.03 0
Gypsum/gley [ 0 0 0.73 0.03 0
Halomorphs/ u 100 0 0 0 0
Young soil 0 99.32 0.24 0.88 0
Fluvial soil 0 0.02 75.28 4.28 0
Sols bruns je 0 0.66 6.4 93.34 0
Water/no data 0 0 0 0 100
Total 100 100 100 100 100
Ground Truth %
Class Total
Unclassified 0
Eroded rocky 9.56
Lithosols [Gr 13.45
Lithosols/Rig 15.68
Brown Lithosols 13.33
Gypsum/gley [ 10.8
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
97
Halomorphs/ u 0.97
Young soil 0.54
Fluvial soil 10.16
Sols bruns je 5.12
Water/no data 20.39
Total 100
Class Commission Omission Commission Omission
(Percent) (Percent) (Pixels) (Pixels)
Eroded rocky 57.75 59.65 49377/85502 53410/89535
Lithosols [Gr 6.47 6.86 7788/120279 8283/120774
Lithosols/Rig 43.15 35.82 60515/140246 44508/124239
Brown Lithosols 15.54 13.19 18529/119238 15307/116016
Gypsum/gley [ 46.11 56.45 44545/96596 67459/119510
Halomorphs/ u 0 0 0/8702 0/8702
Young soil 12.97 0.68 629/4850 29/4250
Fluvial soil 26.65 24.72 24214/90870 21892/88548
Sols bruns je 17.44 6.66 7986/45785 2695/40494
Water/no data 0 0 0/182331 0/182331
Class Prod. Acc. User Acc. Prod. Acc. User Acc.
(Percent) (Percent) (Pixels) (Pixels)
Eroded rocky 40.35 42.25 36125/89535 36125/85502
Lithosols [Gr 93.14 93.53 112491/120774 112491/120279
Lithosols/Rig 64.18 56.85 79731/124239 79731/140246
Brown Lithosols 86.81 84.46 100709/116016 100709/119238
Gypsum/gley [ 43.55 53.89 52051/119510 52051/96596
Halomorphs/ u 100 100 8702/8702 8702/8702
Young soil 99.32 87.03 4221/4250 4221/4850
Fluvial soil 75.28 73.35 66656/88548 66656/90870
Sols bruns je 93.34 82.56 37799/40494 37799/45785
Water/no data 100 100 182331/182331 182331/182331
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
98
MNF classification- ASTER 2005
Ground Truth-
pixel
Class Eroded rocky Lithosols Lithosols/Rig Brown Lithosols Gypsum/gley
Unclassified 0 0 0 0 0
Eroded rocky 31142 705 1705 9096 12613
Lithosols [Gr 2034 100725 7440 16837 17013
Lithosols/Rig 1856 5004 107358 6034 33765
Brown Lithosols 15899 12650 3682 75479 32656
Gypsum/gley [ 6240 1483 3268 3640 17456
Cloud [White] 602 0 0 116 1769
Cloud shadow 826 0 7 0 279
Halomorphs/ u 0 0 0 0 0
Young soil 1179 0 0 69 59
Sols bruns je 3508 0 8 161 454
Fluvial soil 26249 207 771 4584 3446
Total 89535 120774 124239 116016 119510
Ground Truth-
pixel
Class Cloud C
loud
shadow
Halomorphs/
u Young soil Sols bruns je
Unclassified 0 0 0 0 0
Eroded rocky 74 103 0 68 514
Lithosols [Gr 2 0 0 0 0
Lithosols/Rig 0 3 0 0 0
Brown Lithosols 42 0 0 2 0
Gypsum/gley [ 27 57 0 6 217
Cloud [White] 22223 0 9 177 567
Cloud shadow 0 23176 0 0 169
Halomorphs/ u 0 0 8676 0 0
Young soil 421 0 15 5160 3587
Sols bruns je 470 352 2 548 13305
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
99
Fluvial soil 18 3 0 36 126
Total 23277 23694 8702 5997 18485
Ground Truth-
pixel
Class Fluvial soil Total
Unclassified 0 0
Eroded rocky 8658 64678
Lithosols [Gr 23 144074
Lithosols/Rig 299 154319
Brown Lithosols 2869 143279
Gypsum/gley [ 128 32522
Cloud [White] 0 25463
Cloud shadow 2 24459
Halomorphs/ u 0 8676
Young soil 643 11133
Sols bruns je 726 19534
Fluvial soil 75200 110640
Total 88548 738777
Ground Trutth %
Class Eroded rocky Lithosols Lithosols/Rig Brown Lithosols Gypsum/gley
Unclassified 0 0 0 0 0
Eroded rocky 34.78 0.58 1.37 7.84 10.55
Lithosols [Gr 2.27 83.4 5.99 14.51 14.24
Lithosols/Rig 2.07 4.14 86.41 5.2 28.25
Brown Lithosols 17.76 10.47 2.96 65.06 27.32
Gypsum/gley [ 6.97 1.23 2.63 3.14 14.61
Cloud [White] 0.67 0 0 0.1 1.48
Cloud shadow 0.92 0 0.01 0 0.23
Halomorphs/ u 0 0 0 0 0
Young soil 1.32 0 0 0.06 0.05
Sols bruns je 3.92 0 0.01 0.14 0.38
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
100
Fluvial soil 29.32 0.17 0.62 3.95 2.88
Total 100 100 100 100 100
Ground Truth %
Class Cloud C
loud
shadow
Halomorphs/
u Young soil Sols bruns je
Unclassified 0 0 0 0 0
Eroded rocky 0.32 0.43 0 1.13 2.78
Lithosols [Gr 0.01 0 0 0 0
Lithosols/Rig 0 0.01 0 0 0
Brown Lithosols 0.18 0 0 0.03 0
Gypsum/gley [ 0.12 0.24 0 0.1 1.17
Cloud [White] 95.47 0 0.1 2.95 3.07
Cloud shadow 0 97.81 0 0 0.91
Halomorphs/ u 0 0 99.7 0 0
Young soil 1.81 0 0.17 86.04 19.4
Sols bruns je 2.02 1.49 0.02 9.14 71.98
Fluvial soil 0.08 0.01 0 0.6 0.68
Total 100 100 100 100 100
Ground Truth %
Class Fluvial soil Total
Unclassified 0 0
Eroded rocky 9.78 8.75
Lithosols [Gr 0.03 19.5
Lithosols/Rig 0.34 20.89
Brown Lithosols 3.24 19.39
Gypsum/gley [ 0.14 4.4
Cloud [White] 0 3.45
Cloud shadow 0 3.31
Halomorphs/ u 0 1.17
Young soil 0.73 1.51
Sols bruns je 0.82 2.64
IDENTIFICATION OF SOIL PROPERTY VARIATION USING SPECTRAL AND STATISTICAL ANALYSIS
101
Fluvial soil 84.93 14.98
Total 100 100
Class Commission Omission Commission Omission
(Percent) (Percent) (Pixels) (Pixels)
Eroded rocky 51.85 65.22 33536/64678 58393/89535
Lithosols [Gr 30.09 16.6 43349/144074 20049/120774
Lithosols/Rig 30.43 13.59 46961/154319 16881/124239
Brown Lithosols 47.32 34.94 67800/143279 40537/116016
Gypsum/gley [ 46.33 85.39 15066/32522 102054/119510
Cloud [White] 12.72 4.53 3240/25463 1054/23277
Cloud shadow 5.25 2.19 1283/24459 518/23694
Halomorphs/ u 0 0.3 0/8676 26/8702
Young soil 53.65 13.96 5973/11133 837/5997
Sols bruns je 31.89 28.02 6229/19534 5180/18485
Fluvial soil 32.03 15.07 35440/110640 13348/88548
Class Prod. Acc. User Acc. Prod. Acc. User Acc.
(Percent) (Percent) (Pixels) (Pixels)
Eroded rocky 34.78 48.15 31142/89535 31142/64678
Lithosols [Gr 83.4 69.91 100725/120774 100725/144074
Lithosols/Rig 86.41 69.57 107358/124239 107358/154319
Brown Lithosols 65.06 52.68 75479/116016 75479/143279
Gypsum/gley [ 14.61 53.67 17456/119510 17456/32522
Cloud [White] 95.47 87.28 22223/23277 22223/25463
Cloud shadow 97.81 94.75 23176/23694 23176/24459
Halomorphs/ u 99.7 100 8676/8702 8676/8676
Young soil 86.04 46.35 5160/5997 5160/11133
Sols bruns je 71.98 68.11 13305/18485 13305/19534
Fluvial soil 84.93 67.97 75200/88548 75200/110640