Predictive Modelling of Amphibian Distribution Using ... Output/Henok Neggal.pdf · Predictive...
-
Upload
vuongduong -
Category
Documents
-
view
216 -
download
0
Transcript of Predictive Modelling of Amphibian Distribution Using ... Output/Henok Neggal.pdf · Predictive...
Predictive Modelling of Amphibian Distribution
Using Ecological Survey Data: a case study of
Central Portugal
Henok Eyob Negga
March, 2007
Course Title: Geo-Information Science and Earth Observation
for Environmental Modelling and Management
Level: Master of Science (Msc)
Course Duration: September 2005 - March 2007
Consortium partners: University of Southampton (UK)
Lund University (Sweden)
University of Warsaw (Poland)
International Institute for Geo-Information Science
and Earth Observation (ITC) (The Netherlands)
GEM thesis number: 2005-25
Predictive Modelling of Amphibian Distribution Using
Ecological Survey Data: a case study of
Central Portugal
By
Henok Eyob Negga
Thesis submitted to the International Institute for Geo-information Science and Earth
Observation in partial fulfilment of the requirements for the degree of Master of
Science in Geo-information Science and Earth Observation: Environmental
Modelling and Management
Thesis Assessment Board
Dr. J. de Leeuw (Chairman)
Prof. P.M Atkinson (External Examiner)
Dr. B. Toxopeus (1st Supervisor)
Dr. P.J.W. Arntzen (2nd
Supervisor)
International Institute for Geo-Information Science and
Earth Observation, Enschede, The Netherlands
Disclaimer
This document describes work undertaken as part of a programme of study at
the International Institute for Geo-information Science and Earth Observation.
All views and opinions expressed therein remain the sole responsibility of the
author, and do not necessarily represent those of the institute.
I certify that although I have conferred with others in preparing for this assignment,
and drawn upon a range of sources cited in this work, the content of this thesis report
is my original work.
Henok Eyob Negga
i
Abstract
The increasing pressure on natural habitats, due to economic development and
climatic changes, has lead to habitat fragmentation and will cause an eventual loss of
biodiversity. The main aims of the study were (1) to estimate the geographic
distribution of T.marmoratus and T.pygmaeus for two study areas (Abrantes and
Fatima) in central Portugal. (2) To find the most important environmental predictor
variable. (3) To seek for similarities and differences in habitat preferences between
the two species. (4) To make a scenario model showing future species distribution.
For (1-3), Maximum Entropy (MaxEnt) modelling technique using presence-only
data with a set of environmental predictor variables and statistical significance tests
were used. For (4), a Multi Criteria Evaluation (MCE) scenario model based on the
current habitat-species relationship established on the MaxEnt model was created.
The MCE model among other variables incorporated a projected land cover map.
Multi Layer Perception (MLP) neural network was used for land cover change
detection and Markov chain model for projecting the land cover to the year 2020.
Predictor variables were added to the MCE model using fuzzy logic approach.
Membership functions were determined based on response curves of each variable
from MaxEnt model.
All the Maxent models performed better than random prediction with high AUC
(area under curve). Land cover was the most important variable in almost all cases
and the distribution of the species was significantly (P<0.0001) associated with it.
Common habitats for both species in Abrantes area were ‘heterogeneous agriculture’
and ‘mixed forest’ land cover classes. In Fatima area, the common habitats were
‘coniferous forests’. T.pygmaeus maintained its strong presence on ‘heterogeneous
agriculture’ classes in both study areas. On the contrary, T.marmoratus was
completely absent from the same class in Fatima area, instead it inhabited permanent
crop lands.
Overall, the MaxEnt model indicated that the presence of both species was very
much dependent on vegetation cover close to its aquatic habitat. However,
T.pygmaeus has also shown higher adaptability to less vegetated and sometimes
urban areas. The MCE scenario model prediction highlighted that the distribution of
both species will become patchy, scattered and reduced in habitat area. Conservation
planners could use the model results as base information to increase the connectivity
of patches within the landscape in order to delineate corridors for the species.
Keywords: Predictive models, MaxEnt, MLP, MCE, Markov Chain
ii
Acknowledgements
First and foremost, I am honoured to be a part of this prestigious Erasmus Mundus
(GEM) program. It has been a privilege and an extraordinary experience – My
heartfelt gratitude to European Union for allocating fund from the tax payer’s
money.
I owe deep gratitude to Program coordinators, Prof. P.M. Atkinson, Prof. P. Pilesjö,
Prof. dr. hab. K. Dabrowska-Zielinska and Prof. A. Skidmore for designing and
coordinating such a unique program…Thanks to you all, i have now better
opportunities. Thanks also go to all who have been involved in the teaching-learning
process. Special thanks to, Steff Webb, Karin Larsson and Jorien Terlouw for
handling all the hectic administration works and for all those nice parties….. You
made us feel at home.
I am most indebted to my first supervisor Dr. Bert Toxopeus for his invaluable
guidance and constructive comments starting from proposal writing, field work and
to the final thesis writing. More importantly, for allowing me to work on my own
pace and giving me the space I needed. I would like to extend my sincere
appreciation to Dr. Kees de Bie for providing and preparing various data and most of
all for his friendly approach all the way from Poland. Many thanks, to Nuno Curado
and his family for making our Portugal field trip comfortable and memorable.
Eu tive uma estadia maramvilhosa. Obrigado.
Exceptional gratitude to all my fellow GEM friends, i have enjoyed your company,
support and above all, for your friendship. I would be unfair if I fail to mention
Abel, Juan, Maina, Matt, Mila, Pengu, Ritesh, Sarika,Shawn,Sukhad and Yan.
Thanks for your willingness to help, for the wonderful coffee and lunch breaks we
had. I am going to miss you all.
Special thanks to my family at home. My father and mother, my little sisters, Meron,
Birsahel, Wengel and Sarah whose continuous support and encouragement have
made me come this far.
I praise the Lord, Alpha and Omega, my source of inspiration, for without Him I can
do nothing.
iii
This thesis is an outcome of the BIOFRAG–ITC internal research project in
collaboration among:
International Institute for Geo-Information Science and Earth Observation
(ITC), Enschede, The Netherlands
Museum of Natural History (Naturalis), Leiden, The Netherlands
Centro de Investigação em Biodiversidade e Recursos Genéticos (CIBIO), Porto,
Portugal
Special Thanks to Dr. J.W. Arntzen and Dr. N. Sillero and Dr. M.A. Carretero.
Without their cooperation and willingness to share data and assist in the field, this
research would not have been possible.
i
Contents
1. Introduction....................................................................................................... 1 1.1. General Background ................................................................................. 1 1.2. Problem Statement and Justification ......................................................... 2 1.3. Research Objectives .................................................................................. 3
1.3.1. General Objective............................................................................. 3 1.3.2. Specific Objectives........................................................................... 3
1.4. Research Questions ................................................................................... 3 1.5. Research Hypotheses ................................................................................ 3 1.6. Research Approaches................................................................................ 4
2. Materials and Methods...................................................................................... 7 2.1. Study Areas ............................................................................................... 7 2.2. Herpetological Species.............................................................................. 8 2.3. Species Occurrence Data........................................................................... 8 2.4. Environmental Predictor Variables ........................................................... 9
2.4.1. Normalized Difference Vegetation Index (NDVI) ........................... 9 2.4.2. Climatic variables............................................................................. 9 2.4.3. Land Cover....................................................................................... 9 2.4.4. Landscape Matrices ........................................................................ 10 2.4.5. Topographic Variables ................................................................... 12 2.4.6. Proximity Variables........................................................................ 12
2.5. Multi-collinearity .................................................................................... 13 3. Species Distribution Modelling (Using MaxEnt Model) ................................ 15
3.1. Introduction............................................................................................. 15 3.2. Methods................................................................................................... 15
3.2.1. Maximum Entropy (MaxEnt Model).............................................. 15 3.2.2. Presence-Absence Maps................................................................. 16 3.2.3. Model Evaluation ........................................................................... 16 3.2.4. Predictor Variable Importance ....................................................... 17 3.2.5. Habitat Preference .......................................................................... 17
3.3. Results..................................................................................................... 18 3.3.1. Abrantes Area................................................................................. 18 3.3.2. Fatima Area .................................................................................... 26
3.4. Discussion ............................................................................................... 34 3.4.1. Habitat – Species Relationship....................................................... 34
3.5. Conclusions............................................................................................. 37 4. Future Species Distribution (Scenario Model)................................................ 39
4.1. Introduction............................................................................................. 39 4.2. Methods................................................................................................... 40
4.2.1. Multi Layer Perception (MLP) Neural Networks........................... 40 4.2.2. Markov Chain Analysis.................................................................. 41 4.2.3. Multi Criteria Evaluation................................................................ 42 4.2.4. Comparison of MaxEnt and MCE Model....................................... 44
4.3. Results..................................................................................................... 45
ii
4.3.1. Land cover change Analysis...........................................................45 4.3.2. Future Scenario Model ...................................................................45
4.4. Discussion ...............................................................................................47 4.4.1. Impacts of Land cover changes ......................................................47 4.4.2. What if land cover continues to change… (Scenario Model).........48
4.5. Conclusions.............................................................................................48 5. General Discussions ........................................................................................51
5.1. The Niche Concept..................................................................................51 5.2. MaxEnt Model Performance ...................................................................52 5.3. Model Uncertainty ..................................................................................52 5.4. Model result and its Implications for conservation activities ..................53
6. General Conclusions and Recommendations ..................................................55 6.1. Conclusions.............................................................................................55 6.2. Recommendations ...................................................................................56
7. References.......................................................................................................57 8. Appendices......................................................................................................65
iii
List of figures
Figure 1 A simplified Schema of the research framework.......................................... 5
Figure 2 Location Map of the Study Area .................................................................. 7
Figure 3a Triturus marmoratus and Figure 3b Triturus pygmaeus ........................... 8
Figure 4 confusion matrix ........................................................................................ 17
Figure 5 Predicted species distribution of (a) T.marmoratus (b) T.pygmaeus. For
comparison and statistical analysis purposes, these maps were converted to binary
presence-absence models using equal training sensitivity-specificity threshold
values........................................................................................................................ 19
Figure 6 a. & 6 b The environmental variable with highest gain when used in
isolation is corine land cover, which therefore appears to have the most useful
information by itself. The environmental variable that decreases the gain the most
when it is omitted is corine land cover, which therefore appears to have the most
information that isn't present in the other variables................................................. 20
Figure 7a &7b Mosaic plot showing the percentage frequency occurrence of
T.marmoratus and T.pygmaeus in land cover classes .............................................. 21
Figure 8a, b & c box plot graphs showing comparison between presence of
T.marmoratus and T.pygmaeus presence against suitable environmental predictor
variables ................................................................................................................... 23
Figure 9 Comparison of landscape area percentage and percentage presence of.. 24
Figure 10 Predicted species distribution of (a) T.marmoratus (b) T.pygmaeus. For
comparison purposes these maps were converted to binary presence-absence models
using equal training sensitivity-specificity threshold values. ................................... 26
Figure 11 The environmental variable with highest gain when used in isolation is
corine land cover, which therefore appears to have the most useful information by
itself. The environmental variable that decreases the gain the most when it is omitted
is again corine land cover which therefore appears to have the most information
that isn't present in the other variables. ................................................................... 28
Figure 12 The environmental variable with highest gain when used in isolation is
drainage distance, which therefore appears to have the most useful information by
itself. The environmental variable that decreases the gain the most when it is omitted
is drainage distance, which therefore appears to have the most information that isn't
present in the other variables. .................................................................................. 28
Figure 13 a &b, Mosaic plot showing the frequency of occurrence of the species in
each land cover class................................................................................................ 29
Figure 14a, b, c & d box plot graphs showing comparison between species presence
against predictor variables....................................................................................... 31
iv
Figure 15 Comparison of Landscape area percentage of each land cover class and
..................................................................................................................................32
Figure 16 Correspondence analysis plot showing a) Heterogeneous
agricultures(class 5) as common habitat for T.marmoratus and T.pygmaeus in
Abrantes area b) coniferous forests(class 7) as common habitat for T.marmoratus
and T.pygmaeus in Fatima area ...............................................................................35
Figure 17a &17b Comparison of permanent crops and heterogeneous agriculture
class patches proximity to vegetation covers. In Abrantes area heterogeneous
classes are closer to vegetation covers where as the opposite is true in Fatima area.
..................................................................................................................................36
Figure 18 Schematic Diagram of the scenario model ..............................................39
Figure 19 a) Neural network topology and b) accuracy of the neural network .......41
Figure 20 Sigmoidal a) monotonically increasing function (b) monotonically
decreasing (c & d) symmetric functions ...................................................................43
Figure 21 Comparison of MaxEnt and MCE scenario model distribution of
T.marmoratus and T.pygmaeus. Mean plus standard deviation of the distribution
was used as a threshold. ...........................................................................................46
Figure 22 adopted from Soberson and Peterson, (2005), region A represents
geographic regions with appropriate set of abiotic factors. Region B represents
regions where right combination of interacting species occurs. Region C represents
areas accessible to the species with out barriers......................................................51
Figure 23 Figure adopted from Peterson, 2006 .......................................................52
v
List of Tables
Table 1 Description of each land cover classes........................................................ 11
Table 2 Landscape Metrics and formula used to calculate ...................................... 12
Table 3 Variance Inflation Factor ............................................................................ 13
Table 4 Equal training sensitivity and specificity threshold used to classify the
MaxEnt models ......................................................................................................... 16
Table 5 ROC curve and Kappa statistics results for T.marmoratus and T.pygmaeus
using training and test data. ..................................................................................... 18
Table 6 Chi-Square test showing the significant association of the species with Land
cover predictor variable. .......................................................................................... 22
Table 7 Landscape indices of corine 2000 land cover ............................................. 25
Table 8 Comparison of suitable environmental range of T.marmoratus and
T.pygmaeus with t-test .............................................................................................. 25
Table 9 ROC (AUC) and Kappa accuracy assessment............................................. 27
Table 10 Chi-Square test showing the significant association of the species with ... 27
Table 11 Landscape indices of Fatima land cover ................................................... 33
Table 12 comparison of suitable environmental range of T.marmoratus and
T.pygmaeus with t-test .............................................................................................. 33
vi
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
1
1. Introduction
1.1. General Background
The increasing pressure on natural habitats due to economic development, coupled
with climatic changes has lead to habitat fragmentation and eventual loss of
biodiversity. Habitat fragmentation alters population viability by decreasing habitat
availability and increasing isolation (decreasing connectivity) of each remaining
habitat patch (Joly et al., 2004). Decreasing habitat area and connectivity results in
reduced population size where as isolation affects viability by impeding immigration
from other populations (Olsen, 2006; Joly et al., 2003; Velázquez et al., 2003;
Linderman et al., 2005). Determining the status of species is of prime importance to
both ecologists and conservation biologists (Hecnar & M’closkey, 1996).
A prerequisite for an effective conservation strategy is a continuous and consistent
monitoring program on the state and spatial extent of sensitive habitats (Bock,
2003). Nonetheless, there is a wide gap between the information available on the
spatial distribution of biodiversity and methods of habitat monitoring. With this
respect, recently in Europe different land use and habitat mapping schemes have
been designed in order to tackle the problem (Weiers, 2004 & Bock, 2004).
Furthermore, national and international legal documents have been adopted at
European level in order to maintain acceptable level of biodiversity and ensure
sustainable use of natural resources (e.g. EU-Habitat Directives, and EU-Bird
Directive (Directive 79/409/CEE) (Calcerrada & Luque, 2006; Bock, 2003; Weiers,
2004).
At a regional level, one of the most important eco-region in Europe is the
Mediterranean Basin characterized by high levels of biodiversity and endemic
species (IUCN, 2005). The Mediterranean Basin is considered as global biodiversity
hot spot mainly due to the area escaped recent ice age and, the presence of
significant massifs (e.g. Atlas, Southern Taurus, Gudar, Javalambre, and Levant).
The main current threats to the Mediterranean Basin are desertification due to
deforestation, urbanization, pollution, economic activities including infrastructure,
expansion of agriculture, industry and tourism (IUCN, 2005). Data for Morocco
shows that out of 3800 plant species present, 829 are endemic. Greece follows by
554 endemic species out of 4000 plant species. Similarly, amphibians which are the
focus of this research are characterized by 35(56%) endemism out of 62 species
present (IUCN, 2005).
Amphibians are relatively immobile and are especially prone to local extinction
caused by human caused transformation and fragmentation of their habitat
Introduction
2
(Woodford, 2003). Movements of amphibians across the landscape result from both
complex life cycles and dispersal (Joly et al., 2003). Complex life cycle implies two
different habitats at a local scale. Breeding occurs at aquatic habitats while other
activities occur on land. Breeding migration connects the terrestrial home of the
amphibians to the aquatic breeding sites. Hence, to ensure functional integrity of the
annual cycle of amphibians, not only the preservation, maintenance of habitats is
important but also all elements of the landscape that could pose as a spatial barrier
for connectivity between habitats (Hehl- Lange, 2001). Large rivers, canals,
highways and human buildings constitute absolute barriers to amphibian movements
and limit the abundance and dispersal (Löfvenhaft et al., 2004; Joly, 2004). With
this respect, Hehl-Lange, (2001) described the costs of movement between habitat as
physiological (energy demand for movement) and mortality risk due to lack of
refuges against predators or demographic sinks such as roads.
1.2. Problem Statement and Justification
Fragmentation and loss of the natural habitats mainly due to anthropogenic causes is
the main threat for the decline in number and species richness of amphibians
(Woodford, 2003; Joly, 2004; Pearson, 1999; Hecnar & M’closkey, 1996).
Determining the status of species is often difficult because of limited knowledge of
population dynamics and distribution (Hecnar & M’closkey, 1996). Lack of
consistent and up-to-date information on type, location, size and quality of natural
habitats has been identified as a major constraint (Weiers et al., 2004). From the
limited information available, most research studies focus on plants, birds and
mammals rather than herpetofaunal distribution (Guisan & Hofer, 2003).
The conventional approaches to observe species distributions are ground surveys,
aerial photography, telemetry and satellite tracking (de leeuw et al., 2002). As most
of these methods involve much field surveys, monitoring at national and regional
scale, can not realistically keep pace with the rate of change in the landscape
(Osborne et al., 2005) and requires a substantial amount of capital, time and
manpower. Thus recently, GIS based predictive modelling approaches coupled with
statistical tools have been used to analyze species distribution (Arntzen, 2006; Joly,
2004; Corsi et al., 2000; Guisan & Zimmermann, 2000; Weiers, 2004; de leeuw et
al., 2002). In this context, predictive modelling approach makes use of the limited
data available to come up with potential areas suitable for the species to survive. It
compares environmental data of all cells (pixels) with that of cells where the species
have been observed (Chefaoui et al., 2005). More importantly, predictive habitat
distribution modelling provides a tool to study the relationship between distribution
of species given a set of environmental conditions (Guisan & Zimmermann, 2000;
Calcerrada & Luque, 2006).
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
3
The Predictive distribution model makes use of environmental predictor variables
such as climate (temperature, rainfall, etc..), altitude, land cover , vegetation/NDVI,
soil moisture, distance from water bodies, drainage, rivers etc…that are assumed to
be important for the distribution of the species. The information derived is vital for
bio-geographical and conservation studies (Chefaoui et al., 2005; Phillips et al.,
2006). With regard to conservation, potential distribution area identification can
help locate sites suitable for reintroduction programs, or faunistic corridors
favouring success in regional conservation planning (Chefaoui et al., 2005).
1.3. Research Objectives
1.3.1. General Objective
The general objectives of this research is to model amphibian distribution using
predictive modelling approach, thereby to contribute for effective conservation and
planning activities of the amphibian species. In order to achieve these general
objectives of the research, the following specific objectives are proposed.
1.3.2. Specific Objectives
• To study habitat-species relationship and to identify important
environmental parameters
• To predict the distribution of T.marmoratus and T.pygmaeus in the future
1.4. Research Questions
The research will try to answer the following questions.
1. Which environmental parameter is the most important for determining the
distribution of Trtiturus marmoratus and Triturus pygamaeus?
2. Are there significant associations of both T.marmoratus and T.pygmaeus to
common land cover classes (habitats)?
3. Are there significant differences in suitable environmental parameters for
presence between T.marmoratus and T.pygmaeus?
4. Is it possible to predict the distribution of the species in the future based on
change analysis of land cover map?
1.5. Research Hypotheses
Hypothesis 1
H0: There are no significant associations of T.marmoratus and T.pygmaeus to
common habitats.
H1: There are significant associations of T.marmoratus and T.pygmaeus to
common habitats.
Introduction
4
Hypothesis 2
H0: There are no significant differences in ranges of environmental conditions
and presence of T.marmoratus and T.pygmaeus.
H1: There are significant differences in ranges of environmental conditions and
presence of T.marmoratus and T.pygmaeus.
1.6. Research Approaches
This research has two main parts: a species distribution model based on presence
only data using Maximum Entropy (MaxEnt) method (Phillips et al., 2006), which
tries to answer research questions 1-3, and a Multi Criteria Evaluation (MCE) model
showing a scenario of future species distribution based on change analysis of land
cover using neural networks and Markov chain analysis (Research question 4).
Furthermore, different landscape indices are calculated in order to characterize the
state and level of fragmentation within the different land cover classes and also to
assess its implication on the distribution of the species.
Presence only data is chosen because often accurate data on absence of species is
lacking (Brotons et al., 2004) and even when available it may be of questionable
value(Anderson et al., 2003). Since this research is based on observations
(occurrences) of herpetofaunal species, mainly an inductive approach will be
adopted. On the other hand, a deductive approach might be followed when there is a
need to use an expert knowledge, e.g. in selecting environmental predictor variables
or in determining threshold values. Figure 1 summarizes the overall conceptual
framework of the research.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
5
Figure 1 A simplified Schema of the research framework
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
7
2. Materials and Methods
2.1. Study Areas
Two study areas located in central Portugal are considered for examining species
distribution (Figure 2). The first study area, herein after known as Fatima lies in the
western coast of Portugal where as the second area hereinafter known as Abrantes is
located in the mainland along Rio Tagus (Tagus River). Geographically the study
areas are bounded by 9o50’22’’W – 7
026’12’’W, 39
036’07’’N – 39
022’33’’N.
Generally, the climate of Portugal is classified as Atlantic-Mediterranean. Central
Portugal is characterized by milder temperature ranging from 140C in January to
270C in August. Topographically, it consists of hills, dunes and valleys. Vegetation
cover is varied including citrus trees, pine forests, eucalyptus, cork and oak trees.
Figure 2 Location Map of the Study Area
Materials and Methods
8
2.2. Herpetological Species
Two amphibian species which are the focus of this research are Triturus marmoratus
(Figure 3a) and Triturus pygmaeus (Figure 3b). Sometimes T.pygmaeus is
considered as a sub species of T.marmoratus. Both have a varying mixture of green
and black colour. T.marmoratus when adult reaches up to 17 cm in length while
T.pygmaeus does not exceed 11 cm. In the red list of IUCN, T.marmoratus is
classified as “least concern while T.pygmaeus is labelled as “Near Threatened”
(www.amphibiaweb.org). Their normal habitat are vegetation covers, where they
can frequently be found in numbers under dead and rotting wood, and other hiding
places. They emerge from these refuges in the late evening and are primarily
nocturnal (IUCN, 2005).
a
b
Figure 3a Triturus marmoratus and Figure 3b Triturus pygmaeus
2.3. Species Occurrence Data
The species occurrence data were collected by Naturalis museum in Leiden, the
Netherlands. The field survey was conducted in 2000 using hand held GPS to
accurately measure the location of ponds where these species live. The presence of
species is confirmed by collection of eggs and other tissue samples for molecular
genetic identification (Themudo & Arntzen, in press).
A total of 69 samples were collected out of which 40 are for Abrantes area while 29
samples are for Fatima area. A second field trip was made from September to
October, 2006 in order to “ground truth” the species occurrence data collected
previously and as well as to observe the natural habitat and environmental
preference of the species. Global Positioning System (GPS) and IPAC instruments
were used to measure, observe and as well as to describe habitat requirements of the
species.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
9
2.4. Environmental Predictor Variables
2.4.1. Normalized Difference Vegetation Index (NDVI)
The NDVI images were obtained from VEGETATION database from
www.VGT.vito.be. NDVI layers were generated from SPOT-4 sensor with a spatial
resolution of 1km. It consists of 300 images (layers) generated every 10 days
(deckade) for a period of 8 years from April, 1998 to July 2006. Long term NDVI
images show the overall productivity of the ecosystem and thus can be used as
surrogate measure for net primary production (NPP) (e.g., Oindo & Skidmore,
2002). Best ground reflectance values are extracted pixel by pixel for each deckadal
image and geo-referenced to Albers equal area projection system. The NDVI layers
were resampled to 15m resolution.
As species- energy theory predicts, the species richness of an area increases with
productivity (Hutchinson, 1959). In order to assess the effect of NDVI on the
species distribution at aquatic and terrestrial phases, separate NDVI indices were
calculated for the dry season (June-August) and the rainy winter (breeding) season
(November-January). Late winter-early spring is reproduction (aquatic phase), larval
is spring, metamorphosis is early summer for both species.
2.4.2. Climatic variables
Different researchers using statistical results indicated that there might be cause and
effect relationship between climatic variables and species distribution, (e.g.O’Brien,
1998; Lennon et al., 2000; Badgley & Fox, 2000; H-acevedo & Currie, 2003). The
effect of climate could be direct or indirect (Lennon et al., 2000). Direct effect
implies that individuals experience less energy for growth and reproduction. Indirect
effect involves climatic variables controlling resource levels (through primary
productivity), creating competition, and as a result influencing population dynamics.
Climatic data was downloaded from WORLDCLIM database where input data were
gathered from variety sources from 1950-2000. All the environmental layers have a
spatial resolution of 1km and were interpolated using thin plate smoothing spline
algorithm (Hijmans, 2005). All climatic layers were resampled to 15m resolution.
2.4.3. Land Cover
Wide spread change on the land cover units due to human and climatic factors play a
significant role in determining the distribution of species. Land cover changes have
led to fragmentation of the natural habitats of species which eventually alters the
population of species (Joly, 2003). Corine land cover map of Portugal, produced in
2000 were used. The geometric and thematic accuracy of Corine land cover map
Materials and Methods
10
was assessed by Universidade Nova De Lisboa, in 2004. It was reported that the
geometric accuracy is better than 100m while its thematic accuracy is greater than
85% (Freire et al., 2004).
2.4.4. Landscape Matrices
Using the Corine 2000 and a projected land cover map (to be discussed in Chapter
4), landscape matrices such as total edge, connectivity, class area and percentage of
landscape were calculated at class level. The landscape matrices were used to assess
level of fragmentation within land cover classes and to investigate the implication
with regard to the presence of species in each land cover classes (Table 2). Fragstats
3.3® software was used for the calculation of landscape matrices. Detailed
description of each index is found on Fragstats 3.3®Help Manual.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
11
Table 1 Description of each land cover classes
Land Cover
Class Names
Class
code
Description
Urban 1 Most of the land is covered by structures and transport
network. Buildings, roads and artificially surfaced areas
cover more than 80% of the total surface.
Mine Area 2 Areas with open-pit extraction of construction material
(sandpits, quarries) or other minerals (open-cast mines).
Includes flooded gravel pits, except for river-bed extraction.
Arable Land 3 Include non-irrigated arable lands, permanently irrigated
lands and rice fields.
Permanent
Crops
4 Include vineyards, fruit trees, berry plantations and olive
groves.
Heterogeneous
Agriculture
5 Include annual crops associated with permanent crops,
complex cultivation patterns; land principally occupied by
agriculture with significant areas of natural vegetation, agro
forestry areas.
Broad Leaved
Forest
6 Vegetation formation composed principally of trees,
including shrub and bush understorys, where broad-leaved
species predominate.
Coniferous
Forest
7 Vegetation formation composed principally of trees,
including shrub and bush understorys, where coniferous
species predominate. Coniferous forest must cover 75% the
total surface of the unit.
Mixed Forest 8 Vegetation formation composed principally of trees,
including shrub and bush understorys, where neither broad-
leaved nor coniferous species predominate.
Shrub lands 9 Natural grassland, moor and heath land, and transitional
woodlands.
Open area 10 Open spaces, beaches, dune sands, bare rocks, burnt areas
and sparsely vegetated areas.
Water Courses 11 Natural or artificial water courses serving as water drainage
channels and also includes canals. Minimum width for
inclusion: 100 m.
Source: CORINE land cover nomenclature
Materials and Methods
12
2.4.5. Topographic Variables
Shuttle Radar Topography Mission (SRTM) digital elevation model (DEM) with a
spatial resolution of 90m was used as elevation data. Slope and aspect layers were
calculated from the DEM data layer using ArcGIS 9.1® spatial analyst. Further
more; the digital elevation layer was used to extract drainage networks of the two
study areas using ArcGIS 9.1® (Arc Hydro) extension.
2.4.6. Proximity Variables
Direct distance function was used to calculate proximity variables using ArcGIS
9.1® spatial analyst. ‘Proximity to vegetation’ cover represents distance from every
pixel to vegetation covers. Similarly, ‘proximity to drainage’ represents distance
from every pixel towards drainage.
Table 2 Landscape Metrics and formula used to calculate
Metrics Description
( )100( 1)
2
ijk
i i
n
j k
c
CONNECTn n
=
= −
∑
cijk = joining between patch j and k
(0 = unjoined, 1 = joined) of the
corresponding patch type (i),
based on a user specified
threshold distance.
ni = number of patches in the landscape
of the corresponding patch type
(class)
1
ij
i
n
j
x
MNn
==
∑
MN (Mean) equals the sum,
across all patches of the
corresponding patch type, of the
corresponding patch metric
values, divided by the number of
patches of the same type. MN is
given in the same units as the
corresponding patch metric
1
ik
m
k
TE e=
=∑
eik = total length (m) of edge in
landscape involving patch type
(class) i; includes landscape
boundary and background
segments involving patch type i.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
13
2.5. Multi-collinearity
Multi-collinearity refers to linear inter-correlation among variables (Dohoo et al.,
1996; Kovacs et al., 2005). It is a problem to separate the effects of two or more
variables on the response variable. The problem primarily occurs when predictor
variables are significantly correlated with each other than they are with the
dependent variable (Aguilera et al., 2006). Multi-collinearity can be detected using
pair-wise correlation between variables but sometimes existing linear dependencies
could be overlooked when only pair wise correlation is used (Mansfield & Helms,
1982). A more common method is to use Variance Inflation Factor (VIF) denoted by
2(1 )VIF R= −
----------------------------------------------------------------------- 4
The VIF indicate the inflation
in the variance of each regression coefficient
compared with a situation of orthogonality. The decision to consider a VIF
to be
large is essentially arbitrary (Giacomelli et al., 1998). Usually, values larger than 10
suggest that multi-collinearity may be causing estimation problems (Giacomelli et
al., 1998). VIF is calculated using the inverse correlation function in SAS-JMP 6.0®
statistical software; and taking the diagonal values that indicate VIF.
Table 3 Variance Inflation Factor
VIF
Abrantes Fatima
Annual Mean 43.8598 98.8054
Aspect 1.9797 1.6134
Altitude 15.7399 151.4647
Max. Temp 6.7107 166.0289
Min. Temp 29.94 197.3421
Precipitation 9.1727 187.7115
Radiation 4.2162 1.7623
Range(Temp) 6.7927 308.8814
Slope 1.9494 2.1211
Proximity to vegetation 1.7562 2.2796
NDVI(dry season) 4.2438 5.0482
NDVI(wet season) 4.8596 5.0716
Proximity to drainage 1.5691 1.7842
Materials and Methods
14
As Table 3 shows, annual mean temperature, altitude, minimum temperature,
precipitation and range of temperature have VIF >10, indicating multi-collinearity.
To overcome multi-collinearity problem and also reduce the number of predictor
variables, Principal Factor Analysis (PFA) (Skei et al., 2006) is used on all climatic
variables. PFA seeks to discover if the correlated variables could be explained by
smaller number of underlying variables called a “factor” (Field, 2000). Factor
analysis transforms each variable in to linear combination of orthogonal common
factors using the correlation matrix instead of the variance-covariance matrix which
is the case in principal component analysis (PCA). As a result PFA explains the
common variance between variables while PCA explains the total variance.
PFA is calculated using ILWIS 3.2 ®. The first principal factor (PFA-1) accounted
for (99.67%) of the common variance and hereinafter can be interpreted as “climatic
factor” of the climatic variables included in the principal factor analysis (Appendix
A).
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
15
3. Species Distribution Modelling (Using MaxEnt Model)
3.1. Introduction
Several approaches have been used for modelling species distribution. A popular and
frequently used method is Generalized Linear Model (GLM), e.g. (Pearce & Ferrier,
2000; Guisan & Zimmermann, 2000; Beck et al., 2005; Guisan et al., 2002). Others
include neural networks (Manel et al., 1999), and models using presence only data
such as Ecological Niche Factor Analysis (ENFA) (Chefaoui et al., 2005; Santos et
al., 2006; Martinez et al., 2006), Genetic Algorithm for Rule-set Production (GARP)
(Stockwell & Peterson, 1999; Sweeney et al., 2007) and Maximum Entropy
(MaxEnt) (Phillips et al., 2004; Phillips et al., 2006).
All research papers reviewed depicted MaxEnt is superior in performance (e.g.
Sergio et al., 2007; Phillips et al., 2006) than ENFA and GARP methods. Phillips et
al., (2006), described MaxEnt method performs well even for small sample size.
Hence, MaxEnt is adopted in this research.
3.2. Methods
3.2.1. Maximum Entropy (MaxEnt Model)
Maximum Entropy (MaxEnt) is a general-purpose machine learning method with a
precise mathematical formulation (Phillips et al., 2006). The basic idea of MaxEnt is
“to estimate (approximate) unknown probability distribution of a species” (Phillips
et al., 2006). The best approach is to ensure that the approximation should have
maximum entropy. Entropy is defined by Shanon, (1948) as how much ‘choice’ is
involved in the selection of an event”. Thus, maximum entropy refers to maximum
choice. Maximum choice is available when there are fewer constraints
(environmental layers), i.e. unnecessary constraints should be avoided (Phillips et
al., 2006).
The technique first constrains the modelled distribution to match certain features
(environmental layers) of empirical data (training data) and choosing the probability
condition that satisfies these constraints being as uniform as possible(Buehler &
Ungar, 2001). Basically, if a pixel in the study has similar distribution as of the
training data, then higher values are assigned and accordingly pixels with different
distribution are assigned lower values.
The user specified parameters were set as the following. Test percentage=25%,
regularization multiplier =0.2, maximum iteration = 1000, Convergence Threshold=
0.001, maximum number of background points=10000.
Species Distribution Modelling (Using MaxEnt Model)
16
3.2.2. Presence-Absence Maps
The out put of Maxent model was a continuous map. Hence, a threshold must be set
in order to determine the presence or absence of a species. Phillips et al., (2006),
used the minimum cumulative value of training sample points as a threshold.
However, in this research, a more conservative cut off value of “equal training
sensitivity and specificity” was used (Table 4). Using the cut off value, models were
classified in to binary (1 or 0) or presence- absence model. Common habitats of the
two species were determined by crossing models of both species for each study area.
Table 4 Equal training sensitivity and specificity threshold used to classify the
MaxEnt models
Study Area T.marmoratus T.pygmaeus
Abrantes 19.7 28.25
Fatima 14.38 28.54
3.2.3. Model Evaluation
Any approach to ecological modelling has little merit if predictions cannot be
assessed (Verbyla & Litvaitis, 1989). In general two independent datasets are
required to develop and test a model as ‘training’ and ‘testing’ data (Fielding, 1997).
Verbyla & Litvaitis, 1989 described jackknife and bootstrap re-sampling methods as
the most precise. However, methods of data partitioning are more often constrained
by limited availability of samples, as it is the case here. As a result, for Abrantes
area, 75% of the sample data were used for training the model and the remaining
samples were used for testing. In case of Fatima area, all the points were used for
training and the accuracy result was based on the same points used for training the
model.
3.2.3.1. Receiver Operating Characteristics (ROC) Curves
Receiver Operating Characteristics (ROC) curves are threshold independent methods
developed in signal processing and is widely used in clinical medicines (Zweig &
Campbell, 1993). An ROC curve is a graphical representation of the trade off
between the false negative (FN) and false positive (FP) rates for every possible cut
off value (Fielding and Bell, 1997). FN and FP are determined by cross tabulating
observed and predicted values of a model in a confusion matrix (Figure 4). Thus,
unlike the modelling phase which used presence only data, absence data were
required for evaluation.
A new approach designed by (Phillips et al., 2006), however, introduced a new
concept of ‘distinguishing presence from random rather than presence from
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
17
absence’. For every pixel in the study area a negative instance (random) is defined
while a positive instance is defined for the pixels containing presence data. Then the
model makes predictions without looking at the instances (positive and negative)
(For further explanation, see (Phillips et al., 2006). Kappa (KHAT) was calculated in
order assess the prediction is better than random prediction.
( )1
o e
e
p pk
p
− =
− ---------------------------------------------------------------------5
Figure 4 confusion matrix
3.2.4. Predictor Variable Importance
Importance of each environmental predictor variable was assessed using jackknife
operation. Jackknife operates by sequentially excluding one environmental variable
out of the model and running a model using the remaining variables. It also runs a
model using only the excluded variable in isolation. As a result, the gain
contribution of each variable to the total gain of the model (inclusive of all variables)
can be calculated. A variable which decreased the total gain of the model higher than
all the other variables when excluded and as well as a variable which contributed the
highest gain when used alone was identified as the most important variable. (For full
description, see MaxEnt tutorial on www. cs.princeton.edu).
3.2.5. Habitat Preference
A stratified random sampling scheme was adopted for extracting values from land
cover, the presence-absence model, altitude, ‘proximity to drainage’, ‘proximity to
vegetation’, slope, and aspect layers. First, random points were generated; then
sampling was supervised to ensure that all classes were represented proportional to
the class area and class distribution within the study area. Overall a total of 3761
points for Abrantes area and 6742 points for Fatima area were generated.
A contingency table test (Pearson’s Chi-Square statistics) depicting the frequency of
occurrence was calculated to assess the significant association of each species with
land cover classes and a t-test to assess whether there is statistically significant
Actual
+ -
Predicted
+
-
a b
c d
Species Distribution Modelling (Using MaxEnt Model)
18
difference on the suitable range of environmental conditions between the two
species and as well as on the presence and absence of each condition.
3.3. Results
The results of MaxEnt model are divided in to two parts: results for Abrantes study
area and Fatima area. The main results of the MaxEnt models include species
distribution maps, model evaluation, importance of predictor variables and habitat
preference.
3.3.1. Abrantes Area
3.3.1.1. Species Distribution Maps
The species distribution maps for the two species produced broadly similar
predictions (Figure 5a & 5b). For both species south eastern corner of the study area
is predicted as absent. The present-absent maps which were derived using equal
training sensitivity and specificity threshold (T.marmoratus=19.7 &
T.pygmaeus=28.25) shows the distribution of T.marmoratus is better from central to
the north-west part of the study area while T.pygmaeus is better predicted on central
south to north west and also on some eastern parts of the study area (Appendix F).
3.3.1.2. ROC Curves
The ROC curves for both species distribution maps indicated high accuracy. For
T.marmoratus, test data AUC=0.887 and training data AUC= 0.961 while for
T.pygmaeus test data AUC= 0.811 and training data AUC=0.925. Kappa statistics
showed a prediction better than expected by random (AUC=0.5) in all cases (Table 5
and Appendix G).
Table 5 ROC curve and Kappa statistics results for T.marmoratus and T.pygmaeus
using training and test data.
ROC Kappa
0.887(T.marmoratus) 0.774
Test Data 0.811(T.pygmaeus) 0.622
0.961(T.marmoratus) 0.922
Training Data 0.925(T.pygmaeus) 0.850
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
19
a
b
Figure 5 Predicted species distribution of (a) T.marmoratus (b) T.pygmaeus.
For comparison and statistical analysis purposes, these maps were converted
to binary presence-absence models using equal training sensitivity-specificity
threshold values.
3.3.1.3. Predictor Variable Importance
The jackknife operation resulted land cover as the most important variable for both
T.marmoratus and T.pygmaeus when used alone (Figure 6a & 6b). Similarly,
excluding the land cover variable significantly decreased the total gain of both
models higher than other variables. Wet and dry season NDVI variables have strong
gain values when used in isolation; however excluding both variables has little effect
on the total gain of the model.
Knowing this, a separate model using only the two NDVI variables was run in order
to assess the distribution of the species. For both species, this model delineated
similar distribution pattern as of the model inclusive of all the variables but as the
NDVI data has an original spatial resolution of 1km, the detail is not as good as the
model inclusive of variables. (Appendix B). For T.pygmaeus altitude has strong gain
when used alone. Other topographic variables such as slope and aspect do not have
much influence on the models as expected.
Species Distribution Modelling (Using MaxEnt Model)
20
a
b
Figure 6 a. & 6 b The environmental variable with highest gain
when used in isolation is corine land cover, which therefore
appears to have the most useful information by itself. The
environmental variable that decreases the gain the most when it is
omitted is corine land cover, which therefore appears to have the
most information that isn't present in the other variables.
3.3.1.4. Habitat Preference
The distribution of the species is significantly associated (P<0.0001) with land
cover for both species. For T.marmoratus, land cover explained 21.5% of the
distribution while for T.pygmaeus, 34.07% is explained (Table 6). Contingency table
test showed that T.marmoratus and T.pygmaeus have a common significant
association with heterogeneous agriculture (Chi Square (X2) =718.891) (Figure 7a
&7b). To a lesser degree but significantly, both species are also associated with
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
21
T.m
arm
ora
tus
0.00
0.25
0.50
0.75
1.00
Ara
ble
Bro
ad
Le
ave
d
Co
nife
rou
s
He
tero
ge
ne
ou
s
Min
eM
ixe
d F
ore
st
Op
en
Are
a
Pe
rma
ne
nt
Sh
rub
Urb
an
Wa
ter
Bo
die
s
Land cover Classes
Absence
Presence
mixed forests. In addition, T.marmoratus has significant presence on coniferous
forests, mixed forests and shrub lands. T.pygmaeus has association with arable land,
permanent crops and mixed forests. T.pygmaeus is completely absent from
coniferous forests. Both species are completely absent from broad leaved forests.
Appendix C shows Chi-square values and frequency of occurrence (count) of
species in each land cover class.
a
T.p
yg
ma
eu
s
0.00
0.25
0.50
0.75
1.00
Ara
ble
Bro
ad
Leaved
Conifero
us
Hete
rogeneous
Min
eM
ixed F
ore
st
Open A
rea
Perm
anent
Shru
b
Urb
an
Wate
r B
odie
s
Land cover classes
Absence
Presence
b
Figure 7a &7b Mosaic plot showing the percentage frequency occurrence of
T.marmoratus and T.pygmaeus in land cover classes
Species Distribution Modelling (Using MaxEnt Model)
22
A significant difference (P<0.0001) in proximity to vegetation between the two
species is found (Figure 9a). The mean distance for T.marmoratus,
µT.marmoratus=52.3m and for T.pygmaeus, µT.pygmaeus=38.23m (Figure 9a). Similarly,
there is a significant difference (P<0.0001) in mean altitude, for T.marmoratus,
µT.marmoratus=241.12m.a.s.l. and for T.pygmaeus, µT.pygmaeus= 219.79m.a.s.l. (Figure 9b)
Table 6 Chi-Square test showing the significant association of the species with Land
cover predictor variable.
The mean proximity to drainage of presence and absence of each species has shown
a significant difference (P<0.0001). T.marmoratus is present at a mean drainage
distance of 162.44m (range: 156.02m-168.86m) and absent at a mean distance of
215.072m (209.17m-220.97m). T.pygmaeus is present at a mean distance of
159.649m (range: 152.78m-166.52m) and absent 214.837m (range: 209.13-220.54).
Comparison between species shows that there is no significant difference (P=0.282)
(Figure 9c). Mean values and range of environmental variables where the newts
(T.marmoratus and T.pygmaeus) are present and absent, and comparison of the
presence and absence with the t-test are summarized in Appendix D.
No significant difference is found between the two species in preference of slope
and aspect. However, it was observed that in general there is a preference by both
species to inhabit relatively flat areas. Response curves of each predictor variable for
both species have more or less similar shape and pattern except for ‘climate factor’
and slope variables (Appendix H).
Species DF Chi-Square Prob>ChiSq RSquare(U)
T.marmoratus 10 749.049 <.0001 0.2125
T.pygmaeus 10 1165.935 <.0001 0.3407
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
23
100
200
300
400
Altitude
T.marmoratus T.pygmaeus
Species
0
100
200
300
Pro
xim
ity t
o V
egeta
tion
T.marmoratus T.pygmaeus
species
a
b
0
100
200
300
400
500
600
Pro
xim
ity to
Dra
ina
ge
T. marmoratus T. pygmaeus
Species
c
Figure 8a, b & c box plot graphs
showing comparison between
presence of T.marmoratus and
T.pygmaeus presence against
suitable environmental predictor
variables
3.3.1.5. Landscape Indices vs. Presence of Species
The largest presence of both species is found in heterogeneous agricultural lands
which constitute 18.2% of the landscape (Table 7). In contrast, broad leaved forest
covers 18.7% but there is no significant presence of species. Shrub lands have the
largest area coverage of 22.7% with a medium presence of T.marmoratus and
T.pygmaeus. Mixed forests have only 5.2% in area coverage but a reasonably strong
presence of both species is found. Coniferous forest covers 16.5% where there is a
moderate presence of T.marmoratus. Connectivity of patches within land cover
Species Distribution Modelling (Using MaxEnt Model)
24
classes shows that urban areas have the highest connectivity followed by coniferous
and mixed forests.
Figure 9 Comparison of landscape area percentage and percentage presence of
species of each land cover class.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
25
Table 7 Landscape indices of corine 2000 land cover
Table 8 Comparison of suitable environmental range of T.marmoratus and T.pygmaeus with t-test
Landscape matrices at Class Level Presence%
Abrantes Land cover
Class
Area
% of
Landscape Connectivity T.marmoratus T.pygmaeus
Urban 614.000 0.533 0.9524 0 0
mining 100.000 0.08 0 0 0
Arable Land 7051.000 6.12 0.085 0 21.67
Permanent Crops 11396.000 9.89 0.1317 1.2 30.63
Heterogeneous
Agriculture 21054.000 18.277 0.062 55.31 59.92
Broad Leaved
Forest 21623.000 18.77 0.1941 2.59 0
Coniferous 19093.000 16.5746 0.5 25.58 0
Mixed Forest 5970.000 5.1826 0.277 24.12 31.76
Shrub lands 26175.000 22.7225 0.115 29.31 12.53
Open Area 517.000 0.44 0 0 0
Water Bodies 1532.000 1.32 0 0 0
T.marmoratus T.pygmaeus
Abrantes Mean Range Mean Range t-value DF Benferroni Adjusted P
Aspect 123.889 117.63-130.15 139.077 132.83-145.33 3.36932 2468.928 0.001
Altitude 242.126 239.03-245.22 219.796 217.15-222.44 -10.7671 2339.116 <0.0001
Drainage Distance 162.433 156.02-168.85 159.644 152.78-166.51 -0.58231 2447.613 0.5604
Slope 3.06201 3.3143-3.2505 3.57965 3.3143-3.845 3.12039 2188.078 0.0018
Vegetation Distance 52.3854 48.62-56.15 38.2344 35.367-41.102 -5.86612 2341.102 <0.0001
Species Distribution Modelling (Using MaxEnt Model)
26
3.3.2. Fatima Area
3.3.2.1. Species Distribution Maps
The species distribution maps in Fatima area show that T.pygmaeus has an extensive
distribution all over the area (Figure 10b). T.marmoratus is absent on a large
proportion of the north eastern part of the study area (Figure 10a) indicating a strong
difference on preference of habitat compared to T.pygmaeus. The present-absent
binary model using the cut off value (for T.marmoratus = 19.46 &
T.pygmaeus=22.814) gives a better picture of the distribution of the two species
(Appendix F).
a
b
Figure 10 Predicted species distribution of (a) T.marmoratus (b) T.pygmaeus.
For comparison purposes these maps were converted to binary presence-
absence models using equal training sensitivity-specificity threshold values.
3.3.2.2. ROC Curves
The ROC curves indicated a training AUC= 0.915 for T.marmoratus and AUC=
0.839 for T.pygmaeus. Kappa statistics has shown a better than random prediction
(AUC=0.5) for both species (Table 9 and Appendix G).
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
27
3.3.2.3. Predictor Variable Importance
Jackknife operation revealed that for T.marmoratus, land cover is the most important
predictor variable when used alone and omitted from the model (Figure 11 & 12).
Similarly, dry and wet season NDVI variables have strong gains when each variable
is used alone but did not decrease the total gain of the model when excluded.
Altitude also has strong gain value but excluding the variable out of the model did
decrease the total gain considerably. Unlike the other cases, ‘proximity to drainage’
is the most important variable for T.pygmaeus when used alone and excluded from
the model. Land cover follows as the second most important variable. For
T.pygmaeus slope, aspect, altitude and NDVI variables have little contribution to the
model and no significant gain is lost when each variable is excluded.
Table 9 ROC (AUC) and Kappa accuracy assessment
ROC Kappa
0.915(T.marmoratus) 0.8 Training Data
0.839(T.pygmaeus) 0.648
3.3.2.4. Habitat Preference
The distribution of the both species has significant (P<0.0001) association with land
cover (Table 10). However, a strong difference in habitat preference was found
between the two species. T.marmoratus primarily prefers to live in mixed,
coniferous forests and also to some degree inhabits permanent crops (Figure 13a).
T.pygmaeus is strongly associated with coniferous forests. To some extent, extent,
T.pygmaeus is also associated with permanent crops and even in urban areas (Figure
13b). However, T.marmoratus avoids heterogeneous agricultures while T.pygmaeus
is strongly associated with the same class. Similar to Abrantes area, both species are
absent from broad leaved forests.
Table 10 Chi-Square test showing the significant association of the species with
land cover
Species DF Chi-Square Prob>ChiSq RSquare(U)
T.marmoratus 11 2577.321 <0.0001 0.4116
T.pygmaeus 11 2031.298 <0.0001 0.2994
Species Distribution Modelling (Using MaxEnt Model)
28
a. T.marmoratus
Figure 11 The environmental variable with highest gain when used in
isolation is corine land cover, which therefore appears to have the most useful
information by itself. The environmental variable that decreases the gain the
most when it is omitted is again corine land cover which therefore appears to
have the most information that isn't present in the other variables.
b. T.pygmaeus
Figure 12 The environmental variable with highest gain when used in
isolation is drainage distance, which therefore appears to have the most
useful information by itself. The environmental variable that decreases the
gain the most when it is omitted is drainage distance, which therefore appears
to have the most information that isn't present in the other variables.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
29
T.m
arm
ora
tus
0.00
0.25
0.50
0.75
1.00
Ara
ble
Land
Bro
ad L
eaved F
ore
st
Conifero
us F
ore
st
Hete
rogeneous
Agriculture
Min
e A
rea
Mix
ed F
ore
st
Open S
pace
Perm
anent
Cro
ps
Shru
bs
Urb
an
Wate
r B
odie
s
Land cover classes
Absence
Presence
a
T.p
yg
ma
eu
s
0.00
0.25
0.50
0.75
1.00
Ara
ble
La
nd
Bro
ad
Le
ave
d F
ore
st
Co
nife
rou
s F
ore
st
He
tero
ge
ne
ou
s
Ag
ricu
ltu
re
Min
e A
rea
Mix
ed
Fo
rest
Op
en
Sp
ace
Pe
rma
ne
nt
Cro
ps
Sh
rub
s
Urb
an
Wa
ter
Bo
die
s
Land cover classes
Absence
Presence
b
Figure 13 a &b, Mosaic plot showing the frequency of occurrence of the
species in each land cover class
Species Distribution Modelling (Using MaxEnt Model)
30
Significant difference (P<0.0001) was found between altitude range of values that
contributes to the distribution gain of T.pygmaeus and T.marmoratus (Table 12).
Mean altitude of presence for T.marmoratus is 107.361m (range: 105-109.72) while
for T.pygmaeus, the mean altitude of presence is 138.716m (range: 133.84-143.59)
(Figure 14b). In addition, altitude response curve of T.marmoratus shows that it has
a convex relationship with the highest gain value at 125m.a.s.l. For T.pygmaeus the
response curve has concave shape, i.e. the gain value decreases at lower altitude then
starts to increase at about 200m.a.s.l. (Appendix I)
On places where T.pygmaeus is present, it occurs within a mean distance of 99.78m
from drainages which significantly (P<0.0001) differ from T.marmoratus having a
mean distance of 179.434m (Figure 14a). The gain value of proximity to drainage
linearly decreases for T.pygmaeus whereas for T.marmoratus it continuously
increases as distance increases.
The response curve shape of slope and aspect predictor variables are similar for both
species and have little gain contribution to the model. The response curves indicate
that maximum gain is attained at 50 slope and aspect of 150
0-250
0 from north. In
terms of proximity to vegetation covers, T.marmoratus has a mean distance of
150.885m (range: 140.17-161.6) while T.pygmaeus is present at a mean distance of
177.458m (range: 168.35-186.57) (Figure 14c).
3.3.2.5. Landscape Indices vs. Presence of Species
Coniferous forests cover only 4.6% of the total landscape area but have the largest
presence of both species (Table 11). Also strong presence of T.marmoratus is
associated with mixed forests which have 9.73% area coverage. Heterogeneous
agricultures have the largest area coverage of 32% with strong presence of
T.pygmaeus (Figure 15). In terms of connectivity of patches, coniferous forests have
the highest connectivity of 1.61 next to water bodies and shrub lands. Mixed forests
have connectivity value of 0.69.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
31
0
100
200
300
400
500
600
Altitu
de
T.marmoratus T.pygmaeus
Species
0
100
200
300
400
500
600
700
800
900
1000P
roxim
ity to
Dra
ina
ge
T. marmoratus T.pygmaeus
Species
a
b
0
100
200
300
400
500
600
700
800
900
1000
Pro
xim
ity t
o V
eg
eta
tio
n
T.marmoratus T.pygmaeus
Species
c
0
2
4
6
8
10
12
14
16
Slo
pe
T.marmoratus T.pygmaeus
Species
d
Figure 14a, b, c & d box plot graphs showing comparison between species
presence against predictor variables
Species Distribution Modelling (Using MaxEnt Model)
32
Figure 15 Comparison of Landscape area percentage of each land cover class and
presence of Species
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
33
Table 11 Landscape indices of Fatima land cover
Landscape matrices at Class Level Presence %
Fatima Land cover
Class
Area
% of
Landscape Connectivity T.marmoratus T.pygmaeus
Urban 7692.590 4.52 0.1057 0 40.28
mining 2119.005 1.245 0.395 0 0
Arable Land 6244.515 3.66 0.6 0 0
Permanent Crops 25419.930 14.9 0.59 51.3 24.25
Heterogeneous Agriculture 54535.590 32.05 0.54 5.98 56.67
Broad Leaved Forest 11989.868 7.04 1.3 0 0
Coniferous Forest 7851.870 4.6145 1.6156 57.61 64.08
Mixed Forest 16556.190 9.73 0.698 57.75 9.65
Shrub lands 27126.203 15.942 0.434 0 2.82
Open Area 1773.585 1.04 2.56 0 0
Water Bodies 8845.335 5.1 7.1 0.28 0
Table 12 comparison of suitable environmental range of T.marmoratus and T.pygmaeus with t-test
T.marmoratus T.pygmaeus
Fatima Mean Range Mean Range t-value DF
Benferroni
Adjusted P
Altitude 111.474 109.36-113.58 138.716 133.84-143.59 10.05469 2518.612 <0.0001
Aspect 188.334 183.57-193.1 204.673 200.31-209.04 4.958072 2855.806 <0.0001
Proximity to
Drainage 179.51 165.2-193.82 99.781 96.3-103.26 -10.6222 1380.288 <0.0001
Slope 5.08017 4.9114-5.249 4.28833 4.1657-4.411 -7.44475 2434.439 <0.0001
Vegetation Distance 150.885 140.17-161.6 177.458 168.35-186.57 3.705408 2726.505 0.0002
Future Species Distribution (Scenario Model)
34
3.4. Discussion
3.4.1. Habitat – Species Relationship
The MaxEnt model result depicted land cover as the most important variable for
both species in Abrantes area. The same applies for T.marmoratus in Fatima area.
However, for T.pygmaeus in Fatima area, ‘proximity to drainage’ is the most
important variable followed by land cover as the second most important variable. In
addition, the contingency table test (Chi-square Pearson’s correlation) indicated that
the distribution of both species is significantly associated with land cover in the two
study areas.
In Abrantes area, the distribution of T.marmoratus and T.pygmaeus species is
significantly associated with heterogeneous agriculture (Figure 16 a). Other
predictor variables such as ‘proximity to vegetation’ cover and ‘proximity to
drainages’ also plays an important role. For example, areas predicted as absent in
most cases are far away from vegetation covers.
In Fatima area, the association of T.marmoratus shifts to mixed and coniferous
forests (Figure 16b). It is interesting to note that T.marmoratus is almost fully absent
from heterogeneous agricultural lands. Instead it is strongly present on permanent
crops. T.pygmaeus maintains its association with heterogeneous agricultural lands in
addition to its significant presence in coniferous forests. Figure 16a & 16b shows the
correspondence analysis plot of the common habitat area (land cover class) in the
two study areas.
The description of permanent crops and heterogeneous agricultural classes indicates
that both classes have strong similarities. For instance, heterogeneous agriculture
among other cover types is comprised of permanent crops such as olives, fruit trees
and vineyards. This implies, at the scale of corine land cover map, there is a strong
possibility that small patches of permanent crops could be mapped as heterogeneous
agriculture. On the contrary, if these two classes have similarities, it raises the
question again, why T.marmoratus is absent from heterogeneous agricultural classes
in Fatima area?
In order to answer this question, distance is calculated from heterogeneous
agriculture and permanent crop classes to vegetation cover (forest) classes. In
Abrantes area, patches of heterogeneous agriculture are significantly closer
(P<0.0001) to vegetation covers (Figure 17a) where as in Fatima area permanent
crop class patches are significantly closer (p<0.0001) to vegetation cover (Figure
17b). Hence, it appears that T.marmoratus prefers to inhabit patches closer to
vegetation covers than T.pygmaeus.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
35
In all the models, topographic variables such as slope and aspect did not have much
contribution to the total gain of the models as expected. A possible reason could be
the resolution of SRTM data (90m), which may not be good enough to resolve the
importance of the variables. But generally the model indicated, T.marmoratus and
T.pygmaeus prefer flat areas (0-100) and aspect ranges between 120
o-220
o, i.e. south
east-south west direction.
-2
-1.5
-1
-0.5
0
0.5
1
1.5
2
C1
1 10 11 2 3 4
5
6 7
8
9 Absence
Presence
1
10
11 2 3 4 5 6 7 8 9
Ab
se
nce
Pre
se
nce
Land Cover Classes
a
-2
-1.5
-1
-0.5
0
0.5
1
1.5
2
C1
1 10 11 2 3
4
5 6
7
89
Absence
Presence
1
10
11 2 3 4 5 6 7 8 9
Ab
se
nce
Pre
se
nce
Land Cover Classes
b
Figure 16 Correspondence analysis plot showing a) Heterogeneous
agricultures(class 5) as common habitat for T.marmoratus and
T.pygmaeus in Abrantes area b) coniferous forests(class 7) as common
habitat for T.marmoratus and T.pygmaeus in Fatima area
Future Species Distribution (Scenario Model)
36
0
500
1000
1500
2000
2500
3000
3500
4000
4500
5000
Dis
tance
hete
rogeneous
agri
culture
perm
anent
cro
ps
Class
0
500
1000
1500
2000
2500
3000
3500
4000
4500
5000
Dis
tance
Hete
rogeneous
Agri
culture
perm
anent
cro
ps
class
Figure 17a &17b Comparison of permanent crops and heterogeneous
agriculture class patches proximity to vegetation covers. In Abrantes area
heterogeneous classes are closer to vegetation covers where as the opposite is
true in Fatima area.
The contribution of NDVI variables (both dry and wet season) is significant when
used in isolation, but the total gain of the model did not decrease when the variables
are omitted. This implies that the information available in the NDVI variables is also
present in some other variables. This may be integrated effects of climate, soil type,
altitude, slope and other factors manifested on NDVI values (Box et al., 1988).
Using only the two NDVI variables on a separate model, delineated the general
distribution of the species but did not outline details as good as the model including
all variables because the original resolution of the data is 1k x1km.
In Abrantes area, T.pygmaeus appears to inhabit areas close to vegetation cover than
T.marmoratus, even though the difference in distance is small (< 12m). In contrast,
in Fatima area, T.marmoratus is present significantly closer to vegetation covers at a
mean distance of 137.5m than T.pygmaeus that is present at a mean distance of
195.1m. In general, both species tend to be present on areas close to vegetation
covers. This agrees with Guerry & Hunter, (2002) that found out positive
associations of nine amphibian species with proximity to forest habitats from ponds.
For amphibians, the aquatic habitat are used for reproduction while vegetation cover
serve as hide outs from predators, richer food availability (Skei et al., 2006) and as a
shade during their terrestrial phase. Preservation of both habitats is therefore vital to
maintain local populations (Porej et al., 2004) and buffer zones surrounding their
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
37
habitat should be protected (Semilitsch & Bodie, 2003). Moreover, forests increase
the connectivity of the landscape and decrease the degree of isolation.
In case of proximity to drainages, no significant difference was found in Abrantes
area (p=0.564) where as in Fatima area there is a clear demarcation showing
T.pygmaeus living very close to drainages (µ=99.7m) compared to T.marmoratus
which is found at a mean distance of 179.5m.
Altitude in the study areas ranges from 0-600m.a.s.l. however T.marmoratus and
T.pygmaeus have shown a tendency to inhabit topographically lower areas. Mean
altitude of presence for Abrantes area is, µT.marmoratus=241.12m and µT.pygmaeus=
219.79m and for Fatima area, µT.marmoratus= 107.361m and µT.pygmaeus= 158.319m. Skei
et al., (2006), explained larval development of amphibians in general is strongly
correlated with water temperature and thus also with altitude.
The distribution map (Figure 10a) shows T.marmoratus is absent from a large area
extent characterized by scrubs, open spaces and mining areas. On the same area,
T.pygmaeus is strongly present. This concurs with Marty et al., (2005) that described
migrating post breeding T.marmoratus were found out to be strongly attracted to
forests and have avoided large barren areas. Also Sztatecsny et al., (2004), described
T.marmoratus occupies hilly and wooded areas in France.
3.5. Conclusions
From the results of the models and the statistical analysis in both study areas, the
following can be concluded. (1) Land cover was the most important variable for both
species in Abrantes area and also for T.marmoratus in Fatima area. For T.pygmaeus
‘proximity to drainage’ was the most important variable in Fatima area followed by
land cover. (2) Heterogeneous agriculture was the primary common habitat in
Abrantes area. To a lesser extent; both species were associated with mixed forest
covers. In Abrantes area, mixed forests and in Fatima area, coniferous forests
constitute a small part of the landscape but still were common habitat to both
species. This implies that mixed and coniferous forests were ecologically important.
(3) Significant difference between the species in terms of mean suitable
environmental ranges was observed. Overall, the MaxEnt model consistently
outlined, the presence of both species among other things appears to be determined
by the availability of vegetation cover within close proximity (< 200m) to its aquatic
habitat. T.marmoratus avoided large barren areas. The same applies for T.pygmaeus,
but it had also shown higher adaptability to less vegetated areas and some times even
on open spaces and urban areas.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
39
4. Future Species Distribution (Scenario Model)
4.1. Introduction
A scenario showing future prediction of the distribution of T.marmoratus and
T.pygmaeus for Abrantes area is predicted based on Multi Criteria Evaluation
(MCE) method. The Maxent model highlighted land cover as the most important
predictor variable for the distribution of T.marmoratus and T.pygmaeus. The
scenario model tries to show one aspect of the future species distribution, i.e. other
variables being unchanged what will happen if land cover continues to change in the
same trend. Therefore, the basic assumption underlying this prediction is the rate
and nature of change on land cover is assumed to continue in a similar manner.
Figure 18 Schematic Diagram of the scenario model
The MCE model incorporated variables such projected land cover map (2020),
altitude, climate, slope, aspect, distance from drainage and distance from vegetation.
First, Multi Layer Perception (MLP) neural network is used to detect the change in
Future Species Distribution (Scenario Model)
40
land cover between 1990 and 2000, and then Markov chain analysis is used to
project the land cover classes to 2020 based on the changed (transition) classes.
IDRISI Andes ®edition (Land change modeller) module was used for all the
procedures in this section. The sections below discuss the methods at each stage in
detail.
4.2. Methods
4.2.1. Multi Layer Perception (MLP) Neural Networks
Multi Layer Perception (MLP) is one of the most widely used neural networks in
remote sensing (Atkinson, 1997). Various researchers applied neural networks for
different applications. For example, prediction of forest deforestation (Mas et al.,
2004, Garzon et al., 2006), mapping species richness and composition of forests
(Carpenter, 1999, Foody & Cutler, 2006 and Boyd et al., 2002), fire prediction
(Fraser & Li, 2002) and environmental damage assessment (Abuelgasim, 1999)
Multiple layer perception (MLP) neural networks were used to detect land cover
classes changed between 1990 and 2000. The MLP procedure creates random
samples from the transition land cover units (changed during 1990 and 2000) and as
well as from land cover units that did not go through transition or are “persistent”
classes. The neural network will be trained with examples of both transition and
persistent land cover units. The MLP uses half of the training pixels as training data
and retains the other half for testing.
Along with transition land cover classes included in the model are explanatory
variables assumed to explain the main causes of land cover change. These include
proximity to roads, proximity to towns, and fire occurrences (from 1990-1999).
Hence, the neural network has a topology of four input nodes along with two hidden
layers and an output layer.
‘Proximity to towns’ and ‘proximity to roads’ variables were set to be dynamic, i.e.
at each iteration stage as new changed areas emerge, the distance from these two
variables will be recalculated and submitted to the MLP neural network. The MLP
then applies a weight to calculate new transition areas. The assumption taken here is
that more changes tend to occur to places where changes have already occurred. In
order to concentrate on major land cover changes rather than small changes, to
minimize the number of changed classes and increase the accuracy of the neural
network, classes showing changes less than 3000ha were not considered as changing
classes. A sigmoidal (s-shape) non linear function was applied and the accuracy of
the training was determined by Root Mean Square Error (RMSE) (Figure 19).
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
41
a
b
Figure 19 a) Neural network topology and b)
accuracy of the neural network
4.2.2. Markov Chain Analysis
A Markov chain is a dynamic process that consists of a finite number of states and
some known probabilities (Logofet & Lesnaya, 2000). The finite states are measured
in discrete intervals and the maximum discrete states do not change with time
(Yemshanov & Perera, 2001). An existing discrete state Wt can be used to predict a
discrete state Wt+1 by multiplying a matrix of transition probabilities (Pt)
corresponding to the current time interval t
t t tW +1=W P ,---------------------------------------------------------------------------6
where Wt is a 1×n state vector at time t, Pt is a n×n matrix of transition probabilities
created of Pij values, n is the maximum number of discrete states in the chain, and
pij gives the probability of transition from discrete state i to state j between time t
and t+1.
Future Species Distribution (Scenario Model)
42
11 12 13
21 22 23
31 32 33
t
p p p
p p p p
p p p
=
-------------------------------------------------------------------------------7
The transition probability matrix (Pt) in this case, is calculated by the MLP neural
networks using land cover maps of 1990 and 2000. Then Markov models have been
applied for projecting land cover changes to the future, i.e. 2020 based on the
transition probabilities. The out put is a land cover map for the year 2020 which will
be used in a multi criteria evaluation (MCE) model along with other explanatory
variables. Various researches have used Markov chain for different applications, e.g.
forest succession (Logofet & Lesnaya, 2000); Lopez et al., 2001) and Aavikoso,
1995).
4.2.3. Multi Criteria Evaluation
Multi-Criteria Evaluation (MCE) is a decision support system primarily concerned
with how to combine the information from several criteria to form a single index of
evaluation. The use of MCE is particularly important when statistical relationships
between target variables and predictor variables could not be established due to
unavailability of empirical data. In such cases, the use of expert knowledge or any
other sources of objective information overcome this problem (Store & Kangas,
2001). Establishing weights of variables is a complicated aspect for which the most
commonly used technique is the pair-wise comparison matrix (Boteva et al., 2004
and Store & Kangas, 2001). Nevertheless, on this research, the weights of each
predictor variable were calculated from MaxEnt model where the relationship
between habitat and species has already been established. The basic premise of this
approach is that the best information available is the current geographic distribution
itself (from MaxEnt model) and based on that the future distribution can be inferred.
Basically, a two step process was taken. First, each predictor variable was
standardized between 0-1 range using fuzzy logic (1) approach, and (2) weights for
each predictor variable were calculated from (the jackknife operation in MaxEnt).
1.0 Fuzzy logic
Zadeh, (1988), described fuzzy logic as ‘every thing including truth is a matter of
degree’. A fuzzy set is a class of objects where there is no sharp boundary between
those object that belong to the class and that do not (Zadeh, 1970). Mathematically it
can be expressed as
Definition: A= ( ){ }),( xAx µ , x ∈X--------------------------------------------------8
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
43
Where )( XAµ is the grade of membership of x in A, and MXA →:µ is a function
of X to a space called M called the membership function space.
Predictor variables were standardized between 0-1 range using fuzzy logic
membership functions. For each predictor variable, the types of membership
functions were determined after analyzing the response curves (Appendix G) from
MaxEnt model. Mainly sigmoidal (S-shape) and J-shaped membership functions
were used in the fuzzy logic. Both membership functions have three different types.
These are monotonically increasing, monotonically decreasing and symmetric
(Figure 20) (Eastman, 2003).
For the projected land cover map which has categorical data, user defined
membership function for each category were assigned based on the degree of
association with the presence of species on each land cover class found in the
contingency table (Figure 7a & 7b). Then each predictor variables were converted to
Byte data type using the corresponding membership functions.
a)
b)
c)
d)
Figure 20 Sigmoidal a) monotonically increasing function (b)
monotonically decreasing (c & d) symmetric functions
Future Species Distribution (Scenario Model)
44
(2) Weights
The weights of each predictor variable were determined based on the gain
contribution of each predictor variable to the model (MaxEnt jackknife operation).
The gain values were standardized so that the weights add up to one (Table 16). The
linear combination of the variables results the out put model.
Presence= f (EV1*(W1) + EV2*(W2) +EV3*(W3) +------EV n (Wn)) ----------------------9
Where EV are environmental layers and Wn are weights of environmental layers.
4.2.4. Comparison of MaxEnt and MCE Model
The MaxEnt and MCE model used different algorithms; a certain value on MaxEnt
model does not represent equal value on the MCE model. However, both models
represent the distribution of species. Thus, for comparison purpose only, we have
taken mean plus standard deviation (µ+ S.D) of the distribution as a threshold to
assess the change between the present (MaxEnt) and the future (MCE) species
distribution. Outputs of the computed MCE model were multiplied by 100 prior to
the comparison.
The accuracy of the MCE model could not be directly assessed as there could not be
species occurrence data of the future distribution. However, since the MCE model is
based on current MaxEnt distribution, a reasonable agreement is expected between
the MaxEnt and MCE scenario model. Hence, a coefficient of agreement, Kappa
was calculated.
Table 16: Weights applied on predictor variables for the MCE model
Weights
Predictor Variables T.marmoratus T.pygmaeus
Altitude 0.159 0.25
Aspect 0.076 0.107
Corine Land cover 0.302 0.285
Climate Factor 0.175 0.178
Proximity to Drainage 0.063 0.035
Proximity to Vegetation 0.127 0.11
Slope 0.098 0.035
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
45
4.3. Results
The results in this section include: projected land cover map, landscape matrices
(calculated from land cover maps of 2000 and 2020 in order to analyze the changes
in land cover during the specified time period) and a scenario model.
4.3.1. Land cover change Analysis
The projected land cover map indicated that significant changes in area extent, mean
patch area and total edge (TE) occur on land cover classes. Heterogeneous
agricultural areas decrease by 25.3% and coniferous forest decreases by 37.9%. On
the contrary, shrub lands, mixed and broad leaved forests increase by 18.3%, 30%
and 36.4% respectively (Table 17).
The total edge and mean patch area in each class gives critical information on the
level of fragmentation within a landscape. Compared with the 2000 land cover map,
the projected land cover (2020) map indicates that fragmentation significantly
increases. Mean patch area of all land cover classes except for urbane area decreases
significantly. For instance, mean patch area of coniferous forests decrease from
545.5 ha to 109.75 ha and heterogeneous agriculture decrease from 131.5ha to
22.8ha. Similarly, total edge of all land cover classes increases except for urban area.
Overall the landscape matrices indicate that if the land cover change continues in the
same pattern as that of during 1990 and 2000, fragmentation significantly increases
within the landscape. The fragmentation will be more pronounced in coniferous
forests and heterogeneous agriculture classes which from the result of the Maxent
model were identified to be primary habitats of T.marmoratus and T.pygmaeus.
4.3.2. Future Scenario Model
Comparison of the scenario model with the current MaxEnt model using Kappa
(coefficient of agreement) indicates a value of 0.756 and 0.704(Appendix J) for
T.marmoratus and T.pygmaeus respectively. Visual comparison also shows strong
similarity. This indicates the weights and fuzzy membership functions used from the
MaxEnt model maintained the relationship between the species and environmental
predictor variables.
For T.marmoratus, areas above the mean plus standard deviation (µ+ S.D) threshold
value become patchy, scattered and reduced in area extent (Figure 21). There are
also new areas on the western part of the study area where T.marmoratus appears to
be present. Total area above the threshold value on the future MCE model is
13609.845ha where as on the current MaxEnt model, area coverage is 16649.88ha.
Similarly, the distribution of T.pygmaeus on the MaxEnt model has area coverage of
Future Species Distribution (Scenario Model)
46
22916.94 ha where as on the MCE model, the area is 19772.43ha. The decrease in
area extent for T.marmoratus is 18.25% where as for T.pygmaeus, it decreased
by13.72%.
a. Current (MaxEnt) model
b. Future scenario model
c. Current (MaxEnt) model
d. Future scenario model
Figure 21 Comparison of MaxEnt and MCE scenario model distribution of
T.marmoratus and T.pygmaeus. Mean plus standard deviation of the
distribution was used as a threshold.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
47
4.4. Discussion
4.4.1. Impacts of Land cover changes
Chapin et al, (2000), described land cover transformation constitutes the greatest
threat to biodiversity. Impacts of changes in patterns of land cover through time
affect the ecological system by altering the proportion and distribution of habitats for
species that these cover types support (Olsen, 2006; Joly et al., 2003; Velázquez et
al., 2003; Linderman et al., 2005). The projected land cover map using Markov
chain analysis thus enables us to assess the trend of change and thereby investigate
the impacts on species distribution.
Markov chain models take several assumptions of which the fundamental approach
is to regard land use/cover change as a stochastic process - one that is governed by
random variables and describable only in probabilistic terms (Bell, 1974). On this
paper, land cover change is assumed to continue in the same pattern and manner as it
did during 1990 and 2000, i.e. stationarity is assumed but not tested. Testing the
stationarity of land cover goes well beyond the scope this paper. On most researches,
stationarity has usually not been tested but assumed (Weng, 2002). Conversely, Bell,
(1974), argued if the process is not stationary, it does not mean all the predictive
power of the model is lost but we shall lose the ability to determine steady state
equilibrium distribution. The out come of the model should be interpreted from the
point of view that land cover continues to change in the same pattern as it did during
1990 and 2000.
Land use/cover change is the leading cause of habitat loss and fragmentation which
leads to population isolation and possibly extinction (Pearson et al., 1999). Fahrig,
(2003), explained fragmentation of habitats is characterized by at least two
phenomena; (a) a reduction of total area of a land cover and mean area coverage of
patches, i.e. at some point there will be insufficient area of habitat to support a viable
population, and with continued loss eventually there will be insufficient area of
habitat to support even a single individual and b) the total edge of the patches within
a landscape increases.
The projected land cover has shown that total area coverage of coniferous forests
and heterogeneous agriculture decreases while broad leaved and mixed forests gain
area. Field observation confirms this trend, in central Portugal where recurrent fire
occurrences are pervasive, burned pine areas are replanted with eucalyptus trees,
primarily because the regeneration time of pine trees is slow compared to
eucalyptus. Mean patch area and total edge (TE) landscape matrices revealed
fragmentation severely increases in most land cover classes. For instance, mean
patch area of heterogeneous agriculture decreases from 131.5ha to 22.5 ha and
Future Species Distribution (Scenario Model)
48
coniferous forests mean patch area decrease from 545 ha to 109.7 ha. Accordingly,
the total edge increases more than two-fold for coniferous forests and in case of
heterogeneous agriculture, it increases by 17.8%. On the contrary, the total area of
mixed forest increases by 1.6%. Even so, the mean patch area significantly reduced
from 127ha to 8.9ha and the total edge increased by more than two-fold. All in all,
the projected map highlighted the fact that fragmentation of habitats is going to be
severe and its implications on the species distribution and diversity is at task for
conservation planners to tackle.
4.4.2. What if land cover continues to change… (Scenario Model)
The accuracy of the model could not be validated since it is a future scenario model.
However, visual comparison and Kappa (coefficient of agreement) with the MaxEnt
model (current distribution) indicated the weights and membership functions used
maintained the relationship established on the MaxEnt model.
The trend of land cover change in Abrantes area appears to affect T.marmoratus
slightly more than T.pygmaeus. This is primarily because the two land cover classes
(heterogeneous agriculture and coniferous forest) that T.marmoratus is strongly
associated faces major fragmentation, in terms of reduction of mean patch area, total
area and significant increase in total edge. However, this is not to imply T.pygmaeus
will not be affected since it is also strongly associated with heterogeneous
agriculture. However, T.pygmaeus is not associated with coniferous forests where
the highest fragmentation occurs. It has also shown association with less vegetated
classes such as arable land, permanent crops and shrub lands of which fragmentation
level is not as strong. Taking in to consideration these land cover changes, the
scenario model also shows the future distribution of T.marmoratus to be more
patchy, reduced in area extent and scattered. The same applies to T.pygmaeus but to
a lesser degree.
4.5. Conclusions
The following conclusions can be drawn,
(1) The projected land cover map highlighted fragmentation of the landscape is
going to increase significantly. (2) The MCE model depicted one possible aspect of
the scenario, where by the geographic distribution of T.marmoratus and T.pygmaeus
is going to be patchy, reduced in area extent and isolated from each other. The effect
appears to be more pronounced on T.marmoratus than T.pygmaeus. Overall, this
signals the need for integrated conservation activities.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
49
Table 17 Changes in landscape matrices between 2000 and 2020
Class name % percentage
2000
% percentage
2020
total
edge(2000)
total
edge(2020)
Mean patch
area 2000
(hectare)
Mean patch
area 2020
(hectare)
Urban 0.530 0.5363 55500 54389.06 40 44.100
mining 0.080 0.0859 6800 6798.63 100 98.960
Arable Land 6.120 4.5716 392800 312137.2 143.89 87.700
Permanent Crops 9.890 9.5224 729000 926313.6 167.58 172.260
Heterogeneous
agriculture 18.270 13.64 1367900 1611975 131.5 22.810
Broad Leaved
Forest 18.770 21.8269 1118500 2236350 211.99 26.800
Coniferous 16.570 10.29 703400 1622873 545.5 109.750
Mixed Forest 5.180 6.739 341300 897319.5 127 8.900
Shrub lands 22.720 31.0169 1454600 3570981 167.7 24.189
Open Area 0.440 0.4487 25600 25994.77 172.33 20.200
Water Bodies 1.320 1.3138 187600 187962.1 766 756.690
Future Species Distribution (Scenario Model)
50
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
51
5. General Discussions
5.1. The Niche Concept
Guisan & Thuiller, (2005), described that species data and environmental predictor
variables are sampled within a limited time and space. The results of species
distribution modelling, i.e. habitat suitability maps represent only the snapshot of the
expected relationship between the species and environment. This approach of
modelling species distribution determines the presence of species by identifying
areas closely associated with observed presences of species and thus termed as
‘Correlative approach’(Soberson & Peterson, 2005). As in this research, out of the
four factors (abiotic, biotic, dispersal and evolutionary capacity) described by
Soberon & Peterson, (2005), to control the presence of species, only abiotic factors
(climate and physical environmental variables) were used (Region A in Figure 21).
Figure 22 adopted from Soberson and Peterson, (2005), region A represents
geographic regions with appropriate set of abiotic factors. Region B represents
regions where right combination of interacting species occurs. Region C
represents areas accessible to the species with out barriers.
As a result the model only shows species occupying their entire suitable habitat - the
fundamental niche (region A) rather than the realized niche (A η B) where a species
might be excluded due to biotic interactions such as competition and presence of
predators (Peterson, 2006; Silvertown, 2004 and Pulliam, 2000). Region C
represents geographic regions accessible within the dispersal capacity of the species
(i.e. with out barriers). These sets of factors are non ecological, and may indeed not
be permanent (Peterson, 2006). The intersection of the three regions (A η B η C)
represents the true geographic distribution of the species. Hence, on the strictest
sense, our model only delineates the fundamental niche rather than the true
geographic distribution of the species. Contrary this view, Guisan & Thuiller,
(2005), argued that the observed species distributions (occurrence data) is already
constrained by biotic interactions and thus quantifies Hutchinson’s realized niche of
species. Accordingly, based on the latter argument, Figure 22 will be modified as
follows.
General Discussions
52
C B
A
Figure 23 Figure adopted from Peterson, 2006
5.2. MaxEnt Model Performance
The Maxent modelling method since based on presence only data has a number of
advantages that increase the model performance. First, in most cases accurate
absence data is not available. Particularly, because occurrence data is collected
within limited time and space; a certain species might be absent, for example, if it
has more than one habitat, or is nocturnal or even due to improper data collection.
More importantly, when estimating the accuracy of models in most modelling
techniques such as GARP and GLM require absence (pseudo-absence) data, which
in most cases generated randomly. Another advantage of MaxEnt model compared
with other species models is that it works well with small sample size (Phillips et al.,
2006).
All the models performed better than random prediction with high ROC (AUC)
values. However, due to small sample size, for T.marmoratus in Fatima area, a re-
substitution method is adopted which according to Verbyla, (1986) produces
optimistically biased estimates of the model’s true classification accuracy, especially
if many predictor variables are used in relation to sample cases.
5.3. Model Uncertainty
Uncertainty arises from different factors such as Natural, Data and Model
uncertainties (Isukapalli et al., 2001). All models use input data (sample
measurements) from various sources to represent the reality or “population” of the
parameter under investigation. In all cases, the relation between population and
samples is subject to uncertainty (Altman, 1990). For instance, in our model climatic
data were derived from climate stations using spatial interpolation (Hijmans, 2005)
methods which might introduce uncertainty. Similarly, due to natural randomness or
variability, some times climate variables are difficult to measure precisely.
Model uncertainty is the major source of uncertainty and can be broadly said to
originate in the understanding of the system which we are modelling, assumptions
and simplifications incorporated in the model (Reinert et al., 2006). Assumptions
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
53
and simplifications determine the model structure, resolution and boundary
(Isukapalli, 2001). For instance, land cover in the Markov chain analysis assumed to
be stationary, but not tested.
Any model has limited boundary in terms of space and time. Model boundary is a
form of simplification but could also be a source of uncertainty. On the MaxEnt,
model only abiotic factors are included out of the four factors that control the
presence of species as discussed in section 5.1. Practically, the model delineates the
fundamental niche rather than the realized niche. This implies that even if a certain
area is mapped as present on the MaxEnt model, the species might be absent due to
competition from other species or predators. However, this does not mean the model
is wrong, because the model could be accurate within its own boundary, i.e. the
fundamental niche. Resolution of data also contributes to uncertainty, for instance,
NDVI variables might have contributed more gain value if higher resolution data
were used.
5.4. Model result and its Implications for conservation activities
Amphibians exhibit complex life cycles (Pope et al., 2000; Joly et al., 2004; Porej et
al., 2004) within a complex landscape matrix. At the aquatic habitat level, predators,
aquatic vegetation (Joly et al., 2001), water depth and pollutants among other things
influence the presence of species. On the landscape level where terrestrial habitats
are used for migration, foraging and shade, connectivity between habitats plays a
vital role. Indeed, several researches indicated habitat loss and fragmentation are the
main threat for the extinction of amphibian populations (e.g. Cushman, 2006;
Bailey, 2007; Denole & Lehmann, 2006). Thus, in fragmented landscapes,
increasing the connectivity between habitats is of prime importance for conservation
planning (Bailey, 2007; Hecnar and M’Closkey, 1996; Hehl-lange, 2001; Joly,
2001).
For amphibians, Houlahan et al., (2000), described effects of land use extend to
large distances from their aquatic habitat. Pressier et al., 2001, also mentioned
juvenile dispersals cover large distances and are essential to maintain connectivity
between populations. For instance, Ambystoma sp. Salamanders travels a distance
up to 670m from a pond. With this regard, Cushman, (2006), suggested that
conservation activities should be based on landscape level instead of only on site
specific level (ponds) to preserve the connectivity between different populations. As
the MaxEnt Modelling process is at landscape scale, favourable habitats identified
could be used as base information for conservation planners to delineate corridors
for T.marmoratus and T.pygmaeus.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
55
6. General Conclusions and Recommendations
6.1. Conclusions
The main objective of this research aimed at finding habitat-species relationships
through significance tests, and also modelling future species distribution scenario
model based on the current species distribution. Further more, landscape indices
were calculated in order to characterize the fragmentation level within land cover
classes at different times with the view of investigating the relation with the
presence of species. From the results found, the following general conclusions can
be drawn.
• Land cover was the most important predictor variable for all cases except
for T.pygmaeus in Fatima area.
• In Abrantes area, heterogeneous agriculture classes were the primary
common habitat for both species and to a lesser degree, mixed forests.
• In Fatima area, coniferous forests were the primary common habitat
followed by permanent crops.
• The fact that mixed and coniferous forests were common habitat for both
species in Abrantes and Fatima area respectively and constitute a small part
of the landscape implies that they are ecologically important.
• Significant differences between T.marmoratus and T.pygmaeus in mean
suitable environmental range was found.
• The presence of T.marmoratus and T.pygmaeus was strongly influenced by
proximity to vegetation covers. However, T.marmoratus appeared to be
closer to vegetation covers than T.pygmaeus as evidenced from the
proximity of heterogeneous agriculture and permanent crops class patches
in Abrantes and Fatima area respectively. Conversely, T.pygmaeus was
sometimes observed to inhabit less vegetated areas, open spaces and urban
areas.
• The fragmentation level within the landscape will continue to increase
significantly in the future.
• The future species distribution scenario model predicted the distribution of
both species to be patchy, scattered and the overall habitat area to decrease
in the future.
• Conservation planners could use the outputs of this research as base
information to increase the connectivity of patches within the landscape in
order to delineate corridors.
General Conclusions and Recommendations
56
6.2. Recommendations
• On this study only abiotic factors were used. It would be interesting to
investigate the distribution of species by incorporating biotic and dispersal
capacity factors.
• Scale is an important issue in ecological studies. At this research scale,
topographic and NDVI variables did not have much contribution to the
model. Large scale study at microhabitat level using higher resolution data
might be useful to assess habitat- species relationships.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
57
7. References
Aaviksoo, K. (1995). Simulating vegetation dynamics and land use in a mire
landscape using a Markov model. Landscape and Urban Planning, 31, 129- 142.
Abuelgasim, A. A., Ross, W. D., Gopal, S. and Woodcock, C. E. (1999). Change
Detection Using Adaptive Fuzzy Neural Networks: Environmental Damage
Assessment after the Gulf war. Remote Sensing of Environment, 70, 208–22.
Aguilera, A.M., Escbias, M., & Valderrama, M.J. (2006). Using Principal
components for estimating logistic regression with high dimensional multicollinear
data. Computational Statistics & Data Analysis, 50, 1905-1924.
Altman, D.G., Machin, D., Bryant, T.N. & Gardner, M.J. (2000). Statistics with
confidence. BMJ books.
Amphibian Database [on line] Available from< (www.amphibiaweb.org)>
[Accessed July, 12, 2006].
Anderson, R. P., Lew, D. & Peterson, A. T. (2003). Evaluating predictive models of
species distributions: criteria for selecting optimal models. Ecological Modelling,
162, 211–232.
Arntzen, J.W. (2006). From Descriptive to predictive distribution models: a working
example with Iberian amphibians and reptiles. Frontiers in Zoology, 3(8)
Atkinson, P.M. & Tatnall, A. R. L. (1997). Neural networks in remote sensing.
International Journal of Remote Sensing, 18(4), 699-709.
Badgley, C. & Fox, D.L. (2000). Ecological biogeography of North American
mammals: species density and ecological structure in relation to environmental
gradients. Journal of Biogeography, 27, 1437–1467.
Bailey S., (2007). Increasing connectivity in fragmented landscapes: An
investigation of evidence for biodiversity gain in woodlands. Forest Ecology and
Management, 238, 7–23.
Beck, P.S.A., Kalmbach, E., Joly, D., Stien, A. & Nilsen, L. (2005). Modelling local
distribution of an Arctic dwarf shrub indicates an important role for remote sensing
of snow cover. Remote Sensing of Environment, 98,110 – 121.
Bell, E. J. (1974). Markov Analysis of Land Use Change-An Application of
Stochastic Processes to Remotely Sensed Data. Socio-Economic Planning Science,
8, 311-316.
References
58
Bellman, R.E. & Zadeh, L.A. (1970). Decision making in a fuzzy Environment.
Management Science, 17(4), 141-164.
Bock, M. (2003). Remote sensing and GIS-based techniques for the classification
and monitoring of biotopes. Journal of Conservation, 11, 145-155.
Boteva, D., Griffiths, G., Dimopoulos, P. (2004). Evaluation and mapping of the
conservation significance of habitats using GIS: an example from Crete, Greece.
Journal for Nature Conservation, 12, 237-250.
Box, E. O., Holben, B. N., & Kalb, V. (1989). Accuracy of AVHRR vegetation
index as a predictor of biomass, primary productivity and net CO2 flux. Vegetatio,
80, 71-89.
Boyd, D. S., Foody, G. M. and Ripple, W. J. (2002). Evaluation of approaches for
forest cover estimation in the Pacific Northwest, USA, using remote sensing.
Applied Geography, 22(4), 375-392.
Brotons, L., Thuiller, W., Araujo, M.B. & Hirzel, A. H. (2004). Presence-absence
versus presence-only modelling methods for predicting bird habitat suitability.
Ecography 27, 437-448.
Buehler, E.C. & Ungar, L.H. (2001). Maximum Entropy Methods for Biological
Sequence Modeling. BIOKDD, 60-64, 2001.
Carpenter, G.A., Gopal, S., Macomber, S., Martens, S., Woodcock, C.E. and
Franklin, J. (1999). A Neural Network Method for Efficient Vegetation Mapping.
Remote Sensing of the Environment, 70, 326–338.
Chapin, F. S., Zavaleta, E. S., Eviner, V. T., Naylor, R. L., Vitousek, P. M.,
Reynolds, H. L., Hooper, D. U., Lavorel, S., Sala, O. E., Hobbie, S. E., Mack, M. C.,
& Diaz, S. (2000). Consequences of Changing Biodiversity. Nature, 405, 234–242.
Chefaoui, R.M., Hortal, J. & Lobo, J.M. (2005). Potential distribution modelling,
niche characterization and conservation status assessment using GIS tools: a case
study of Iberian Copris species. Biological Conservation,122, 327–338.
Corsi, F., Skidmore, A., & de Leeuw, J. (2000). Modelling species distribution with
GIS. In L. Boitani & T. K. Fuller (Eds.), Research techniques in animal ecology (pp.
389-434). New York: Columbia University Press.
Cushman, S.A. (2006). Effects of habitat loss and fragmentation on amphibians: a
review and prospectus. Biological Conservation 128, 231–240.
de Leeuw, A. J., Ottichilo, W. K., Toxopeus, A. G., & Prins, H. H. T. (2002).
Application of remote sensing and geographic information systems in wildlife
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
59
mapping and modelling. In A. Skidmore (Ed.), Environmental modelling with GIS
and remote sensing (pp. 121-145). London: Taylor & Francis.
Denoe, M. & Lehmann, A. (2006). Multi-scale effect of landscape processes and
habitat quality on newt abundance: Implications for conservation. Biological
Conservation, 130, 495–504.
Dohoo, I.R., Ducrot, C., & Donald, A. (1996). An overview of techniques for
dealing with large number of independent variables in epidemiological studies.
Preventive Veterinary Medicine, 29, 221-239.
Eastman, J. R. (2003). IDRISI Kilimanjaro, Guide to GIS and Image Processing.
User's guide (ver. 14), Worcester, MA, Clark University Press.
Field, A. (2000). Discovering Statistics Using SPSS Windows. London:Sage
Bartlett,J.Gani, Ed. London: Academic Press.
Fraser, R. H. and Li, Z. (2002). Estimating fire-related parameters in boreal forest
using SPOT Vegetation. Remote Sensing of Environment, 82(1), 95-110.
Fielding, A.H. & Bell, J.F. (1997). A review of methods for the assessment of
prediction errors in conservation presence/absence models. Environmental
Conservation, 24, 38-49.
Foody, G.M. and Cutler, M.E.J. (2006). Mapping the species richness and
composition of tropical forests from remotely sensed data with neural networks.
Ecological Modelling, 195(1-2), 37-42.
Freire, S., Nunes, A., & Bastos, A. (2004). CORINE Land cover 2000 Portugal Final
results. ISEGI LabNT Meeting.
Garzón, M.B., Blazek, R., Neteler, M., Sánchez de Dios, R, Ollero, H.S. and
Furlanello, C. (2006). Predicting habitat suitability with machine learning models:
The potential area of Pinus sylvestris L. in the Iberian Peninsula, Ecological
Modelling, 197, 3-4, 383-393.
Giacomellia, L., Boggettia, H., Agnellib, H., Anunziataa, J., Silbera, J. J., Cattanaa,
R. (1998). Relevant physicochemical factors in chromatographic separation of
Alternaria alternata mycotoxins. Analytica Chimica Acta. 370, 79-89.
Guerry, A.D. & Hunter Jr., M.L. (2002). Amphibian distributions in a landscape of
forests and agriculture: an examination of landscape composition and configuration.
Conservation Biology, 16, 745–754.
References
60
Guisan, A., Edwards, T.C. Jr, & Hastie, T. (2002). Generalized linear and
generalized additive models in studies of species distributions: setting the scene.
Ecological Modelling 157, 89-100.
Guisan, A. & Hofer, U. (2003). Predicting reptile distributions at the mesoscale:
relation to climate and topography. Journal of Biogeography, 30, 1233–1243.
Guisan, A. & Thuiller, W. (2005). Predicting species distribution: offering more than
simple habitat models. Ecology Letters, 8, 993–1009.
Guisan, A. &. Zimmermann, N.E (2000). Predictive habitat distribution models in
ecology. Ecological Modelling 135, 147–186.
Hecnar, S.J., M’Closkey, R.T. (1996). Regional dynamics and the status of
amphibians. Ecology, 77, 2091–2097.
Hijmans, R.J., S.E. Cameron, J.L. Parra, P.G. Jones and A. Jarvis, 2005. Very high
resolution interpolated climate surfaces for global land areas. International Journal
of Climatology ,25, 1965-1978.
Houlahan, J.E., Findlay, C.S., Schmidt, B.R., Meyer, A.H., Kuzmin, S.L., 2000.
Quantitative evidence for global amphibian population declines. Nature, 404, 752–
755.
Hutchinson, G.E (1959). Homage to Rosalia Why are there so Many Kinds of
Animals? American Naturalist, 870, 145-159.
IUCN-MED. (2005). Mediterranean biodiversity [on line] Available from
http://www.iucn.org/places/medoffice/en/en_conservation.html [Accessed July 15,
2006].
Isukapalli, S.S. & Georgopoulos P. G. (2001). Computational Methods for
Sensitivity and Uncertainty Analysis for Environmental and Biological Models,
EPA/600/R-01-068.
Jehle, R, & Arntzen, J.W. (2000). Post-breeding migrations of newts (Triturus
cristatus and Triturus marmoratus) with contrasting ecological requirements.
Journal of Zoology (London) 251: 297–306.
Joly, P., Morand, C., Cohas, A. (2004). Habitat fragmentation and amphibian
conservation: building a tool for assessing landscape matrix connectivity, 4, 500-
511.
Kovacs, P., Petres, T., & Toth, L. (2005). A New Measure of Multicollinearity in
Linear Regression Models. International Statistical Review, 73, 405-412.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
61
Lennon, J.J., Greenwood, J.J.D. and. Turner, J.R.G. (2000). Bird diversity and
environmental gradients in Britain: a test of the species-energy hypothesis. Journal
of Animal Ecology, 69, 581-598.
Linderman, M.A., An, L., Bearer, S., He, G., Ouyang, Z., Liu, J. (2005). Modelling
the spatio-temporal dynamics and interactions of households, landscapes, and giant
panda habitat. Ecological Modelling, 183, 47–65.
Li, A., Wang, A., Liang, S., Zhou, W. (2006). Eco-environmental vulnerability
evaluation in mountainous region using remote sensing and GIS – A case study in
the upper reaches of Minjiang River, China. Ecological Modelling, 192, 175-187.
Löfvenhaft, K., Runborg, S. & Sjögren-Gulve, P. (2004). Biotope patterns and
amphibian distribution as assessment tools in urban landscape planning. Landscape
and Urban Planning, 68, 403–427.
Logofet, D.O. & Lesnaya, E.V (2000). The mathematics of Markov models: what
Markov chains can really predict in forest successions. Ecological Modelling, 126,
285–298.
López, E., Bocco, G., Mendoza, M. and Duhau, E. (2001). Predicting land-cover and
land-use change in the urban fringe: A case in Morelia city, Mexico. Landscape and
Urban Planning, 55(4), 271-285.
Manel, S., Dias, J.M. & Ormerod, S.J (1999). Comparing discriminant analysis,
neural networks and logistic regression for predicting species distributions: a case
study with a Himalayan river bird, Ecological Modelling, 120, 337-347.
Mansfield, E.R. & Helms, B.H. (1982). Detecting Multicollinearity. The American
Statistician, 36, 158-160.
Martínez, I., Carreño, F., Escudero, A. and Rubio, A. (2006). Are threatened lichen
species well-protected in Spain? Effectiveness of protected areas
network. Biological Conservation, 133, 500-511.
Mas, J.F., Puig, H., Palacio, J.L., Sosa-Lopez, A. (2004). Modelling deforestation
using GIS and artificial neural networks. Environmental Modelling & Software,
19,461–471.
O’Brien, E.M (1998). Water-Energy dynamics, climate and prediction of woody
plant species: an interim general model. Journal of Biogeography, 25, 379-398.
Olsen, L.M, Dale, V.H., Foster, T. (2006). Landscape patterns as indicators of
ecological change at Fort Benning, Georgia, USA. Landscape and Urban Planning,
Article in Press.
References
62
Osborne, P.E. & Sua´rez-Seoane, S. (2002). Should data be partitioned spatially
before building large-scale distribution models? Ecological Modelling, 157, 249-
259.
Pearce, J. & Ferrier, S. (2000). Evaluating the predictive performance of habitat
models developed using logistic regression. Ecological Modelling, 133, 225–245.
Pearson, S.M, Turner, M.G, Drake, J.B (1999). Landscape change and habitat
availability in the southern Appalachians high lands and Olympic Peninsula.
Ecological Applications, 9, 1288-1304.
Peterson, A.T. (2006). Uses and Requirements of Ecological Niche Models and
Related Distributional Models. Biodiversity Informatics, 3, 59-72.
Phillips, S.J., Anderson, R.P. & Schapired, R.E. (2006). Maximum entropy
modelling of species geographic distributions. Ecological Modelling, 190, 231–259.
Pope, S.E., Fahrig, L., & Merriam, H.G. (2000). Landscape complementation and
metapopulation effects on leopard frog populations. Ecology, 81, 2498–2508.
Porej, D., Micacchion, M., Hetherington, T.E. (2004). Core terrestrial habitat for
conservation of local populations of salamanders and wood frogs in agricultural
landscapes, Biological Conservation, 120, 399–409.
Preisser, E.L., Kefer, J.Y., Lawrence, J.D., 2001. Vernal pool conservation in
Connecticut: an assessment and recommendations. Environmental Management 26
(5), 503–513.
Pulliam, H.R. (2000). On the relationship between Niche and distribution. Ecology
Letters, 349-361.
Romero-Calcerrada, R. & Luque, S. (2006). Habitat quality assessment using
Weights of Evidence based GIS modelling: The case of Picoides tridactylus as
species indicator of the biodiversity value of the Finnish forest. Ecological
Modelling, 196, 62-76.
Reinert, J.M, Apostolakis, G.E. (2006). Including Model Uncertainty in Risk-
informed Decision Making. Annals of Nuclear Energy, 33, 354–369.
Santos, X., Brito, J.C, Sillero, N., Pleguezuelos, J.M, Llorente, G.A., Fahd, S. and
Parellada, X.(2006). Inferring habitat-suitability areas with ecological modelling
techniques and GIS: A contribution to assess the conservation status of Vipera
latastei. Biological Conservation, 130, 3,416-425.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
63
Sehl-Lange, S. (2001). Structural elements of the landscape and their ecological
functions. Landscape and Urban Planning, 54, 105-113.
Semlitsch, R.D. & Bodie, J.R. (2003). Biological criteria for buffer zones around
wetlands and riparian habitats for amphibians and reptiles. Conservation Biology,
17, 1219–1228.
Sérgio, C., Figueira, R., Draper, D., Menezes, R. and Sousa, A.J. (2007). Modelling
bryophyte distribution based on ecological information for extent of occurrence
assessment. Biological Conservation, 135(3), 341-351.
Shannon, C.E., (1948). A Mathematical Theory of Communication. Bell Syst. Tech.
J. 27, 379–423, 623–656.
Silvertown, J. (2004). Plant coexistence and the niche. Trends in Ecology and
Evolution, Review Article, 19(11).
Soberon, J. & Peterson, A.T. (2005). Interpretation of Models of Fundamental
Ecological Niches and Species’ Distributional Areas. Biodiversity Informatics, 2, 1-
10.
Skei, J.K., Dolmen, D., Ronning, L and. Ringsby, T.H. (2006). Habitat use during
the aquatic phase of the newts Triturus vulgaris (L.) and T.cristatus (Laurenti) in
central Norway: proposition for conservation and monitoring area. Amphibia-
Reptilia, 27, 309-324.
Stockwell, D. & Peters, D. (1999). The GARP modelling system: problems and
solutions to automated spatial prediction. int. j. geographical information science,
13, 2, 143-158.
Store, R. & Kangas, J. (2001). Integrating Multi-Criteria Evaluation and Expert
Knowledge for GIS Based Habitat Suitability Modelling. Landscape and Urban
Planning, 59, 79-93.
Sweeney, A.W., Beebe, N.W. and Cooper, R.D. (2007). Analysis of environmental
factors influencing the range of anopheline mosquitoes in northern Australia using a
genetic algorithm and data mining methods. Ecological Modelling, In Press,
Corrected Proof.
Themudo, G.E. & Arntzen, J.W. (In Press). Molecular identification of marbled
newts and justification of the Triturus marmoratus and Triturus pygmaeus species
status. Herpetological Journal.
VEGETATION Database [on line] available from www.vgt.vito.be [Accessed on
July 25, 2006].
References
64
Velázquez, A., Durán, E., Ramírez, I., Mas, J., Bocco, G., Ramírez, G. and Palacio,
J. (2003). Land use-cover change processes in highly bio diverse areas: the case of
Oaxaca,Mexico. Global Environmental Change, 13(3), 175-184.
Verbyla, D.L. (1986). Potential prediction bias in regression models. Canadian
Journal of Forest Research 16, 1255-1257.
Verbyla, D.L. and Litvaitis, J.A. (1989). Resampling methods for evaluating
classification accuracy of wildlife habitat models. Environmental Management,
13(6): 783 - 787.
Weiers, S., Bock, M., Wissen, M. & Rossner, G. (2004). Mapping and indicator
approaches for the assessment of habitats at different scales using remote sensing
and GIS methods. Landscape and Urban Planning, 67, 43-65.
Weng, Q. (2002). Land use change analysis in the Zhujiang Delta of China using
satellite remote sensing, GIS and stochastic Modelling. Journal of Environmental
Management, 64, 273–284.
Woodford, J. E. & Meyer, M. W. (2003). Impact of lakeshore development on green
frog abundance. Biological Conservation, 110, 277–284.
Yemshanov, D. & Perera, A.H. (2002). A spatially explicit stochastic model to
simulate boreal forest cover transitions: general structure and properties. Ecological
Modelling, 150,189–209.
Zadeh, L.A. (1988). Fuzzy Logic. IEEE, 83-93.
Zadeh, L.A. (1997). Toward a theory of fuzzy information granulation and its
centrality in human reasoning and fuzzy logic. Fuzzy Sets and Systems, 90, 111-127.
Zweig, M.H. & Campbell, G. (1993). Receiver-Operating Characteristic (ROC)
Plots: A Fundamental Evaluation Tool Clinical Medicine. Clinical Chemistry, 39(4),
561-577.
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
65
8. Appendices
AppendixA. Principal Factor Analysis (PFA) coefficients of climate variables. The
PFA has six factors of which Factor 1 has a variance percentage >
95% for Abrantes area and variance percentage >99% for Fatima
area.
Abrantes
Mean
annual
Temp.
Maximum
Temp.
Minimu
m
Temp.
Range
of
Temp.
Radiation Precipitation Variance
percentage
per band
Factor 1 0.412 0.412 0.408 0.409 0.396 0.412 95.22
Factor 2 -0.382 -0.382 0.195 0.346 0.635 -0.382 3.28
Factor 3 0.13 0.13 -0.775 -0.169 0.565 0.13 0.99
Factor 4 0.017 0.017 0.441 -0.827 0.346 0.017 0.51
Factor 5 0.707 -0.707 0 0 0 0 0
Factor 6 -0.408 -0.408 0 0 0 0.816 0
Fatima
Mean
annual
Temp.
Maximum
Temp.
Minimu
m Temp.
Range
of
Temp.
Radia
tion
Precipitatio
n
Varianc
e
percent
age per
band
Factor
1 0.409 0.409 0.408 0.41 0.41 0.407 99.92 Factor
2 0.177 0.168 0.537 -0 -0.07 -0.805 0.08
Factor
3 0.132 0.158 -0.668 0.34 0.46 -0.428 0
Factor
4 -0.278 -0.361 0.246 -0.3 0.78 -0.038 0
Factor 5 -0.255 0.793 -0.11 -0.5 0.07 0.036 0
Factor
6 -0.801 0.145 0.167 0.55 -0.03 -0.039 0
Appendices
66
Appendix B. Predictive species distribution model of T.marmoratus and
T.pygmaeus in Abrantes and Fatima area using only NDVI variables.
a. T.marmoratus (Abrantes
area)
b. T.pygmaeus (Abrantes area)
c. T.marmoratus (Fatima area)
d. T.pygmaeus (Fatima area)
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
67
Appendix C. Contingency table (Pearson’s Chi-square) test showing the significant association of T.marmoratus and T.pygmaeus in
Abrantes and Fatima area.
Count Cell Chi^2
Absence Presence
Arable 159 0.1064
44 0.4358
Broad Leaved
657 31.4729
0 128.919
Coniferous 735 35.2094
0 144.225
Heterogeneous
287 144.628
429 592.428
Mine 3 0.1437
0 0.5887
Mixed Forest
116 3.1183
54 12.7731
Open Area 18 0.8623
0 3.5320
Permanent 231 5.0204
102 20.5647
Shrub 761 5.4467
109 22.3107
Urban 14 0.6707
0 2.7471
Water Bodies
42 2.0120
0 8.2414
3023 738
b. T.pygmaeus
Abrantes area
Count Cell Chi^2
Absence Presence
Arable 203 15.3206
0 48.6315
Broad Leaved
640 39.4517
17 125.230
Coniferous 547 0.2542
188 0.8070
Heterogeneous
320 92.5443
396 293.759
Mine 3 0.2264
0 0.7187
Mixed Forest 129 0.0006
41 0.0018
Open Area 18 1.3585
0 4.3122
Permanent 329 22.6748
4 71.9754
Shrub 615 3.2795
255 10.4099
Urban 14 1.0566
0 3.3539
Water Bodies
42 3.1698
0 10.0617
2860 901
a. T.marmoratus
Abrantes area
Count Cell Chi^2
Absence Presence
Arable Land 244 13.8948
1 50.7111
Broad Leaved Forest
435 19.7567
13 72.1052
Coniferous Forest 110 72.4314
199 264.350
Heterogeneous Agriculture
2081 88.4728
77 322.895
Mine Area 83 4.4172
1 16.1212
Mixed Forest 164 258.988
520 945.217
Open Space 70 4.1250
0 15.0549
Permanent Crops 403 206.315
632 752.978
Shrubs 1060 60.0491
5 219.158
Urban 286 15.8932
2 58.0047
Water Bodies 356 20.9786
0 76.5648
5292 1450
c. T.marmoratus
Fatima area
Count Cell Chi^2
Absence Presence
Arable Land 245 28.6226
0 70.6437
Broad Leaved Forest
448 52.3385
0 129.177
Coniferous Forest
141 28.3107
168 69.8738
Heterogeneous Agriculture
1014 177.262
1144 437.502
Mine Area 82 8.2597
2 20.3859
Mixed Forest 644 50.7832
40 125.338
Open Space 70 8.1779
0 20.1839
Permanent Crops
694 2.4599
341 6.0713
Shrubs 918 33.8123
147 83.4523
Urban 186 1.7535
102 4.3278
Water Bodies 356 41.5904
0 102.650
4798 1944
d. T.pygmaeus
Fatima area
Appendices
68
Appendix D: Mean environmental ranges of presence and absence of T.marmoratus and T.pygmaeus in Abrantes area compared
with t-test.
T.marmoratus
Newt Present Newt Absent
Abrantes Mean Range N Mean Range N t-value DF P
Elevation 241.07 238.09-244.05 1274 246.713 243.4-250.02 2488 -2.4842 3569.391 0.0136
Aspect 123.891 117.63-130.15 1274 129.532 124.68-134.39 2488 -1.39681 2756.994 0.1626
Drainage Distance 162.44 156.02-168.86 1274 215.072 209.17-220.97 2488 -11.8441 3173.814
<0.0001
Slope 3.06727 2.8791-3.2554 1274 3.28078 3.071-3.49 2488 -1.48842 3572.869 0.1367
Vegetation Distance 52.409 48.64-56.18 1274 124.06 114.65-133.47 2488 -13.8661 3187.477 <0.0001
Abrantes T.pygmaeus
Newt Present Newt Absent
Abrantes Mean Range N Mean Range N t-value DF P
Elevation 219.776 217.13-222.42 1199 256.509 253.29-259.73 2563 -17.2993 3642.457
<0.0001
Aspect 139.08 132.83-145.33 1199 122.261 117.44-127.08 2563 4.180543 2615.28
<0.0001
Drainage Distance 159.649 152.78-166.52 1199 214.837 209.13-220.54 2563 -12.1265 2800.417
<0.0001
Slope 3.59009 3.325--3.8552 1199 3.02995 2.8442-3.2157 2563 3.394663 2394.372 0.0007
Vegetation Distance 38.253 35.38-41.12 1199 128.586 119.44-137.74 2563 -18.4727 3027.301
<0.0001
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
69
Appendix E: Mean environmental ranges of presence and absence of T.marmoratus and T.pygmaeus in
Fatima area compared with t-test.
Fatima T.marmoratus
Newt Present Newt Absent
Altitude 107.361 105-109.72 1234 150.863 147.17-154.56 5514 -18.1446 6573.32 <0.0001
Aspect 188.334
183.57-
193.1 1234 179.166 176.22-182.11 5514 2.97956 2270.192 0.0019
Drainage Distance 179.434
172.82-
186.05 1234 445.441 420.65-470.23 5514 -18.2154 6550.543 <0.0001
Slope 5.08017
4.9114-
5.249 1234 4.68266 4.5668-4.7985 5514 3.80779 2544.826 0.0001
Vegetation Distance 150.885
140.17-
161.6 1234 340.254 325.48-355.03 5514 -20.3452 5739.476 <0.0001
Fatima T. pygmaeus
Newt Present Newt Absent
Altitude 138.716
133.84-
143.59 1884 145.575 141.76-149.39 4864 5.99745 4270.742 0.0299
Aspect 204.673
200.31-
209.04 1884 171.963 168.9-175.03 4864 12.01839 4015.864 <.0001
Drainage Distance 99.781 96.3-103.26 1884 517.117 488.95-545.28 4864 -28.7858 5012.944 <0.0001
Slope 4.28833
4.1657-
4.411 1884 4.93625 4.8067-5.0657 4864 -7.12349 5688.369 <.0001
Vegetation Distance 177.458
168.35-
186.57 1884 355.267 338.71-371.83 4864 -18.4437 6672.722 <0.0001
Appendices
70
Appendix F: Present-absent binary models after introducing equal sensitivity-specificity threshold values on the
continuous MaxEnt model.
a. T.marmoratus, Abrantes area
b. T.pygmaeus, Abrantes Area
c. T.marmoratus, Fatima area
d. T.pygmaeus, Fatima area
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
71
Appendix G: Receiver Operating Characteristic (ROC) curves for T.marmoratus and T.pygmaeus in Abrantes and Fatima
area.
a. T.marmoratus, Abrantes area
b. T.pygmaeus, Abrantes area
c. T.marmoratus, Fatima area
d. T.pygmaeus, Fatima area
Appendices
72
Appendix H: These curves show how each environmental variable affects the Maxent prediction for Abrantes area. The (raw)
Maxent model has the form exp(...)/constant, and the curves show how the exponent changes as each environmental
variable is varied, keeping all other environmental variables at their average sample value.
T.marmoratus, Abrantes Area
T.pygmaeus, Abrantes area
Predictive Modelling of Amphibian Distribution using Ecological Survey Data
73
Appendix I: These curves show how each environmental variable affects the Maxent prediction for Fatima area. The (raw) Maxent
model has the form exp(...)/constant, and the curves show how the exponent changes as each environmental variable is
varied, all other environmental variables at their average sample value.
T.marmoratus, Fatima area
T.pygmaeus, Fatima area
Appendices
74
Appendix J: Kappa, coefficient of agreement between the present MaxEnt and
future MCE model for T.marmoratus and T.pygmaeus
a. T.marmoratus
b. T.pygmaeus