Mapping Species Distributions with MAXENT Using a ...

13
Mapping Species Distributions with MAXENT Using a Geographically Biased Sample of Presence Data: A Performance Assessment of Methods for Correcting Sampling Bias Yoan Fourcade 1 *, Jan O. Engler 2,3 , Dennis Ro ¨ dder 3 , Jean Secondi 1 1 LUNAM Universite ´ d’Angers, GECCO (Groupe e ´ cologie et conservation des verte ´bre ´s), Angers, France, 2 Department of Wildlife Ecology, University of Go ¨ ttingen, Go ¨ ttingen, Germany, 3 Zoological Research Museum Alexander Koenig, Bonn, Germany Abstract MAXENT is now a common species distribution modeling (SDM) tool used by conservation practitioners for predicting the distribution of a species from a set of records and environmental predictors. However, datasets of species occurrence used to train the model are often biased in the geographical space because of unequal sampling effort across the study area. This bias may be a source of strong inaccuracy in the resulting model and could lead to incorrect predictions. Although a number of sampling bias correction methods have been proposed, there is no consensual guideline to account for it. We compared here the performance of five methods of bias correction on three datasets of species occurrence: one ‘‘virtual’’ derived from a land cover map, and two actual datasets for a turtle (Chrysemys picta) and a salamander (Plethodon cylindraceus). We subjected these datasets to four types of sampling biases corresponding to potential types of empirical biases. We applied five correction methods to the biased samples and compared the outputs of distribution models to unbiased datasets to assess the overall correction performance of each method. The results revealed that the ability of methods to correct the initial sampling bias varied greatly depending on bias type, bias intensity and species. However, the simple systematic sampling of records consistently ranked among the best performing across the range of conditions tested, whereas other methods performed more poorly in most cases. The strong effect of initial conditions on correction performance highlights the need for further research to develop a step-by-step guideline to account for sampling bias. However, this method seems to be the most efficient in correcting sampling bias and should be advised in most cases. Citation: Fourcade Y, Engler JO, Ro ¨ dder D, Secondi J (2014) Mapping Species Distributions with MAXENT Using a Geographically Biased Sample of Presence Data: A Performance Assessment of Methods for Correcting Sampling Bias. PLoS ONE 9(5): e97122. doi:10.1371/journal.pone.0097122 Editor: John F. Valentine, Dauphin Island Sea Lab, United States of America Received March 3, 2014; Accepted April 14, 2014; Published May 12, 2014 Copyright: ß 2014 Fourcade et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Funding: This study was supported by a grant to YF and JS from Plan Loire Grandeur Nature and European Regional Development Fund (ERDF). JOE was kindly supported within the German Federal Environmental Foundation fellowship program. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Competing Interests: The authors have declared that no competing interests exist. * E-mail: [email protected] Introduction A key issue in ecology and conservation biology is to determine how species are distributed in space. Since extinction risk is associated with range size [1], a significant reduction of a species range often determines change in conservation status (see for example IUCN criteria [2,3]) and prime conservations actions [4,5]. Likewise, protected areas usually focus on biodiversity hotspots [6] in order to conserve efficiently as many species as possible [7–9]. Therefore, conservationists often need precise assessments of species ranges. Beyond simple range description, identifying which main factors limit distributions is essential to efficiently forecast the benefits of conservation management. In order to deal with these questions, several methods of species distribution modeling (SDM), also known as ecological niche modeling (ENM) [10], have been developed since the 1980s [11]. The principle of SDM is to relate known locations of a species with the environmental characteristics of these locations in order to estimate the response function and contribution of environ- mental variables [12], and predict the potential geographical range of a species [13]. These models estimate the fundamental ecological niche in the environmental space (i.e. species response to abiotic environmental factors [14]) and project it onto the geographical space to derive the probability of presence for any given area or, depending on the method, the likelihood that specific environmental conditions are suitable for the target species [15]. Distribution models are used by conservation practitioners to estimate the most suitable areas for a species and infer probability of presence in regions where no systematic surveys are available [16]. They can also assess the potential expansion of introduced species in newly colonized areas [17,18], estimate the future range of a species under climate change [18,19] or assist in reserve planning [20]. Several statistical models exist to predict the distribution of a species [21]. Beyond classical regression methods (Resource Selection Function RSF [22,23], Generalized Linear Models GLM [24]), algorithmic modeling based on machine learning (for example Artificial Neural Networks [25], Maximum Entropy MAXENT [26], Classification And Regression Trees CART [27]) have become increasingly popular in recent years. Among these, PLOS ONE | www.plosone.org 1 May 2014 | Volume 9 | Issue 5 | e97122

Transcript of Mapping Species Distributions with MAXENT Using a ...

Page 1: Mapping Species Distributions with MAXENT Using a ...

Mapping Species Distributions with MAXENT Using aGeographically Biased Sample of Presence Data: APerformance Assessment of Methods for CorrectingSampling BiasYoan Fourcade1*, Jan O. Engler2,3, Dennis Rodder3, Jean Secondi1

1 LUNAM Universite d’Angers, GECCO (Groupe ecologie et conservation des vertebres), Angers, France, 2 Department of Wildlife Ecology, University of Gottingen,

Gottingen, Germany, 3 Zoological Research Museum Alexander Koenig, Bonn, Germany

Abstract

MAXENT is now a common species distribution modeling (SDM) tool used by conservation practitioners for predicting thedistribution of a species from a set of records and environmental predictors. However, datasets of species occurrence usedto train the model are often biased in the geographical space because of unequal sampling effort across the study area. Thisbias may be a source of strong inaccuracy in the resulting model and could lead to incorrect predictions. Although anumber of sampling bias correction methods have been proposed, there is no consensual guideline to account for it. Wecompared here the performance of five methods of bias correction on three datasets of species occurrence: one ‘‘virtual’’derived from a land cover map, and two actual datasets for a turtle (Chrysemys picta) and a salamander (Plethodoncylindraceus). We subjected these datasets to four types of sampling biases corresponding to potential types of empiricalbiases. We applied five correction methods to the biased samples and compared the outputs of distribution models tounbiased datasets to assess the overall correction performance of each method. The results revealed that the ability ofmethods to correct the initial sampling bias varied greatly depending on bias type, bias intensity and species. However, thesimple systematic sampling of records consistently ranked among the best performing across the range of conditionstested, whereas other methods performed more poorly in most cases. The strong effect of initial conditions on correctionperformance highlights the need for further research to develop a step-by-step guideline to account for sampling bias.However, this method seems to be the most efficient in correcting sampling bias and should be advised in most cases.

Citation: Fourcade Y, Engler JO, Rodder D, Secondi J (2014) Mapping Species Distributions with MAXENT Using a Geographically Biased Sample of Presence Data:A Performance Assessment of Methods for Correcting Sampling Bias. PLoS ONE 9(5): e97122. doi:10.1371/journal.pone.0097122

Editor: John F. Valentine, Dauphin Island Sea Lab, United States of America

Received March 3, 2014; Accepted April 14, 2014; Published May 12, 2014

Copyright: � 2014 Fourcade et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permitsunrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: This study was supported by a grant to YF and JS from Plan Loire Grandeur Nature and European Regional Development Fund (ERDF). JOE was kindlysupported within the German Federal Environmental Foundation fellowship program. The funders had no role in study design, data collection and analysis,decision to publish, or preparation of the manuscript.

Competing Interests: The authors have declared that no competing interests exist.

* E-mail: [email protected]

Introduction

A key issue in ecology and conservation biology is to determine

how species are distributed in space. Since extinction risk is

associated with range size [1], a significant reduction of a species

range often determines change in conservation status (see for

example IUCN criteria [2,3]) and prime conservations actions

[4,5]. Likewise, protected areas usually focus on biodiversity

hotspots [6] in order to conserve efficiently as many species as

possible [7–9]. Therefore, conservationists often need precise

assessments of species ranges. Beyond simple range description,

identifying which main factors limit distributions is essential to

efficiently forecast the benefits of conservation management. In

order to deal with these questions, several methods of species

distribution modeling (SDM), also known as ecological niche

modeling (ENM) [10], have been developed since the 1980s [11].

The principle of SDM is to relate known locations of a species

with the environmental characteristics of these locations in order

to estimate the response function and contribution of environ-

mental variables [12], and predict the potential geographical range

of a species [13]. These models estimate the fundamental

ecological niche in the environmental space (i.e. species response

to abiotic environmental factors [14]) and project it onto the

geographical space to derive the probability of presence for any

given area or, depending on the method, the likelihood that

specific environmental conditions are suitable for the target species

[15]. Distribution models are used by conservation practitioners to

estimate the most suitable areas for a species and infer probability

of presence in regions where no systematic surveys are available

[16]. They can also assess the potential expansion of introduced

species in newly colonized areas [17,18], estimate the future range

of a species under climate change [18,19] or assist in reserve

planning [20].

Several statistical models exist to predict the distribution of a

species [21]. Beyond classical regression methods (Resource

Selection Function RSF [22,23], Generalized Linear Models

GLM [24]), algorithmic modeling based on machine learning (for

example Artificial Neural Networks [25], Maximum Entropy

MAXENT [26], Classification And Regression Trees CART [27])

have become increasingly popular in recent years. Among these,

PLOS ONE | www.plosone.org 1 May 2014 | Volume 9 | Issue 5 | e97122

Page 2: Mapping Species Distributions with MAXENT Using a ...

MAXENT has been described as especially efficient to handle

complex interactions between response and predictor variables

[15,28], and to be little sensitive to small sample sizes [29]. This, as

well as its extreme simplicity of use, has made MAXENT the most

widely used SDM algorithm. In December 2013, 1886 citations of

the article describing the method [30] were reported in Web of

Science.

MAXENT modeling, and SDM in general, is now commonly

implemented in conservation-oriented studies [31]. Regional or

continent-wide studies are facilitated by the recent availability of

global datasets. Environmental layers, such as the global climate

variables developed in the WorldClim project [32], offer

continuous description of very large areas [33]. Similarly, the

development of open biodiversity databases (see for example the

Global Biodiversity Information Facility, GBIF, http://www.gbif.

org) increases manifolds the spatial coverage of fieldwork

observations that could have been collected by a single project.

Such databases usually provide presence-only data that can be

handled by modeling methods like MAXENT.

However, datasets derived from opportunistic observations or

museum records rather than from planned surveys often exhibit a

strong geographic bias [34], some areas being visited more often

than others because of their accessibility [35] or their naturalistic

interest. This unequal survey coverage of a species distribution is

often referred as sampling bias, sample selection bias or survey

bias. The quality of the model can be strongly affected if entire

parts of the environmental space suitable to a species are absent or

poorly represented in the survey dataset [36,37], or alternatively, if

some areas are overrepresented due to locally high sampling

efforts. Several studies questioned the effect of sampling design

[38], or the biased nature of museum and herbarium datasets [39]

on the predictive performance of SDMs. Surprisingly, the issue of

quantifying and correcting sampling bias has been poorly

addressed despite its crucial importance. Although authors pointed

out that the distribution of locations in the geographical and/or

ecological space may impact the reliability of the model [35,36,40–

43], the potential effect of the sampling bias in the dataset is

usually poorly taken into account or not considered at all.

However, very different SDM outputs can be generated that lead

to contrasting conclusions whether sampling bias is corrected or

not [44], making SDM studies that did not incorporate this issue

highly doubtful.

In regard to the considerable influence of sampling bias on

SDM prediction ability, Araujo et al. [45] considered the

improvement of sampling designs as one of the five major

challenges for future development of SDMs. Several bias

correcting methods have been proposed [46–51] but they have

been rarely used so far. Comparison and evaluation of different

methods to correct sampling bias have only been recently carried

out and no consensual guideline emerged to solve it. A few recent

studies explored the consequences of and potential solutions to

correct for sampling bias (by Syfert et al. [41], Kramer-Schadt

[52], Varela et al. [53] and Boria et al. [54]). In spite of their

interest, authors investigated a single case study so that it is not

possible to evaluate the efficiency of a correction method across

species. Second, the empirical bias caused by sampling intensity

[41] was never tested and no more than two correction techniques

have been evaluated at the same time whereas many more have

been proposed or used in the literature [47,49,55]. Therefore, the

influence of the nature and intensity of bias on the capacity of

various techniques to correct for sampling bias has not been

investigated. This remains however a critical issue, especially for

users who need robust and reliable SDM predictions such as

conservation practitioners.

The goal of this comparative study is to test the effect of bias

type, bias intensity, and correction method on MAXENT model

performance. Unlike the previous cited studies [41,52–54] we

assessed the performance of five bias correcting methods among

the most frequently used under various conditions of bias type and

intensity. We used a virtual species to generate four types of

sampling biases and three bias intensities, and applied on these

biased datasets different corrections. We quantified the relative

correction performance across the range of bias conditions and

across species. The same framework was also applied on two real

datasets. The full workflow on which analyses were based is

sketched in Figure 1. Therefore, the present study provides for the

first time a comprehensive multi-species evaluation of the most

common methods of sampling bias correction under different

scenarios of bias and intensities of bias. Intended for conserva-

tionists who use MAXENT on a regular basis, we expect this work

to provide insights on the selection of the most suitable methods to

produce reliable distribution models using biased datasets.

Furthermore, it encourages modelers to develop improvements

of techniques to correct for sampling bias suitable for the vast set of

modeling methods available.

Materials and Methods

Species datasetsIn order to obtain a true unbiased dataset, we created a virtual

species by randomly sampling a set of points in a given

environment determined by a single categorical variable. A similar

approach has been used formerly [52]. We extracted 2000 random

coordinates from the ‘‘Closed to open shrubland’’ category of the

North American Globcover map (Globcover 2009, http://due.

esrin.esa.int/globcover [56]) to generate an unbiased dataset. The

geographical extent of the virtual species was chosen to match the

scale of the two real species ranges which are also both located in

North America. In practice, the virtual species covered the larger

part of western North America, including the lower valleys of the

Rocky Mountains, Mojave Desert, Baja California peninsula, and

Northern Mexico (Figure 2).

We additionally used the occurrence datasets of two real species

(Figure 2). We compiled a set of 1825 occurrences for the painted

turtle (Chrysemys picta) downloaded from the World Turtle

Database (http://emys.geo.orst.edu; accessed on May 2011).

The second species was the white-spotted slimy salamander

(Plethodon cylindraceus) for which we collected a set of 208

observations. A part of the dataset was provided by J. Milanovich

and additional records were obtained from the Global Biodiversity

Information Facility (http://www.gbif.org, accessed on May

2011). According to our knowledge of the distribution of these

species, these original datasets seem relatively unbiased, i.e. the

distribution of records over space reflects the known spatial

distribution of the species.

Environmental predictorsWe used distinct sets of environmental predictors depending on

the modeled species. Climatic and topographic grids were

downloaded from the WorldClim database [32] (http://www.

worldclim.org) at a resolution of 2.5 arc-min (4.63 km at the

equator). The global map of land cover provided by the European

Spatial Agency was downloaded in its 2009 version (Globcover

2009 [56], http://due.esrin.esa.int/globcover) and rescaled to fit

the 2.5 arc-min resolution of the other variables. Finally, we

compiled 5 years of 10-day periods of NDVI (2007–2011), a

measure of vegetation productivity derived from multispectral

remote-sensing images, downloaded from the SPOT-VEGETA-

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 2 May 2014 | Volume 9 | Issue 5 | e97122

Page 3: Mapping Species Distributions with MAXENT Using a ...

TION project [57] (http://free.vgt.vito.be). We averaged across

these 5 years three layers of mean, minimum and maximum

annual NDVI.

We removed for each species some highly intercorrelated

(correlation coefficient computed by ArcGIS 10; .0.9 or ,20.9)

variables because multicollinearity may violate statistical assump-

tions and may alter model predictions [58]. The resulting variable

sets were composed of 14 predictors (Table 1). Since the

geographical distribution of the virtual species and Chrysemys picta

records covered a large range in North America, we modeled both

species across the same geographic area across North America.

Plethodon cylindraceus occurrences are restricted to a smaller area of

Eastern USA. Accordingly, the geographical range of predictors

was restricted to a narrower area (Table 1).

Generation of sampling biasThe three original datasets were altered to generate four types of

bias that might occur when collecting observations (Figure 3). The

original datasets were thus subsampled so that the remaining

records were biased in the geographical space. We also created

three levels of bias intensity, hereafter referred as ‘‘low’’,

‘‘medium’’ and ‘‘high’’ to assess the effect of this parameter on

model outputs (Figure 3). For each species, each combination of

bias type (4) and bias intensity (3) was replicated 10 times resulting

in a total of 360 biased datasets used to model distribution. The

four types of sampling bias were generated as follows:

(1) TWO AREAS - The original dataset was biased such that its

northern part exhibited a high density of records and the southern

part a low density. This kind of bias is common when a species is

systematically monitored in one part of its range and not surveyed

in the other, for instance in different countries or groups of

countries [44].

(2) GRADIENT - We generated a density gradient of

observations decreasing from the north to the south of the range.

This bias is close to the first one but here record density changed

gradually. Such a bias would not reflect a difference in survey

schemes between administrative divisions but a gradual reduction

of sampling intensity towards a limit of species range.

(3) CENTER - The density of occurrences gradually decreased

from the core of the distribution to the periphery. Such bias

mimics cases in which sampling effort is concentrated in the centre

of the known range of the species, whereas peripheral areas,

potentially less suitable [59], are neglected.

(4) TRAVEL TIME - We used the travel time to the nearest

city, using a map produced by the European Commission [60]

(available at bioval.jrc.ec.europa.eu/products/gam). This variable

integrates both the distance to the city and the presence of road

networks. This map was used as a grid of sampling probability, in

which probability of keeping a record was highest close to cities

and in areas with dense road networks. This bias corresponds to a

common situation where most of records are located around cities

or along roads [35,61].

The full details of the generation of sampling bias are given in

Supporting information, Material S1.

Species distribution modelingWe used for modeling the software MAXENT [30], a machine

learning algorithm that applies the principle of maximum entropy

Figure 1. Workflow used in analyses. Original datasets of a virtual and 2 real species were altered to create 12 bias combining 4 bias types and 3bias intensities. Five methods of sampling bias correction were employed to assess the improvement in the modeled distribution relative to theoriginal distribution using MAXENT. Correction performance was assessed using AUC and 3 measures of overlap between the corrected the originalunbiased model.doi:10.1371/journal.pone.0097122.g001

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 3 May 2014 | Volume 9 | Issue 5 | e97122

Page 4: Mapping Species Distributions with MAXENT Using a ...

to predict the potential distribution of species from presence-only

data and environmental variables [26]. Currently, this widely used

method is particularly efficient to handle complex interactions

between response and predictor variables [15,28], and is little

sensitive to small sample sizes [29]. All models were computed

using the version 3.3.3k of MAXENT (http://www.cs.princeton.

edu/,schapire/maxent/). Runs were conducted with the default

variable responses settings, and a logistic output format which

results in a map of habitat suitability of the species ranging from 0

to 1 per grid cell, wherein the average observation should be close

to 0.5 [15]. The models were evaluated by the area under the

ROC curve (AUC), and three measures of overlap with the

unbiased model (see below section ‘‘Model evaluation and

statistical analyses’’).

Methods of sampling bias correctionWe applied on all our biased datasets five methods of bias

correction that have been already published. In order to evaluate

their usefulness in real conditions, we used these methods as if the

source, shape or strength of the sampling bias was unknown.

Therefore, we did not select a correction method according to our

knowledge of the bias, as this information is unknown in most

empirical studies.

(1) Systematic Sampling. A subsample of records regularly

distributed in the geographical space was selected [46,54,55,62].

MAXENT already discards redundant records that occur in a

single cell. We removed neighboring occurrences at a coarser

resolution than MAXENT does. We created a grid of a defined

cell size and randomly sampled one occurrence per grid cell. This

subsampling reduces the spatial aggregation of records but does

not correct the lack of data due to low sampling effort in some

areas. This method could also underestimate the contribution of

suitable areas where the high density or records reflects the true

ecological value for the species. The resolution of the reference

grid was 2 degrees for Chrysemys picta and the virtual species, and

0.2 degree for Plethodon cylindraceus.

(2) Bias File. This option is implemented in MAXENT. The

software can be fed with a bias grid [49,63] that is a sampling

probability surface. The cell values reflect sampling effort and give

a weight to random background data used for modeling. An ideal

way of creating biasfiles would be to represent the actual sampling

intensity across the study area. Although it can be roughly

estimated by the aggregation of occurrences from closely related

species [48], in most real modeling situations, this information is

lacking. Thus, instead of using our knowledge of the artificially

created biases, we produced bias grids by deriving a Gaussian

kernel density map of the occurrence locations, rescaled from 1 to

20, following Elith et al. [63]. These maps were implemented in

the biasfile option in MAXENT.

(3) Restricted Background. MAXENT, as most other

presence-pseudoabsence methods, generates a ‘‘background’’ or

‘‘pseudo-absence’’ sample of points [15]. It has been argued that

the selection of background points may strongly affect the resulting

model [64–66]. By default 10000 pseudo-absences are randomly

selected from the whole rectangular study area. This approach was

followed for all the other cases, as most SDM studies keep the

default MAXENT selection of background points. However,

according to Phillips [47], if occurrences are restricted to a fraction

of the study area, model performance can be enhanced by drawing

the background points from this fraction of the area. The

reliability of predictions should be improved when the model is

transferred to the rest of the area. Following this recommendation,

we randomly sampled 10000 pseudo-absences in buffer areas

around occurrences and used them as background samples in

MAXENT. Buffer size was a radius of 500 km for the virtual

species and Chrysemys picta, and 100 km for Plethodon cylindraceus.

(4) Cluster. Biased datasets typically lead to spatial autocor-

relation of records and artificial spatial clusters of observations thus

violating the assumption of independence [67]. This bias can be

circumvented by sampling one point per cluster in environmental

space [53,68,69]. We first performed a principal component

Figure 2. Locations of records used for modeling. (A) Virtualspecies; (B) Chrysemys picta; (C) Plethodon cylindraceusdoi:10.1371/journal.pone.0097122.g002

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 4 May 2014 | Volume 9 | Issue 5 | e97122

Page 5: Mapping Species Distributions with MAXENT Using a ...

analysis (PCA) on the environmental descriptors of occurrences

using the ‘‘ade4’’ R package [70] in order to define independent

axes in the environmental space. Then, we ran a cluster analysis

based on Euclidean distance on PCA axes space. The resulting

dendrogram was used to define a number of classes corresponding

to half of the occurrences. One record was randomly sampled per

class and the models were run on these subsampled datasets.

(5) Split. When occurrence frequency greatly differs between

two areas because of unequal sampling effort, the area can be split

in two strata within which coverage probability is more

homogeneous, and one model be computed for each stratum.

This method has been used for species occurring over a large

distribution range and extended environmental gradients

[44,51,71]. We split our biased datasets in a northern and a

southern stratum. We combined the model outputs to produce a

composite model for the entire range, keeping the highest value in

pixels where strata overlapped.

Evaluation of correction methodsTo estimate the ability of each correction method to recover the

information contained in the original unbiased data set, we used 4

criteria that correspond to the interests of different end users of

SDMs. We compared the models obtained after applying a bias in

the dataset to the original model, and after applying a sampling

bias correction method. The original dataset of the virtual species

was created to be unbiased. The model computed from it will be

thus referred as the unbiased model. Although we do not formally

know the actual bias in the Plethodon and Chrysemys original

datasets, we will also refer to the models computed from their

original datasets as unbiased models. We should keep in mind that

we compare the change in the resulting model, whatever the

original bias. The models referred as biased were computed after

applying a sampling bias and the corrected models after applying a

correction method to the biased dataset.

(1) AUC. The area under the receiver operating curve (ROC),

known as the AUC is one of the most common statistics to assess

model performance. AUC can be interpreted as the probability

that a presence cell have a higher predicted value than a absence

cell (or pseudo-absence), both of them being chosen randomly

[28]. Although the use of AUC for the evaluation of ecological

models has been criticized, especially when calculating against

background points rather than true-absences [72,73], it should be

reliable to compare models generated for a single species in the

same area and the same predictors.

The calculation of AUC was performed using the R package

‘‘PresenceAbsence’’ [74], using as test points the fraction of the

original dataset which was excluded to create the biased dataset.

The metrics was calculated by the comparison between these test

points and either (i) 10000 true absence points sampled outside the

range, which corresponds here to the true occupancy for the

virtual species, or (ii) 10000 background points randomly sampled

in all the study area for the real species. For original models, in the

absence of true test points, a mean AUC value was computed

Table 1. Environmental predictors included in MAXENT modeling for the virtual species, Chrysmeys picta and Plethodoncylindraceus.

Virtual species Chrysemys picta Plethodon cylindraceus

Variable

Altitude 6 6 6

Annual mean temperature 6 6 6

Mean diurnal range 6 6 6

Isothermality 6 6 6

Temperature annual range 6 6 6

Mean temperature of wettest quarter 6 6 6

Annual precipitation 6 6 6

Precipitation seasonality 6 6 6

Precipitation of warmest quarter 6 6 6

Precipitation of coldest quarter 6 6 6

Land cover 6 6 6

Maximum NDVI 6 6

Mean NDVI 6 6 6

Minimum NDVI 6 6 6

Mean temperature of driest quarter 6

Extent

Longitude

min 2140.00 2140.00 288.63

max 250.00 250.00 270.68

Latitude

min 20.00 20.00 28.91

max 75.00 75.00 45.46

The selected layers are indicated for each species, as well as the extent of modeling (in decimal degrees).doi:10.1371/journal.pone.0097122.t001

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 5 May 2014 | Volume 9 | Issue 5 | e97122

Page 6: Mapping Species Distributions with MAXENT Using a ...

using 5 random splits of the dataset, each subsample being used in

turn to evaluate the model.

(2) DGEO overlap in the geographical space. Several

metrics of niche overlap are available (e.g. D [75], modified

Hellinger distance I [76] or BC [77]). We used the Schoener’s D

index [75] that has been suggested to be best suited for SDM

outputs [78]. This statistics considers the probability distributions

across space of the difference in the probability of presence of two

species, based on their respective distribution models. Dgeo index is

ranged from 0 (no overlap) to 1 (complete overlap, identical

models).

(3) DENV overlap in the environmental space. We

estimated niche overlap in the environmental space between the

unbiased, biased, and corrected models. We used the PCA-env

approach described by Broennimann [79] to calculate Schoener’s

D index based on the environmental characteristics of two sets of

occurrences. This approach defines the environmental space by

the two first axes of a principal component analysis of all the pixels

of the study area. The niche overlap is calculated from the

smoothed density of occurrences in the environmental space

following a kernel density function applied on each dataset.

Depending on the correction method used, some models can be

based on the same input dataset (e.g. the biasfile correction uses the

same records as the biased model). In order to compare our

SDMs, we applied the PCA-env method on 500 points sampled

from the SDM outputs instead of on the input records. We

selected these points in the output map using the SDM probability

of presence as a sampling probability.

(4) GOVER overlap between binary maps. SDM outputs

are often converted to binary maps that are more tractable for

conservationists who for instance need to delineate protected

areas. In such maps, a pixel is considered as either suitable to the

species or not. We used the non-fixed 10th percentile training

presence threshold value to generate binary maps as proposed by

Liu et al. [80]. We then measured the overlap between biased and

corrected models, and unbiased models. This measure strictly

measures the geographical overlap whereas the D index estimates

the overlap of ecological niches in environmental space (Denv) or

projected onto the geographical space (Dgeo).

To evaluate the performance of bias correction, we derived new

indicators from AUC, Dgeo, Denv, and Gover that quantify the

improvement of the corrected model to the biased model,

standardized by the difference between the unbiased and the

biased models. These indices, named respectively DAUC, DDgeo,

DGover and DDenv were calculated as follows:

DAUC~ AUCcorrected{AUCbiasedð Þ= AUCunbiased{AUCbiasedð Þ

DDgeo~ Dgeocorrected{Dgeobiased

� �.1{Dgeobiased

� �

DGgover~ Govercorrected{Goverbiased

� �.1{Goverbiased

� �

DDenv~ Denvcorrected{Denvbiased

� �.1{Denvbiased

� �

These four indices range from -‘ to 1, a positive value

indicating that the model was actually corrected (with 1

corresponding to perfect correction, i.e. corrected model exactly

similar as the unbiased one) whereas a negative value indicates

that the correction produced a worse model than the biased one.

Results

Effect of bias type and intensity on model outputsThe biased models resulted in a reduction of AUC in all cases

and a deviation from the unbiased model for all overlap measures

(Figure 4). The deviation varied largely depending on species and

bias type though: 0.28–0.91 for Dgeo, 0.18–0.89 for Denv, 0.03–1 for

and Gover. For the virtual species, the bias type yielding the largest

deviation depended on the performance measures considered: ‘‘2

areas’’ for the AUC, ‘‘travel time’’ for Dgeo, ‘‘Center’’ for Denv and

‘‘Gradient’’ for Gover, the latter providing the lowest values of

overlap with the original unbiased models. However, the

differences between unbiased and biased models remained

moderate (mean percentage variation: AUC: 21.74%; Dgeo: 2

24.41%; Denv: 224.62%; Gover: 230.07%). All bias types had also

globally similar effects in terms of deviation from the unbiased

model (Figure 4). For Chrysemys picta, all evaluation measures were

strongly affected by the ‘‘center’’ bias. AUC decreased more than

5%, and overlaps with the unbiased model ranged from 0.26 to

0.49. In contrast, the P. cylindraceus dataset was overall weakly

affected by the biases so that the biased models did not lead to

Figure 3. Generation of sampling bias for the virtual species. Togenerate artificial sampling bias, the original dataset (here the virtualspecies) was altered into 4 different types of bias (rows), each with 3intensities (columns).doi:10.1371/journal.pone.0097122.g003

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 6 May 2014 | Volume 9 | Issue 5 | e97122

Page 7: Mapping Species Distributions with MAXENT Using a ...

noticeable differences with the unbiased model. The strong effect

of the ‘‘center’’ bias was visible in Plethodon cylindraceus only for the

overlap of binary maps (Gover). This bias in which only the central

zone is sampled may exclude a large part of the original

environmental space and lead to very inaccurate SDM outputs.

Interestingly, the decrease in AUC performance for all bias types

was more pronounced in C. picta than the two other datasets even

when the values of overlap were in the same range.

Relative performance of correction methodSince we evaluated the performance of correction methods

using indices with different sets of assumptions, interpretation may

slightly differ with the measure considered. However, as we mainly

aimed at comparing SDM outputs, i.e. maps of habitat suitability,

we primarily focused our interpretations on DDgeo which truly

evaluates the overlap between standard SDM maps. Moreover,

the two measures based on Schoener’s D, in geographic (DDgeo)

and in environmental space (DDenv), were highly correlated and

outputs of bias correction were qualitatively similar (Supporting

information, Figure S1). We discuss here results for DDgeo only.

Correction performance strongly depended on the species

(Table 2). Considering the three species together, less than half

(29%) of all combinations (species 6 bias type 6 bias intensity 6correction method) yielded corrected models (following DDgeo) with

more accurate predictions than the biased model. For the virtual

species, and considering DDgeo, 57% of corrected models (34 out of

60 combinations of bias type, bias intensity and correction

method) were more similar to the model generated with the

unbiased dataset than the biased model (Table 2). Most cases for

which no method was able to provide bias correction were

‘‘center’’ and ‘‘travel time’’ biases, with medium to high intensities.

Conversely, only 7% of P. cylindraceus models were corrected (4

cases out of 60), while 25% of C. picta models were corrected (15

cases out of 60) and offered a better result than the biased model.

Regardless of the species, the bias type, and the metrics

considered, the restricted background method failed to improve

the biased models in almost all tested cases. The other methods

performed better but were ranked differently depending on bias

type. Systematic sampling performed slightly better and more

consistently among the competing methods as shown by the

relative performance of each method across bias types (Figure 5).

Although systematic sampling was not always ranked first, it

showed very little deviation from the most performing method and

performed on average better than the others (for DDgeo: mean rank

Systematic sampling = 2.1161.08 SD; mean rank Split = 2.5361.08

SD, mean rank Cluster = 2.61361.23 SD; mean rank Biasfile

= 3.3161. 31 SD). In contrast, the restricted background method

recorded the least correction (mean rank Restricted background

= 4.4461.13 SD).

Overall, and considering only DDgeo, the systematic sampling

method was able to correct the bias (DDgeo.0) in 33% of the test

cases. This success rate rose to 66% in the case of the virtual

species, for which we were able to compare to a true unbiased

model. However, the biasfile corrected the initial bias in 23% of

test cases. The cluster and split method were both efficient in 23%

of cases while only 6% of cases were corrected by the restricted

background method.

Interestingly, relative performance between methods was

consistent across metrics (Figure 5). The restricted background

method was always the least performing one in terms of DAUC,

Figure 4. Evaluation indices of biased models across bias intensities. Reduction of AUC between unbiased and biased models (inpercentage), Schoener’s D overlaps between biased and unbiased models computed on SDMs (Dgeo) and in environmental space (Denv), and Gover thebinary distribution overlap (mean 6 SD). Dark grey bars: Chrysemys picta, light grey bars: Plethodon cylindraceus, black bars: virtual species.doi:10.1371/journal.pone.0097122.g004

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 7 May 2014 | Volume 9 | Issue 5 | e97122

Page 8: Mapping Species Distributions with MAXENT Using a ...

Ta

ble

2.

Me

anco

rre

ctio

np

erf

orm

ance

acro

ss1

0re

plic

ate

s,fo

re

ach

spe

cie

s,b

ias

typ

e,

bia

sin

ten

sity

and

corr

ect

ion

me

tho

d.

2ar

eas

Ce

nte

rG

rad

ien

tT

rave

lti

me

low

me

diu

mh

igh

low

me

diu

mh

igh

low

me

diu

mh

igh

low

me

diu

mh

igh

(a)

DA

UC

Ch

ryse

mys

pic

ta

Bia

sfile

20

.64

20

.12

0.0

12

0.0

72

0.0

30

.25

*2

0.9

32

0.6

12

0.3

12

2.0

92

0.5

72

0.2

Clu

ste

r0

.09

0.0

90

.28

0.2

40

.19

0.1

22

0.0

82

0.0

60

.01

21

.12

20

.22

*2

0.0

5*

Re

stri

cte

db

ackg

rou

nd

20

.02

20

.21

20

.16

20

.43

20

.69

20

.85

20

.12

20

.22

20

.16

20

.96

20

.46

20

.62

Split

0.6

0*

0.6

4*

0.7

4*

20

.12

20

.14

0.1

0.1

0*

0.0

2*

0.1

4*

20

.94

*2

0.2

82

0.4

3

Syst

em

atic

sam

plin

g2

0.8

50

.19

0.4

20

.31

*0

.33

*2

0.8

22

0.9

72

0.5

82

0.1

62

3.7

62

1.1

72

0.2

6

Ple

tho

do

ncy

lin

dra

ceu

s

Bia

sfile

0.1

20

.14

20

.03

0.1

0.0

90

.22

0.0

32

0.2

42

0.1

12

0.0

72

0.0

20

.02

Clu

ste

r0

.17

0.1

10

.15

0.1

90

.19

0.2

60

.03

0.0

0*

20

.12

0.0

20

.02

*2

0.0

8

Re

stri

cte

db

ackg

rou

nd

21

0.0

32

10

.45

21

0.4

92

5.3

62

4.1

22

5.0

42

6.1

92

10

.64

27

.88

21

2.7

42

9.4

72

11

.12

Split

0.0

30

.01

0.2

20

.12

20

.04

0.2

20

.13

20

.09

20

.18

20

.07

20

.07

20

.24

Syst

em

atic

sam

plin

g0

.31

*0

.32

*0

.42

*0

.51

*0

.51

*0

.50

*0

.13

*2

0.0

90

.14

*0

.17

*0

.01

0.0

4*

Vir

tua

lsp

eci

es

Bia

sfile

23

.30

.42

0.4

90

.17

0.3

5*

0.2

62

0.1

0.3

40

.41

22

.87

22

.40

*2

0.2

7*

Clu

ste

r2

0.7

50

.11

0.2

42

0.1

82

0.1

52

0.1

72

0.0

10

.09

0.0

72

1.8

22

3.3

20

.43

Re

stri

cte

db

ackg

rou

nd

22

.09

20

.34

20

.78

21

.97

21

0.0

52

1.0

42

1.2

62

1.5

29

.87

21

8.7

62

6.4

3

Split

0.1

8*

0.7

2*

0.8

2*

20

.22

0.2

92

0.1

70

.38

*0

.42

*0

.60

*2

1.8

0*

22

.48

20

.53

Syst

em

atic

sam

plin

g2

3.2

60

.45

0.7

20

.39

*0

.33

0.3

5*

20

.83

20

.08

0.3

92

2.5

42

2.7

72

0.6

9

(b)

DD

ge

o

Ch

ryse

mys

pic

ta

Bia

sfile

20

.41

20

.21

0.0

30

.30

.28

0.3

12

0.4

92

0.3

22

0.1

32

0.7

22

0.5

62

0.2

8

Clu

ste

r2

0.3

42

0.2

82

0.1

30

.01

0.0

80

.03

20

.27

20

.13

*2

0.0

32

0.1

8*

20

.14

*0

.00

*

Re

stri

cte

db

ackg

rou

nd

20

.31

20

.19

20

.11

20

.11

20

.09

20

.12

20

.19

*2

0.1

62

0.0

92

0.2

82

0.3

32

0.4

4

Split

20

.24

*0

.07

*0

.37

*2

0.1

42

0.0

80

.04

20

.27

20

.23

20

.05

20

.37

20

.28

20

.21

Syst

em

atic

sam

plin

g2

0.4

72

0.1

90

.02

0.4

1*

0.3

9*

0.3

2*

20

.47

20

.25

0.0

2*

20

.59

20

.31

20

.07

Ple

tho

do

ncy

lin

dra

ceu

s

Bia

sfile

21

.17

21

.14

20

.88

21

.12

0.7

52

0.6

22

0.9

62

0.9

82

0.9

42

1.2

20

.76

20

.61

Clu

ste

r2

0.0

5*

20

.07

*2

0.0

2*

20

.17

20

.09

20

.03

20

.15

20

.12

*2

0.1

42

0.0

1*

20

.06

0.0

2

Re

stri

cte

db

ackg

rou

nd

23

.95

22

.72

22

.27

24

.55

23

.64

23

.08

22

.15

22

.32

1.9

52

3.8

23

.08

22

.54

Split

20

.12

20

.23

20

.73

20

.16

20

.05

20

.02

20

.14

20

.23

20

.26

20

.18

20

.09

*2

0.0

7

Syst

em

atic

sam

plin

g2

0.1

62

0.2

92

0.1

92

0.0

6*

0.0

4*

0.0

1*

20

.10

*2

0.1

92

0.0

9*

20

.24

20

.10

.03

*

Vir

tua

lsp

eci

es

Bia

sfile

0.1

40

.46

0.0

90

.61

*2

0.6

4*

21

.35

0.3

60

.25

0.4

70

.79

*2

0.0

9*

20

.07

*

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 8 May 2014 | Volume 9 | Issue 5 | e97122

Page 9: Mapping Species Distributions with MAXENT Using a ...

Ta

ble

2.

Co

nt.

2ar

eas

Ce

nte

rG

rad

ien

tT

rave

lti

me

low

me

diu

mh

igh

low

me

diu

mh

igh

low

me

diu

mh

igh

low

me

diu

mh

igh

Clu

ste

r0

.28

0.3

22

0.5

50

.22

2.1

22

.44

0.4

10

.18

0.3

10

.75

20

.44

20

.27

Re

stri

cte

db

ackg

rou

nd

00

.22

0.7

62

0.0

72

2.7

42

2.8

90

.25

20

.22

-0.1

40

.53

21

.76

21

.4

Split

0.4

30

.58

*0

.32

*0

.12

2.5

12

2.3

50

.49

*0

.25

0.4

10

.72

20

.64

20

.51

Syst

em

atic

sam

plin

g0

.32

*0

.51

0.1

40

.41

21

.35

20

.99

*0

.45

0.3

0*

0.5

0*

0.7

82

0.1

42

0.4

2

(c)

DG

ov

er

Ch

ryse

mys

pic

ta

Bia

sfile

0.1

80

.25

0.0

90

.26

0.0

70

20

.10

*2

0.0

52

0.2

32

0.1

1*

20

.19

20

.2

Clu

ste

r2

0.3

72

0.1

42

0.0

72

0.0

10

.06

0.0

12

0.3

82

0.0

92

0.1

52

0.5

52

0.4

52

0.8

4

Re

stri

cte

db

ackg

rou

nd

20

.12

20

.09

20

.05

0.0

10

20

.01

20

.14

20

.13

20

.08

20

.22

0.3

22

0.1

0*

Split

20

.01

0.0

80

.57

*2

0.0

42

0.0

50

20

.53

20

.56

20

.18

20

.99

20

.92

1.2

5

Syst

em

atic

sam

plin

g0

.33

*0

.40

*0

.31

0.6

5*

0.4

1*

0.3

4*

20

.25

0.0

5*

0.1

8*

20

.23

0.2

5*

20

.98

Ple

tho

do

ncy

lin

dra

ceu

s

Bia

sfile

0.2

90

.74

*0

.52

0.7

1*

0.7

2*

0.6

3*

0.4

8*

0.3

30

.43

*0

.43

*0

.29

*0

.32

*

Clu

ste

r0

.29

0.3

0.2

30

.39

0.1

40

.11

0.0

80

.39

0.1

50

.26

20

.23

0.1

1

Re

stri

cte

db

ackg

rou

nd

20

.62

20

.19

20

.45

20

.34

20

.17

20

.21

02

0.3

52

0.1

72

1.0

92

0.7

92

0.8

7

Split

0.5

2*

0.4

0.8

3*

0.5

40

.51

0.3

20

.41

0.4

8*

0.2

90

.19

20

.09

20

.03

Syst

em

atic

sam

plin

g0

.48

0.6

50

.62

0.6

70

.49

0.4

60

.26

0.1

20

.21

0.4

3*

0.0

10

.31

Vir

tua

lS

pe

cie

s

Bia

sfile

0.4

1*

0.7

7*

20

.31

20

.25

22

.33

23

.09

0.8

0*

0.6

5*

0.6

60

.86

*2

0.3

92

0.4

4

Clu

ste

r2

0.2

10

.22

2.3

72

1.1

32

3.5

32

3.0

50

.61

0.2

90

.40

.82

0.7

32

0.6

2

Re

stri

cte

db

ackg

rou

nd

00

.34

22

.21

21

.17

23

.72

3.1

70

.69

0.3

90

.41

0.8

32

0.5

32

0.4

2*

Split

0.3

10

.62

0.0

2*

21

.16

23

.68

23

.03

0.6

90

.47

0.6

10

.83

20

.64

20

.57

Syst

em

atic

sam

plin

g0

.05

0.6

22

0.0

70

.02

*2

1.3

7*

22

.01

*0

.62

0.4

20

.73

*0

.86

*0

.17

*2

0.7

2

Po

siti

veva

lue

s,i.e

.ca

ses

wh

ere

the

bia

sw

asac

tual

lyco

rre

cte

d,

are

sho

wn

inb

old

.Fo

re

ach

com

bin

atio

no

fsp

eci

es

and

bia

s(t

ype6

inte

nsi

ty),

the

be

stm

eth

od

(i.e

.th

eo

ne

wh

ich

has

the

hig

he

stva

lue

)is

hig

hlig

hte

db

yan

aste

risk

.C

orr

ect

ion

pe

rfo

rman

ceis

est

imat

ed

by

thre

em

eas

ure

s:(a

)D

AU

C,

(b)D

Dg

eo,

(c)D

Go

ve

r.d

oi:1

0.1

37

1/j

ou

rnal

.po

ne

.00

97

12

2.t

00

2

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 9 May 2014 | Volume 9 | Issue 5 | e97122

Page 10: Mapping Species Distributions with MAXENT Using a ...

DDgeo and DGover (but for C. picta and DDgeo, for which it is ranked

4/5). The systematic sampling method was among the most

performing methods. However, even if systematic sampling was

overall the most efficient method across species in terms of DDgeo, it

was outperformed by the split or biasfile methods in some cases for

the virtual species, or by the cluster method under some

combinations of bias 6 intensity for the two real species

(Table 2). However, when the systematic sampling was unable to

resolve the bias, this latter was most often equally poorly corrected

by any of the methods tested.

Discussion

As an unexpected first finding, we noticed that the range of

AUC values obtained for biased and corrected models remained

high even for models with the strongest biases. The decrease in

AUC observed after applying the bias was moderate, less than 2%

on average, across species and bias type. Moreover, the AUC

values of the biased models were almost always over 0.8 or 0.9,

which would classify the models as ‘‘good’’ or ‘‘very good’’ (Araujo

et al. [81] adapted from Swets [82]). Together with other studies

[72,73,83] our results highlight that this measure may poorly

reflect model accuracy. Therefore, studies that focus solely on the

AUC value should interpret their results with caution. AUC may

be a good statistical measure of discrimination ability, but it often

fails to quantify the ecological realism of modeled distribution

[72,73,83] especially when estimated from presence-only data.

Because we have a reference model, we will mainly focus on the

overlap indices with the unbiased model as a measure of predictive

accuracy performance.

Contrary to previous studies investigating sampling bias

correction in SDM that focused on a few methods and simple

biases [41,52–54], we reviewed here five different ways to deal

with sampling bias and used both real and virtual datasets under

various bias scenarios. We also considered bias intensity that has

been to our knowledge never assessed and proved to be of as a

high concern as the type of bias. Moreover, instead of relying only

on classical measures of SDM performance such as AUC (as used

in Syfert et al. [41], Varela et al. [53] and Boria et al; [54]) or

omission/commission error (as used in Kramer-Schadt et al. [52]),

we evaluated the correction performance by directly comparing

the SDM outputs. Therefore, we actually assessed the ability of the

tested methods to recover the unbiased model, which is the

expected behavior of an efficient sampling bias correction. In

addition, rather than basing our conclusions on island species

[41,52,54], we used continental species whose distributions are

clearly shaped by climate and not by a geographically bound

space.

Our results clearly evidence that the different methods of

sampling bias correction tested here may have very variable

efficiency depending on the modeling conditions (biases type and

correction method). Interestingly, the correction may have a

positive effect, and actually contributes to correct the bias;

nonetheless, in some cases it may produce a poorest model than

the biased model. These results suggest that the problem of

sampling bias in species distribution modeling has probably

multiple answers depending on the context. We especially

emphasize that the type and intensity of bias influence the ability

of various methods to resolve the initial bias.

However, correction methods did not perform equally across

the various conditions. The less efficient method restricted the

spatial extent of the background whereas in other methods, the

background points were selected from the whole available

environment (i.e. randomly drawn from the area covered by the

environmental grid files). Surprisingly, this method is often used

and have been contributed to improve SDM performance in some

cases [47,48]. However, as suggested by Thuiller et al. [84] and

Vanderwal et al. [85], excessively restricting the geographical

extent of pseudo-absences to a narrow area or selecting them from

a too large area reduces model accuracy. Background selection

may greatly influence the resulting model as it determines the

underlying assumptions of the model to use [64]. Therefore, this

step should be undertaken with caution. The size of the buffer used

for background selection also greatly influences model perfor-

Figure 5. Rank of each method to correct sampling bias. Mean ranks 6 standard-error for the performance of each method to correctsampling bias for each species (Chrysemys picta: left, Plethodon cylindraceus: center, virtual species: right), following 3 measures of correctionperformance: DAUC (left), DDgeo (centre), and DGover (right). For each type of bias and bias intensity, the method which results in the most efficientcorrection is set to 1 whereas the least powerful method is set to 5. The plotted values are the mean rank across the 4 types of bias and 3 intensities.doi:10.1371/journal.pone.0097122.g005

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 10 May 2014 | Volume 9 | Issue 5 | e97122

Page 11: Mapping Species Distributions with MAXENT Using a ...

mance. For instance, AUC often increases with the size of the

study area because it contributes to include background points that

have environmental characteristics greatly distant from the species

requirement, resulting in artificial increase of SDM validation

[65]. The selection of the training area should therefore be strictly

relevant to the ecology of the species and the objective of the study.

A relevant selection of the training area (the geographic region in

which background points are selected) should reflect the

geographical space accessible to the species over a given time

period [65]. It may thus be essential to carry out a rigorous

investigation of the optimal geographic distance between the set of

occurrences used to train the model and background points. It has

to be both optimal for model training and biologically meaningful.

The interpretation of the modeled distribution must also be

engaged carefully as it may reflect the fundamental niche or the

true occupied range, and often a position between both.

Regarding the high variability in correction performance of the

different methods depending on various factors, it is difficult to

propose a universal guideline to solving sampling bias. It might be

advisable to evaluate first several types of correction. The final

choice of correction method would be then based on their effect in

classical SDM evaluation metrics (e.g. AUC, Kappa, True Skill

Statistics TSS) and possibly the adequacy of output maps to a priori

knowledge of the species distribution. A first useful step might also

be to evaluate bias type to design and select the most appropriate

correction. For instance, the split method makes sense only if the

most and less sampled areas are at least roughly known. Most of

the time, sampling bias is only inferred by the known sampling

effort or empirical knowledge of the species distribution. The true

severity and shape of this bias is almost always lacking. However,

in some cases, bias can be evaluated by comparing the geographic

distribution of the available occurrences to known sources of bias.

An estimation of sampling probabilities across the study area can

provide insights on the potential bias that may affect the collected

observations and help in further choice of the correction method.

The known characteristics of the modeled species may also

condition the strategy to use. A species with an expected very large

geographical and/or environmental range should benefit from

being split into two or more partitions that are combined

afterward [51]. Finally, the five methods tested here are not

necessarily exclusive. For instance, it is possible to split a large

dataset and apply systematic sampling to each dataset in species

with broad distributions [44] or to apply the biasfile method after

using first another correction method

Nevertheless, we found that only systematic sampling constantly

performed well irrespective of species and bias type. Interestingly

enough, it happened to be the simplest and most obvious way to

solve the geographic bias. Beside, this method can be quickly

applied to any dataset, even if the nature of the bias is unknown.

Kramer-Schadt et al. [52] and Boria et al. [54] also identified this

technique, which they named spatial filtering, as the most effective

(but note that Varela et al. [53] found that environmental filtering,

equivalent to our cluster method, may provide better results). We

might keep in mind that this method may have a few drawbacks

though. Subsampling the complete dataset may alter the

distribution of occurrences in the environmental space and

exclude some portion of environmental space from the input

records. This problem may be circumvented by adjusting the grid

size but this may not be possible when the sample size of

occurrences (presences) is too low. On the contrary, smoothing the

distribution of occurrences may lead to overestimating the

probability presence in marginal areas. The grid cell size used to

sample the occurrences may also be a source of problem; it must

be large enough to resolve the bias but not too large to result in a

strong loss in resolution. A fine adjustment of the parameters of the

method (e.g. the resolution of systematic sampling that provides the

optimal trade-off between sampling bias correction and informa-

tion reduction) is thus necessary to maximize its performance.

Our results suggest that the difference in performance between

the systematic sampling method and the optimal method for the

dataset under investigation is slight. Since systematic sampling

seems to be robust enough to differentiate sampling bias and

species, we suggest that this could be the method selected first if no

further attempts to correct sampling bias are to be made. This

method provides a quick and efficient way to improve SDM from

a biased dataset and is likely to be relevant most of the time. Even

if the elaboration of a step-by-step framework would be ideal to

assist in the definition of a reliable strategy of bias correction, we

highlight that most SDMs studies, at least those that use

MAXENT as modeling algorithm, would highly benefit from this

simple tweaking of input occurrences. Regarding the major

impacts that conservation actions are expected to provide

[86,87], they must be based as much as possible on irrefutable

models. Therefore, it is essential to develop simple methods

intended for conservation practitioners, such as techniques of

sampling bias correction, to build robust distribution models. In

this regard, we encourage MAXENT users to carefully take

sampling bias into account, and to use systematic sampling of their

input occurrences as a quick and simple resolution of bias.

Supporting Information

Figure S1 Correlations between measures of correctionperformance.

(PDF)

Material S1 Details of the generation of sampling bias.

(PDF)

Author Contributions

Conceived and designed the experiments: YF JOE DR JS. Analyzed the

data: YF. Wrote the paper: YF JS.

References

1. Purvis A, Gittleman JL, Cowlishaw G, Mace GM (2000) Predicting extinction

risk in declining species. Proc R Soc B Biol Sci 267: 1947–1952.

2. Mace GM, Collar NJ, Gaston KJ, Hilton-Taylor C, Akcakaya HR, et al. (2008)

Quantification of extinction risk: IUCN’s system for classifying threatened

species. Conserv Biol 22: 1424–1442.

3. IUCN (2001) IUCN Red List Categories and Criteria, Version 3.1. Gland,

Switzerland: IUCN – The World Conservation Union.

4. Harris G, Pimm SL (2008) Range size and extinction risk in forest birds. Conserv

Biol 22: 163–171.

5. Rodrigues ASL, Pilgrim JD, Lamoreux JF, Hoffmann M, Brooks TM (2006)

The value of the IUCN Red List for conservation. Trends Ecol Evol 21: 71–76.

6. Myers N, Mittermeier RA, Mittermeier CG, Da Fonseca GA, Kent J (2000)

Biodiversity hotspots for conservation priorities. Nature 403: 853–858.

7. Mittermeier R, Myers N, Thomsen J, Da Fonseca G, Olivieri S (1998)

Biodiversity hotspots and major tropical wilderness areas: approaches to setting

conservation priorities. Conserv Biol 12: 516–520.

8. Reid W (1998) Biodiversity hotspots. Trends Ecol Evol 13: 275–280.

9. Moilanen A, Franco AMA, Early RI, Fox R, Wintle B, et al. (2005) Prioritizing

multiple-use landscapes for conservation: methods for large multi-species

planning problems. Proc R Soc B Biol Sci 272: 1885–1891.

10. Peterson A (2006) Uses and requirements of ecological niche models and related

distributional models. Biodivers Informatics: 59–72.

11. Ferrier S (1984) The status of the Rufous Scrub-bird Atrichornis rufescens:

habitat, geographical variation and abundance. PhD thesis. Armidale, Australia:

University of New England.

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 11 May 2014 | Volume 9 | Issue 5 | e97122

Page 12: Mapping Species Distributions with MAXENT Using a ...

12. Austin M, Belbin L, Meyers J, Doherty M, Luoto M (2006) Evaluation of

statistical models used for predicting plant species distributions: Role of artificial

data and theory. Ecol Modell 199: 197–216.

13. Elith J, Leathwick JR (2009) Species Distribution Models: Ecological

Explanation and Prediction Across Space and Time. Annu Rev Ecol Evol Syst

40: 677–697.

14. Soberon J, Peterson AT (2005) Interpretation of models of fundamental

ecological niches and species’ distributional areas. Biodivers Informatics 2: 1–10.

15. Elith J, Phillips SJ, Hastie T, Dudık M, Chee YE, et al. (2011) A statistical

explanation of MaxEnt for ecologists. Divers Distrib 17: 43–47.

16. Elith J, Burgman M (2002) Predictions and their validation: Rare plants in the

Central Highlands, Victoria, Australia. Predicting Species Occurrences: Issues of

Accuracy and Scale. Washington, USA: Island Press. pp. 303–313.

17. Jimenez-Valverde A, Peterson AT, Soberon J, Overton JM, Aragon P, et al.

(2011) Use of niche models in invasive species risk assessments. Biol Invasions 13:

2785–2797.

18. Jeschke J, Strayer D (2008) Usefulness of bioclimatic models for studying climate

change and invasive species. Ann N Y Acad Sci 1134: 1–24.

19. Sinclair S, White M, Newell G (2010) How useful are species distribution models

for managing biodiversity under future climates. Ecol Soc 15: 8.

20. Thorn JS, Nijman V, Smith D, Nekaris KAI (2009) Ecological niche modelling

as a technique for assessing threats and setting conservation priorities for Asian

slow lorises (Primates: Nycticebus). Divers Distrib 15: 289–298.

21. Franklin J (2009) Mapping Species Distributions: Spatial Inference and

Prediction. Cambridge, UK: Cambridge University Press.

22. Boyce M, McDonald L (1999) Relating populations to habitats using resource

selection functions. Trends Ecol Evol 14: 268–272.

23. Manly BFJ, McDonald LL, Thomas D (1993) Resource Selection by Animals:

Statistical Design and Analysis for Field Studies. Chapman & Hall.

24. McCullagh P, Nelder JA (1989) Generalized Linear Models. London, UK:

Chapman & Hall.

25. Ripley BD (1996) Pattern Recognition and Neural Networks. Cambridge, UK:

Cambridge University Press.

26. Phillips SJ, Dudık M, Schapire RE (2004) A maximum entropy approach to

species distribution modeling. Proceedings of the twenty-first international

conference on Machine learning. New York, New York, USA: ACM. p. 83.

27. Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and

Regression Trees. Chapman & Hall.

28. Elith J, Graham CH, Anderson RP, Dudık M, Ferrier S, et al. (2006) Novel

methods improve prediction of species’ distributions from occurrence data.

Ecography 29: 129–151.

29. Wisz MS, Hijmans RJ, Li J, Peterson AT, Graham CH, et al. (2008) Effects of

sample size on the performance of species distribution models. Divers Distrib 14:

763–773.

30. Phillips SJ, Anderson RP, Schapire RE (2006) Maximum entropy modeling of

species geographic distributions. Ecol Modell 190: 231–259.

31. Elith J, Leathwick JR (2006) Conservation prioritisation using species

distribution modelling. In: Moilanen A, Wilson KA, Possingham HP, editors.

Spatial conservation prioritization: quantitative methods and computational

tools. Oxford, UK: Oxford University Press.

32. Hijmans RJ, Cameron SE, Parra JL, Jones PG, Jarvis A (2005) Very high

resolution interpolated climate surfaces for global land areas. Int J Climatol 25:

1965–1978.

33. Kozak KH, Graham CH, Wiens JJ (2008) Integrating GIS-based environmental

data into evolutionary biology. Trends Ecol Evol 23: 141–148.

34. Dennis R, Thomas CD (2000) Bias in butterfly distribution maps: the influence

of hot spots and recorder’s home range. J Insect Conserv 4: 73–77.

35. Kadmon R, Farber O, Danin A (2004) Effect of roadside bias on the accuracy of

predictive maps produced by bioclimatic models. Ecol Appl 14: 401–413.

36. Leitao PJ, Moreira F, Osborne PE (2011) Effects of geographical data sampling

bias on habitat models of species distributions: a case study with steppe birds in

southern Portugal. Int J Geogr Inf Sci 25: 439–454.

37. Bystriakova N, Peregrym M, Erkens RHJ, Bezsmertna O, Schneider H (2012)

Sampling bias in geographic and environmental space and its effect on the

predictive power of species distribution models. Syst Biodivers 10: 1–11.

38. Edwards TC, Cutler DR, Zimmermann NE, Geiser L, Moisen GG (2006)

Effects of sample survey design on the accuracy of classification tree models in

species distribution models. Ecol Modell 199: 132–141.

39. Loiselle B, Jørgensen P, Consiglio T, Jimenez I, Blake JG, et al. (2008) Predicting

species distributions from herbarium collections: does climate bias in collection

sampling influence model outcomes? J Biogeogr 35: 105–116.

40. Reddy S, Davalos LM (2003) Geographical sampling bias and its implications

for conservation priorities in Africa. J Biogeogr 30: 1719–1727.

41. Syfert MM, Smith MJ, Coomes DA (2013) The effects of sampling bias and

model complexity on the predictive performance of MaxEnt species distribution

models. PLoS One 8: e55158.

42. Costa GC, Nogueira C, Machado RB, Colli GR (2009) Sampling bias and the

use of ecological niche modeling in conservation planning: a field evaluation in a

biodiversity hotspot. Biodivers Conserv 19: 883–899.

43. Beck J, Boller M, Erhardt A, Schwanghart W (2013) Spatial bias in the GBIF

database and its effect on modelling species’ geographic distributions. Ecol

Inform 19: 10–15.

44. Fourcade Y, Engler JO, Besnard AG, Rodder D, Secondi J (2013) Confronting

expert-based and modelled distributions for species with uncertain conservation

status: a case study from the Corncrake (Crex crex). Biol Conserv 167: 161–171.

45. Araujo MB, Guisan A (2006) Five (or so) challenges for species distribution

modelling. J Biogeogr 33: 1677–1688.

46. Hijmans RJ, Elith J (2012) Species distribution modeling with R. http://cran.r-

project.org/web/packages/dismo/vignettes/dm.pdf.

47. Phillips SJ (2008) Transferability, sample selection bias and background data in

presence-only modelling: a response to Peterson, et al. (2007). Ecography 31:

272–278.

48. Phillips SJ, Dudık M, Elith J, Graham CH, Lehmann A, et al. (2009) Sample

selection bias and presence-only distribution models: implications for back-

ground and pseudo-absence data. Ecol Appl 19: 181–197.

49. Dudık M, Schapire RE, Phillips SJ (2005) Correcting sample selection bias in

maximum entropy density estimation. Appearing in Advances in Neural

Information Processing Systems. Vol. 18.

50. Rodder D, Lotters S (2009) Niche shift versus niche conservatism? Climatic

characteristics of the native and invasive ranges of the Mediterranean house

gecko (Hemidactylus turcicus). Glob Ecol Biogeogr 18: 674–687.

51. Osborne PE, Suarez-Seoane S (2002) Should data be partitioned spatially before

building large-scale distribution models? Ecol Modell 157: 249–259.

52. Kramer-Schadt S, Niedballa J, Pilgrim JD, Schroder B, Lindenborn J, et al.

(2013) The importance of correcting for sampling bias in MaxEnt species

distribution models. Divers Distrib 19: 1366–1379.

53. Varela S, Anderson RP, Garcıa-Valdes R, Fernandez-Gonzalez F (2014)

Environmental filters reduce the effects of sampling bias and improve predictions

of ecological niche models. Ecography: 10.1111/j.1600–0587.2013.00441.x.

54. Boria RA, Olson LE, Goodman SM, Anderson RP (2014) Spatial filtering to

reduce sampling bias can improve the performance of ecological niche models.

Ecol Modell 275: 73–77.

55. Veloz SD (2009) Spatially autocorrelated sampling falsely inflates measures of

accuracy for presence-only niche models. J Biogeogr 36: 2290–2299.

56. Arino O, Bicheron P, Achard F, Latham J, Witt R, et al. (2008) GLOBCOVER

The most detailed portrait of Earth. ESA Bull - Eur Sp Agency 136: 24–31.

57. Maisongrande P, Duchemin B, Dedieu G (2004) VEGETATION/SPOT: an

operational mission for the Earth monitoring; presentation of new standard

products. Int J Remote Sens 25: 9–14.

58. Heikkinen RK, Luoto M (2006) Methods and uncertainties in bioclimatic

envelope modelling under climate change. Prog Phys Geogr 6: 751–777.

59. Sagarin RD, Gaines SD (2002) The ‘‘abundant centre’’ distribution: to what

extent is it a biogeographical rule? Ecol Lett 5: 137–147.

60. Nelson A (2008) Travel time to major cities: A global map of Accessibility.

Luxembourg: Office for Official Publications of the European Communities.

61. McCarthy K, Fletcher RJ (2012) Predicting Species Distributions from Samples

Collected along Roadsides. Conserv Biol 26: 68–77.

62. Ihlow F, Dambach J, Engler JO, Flecks M, Hartmann T, et al. (2012) On the

brink of extinction? How climate change may affect global chelonian species

richness and distribution. Glob Chang Biol 18: 1520–1530.

63. Elith J, Kearney M, Phillips SJ (2010) The art of modelling range-shifting

species. Methods Ecol Evol 1: 330–342.

64. Chefaoui RM, Lobo JM (2008) Assessing the effects of pseudo-absences on

predictive distribution model performance. Ecol Modell 210: 478–486.

65. Barve N, Barve V, Jimenez-Valverde A, Lira-Noriega A, Maher SP, et al. (2011)

The crucial role of the accessible area in ecological niche modeling and species

distribution modeling. Ecol Modell 222: 1810–1819.

66. Acevedo P, Jimenez-Valverde A, Lobo JM, Real R (2012) Delimiting the

geographical background in species distribution modelling. J Biogeogr 39: 1383–

1390.

67. Dormann CF, McPherson JM, Araujo MB, Bivand R, Bolliger J, et al. (2007)

Methods to account for spatial autocorrelation in the analysis of species

distributional data: a review. Ecography 30: 609–628.

68. Rodder D, Kielgast J, Bielby J, Schmidtlein S, Bosch J, et al. (2009) Global

Amphibian Extinction Risk Assessment for the Panzootic Chytrid Fungus.

Diversity 1: 52–66.

69. Stiels D, Schidelko K, Engler JO, Elzen R, Rodder D (2011) Predicting the

potential distribution of the invasive Common Waxbill Estrilda astrild

(Passeriformes: Estrildidae). J Ornithol: 769–780.

70. Dray S, Dufour A (2007) The ade4 package: implementing the duality diagram

for ecologists. J Stat Softw 22: 1–20.

71. Gonzalez S, Soto-Centeno J, Reed D (2011) Population distribution models:

species distributions are better modeled using biologically relevant data

partitions. BMC Ecol 11: 20.

72. Lobo JM, Jimenez-Valverde A, Real R (2008) AUC: a misleading measure of the

performance of predictive distribution models. Glob Ecol Biogeogr 17: 145–151.

73. Jimenez-Valverde A, Acevedo P, Barbosa a M, Lobo JM, Real R (2012)

Discrimination capacity in species distribution models depends on the

representativeness of the environmental domain. Glob Ecol Biogeogr 22: 508–

516.

74. Freeman EA, Moisen G (2008) {PresenceAbsence}: An R Package for Presence

Absence Analysis. J Stat Softw 23: 1–31.

75. Schoener T (1968) The Anolis lizards of Bimini: resource partitioning in a

complex fauna. Ecology 49: 704–726.

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 12 May 2014 | Volume 9 | Issue 5 | e97122

Page 13: Mapping Species Distributions with MAXENT Using a ...

76. Warren DL, Glor RE, Turelli M (2008) Environmental niche equivalency versus

conservatism: quantitative approaches to niche evolution. Evolution 62: 2868–2883.

77. Bray J, Curtis J (1957) An ordination of the upland forest communities of

southern Wisconsin. Ecol Monogr 7: 325–349.78. Rodder D, Engler JO (2011) Quantitative metrics of overlaps in Grinnellian

niches: advances and possible drawbacks. Glob Ecol Biogeogr 20: 915–927.79. Broennimann O, Fitzpatrick MC, Pearman PB, Petitpierre B, Pellissier L, et al.

(2012) Measuring ecological niche overlap from occurrence and spatial

environmental data. Glob Ecol Biogeogr 21: 481–497.80. Liu C, Berry P, Dawson T, Pearson R (2005) Selecting thresholds of occurrence

in the prediction of species distributions. Ecography 28: 385–393.81. Araujo MB, Pearson R, Thuiller W, Erhard M (2005) Validation of species–

climate impact models under climate change. Glob Chang Biol 11: 1504–1513.82. Swets JA (1988) Measuring the accuracy of diagnostic systems. Science 240:

1285–1293.

83. Jimenez-Valverde A (2014) Threshold-dependence as a desirable attribute for

discrimination assessment: implications for the evaluation of species distribution

models. Biodivers Conserv 23: 369–385.

84. Thuiller W, Brotons L, Araujo MB, Lavorel S (2004) Effects of restricting

environmental range of data to project current and future species distributions.

Ecography 27: 165–172.

85. VanDerWal JJ, Shoo L, Graham CH, Williams S (2009) Selecting pseudo-

absence data for presence-only distribution modeling: How far should you stray

from what you know? Ecol Modell 220: 589–594.

86. Adams WM, Aveling R, Brockington D, Dickson B, Elliott J, et al. (2004)

Biodiversity conservation and the eradication of poverty. Science 306: 1146–

1149.

87. Hoffmann M, Hilton-Taylor C, Angulo A, Bohm M, Brooks TM, et al. (2010)

The impact of conservation on the status of the world’s vertebrates. Science 330:

1503–1509.

An Assessment of Methods for Correcting Sampling Bias in SDM

PLOS ONE | www.plosone.org 13 May 2014 | Volume 9 | Issue 5 | e97122