Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf ·...

14
Estimating aboveground carbon in a catchment of the Siberian forest tundra: Combining satellite imagery and eld inventory Hans Fuchs a, , Paul Magdon a , Christoph Kleinn a , Heiner Flessa b a Chair of Forest Inventory and Remote Sensing, Burckhardt Institute, Georg-August-Universität Göttingen, Büsgenweg 5, 37077 Göttingen, Germany b Soil Science of Temperate and Boreal Ecosystems, Büsgen Institute, Georg-August-Universität Göttingen, Büsgenweg 5, 37077 Göttingen, Germany abstract article info Article history: Received 8 January 2008 Received in revised form 11 July 2008 Accepted 13 July 2008 Keywords: ASTER Carbon estimation Feature selection Forest inventory Forest tundra Global change k-NN regionalization Multiple linear regression Quickbird Siberia This study was part of an interdisciplinary research project on soil carbon and phytomass dynamics of boreal and arctic permafrost landscapes. The 45 ha study area was a catchment located in the forest tundra in northern Siberia, approximately 100 km north of the Arctic Circle. The objective of this study was to estimate aboveground carbon (AGC) and assess and model its spatial variability. We combined multi-spectral high resolution remote sensing imagery and sample based eld inventory data by means of the k-nearest neighbor (k-NN) technique and linear regression. Field data was collected by stratied systematic sampling in August 2006 with a total sample size of n = 31 circular nested sample plots of 154 m 2 for trees and shrubs and 1 m 2 for ground vegetation. Destructive biomass samples were taken on a sub-sample for fresh weight and moisture content. Species-specic allometric biomass models were constructed to predict dry biomass from diameter at breast height (dbh) for trees and from elliptic projection areas for shrubs. Quickbird data (standard imagery product), acquired shortly before the eld campaign and archived ASTER data (Level-1B product) of 2001 were geo-referenced, converted to calibrated radiances at sensor and used as carrier data. Spectral information of the pixels which were located in the inventory plots were extracted and analyzed as reference set. Stepwise multiple linear regression was applied to identify suitable predictors from the set of variables of the original satellite bands, vegetation indices and texture metrics. To produce thematic carbon maps, carbon values were predicted for all pixels of the investigated satellite scenes. For this prediction, we compared the kNN distance-weighted classier and multiple linear regression with respect to their predictions. The estimated mean value of aboveground carbon from stratied sampling in the eld is 15.3 t/ha (standard error SE = 1.50 t/ha, SE% = 9.8%). Zonal prediction from the k-NN method for the Quickbird image as carrier is 14.7 t/ha with a root mean square error RMSE =6.42 t/ha, RMSE r = 44%) resulting from leave-one-out cross- validation. The k-NN-approach allows mapping and analysis of the spatial variability of AGC. The results show high spatial variability with AGC predictions ranging from 4.3 t/ha to 28.8 t/ha, reecting the highly heterogeneous conditions in those permafrost-inuenced landscapes. The means and totals of linear regression and k-NN predictions revealed only small differences but some regional distinctions were recognized in the maps. © 2008 Elsevier Inc. All rights reserved. 1. Introduction Forest ecosystems are an important part of the global carbon cycle because they store a large part of the total terrestrial organic carbon and exchange CO 2 with the atmosphere. In permafrost regions carbon storage and net-exchange of CO 2 may change considerably due to the predicted temperature increase of 6 to 7 °C within the next 100 years (IPCC, 2007). Permafrost thaw and the related deepening of the active layer are expected to increase CO 2 emission from organic matter mineralization in the soil (Hobbie et al., 2000) but there is potential increase of carbon storage in plant biomass (White et al., 2000). There are considerable uncertainties and knowledge gaps regard- ing the inuence of current and future permafrost dynamics on the processes of carbon emission and carbon sequestration in northern ecosystems. Monitoring the vegetation dynamics of the circumpolar boreal forest (northern taiga) and arctic tundra region is important for understanding the consequences of climate-driven changes in these areas (Ranson et al., 2004). The transitional region of the forest tundra is expected to be sensitive to even small changes in environmental variables and thus it can offer early insight into potential changes in vegetation and aboveground carbon stocks driven by climate change. Remote Sensing of Environment 113 (2009) 518531 Corresponding author. Tel.: +49 55139 3469; fax: +49 551 39 9787. E-mail addresses: [email protected] (H. Fuchs), [email protected] (P. Magdon), [email protected] (C. Kleinn), h[email protected] (H. Flessa). 0034-4257/$ see front matter © 2008 Elsevier Inc. All rights reserved. doi:10.1016/j.rse.2008.07.017 Contents lists available at ScienceDirect Remote Sensing of Environment journal homepage: www.elsevier.com/locate/rse

Transcript of Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf ·...

Page 1: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Remote Sensing of Environment 113 (2009) 518–531

Contents lists available at ScienceDirect

Remote Sensing of Environment

j ourna l homepage: www.e lsev ie r.com/ locate / rse

Estimating aboveground carbon in a catchment of the Siberian forest tundra:Combining satellite imagery and field inventory

Hans Fuchs a,⁎, Paul Magdon a, Christoph Kleinn a, Heiner Flessa b

a Chair of Forest Inventory and Remote Sensing, Burckhardt Institute, Georg-August-Universität Göttingen, Büsgenweg 5, 37077 Göttingen, Germanyb Soil Science of Temperate and Boreal Ecosystems, Büsgen Institute, Georg-August-Universität Göttingen, Büsgenweg 5, 37077 Göttingen, Germany

⁎ Corresponding author. Tel.: +49 551 39 3469; fax: +E-mail addresses: [email protected] (H. Fuchs), pmag

[email protected] (C. Kleinn), [email protected] (H. Fless

0034-4257/$ – see front matter © 2008 Elsevier Inc. Aldoi:10.1016/j.rse.2008.07.017

a b s t r a c t

a r t i c l e i n f o

Article history:

This study was part of an in Received 8 January 2008Received in revised form 11 July 2008Accepted 13 July 2008

Keywords:ASTERCarbon estimationFeature selectionForest inventoryForest tundraGlobal changek-NN regionalizationMultiple linear regressionQuickbirdSiberia

terdisciplinary research project on soil carbon and phytomass dynamics of borealand arctic permafrost landscapes. The 45 ha study area was a catchment located in the forest tundra innorthern Siberia, approximately 100 km north of the Arctic Circle.The objective of this study was to estimate aboveground carbon (AGC) and assess and model its spatialvariability. We combined multi-spectral high resolution remote sensing imagery and sample based fieldinventory data by means of the k-nearest neighbor (k-NN) technique and linear regression.Field data was collected by stratified systematic sampling in August 2006 with a total sample size of n=31circular nested sample plots of 154 m2 for trees and shrubs and 1 m2 for ground vegetation. Destructivebiomass samples were taken on a sub-sample for fresh weight and moisture content. Species-specificallometric biomass models were constructed to predict dry biomass from diameter at breast height (dbh) fortrees and from elliptic projection areas for shrubs.Quickbird data (standard imagery product), acquired shortly before the field campaign and archived ASTERdata (Level-1B product) of 2001 were geo-referenced, converted to calibrated radiances at sensor and used ascarrier data. Spectral information of the pixels which were located in the inventory plots were extracted andanalyzed as reference set. Stepwise multiple linear regressionwas applied to identify suitable predictors fromthe set of variables of the original satellite bands, vegetation indices and texture metrics. To produce thematiccarbon maps, carbon values were predicted for all pixels of the investigated satellite scenes. For thisprediction, we compared the kNN distance-weighted classifier and multiple linear regression with respect totheir predictions.The estimated mean value of aboveground carbon from stratified sampling in the field is 15.3 t/ha (standarderror SE=1.50 t/ha, SE%=9.8%). Zonal prediction from the k-NN method for the Quickbird image as carrier is14.7 t/ha with a root mean square error RMSE=6.42 t/ha, RMSEr=44%) resulting from leave-one-out cross-validation. The k-NN-approach allows mapping and analysis of the spatial variability of AGC. The resultsshow high spatial variability with AGC predictions ranging from 4.3 t/ha to 28.8 t/ha, reflecting the highlyheterogeneous conditions in those permafrost-influenced landscapes. The means and totals of linearregression and k-NN predictions revealed only small differences but some regional distinctions wererecognized in the maps.

© 2008 Elsevier Inc. All rights reserved.

1. Introduction

Forest ecosystems are an important part of the global carbon cyclebecause they store a large part of the total terrestrial organic carbonand exchange CO2 with the atmosphere. In permafrost regions carbonstorage and net-exchange of CO2 may change considerably due to thepredicted temperature increase of 6 to 7 °C within the next 100 years(IPCC, 2007). Permafrost thaw and the related deepening of the active

49 551 39 [email protected] (P. Magdon),a).

l rights reserved.

layer are expected to increase CO2 emission from organic mattermineralization in the soil (Hobbie et al., 2000) but there is potentialincrease of carbon storage in plant biomass (White et al., 2000).

There are considerable uncertainties and knowledge gaps regard-ing the influence of current and future permafrost dynamics on theprocesses of carbon emission and carbon sequestration in northernecosystems. Monitoring the vegetation dynamics of the circumpolarboreal forest (northern taiga) and arctic tundra region is important forunderstanding the consequences of climate-driven changes in theseareas (Ranson et al., 2004). The transitional region of the forest tundrais expected to be sensitive to even small changes in environmentalvariables and thus it can offer early insight into potential changes invegetation and aboveground carbon stocks driven by climate change.

Page 2: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

519H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

1.1. Estimation of aboveground carbon (AGC)

Information about the amount and distribution of abovegroundphytomass is a prerequisite for the evaluation of the role of carbonexchange rates and for realistic predictions of future climate changes.Vaganov et al. (2006) report that in 2006 more than 20 internationalresearch projects dealt with this issue in Siberia. There are twoprimary methods for the assessing of phytomass: (1). field mea-surements, and (2) indirect assessments using remote sensingtechniques.

Camill and Clark (1998) showed that local factors such as slope,aspect and successional patterns have a stronger influence on borealpermafrost peatland dynamics than the regional mean annualtemperature. Therefore, studies describing processes on a local scaleare of great importance for calibration and validation of ecosystemmodels.1. Field measurements: Aboveground carbon stock (AGC) cannot be

measured directly but must be derived from aboveground biomass(AGB). For ecosystem level biomass assessments there are numer-ous techniques two of which have been applied in the context ofboreal and arctic regions:(a.) Biomass expansion factors (BEF) are derived from the relation-

ship between easy to measure variables such as stand volumeand AGB. They are widely used (Alexeyev et al., 1995; Alexeyev& Birdsey, 1998; Houghton et al., 2007; Isaev et al., 1995;Monserud et al., 1996; Nilsson et al., 2000). Alexeyev et al.(2000) distinguishes the geobotanical method and the forestinventory method further.• The geobotanical method uses a stratification based onvegetation and soil criteria. Data from a network of fieldplots serve for the assessment of the mean stock per stratumand the extrapolation to the strata. Selection of field plotsdoes not necessarily follow probability sampling but is basedon geobotanical criteria. For example, this method was usedby Bazilevich (1993) to describe the productivity of thevegetation in Russia.

• The forest inventory method is based on two data sources:probability sampled forest inventory plots and experimentalplots where AGB of different life forms is estimated. The rela-tionship between forest inventory data and measured carbonstocks is used to derive expansion factors. Often growing stockexpressed in terms of tree volume is used as an inventoryvariable (Turner et al., 1995). Alexeyev et al. (1994) for exampleconducted a large study in Russia with 2290 sample plots.

(b.) Allometric models for estimation of AGB are also very common(Camill et al., 2001; Fehrmann, 2006; Kajimoto et al., 2006;Montagu et al., 2005; Ohmann et al., 1976; Jia & Akiyama,2005 ; Wirth et al., 1999). Biomass is predicted by these modelson a per plot or per individual plant basis. Therefore, usually,species or site-specific models are developed by fittingparametric regression models to the relationship betweenAGB and easy to measure plant variables such as diameter atbreast height, dbh.

2. Remote sensing applications: High latitude permafrost regions aredifficult to access. Therefore, since the 1970s, remote sensing datahave been used for monitoring these areas in addition to fieldmeasurements (Lu, 2006). It is important to note, however, thatany remote sensing approach for biomass estimation also dependson field measurements.

Ahern et al. (2000) summarize the capability of optical remotesensing sensors for monitoring carbon cycles of boreal forests andidentify the following features that can be assessed:

• Forest cover and biomass spatial distribution (when supported byfield observations),

• Patterns of disturbance (fire, insect, land clearing),• Pattern and rate of regrowth after disturbance,• Seasonal variations of the net primary production.

The importance of remote sensing for AGB estimation hasincreased considerably during the last years due to better availabilityand coverage and finer geometric resolution of remote sensingdata (Lu et. al., 2002; Muukkonen & Heiskanen, 2005; Nelson et al.,1988; Wulder et al., 2008; Zheng et al., 2004). The application ofremote sensing has the distinct advantage that large areas can bemonitored (Hese et al., 2005) and spatial variability much bettercharacterized than when using field inventory data exclusively—always assuming that imagery with adequate spatial, spectral andradiometric characteristics is available.

There are numerous examples of approaches to estimate AGB fromsatellite data (Lu, 2006). Regression analysis is the most commonmodeling approach (Dong et al., 2003; Lu, 2005; Muukkonen& Heiskanen, 2005; Rahman et al., 2005; Zheng et al., 2004) but anincreasing number of studies use the nonparametric k-nearestneighbor technique (k-NN) for regionalization (Cabaravdic, 2007;Finley et al., 2006; Tomppo et al., 2002; Tuominen & Pekkarinen,2005). The basic idea is to estimate the unknown value of an attributeof an object based on its similarity to objects with known values of theattribute. k-NN methods are easy to implement in computer softwarein principle, although the computational demands are high and theformulation of error components is still a challenge (McRoberts et al.,2007; Stage & Crookston, 2007).

Most applications of k-NN for remote sensing data use mediumresolution imagery such as Landsat images (Finley et al., 2006; Franco-Lopez et al., 2001; Katila & Tomppo, 2001; Mäkelä & Pekkarinen,2001). The availability, low cost and large swath width of Landsatsatellite sensors make them more appropriate for nationwide inven-tories. Only few studies have used fine resolution imagery such asQuickbird or aerial photographs (Muinonen et al., 2001; Tuominen &Pekkarinen, 2005). Since digital aerial photographs are widelyavailable it is interesting to contrast medium resolution ASTERsatellite images (comparable to Landsat) with high resolution Quick-bird satellite images to improve prediction on a regional and localscale.

1.2. Objectives

In the framework of an interdisciplinary research project, soilcarbon stocks and fluxes were analyzed in a relatively small study areain the forest tundra zone of central Siberia. There are three objectivesof this particular study. The primary objective is to estimate andregionalize aboveground dry biomass and carbon stocks in a smallcatchment where no prior forest inventory data are available usingfield observations and remote sensing.

The second objective was to apply the k-NN-technique for estimationand regionalization of AGC as well as to quantify and model ecosystemfluxesandbudgets. For the latter, AGCwas tobemappedwithafinespatialresolution. The prediction accuracy of the k-NN technique was thencompared to the accuracy of multiple regression techniques and to theaccuracy of field-based stratified estimates.

The third objective was to compare the utility of data obtainedfrommedium and high resolution satellite sensors for AGC estimation.

2. Materials and methods

2.1. Study site

The 45 ha study site is the catchment of Little GrawijkaCreek in Central Siberia near the town of Igarka at 67°48′ latitudeand 86°43′ longitude on the east shore of lower Yenissej River,approximately 100 km north of the Arctic Circle. It belongs to the

Page 3: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Fig.1.Upper: Location of study site Little Grawijka Catchment near the town Igarka in the transition zone between tundra and taiga (ESRI, 2006). Lower: Pre-stratification of study siteinto the southern catchment area consisting mainly of north exposed and flat areas (Stratum 3), the south to south east exposed areas in the northwest of the catchment (Stratum 1),and the open bog areas (Stratum 2).

520 H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

administrative region of Krasnoyarsk Krai of the Russian Federation(Fig. 1). The catchment has a variety of landscape units and vegetationassociations in a distinctive spatial pattern. The eastern part isdominated by raised bogs, thermokarst ponds and little streams,which flow into Little Grawijka Creek carving a small valley on its wayto the Yenissej River. The topography is flat with a height range of just11 m.

South and south-east exposed slopes where the active layer depthis greatest are covered by dense forest stands which are dominated byBetula pendula, whereas forest stands on the north exposed and flatareas are sparser and dominated by Picea obovata. Growing stock of

the forest is low with a mean dbh of 6.5 cm and mean basal area of0.1 m2/ha (results from the field survey). On raised bogs, vegetation iscomposed of dwarf shrubs, herbs, moss and lichen. A detaileddescription of permafrost distribution, soil properties and vegetationwithin this catchment was recently published by Rodionov et al.(2007).

2.2. Structure of the carbon mapping process

Fig. 2 gives an overview of the methodological steps of AGC mapproduction, where field observations of local biomass are combined

Page 4: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Fig. 2. Workflow of carbon mapping.

521H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

with carrier data from remote sensing imagery. The focus is on themethodological components of remote sensing image processing.Field sampling and estimation of dry biomass of various biomasscompartments on the field plots are briefly summarized in the nextsection.

2.3. Inventory data

Forest inventory data were not available for our study site.So a field survey was conducted in August 2006. An existingsystematic sampling grid (n=168, grid spacing=50 m, (Rodionovet al., 2007)) and 7 additional experimental plots were assigned to L=3strata according to expected AGC. The 7 additional plots are referred toas intensive measurement plots and were established to cover typicallandscape units in the study area (Flessa et al., 2006). On these plots,soil properties were determined and net-exchange of CH4 wasrepeatedly measured. Within each stratum n=8 sample locationswere selected randomly from the base sample grid (Fig. 1). Theinterpretation of AGC estimates needs to be made with cautionbecause we used a mix of two different plot selection strategies.

Fixed area circular sample plots with radius r=7 m wereestablished at all sample points. Geolocation was done with a singlehandheld GPS receiver using Standard Positioning Service (SPS)with typical horizontal accuracy of 10 m at (95% confidence interval).The aboveground fresh biomass was estimated from species specificallometric models that were derived from biomass sub-samplesof sample trees. Moisture content was measured with the moistureanalyzer MA30 (Sartorius Inc.) and allowed conversion of freshbiomass to dry biomass. Dry biomass was then transformed to carbonusing a simple conversion factor of 0.5 as described by Alexeyev andBirdsey (1998) for most of the Siberian plant communities.

2.4. Availability of remotely sensed data

We limited our study to passive electro-optical sensors in thespectrum of near infrared and infrared with high and medium spatial

resolution (that is 30 m ground sampling distance (GSD) or smaller).Limitations for the data search were cloud cover, snow, ice and theshort growing season from June to September.

We also evaluated possibilities for acquiring actual satellite images.Forestry in Russia has a long tradition of using remote sensing. But useof the Russian high resolution imagery is restricted, and such imagerywas not available for this project. We aimed to acquire an actualsatellite image for the vegetation period close to our field campaign. Anew cloudless Quickbird image was taken on 16 July 2006 andobtained as a standard imagery product from Eurimage Inc. with 1panchromatic, GSD=0,6 m, and 4 multispectral bands, GSD=2.4 m(view angle 6.2°, sun elevation/azimuth 44°/179.4°).

We completed our data acquisition using archived ASTER imagedata: Level-1B product, 3 bands in visible and near infrared spectrum(VNIR), GSD=15 m, 6 bands in shortwave infrared spectrum (SWIR),GSD=30 m, 5 bands in thermal infrared spectrum (TIR), GSD=90 m,recorded on 10 September 2001 with a cloud cover of 2% (viewangle 2.7°, sun elevation/azimuth 27.5°/180.7°) from the U.S. Geolo-gical Service.

2.5. Image preprocessing

The image data were converted to 32bit floating point numbers astop-of-atmosphere radiances with standard procedures described inKrause (2003) for Quickbird and in Abrams et al. (2003) for ASTER.Neither atmospheric correction nor topographic normalization wasused because data for fitting of aerosol and atmosphere models attime of image acquisition were not available and relief is low withinthe study area.

Quickbird bands showed high geometric quality compared to fieldbased GPS measurements and were resampled to 1 m pixel size.The horizontal error of the standard imagery product Quickbird isreported with 14 m (DigitalGlobe, 2008). Our GPS measured plotlocations show a root mean square error (RMSEXY) of 5.31 m, derivedfrom n=19 tree group positions using the standard imagery product asreference.

Page 5: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Table 1Measures for exterior post evaluation of predictions (n=number of observations;xi=observed values for entity i; x̂i=estimated value for entity i;, x―=observed meanvalue, sd=standard deviation)

Bias BIAS = 1n ∑

n

n = 1x̂i−xi

� �

Root Mean Square Error RMSE =

ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi∑n

n = 1xi− x̂i� �2

n

s=

ffiffiffiffiffiffiffiffiffiffiMSE

p

Relative Bias BIASr = BIASx d100k

Relative RMSE RMSEr = RMSEx d100k

Coefficient of variation COVr =sd̂

xx d100k

Fig. 3. Extracted average valueswithin inventory plots for selectedvariables. a) Plot design,b) Overlay on 1 m-resolution Quickbird band (left) and overlay on 15 m-resolution ASTERband (right).

522 H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

We used image to image matching and co-registered the ASTERVNIR bands with the Quickbird scene (RMSEXY=0.28 m). Due todifferences in spatial resolution we used the ASTER VNIR bands as areference for co-registration of ASTER SWIR bands (RMSEXY=0.11.).Nearest neighbor resampling resulted in a pixel size of 15 m for ASTERbands. The ASTER TIR bands were excluded from analysis because GSDwas larger than 30 m.

Additional sets of indices (34 for Quickbird, 27 for ASTER) werederived from the original satellite bands and included vegetationindices, principal components, tasseled cap transforms, texturemeasures (Table 2). Together with the original bands, these imagetransforms served as input for feature selection.

The reference data set was extracted from the images as theaverage of all pixel values (per band)with centers in the location of thecircular inventory plots (Fig. 3). By averaging we aimed at decreasingthe effect of mis-registration errors that originate from imperfectmatching of imagery to the sample plots (Mäkelä & Pekkarinen, 2001;Muukkonen & Heiskanen, 2005).

Thermokarst ponds composed of open water bodies and swim-ming vegetation layers were masked and excluded from the target setusing a semi-automatic segmentation approach. The segmentationwas based on manually located seed points in the Gram–Schmidtenhanced Quickbird image with a seed growing algorithm utilizingsimilarity criteria (spatial distance, spectral Euclidean distance) (LeicaGeosystems Geospatial Imaging, 2006).

2.6. Feature extraction and linear regression modeling

Subsets of variables as input for k-NN and linear regressionpredictionwere selected to keep themodels less complex and to reducecomputational efforts. As variable selection strategy we used stepwiseselection and a linear model AGC=β0+β1x1+β2x2+⋯+βkxk+ε, definingα=0.15 as threshold for adding or removing a predictor variable. Thepartial F-statisticswere used asmodel selection criteria.We checked the

resulting subsetmodels for correct functional specification and analyzedoutliers and residuals using correlation analysis. Model evaluationcriteria were the coefficient of determination R2, the adjusted R2, thesignificance of the regression and the root residual mean square errorffiffiffiffiffiffiffiffiffiffiffiffiffi

MSResp�

, the residual standard error of regression). Because negativecarbon stock is not plausible,we set negative predictions from the linearregression model to zero thus enforcing a non-negative model.

2.7. k-NN prediction

We used the weighted k-NN algorithm implemented in the kknnpackage Version 1.0.3 (Schliep & Hechenbichler, 2006) of the OpenSource statistical software R (R Development Core Team, 2007). Theset of points for which both the independent and dependent variablesare known is referred to as the reference set, whereas the set of pointsfor which only independent variables are known is referred to as thetarget set (Finley et al., 2006). As a first step, the image variables of thereference set were standardized by dividing the variable values bytheir standard deviation. A triangular kernel function was defined asthe weighting function for variable distances (Hechenbichler &Schliep, 2004):

w ið Þ = 1−jdjð ÞdI jdjV1ð Þwhere

d distance;I(.) indicator function: if condition in brackets is true=1,

otherwise=0.

This kernel function needs a window width so that the valuesbecome zero at a certain distance from the maximum value. In thekknn algorithm, the Euclidean distances are standardized by dividingby the distance of the closest neighbor (k+1) not used to calculate theprediction. These standardized distances always take values withinthe interval [0,1] (Hechenbichler & Schliep, 2004).

Calculations for the ASTER data were done on a standard PC,while Quickbird imagery was computed using a 64 bit AMD-Opteroncluster. The evaluation approach for the choice of k and for comparingpredictions to predictions from linear regression was the leave-one-out (LOO) cross-validation. Statistical measures for the exterior postevaluation of accuracy (Table 1) were the absolute and relative rootmean square error (RMSE and RMSEr), the absolute and relative bias(BIAS and BIASr) (Muukkonen & Heiskanen, 2005), the coefficientof variation of predicted values (COV) and the Pearson correlationcoefficient r between observed and predicted values.

3. Results

3.1. Feature selection

We compared the performance of k-NN and linear regression usingonly the original bands as is often done in other studies (Gjertsen,2007; Katila & Tomppo, 2001; Kim & Tomppo, 2006; Mäkelä &

Page 6: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Table 2Pearson correlation coefficients r of field observed AGC and satellite image variables, n=30, abbreviations are defined in Table 7 of Appendix A

Quickbird ASTER

Variable Correlation Variable Correlation Variable Correlation Variable Correlation

r p r p r p r p

Original bands B1 0.58 0.0008 VNIR1 −0.10 0.6174 SWIR3 −0.57 0.0010B2 −0.55 0.0018 VNIR2 −0.32 0.0843 SWIR4 −0.53 0.0023B3 −0.58 0.0008 VNIR3 −0.37 0.0419 SWIR5 −0.46 0.0099B4 −0.22 0.2345 SWIR1 0.57 0.0010 SWIR6 −0.55 0.0015PAN −0.37 0.0417 SWIR2 −0.52 0.0031

Tasseled Cap, PCA PC1 −0.26 0.1720 TC1 −0.42 0.0221 VPC1 −0.36 0.0511 VSWIRPC2 0.07 0.7287PC2 0.38 0.0401 TC2 −0.11 0.5764 VPC2 0.08 0.6871 VSWIRPC3 −0.54 0.0021PC3 −0.50 0.0048 TC3 −0.05 0.7740 VPC3 −0.45 0.0118 VSWIRPC4 0.20 0.2935PC4 −0.26 0.1587 TC4 0.30 0.1033 VSWIRPC1 −0.37 0.0444

Vegetation-indices NDVI 0.21 0.2583 ARVI 0.50 0.0051 NDVI −0.14 0.4544 TNDVI −0.14 0.4530SR 0.15 0.4164 SARVI −0.25 0.1814 SR −0.14 0.4587 MSI −0.44 0.0153SQRT_SR 0.17 0.3582 MSARVI −0.23 0.2305 SQRT_SR −0.14 0.4575 MIDIR 0.27 0.1453SAVI 0.21 0.2629 TVI −0.01 0.9678 SAVI −0.06 0.7680EVI 0.22 0.2515 NDWI −0.10 0.6127 II2 −0.43 0.0164

Texture measures TEXVAR3 0.11 0.5463 TEXVAR19 0.29 0.1207 TEXCON3 −0.33 0.0787 TEXCON7 −0.27 0.1544TEXVAR5 0.17 0.3622 TEXVAR21 0.29 0.1257 TEXENT3 −0.05 0.7731 TEXENT7 −0.27 0.1535TEXVAR7 0.17 0.3600 TEXVAR23 0.31 0.1004 TEXSAM3 0.03 0.8641 TEXSAM7 0.25 0.1857TEXVAR9 0.25 0.1786 TEXVAR25 0.33 0.0757 TEXCORR3 −0.25 0.1750 TEXCORR7 0.21 0.2653TEXVAR11 0.26 0.1619 TEXCON25 0.68 b .0001 TEXCON5 −0.36 0.0497TEXVAR13 0.25 0.1779 TEXENT25 0.59 0.0006 TEXENT5 −0.28 0.1340TEXVAR15 0.27 0.1569 TEXSAM25 −0.52 0.0035 TEXSAM5 0.25 0.1856TEXVAR17 0.28 0.1392 TEXCORR25 −0.55 0.0016 TEXCORR5 −0.08 0.6841

523H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

Pekkarinen, 2001; Mukkonen & Heiskanen, 2005, 2006) with featureselection resulting from stepwise linear regression using imagetransforms and original bands. To get an overview of the relevanceof the variables we explored the Pearson correlation coefficients rbetween AGC and the analyzed satellite image variables, which aregiven in Table 2. In general, the linear correlations of these indices tofield observed carbon are low while multicollinearity between theimage variables is high. Original bands of both Quickbird and ASTERare negatively correlated to carbon. Only a few vegetation indices aresignificantly correlated to total carbon stock.

The texture measure “variance” (TEXVAR) of the original Quickbirdpanchromatic band showed increasing coefficients for increasingkernel sizes 3×3 to 25×25. The texture measure “contrast” based onthe gray-level co-occurrence matrix (GLCM) (Haralick et al., 1973) wasone of the best variables for Quickbird and ASTER. It can be interpretedas a measure for gray-level differences between neighboring pixelsin a spatial neighborhood of 15×15 m for Quickbird and 75×75 mfor ASTER. Finally, subsets were selected from the image transformsand original bands (Table 2) by stepwise linear regression (Table 3).

3.2. Linear models

We built linear models for Quickbird and ASTER based on theselected variables (Table 3).

For both ASTER and Quickbird the first order interactions betweenthe selected variables were checked and found not to be significant.The goodness of fit of the selected models was examinedwith analysisof scatter diagrams and residual graphs (Fig. 4). The residual graphs for

Table 3Results of stepwise linear regression

Quickbird

Variable Estimate Std. Error p(N|t|) Partial R2

Intercept 59.48 21.79 0.011TEXCON25 2.26 0.64 0.002 0.466NDWI 192.66 80.74 0.025 0.090MSARVI −1.72 0.66 0.014 0.036Adj. R2 0.544ffiffiffiffiffiffiffiffiffiffiffiffiffiMSRes

p5.733

Quickbird and ASTER (Fig. 4a and c) both show increasing residualswith increasing carbon values, a phenomenon which is well knownfor forestry and biological response variables and affects analysesinvolving variances (precision, confidence intervals). However, in thepresent case, logarithmic transformations of dependent and indepen-dent variables did not stabilize the variances but only increased thestandard error of residuals. Comparison of predicted versus observedvalues reveals a clearly curved band pattern for ASTER (Fig. 4d) and aslightly curved pattern for Quickbird (Fig. 4b) raises the question ofthe suitability of the selected linear model.

3.3. k-NN approach

The performance of k-NN was evaluated using two subsets ofvariables. We compared predictions using image transforms andfeature selection by stepwise linear regression (referred to as Quick-bird1 and ASTER1) to predictions based on the original bands only(referred to as Quickbird2 and ASTER2, see Table 4).

The underlying assumption of the k-NN technique, that the totalrange and variation of the variables in the target set are also pre-sent in the reference set, is sensitive to violations. Thus, k-NN pre-diction extrapolates poorly beyond the ranges of the reference data(McRoberts et al., 2007). Therefore we compared the distributionof pixel values for the reference set with that for the target set. Aswe extracted average values within the inventory circles from theQuickbird image, we had to adapt the scale of the target set to thereference set, using a circular mean filter with the radius of theinventory plots. The histograms of the distribution of standardized

ASTER

Variable Estimate Std. Error p(N|t|) Partial R2

Intercept −6.89 23.95 0.776SWIR3 −46.20 11.05 0.000 0.326VSWIRPC2 5.06 1.89 0.013 0.079TEXCON5 −0.26 0.11 0.027 0.104Adj. R2 0.452ffiffiffiffiffiffiffiffiffiffiffiffiffiMSRes

p6.286

Page 7: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Fig. 4. Graphs of residual and predicted versus observed values for Quickbird (a, b) and ASTER (c, d) linear regression models.

524 H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

variables in Fig. 5 show a similar range of target and reference set, butit is also illustrated that some extreme values of the target data set arenot completely covered by the reference data set.

The comparison of the different subsets of image variables is basedon the LOO cross-validation criterion (see Table 1). RMSEr of allQuickbird and ASTER subsets show a typical decrease with increasingk (Fig. 6). Minimum RMSEr was found for Quickbird 1 at k=10.The relative bias stabilizes at a level of about +8% for all subsetsindicating a systematic overestimation. BIASr is lowest for ASTER1 andQuickbird2 at k=2. For Quickbird1 the relative bias is approximatelyconstant over all observed k.

Smaller k values preserve range and variance better (see coefficientof variation and range in Fig. 6) but result in greater uncertainty ofpredictions as the RMSEr is higher. Because we wanted to maintainprecision while conserving most of the variability of predicted carbonwe used k=5 neighbors as a compromise (compare also Fehrmann,2006; Franco-Lopez et al., 2001; Katila & Tomppo, 2001; McRobertset al., 2002).

Table 4Subset definition for k-NNprediction. Abbreviations are explained inTable 7 of AppendixA

Quickbird ASTER

Set 1 TEXCON25, MSARVI, NDWI SWIR3, VSWIRPC2, TEXCON 5Set 2 B1, B2, B3, B4, PAN VNIR 1,2,3, SWIR 1, 2, 3, 4, 5, 6

For k=5 all quality criteria, except BIASr led to selection of the subsets1 for Quickbird and for ASTER. Comparing both sensors, ASTER1 shows alower BIASr and preserves the range of the reference set better thanQuickbird1, whereas Quickbird1 has lowest RMSEr.

Fig. 7 illustrates the superiority of the indices (subset 1, derivedby image transforms and feature selection) over the original bands(subset 2) for prediction: the correlation coefficients of observed andpredicted values are considerably higher for subsets 1 for both ASTERand Quickbird. Highest correlation was found here for Quickbird1 andk=10.

The residual graphs for the selected subsets 1 (Fig. 8) illustrate that thevarianceof the residuals increaseswith increasingvalues of predicted totalcarbon. The predicted versus observed values give an indication of a clearbut weak relationship for both ASTER and Quickbird.

3.4. Comparison between linear regression, k-NN prediction and stratifiedsampling

Fig. 9 illustrates the results of linear regressionprediction and k-NNprediction as AGC maps with a spatial resolution of 1 m for Quickbirdand 15 m for ASTER based on the same reference set. In general, k-NNand linear regression show similar spatial distributions of carbon. Bogareas are clearly identified with small predicted values for AGC.

A detailed look at the carbon maps, however, reveals differences ofpredicted AGC between the k-NN and the linear regression approach:

Page 8: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Fig. 5. Comparison of range and distribution of target and reference data set for selected standardized (x =0, sd=1) image bands.

525H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

the k-NN algorithm has a sharpening effect and emphasizes edgeswhereas linear regression predictions are smoother in the spatialdomain. The minimum and maximum of carbon range are betterrepresented by k-NN compared to field based inventory results. Linearregression considerably over- or underestimates carbon near extremevalues. This is evident for regression ASTER model 2 where negativecarbon values were set to zero. The range of carbon for regressionprediction is larger than that for k-NN prediction.

Quickbird k-NN prediction gives the center of predicted carbondensity on flat and north-east exposed slopes. For Quickbird linearregression prediction high carbon density is located along the creekand on south-west exposed slopes that are associated with deeperactive layers or lack of permafrost. This is in agreement with our fieldobservations. Investigation of the relation between soil and terrainvariables, and carbon spatial distribution is ongoing research.

Table 5 gives an overview of the LOO evaluation of Quickbirdand ASTER subsets. Comparing BIASr we conclude that the systematicerrors are larger for the k-NN predictions than for linear regressionmodels. We find a systematic overestimation for all k-NN estimates(positive BIAS) whereas BIAS of linear regression models is small.There is clear evidence for an improvement as a result of feature

selection instead of subsets with the original bands: RMSEr declines by11.9% using Quickbird1 and 5% using ASTER1 instead of the subsetswith the original bands. This underlines the importance and potentialbenefits of a feature selection step prior to k-NN or linear regressionprediction.

We compared average and total predicted carbon within the strataboundaries using data from stratified sampling of field plots. We usedthe estimator for stratified random sampling with proportionalallocation of samples as an approximation to the stratified systematicsampling that had been done. The total populationmeanwas estimatedas 15.3 AGC t/ha (SE=1.50 t/ha, SE%=9.8%). Stratum3 and 1 have similarestimated mean values, whereas Stratum 2 which covers most of thebog area holds only 10% of total estimated AGC in the catchment.

Predictions with Quickbird and ASTER as carrier data come closeto the total and strata means estimated from stratified sampling.The estimated total in stratum h, and estimated mean of stratum h arein all cases within the 95% confidence interval of stratified sampling.k-NN prediction of Quickbird comes closest to the estimates fromstratified systematic sampling with total average y =15.1 t/ha. TheQuickbird subset produced the highest mean carbon predictions instratum 1 (and this is close to the estimates from field sampling),

Page 9: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Fig. 6. Evaluation criteria versus k=number of neighbors for subset selection using k-NN prediction; “….1”=subset with the selected variables (see Table 4), “….2”=subset with theoriginal bands only.

526 H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

ASTER models in stratum 3. Table 6 lists the stratum-wise results forthe field inventory based estimation and the ASTER and Quickbirdbased predictions.

Fig. 7. Correlation coefficients of predicted to observed carbon for ASTER and Quickbirdsubsets. For legend see Fig. 6.

4. Discussion

Combining data from field sampling and the satellite imagevariables we produced high resolution maps of above ground carbon,preservingmean and total carbon stock estimates while adding spatialvariability. The results of this case study are based on a relatively smallsample size and refer to a study area of 45 ha. The carbon predictionsfrom k-NN and linear regression are well in the range of values re-ported by other studies: Vederova et al. (2002) reported dry phy-tomass values of 25.7 t/ha and 36.9 t/ha for two test sides in the foresttundra (68°20′N, 87°′50′E) at the Yenissej River. Alexeyev & Birdsey(1998) reported an AGC of 15 t/ha for the ecoregion forest tundra.Nilsson et al. (2000) estimated in their full carbon account a AGC of23.9 t/ha in forest tundra. Schlesinger (1997) estimated a mean livingphytomass density (dry matter) of 8 t/ha in the tundra and 95 t/ha inthe boreal forest. The IIASA Forestry program (IASA FOR, 2007)published maps of the aboveground living phytomass density (drymatter) with a range of 8–23 t/ha at the location of the study areaLittle Grawijka Creek Catchment. Compared to other studies using k-NN for estimation of forest variables, in particular volume, (Franco-Lopez et al., 2001; Tuominen & Pekkarinen, 2005) our results exhibit arelatively low RMSEr of 44% for the best Quickbird subset.

The performance of image transforms such as vegetation indices,texture and tasseled cap transformswas compared against the originalimage bands. In this studywe found better results for selected variablesubsets. The improvement was of an order of magnitude of 5–12%

Page 10: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Fig. 8. Residuals and predicted vs. observed values for Quickbird1 (above) and ASTER1 (below) for k-NN prediction with k=5.

527H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

RMSEr for the observed mean. In addition, considerable improve-ments in computational efficiency of nearest neighbor searches wereachieved; computational efficiency is still an issue for the applicationof the k-NN technique on high resolution satellite images and/orlarger geographical areas.

Stratified systematic sampling with circular field plots was foundto be adequate as a sampling strategy for reference data. The estimateof the population mean revealed a relative standard error below 10%.The stratified sampling produced sufficient reference data to beused for prediction using the satellite imagery, because the spectralrange of the target set was covered well and we were able to compareestimates for strata zones with the regionalization results of k-NNprediction and linear regressions. If image data are available beforethe field campaign we suggest an a-priori stratification based onselected image variables like texture measures of the Quickbird pan-chromatic band or ASTER SWIR bands to cover the whole range ofimage data in the reference set. This could reduce the necessity ofextrapolating outside the range of observed values for both linearregression and k-NN prediction and thus contribute to increasingprediction accuracy.

For error estimation we did not use a second independentreference data set, because establishing field plots and determiningthe fresh weight of the aboveground phytomass are very costly.Relatively small sample sizes are among the common challenges ofbiomass and carbon assessments.

Selecting optimal parameters and error estimation for the k-NNalgorithm are usually based only on the reference set using LOO cross-

validation. Error prediction is nearly unbiased but can be highlyvariable for small samples. (Efron & Tibshirani, 1997; Franco-Lopezet al., 2001).

Gray-level co-occurrence matrix (GLCM) measures for very highresolution images (Franklin et al., 2000, 2001; Hay et al., 1996;Puissant et al., 2005; Tuominen & Pekkarinen, 2005) and for Landsat(Lu, 2005) have been successfully included in our study for AGCprediction improving accuracy.

GLCM contrast is typically found to be among the most effectivedescriptors of image texture (Van der Sanden & Hoekman, 2005). Inthis study, the GLCM contrast based on a kernel of 25×25 pixels of thepanchromatic Quickbird band (0.60 m pixel size) outperformed allother variables in the feature selection process. The contrast at kernel5×5 of the first principal component of ASTER VNIR bands (15 m pixelsize) improved linear regression.

The panchromatic band of high resolution satellite imagesoften yields better results for forest biomass and for prediction ofstand characteristics than multispectral bands; this is in agreementwith our results. Franklin et al. (2001) found best results forseparability of age classes of Canadian coniferous forest stands onpanchromatic Ikonos image for GLCM Homogenity and window sizes15–25 m. Proisy et al. (2007) predicted mangrove biomass fromFourier-based textural ordination of Ikonos images. Leboeuf et al.(2007) classified shadows of panchromatic Quickbird band withsimple thresholding. They found close linear relationships betweenshadow fraction (which we assume to be strongly correlated totexture) and the aboveground biomass in boreal black spruce stands

Page 11: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Fig. 9. Aboveground carbon maps of k-NN and linear regression prediction for Quickbird subset1 and ASTER subset1 of the study area Little Grawijka Creek Catchment.

528 H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

in the Canadian taiga. They preferred reference squares with sizesfrom 10 m to 30 m and pixel-based estimations of shadows.Greenberg et al. (2005) classified shadows of pansharpened IKONOSscenes and found correlations with DBH, stem number and crownarea.

The Quickbird panchromatic sensor has a spectral range of450 nm–900 nm wavelength (DigitalGlobe, 2008) in the spectralregion of visible including near infrared. More research in the field of

texture analysis of this spectral range for carbon estimation is neededfor determination of appropriate texture measures and movingwindow sizes.

By analyzing the AGCmaps provided by the different methods onecan see that k-NN has the effect of edge preserving and enhancement,while linear regression estimation provides a generalization in thespatial domain. In general, the maps of linear regression and k-NNexhibit strong similarities. This is expected and in agreement with

Page 12: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

Table 5Comparison of k-NN and linear regression prediction for Quickbird and ASTER subset1 (image transforms) and subset2 (original bands) based on LOO cross-validation of thereference set (n=30): r=Pearson correlation coefficient of observed and predicted values

MODEL RMSE RMSEr BIAS BIASr COV r

t/ha % t/ha % % –

k-NN Quickbird1 6.42 43.54 1.170 7.93 43.89 0.66Quickbird2 8.17 55.38 0.715 4.85 42.65 0.40ASTER1 7.92 53.70 0.498 3.38 39.81 0.42ASTER2 8.65 58.71 1.597 1.60 35.67 0.28Quickbird1 6.08 41.21 0.028 0.19 44.50 0.69

Linear regression Quickbird2 7.80 52.94 0.030 0.20 40.49 0.44ASTER1 6.51 44.17 0.077 0.52 42.06 0.63ASTER2 9.13 61.93 0.475 0.47 46.6 0.29

Subset definitions according to Table 4.

529H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

the findings of Labrecque et al. (2006). Studying the range of theAGC maps it is obvious that linear regression estimation performsworse with respect to extreme observations, among the reasonsbeing a misspecification of the multiple linear regression models atthe bounds of the range of observed values. The linear regressionmodel is not appropriate in these intervals. This is severe for theASTER model where negative carbon estimates would result, whichwe consistently set to zero. But still, the mean and total are preservedwell, which corresponds to the findings of Labrecque et al. (2006). Wedid not apply alternative methods of nonlinear regression or locallyweighted regression because our reference data set was consideredtoo small. The issue of specifying model adequacy in regressionanalysis is not an issue for the k-NN technique, although predictionsfor pixels in the target set that are not covered by the reference setmay be biased.

The results from LOO cross-validation and the AGC maps in-dicate a slight superiority of the linear models in our case study; forwhich variance estimators are available, while variance estimationfor k-NN is a challenge and ongoing research (e.g. McRoberts et al.,2007). Systematic errors (bias) of the k-NN method were higherthan those of linear regression. This is in agreement with Gjertsen(2007), McRoberts et al. (2002) and Trotter et al. (1997). Thecomputational effort for both methods can be reduced by usingfeature selection algorithms such as linear stepwise selection, butremains high when k-NN is applied for high resolution imagery likeQuickbird.

Table 6Estimates (stratified systematic field sampling) and predictions (k-NN and regression fromNh=population size of stratum h, a=size of stratum, Ah=area of stratum h, y h=estimatedstratum h; Min=minimum, Max=maximum of estimates

Stratified sampling nNh

Ah (ha)Syh― (t/ha)y―h (t/ha)τ h (tons)min / max (t/ha)

Quickbird k-NN y―h (t/ha)τ h (tons)min / max (t/ha)

Linear model y―h (t/ha)τ h (tons)min / max (t/ha)

ASTER k-NN y―h (t/ha)τ h (tons)min / max (t/ha)

Linear model y―h (t/ha)τ h (tons)min / max (t/ha)

5. Conclusions

1. The k-NN algorithm delivers reliable overall results as compared toestimates from stratified sampling. Although we used only a smallsample size, spatial variability is retained using high resolutionsatellite images. Quickbird appears more suitable than ASTER forecosystem studies at local scale.

2. Stratified systematic sampling was found adequate for fieldsampling of aboveground biomass. If image data are availableprior to field assessment, accuracy of both k-NN and linear modelprediction could be improved by a-priori stratification according toselected image variables in a way that the range of the target set isfully covered by the reference set.

3. Feature selection results in a considerable decrease of estimationerrors. It appears that texture analysis of panchromatic highresolution satellite imagery merits more attention. In this contextresearch into tree shadows and selection of texture measures forcarbon stock estimation is ongoing.

4. Comparison of linear regression and k-NN prediction showedadvantages for the regression approach as bias and RMSE weresmaller (except for the original bands of ASTER) and correlationcoefficients of all linear models were higher than those of thecorresponding k-NN imputations. Judging from a practical point ofview, linear regression needs less computational efforts and canbe performed on standard PCs. But linear regression estimationperforms worse than k-NNwhen it comes to extreme observations:

ASTER and Quickbird imagery) of AGC [t/ha], where nh=sample size in stratum h;mean in stratum h; τ h=estimated total in stratum h; syh estimated standard error in

Stratum Total

1 2 3

9 10 12 3180 34 58 17222.75 8.43 14.25 45.432.59 2.69 2.14 1.5016.45 8.73 17.57 15.30374.24 73.55 250.23 698.025.2 / 31.2 3.3 / 27.2 5 / 30.6 3.3 / 31.214.82 10.14 17.18 14.70337.23 84.53 243.99 665.754.3 / 28.8 4.2 / 28.2 4.4 / 28.4 4.2 / 28.816.86 10.93 16.45 15.64383.62 91.08 233.62 708.320.0 / 38.3 0.0 / 33.5 0.0 / 32.1 0.0 / 38.314.71 11.22 17.46 14.94333.89 92.40 249.06 675.354.0 / 26.1 4.0 / 25.9 5.3 / 26.5 4.0 / 26.514.63 10.55 16.30 14.41332.05 86.92 232.58 651.540.0 / 33.3 0.0 / 24.9 3.9 / 31.9 0.0 / 33.3

Page 13: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

530 H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

a general solution to the problem of negative prediction valuesfor biomass is still to be identified in particular when dealing withsmall sample sizes.

Acknowledgements

This researchwas partially supported by a grant from the DeutscheForschungsgemeinschaft (DFG project KL894/8-2). This support isgratefully acknowledged. Our thanks are due to the Sukachev Institutefor Forest Research, Krasnoyarsk, for efficient collaboration; inparticular, our thanks go to Dr Natalia Mikeyeva for her excellentsupport to field work and vegetation analyses.

We thank Mr. Otto Kuhlmann and Sartorius Inc., Göttingen, forproviding themoisture analyzerMA30 and Dr Frantisek Vilčko from ourresearch group for developing the software for that device. Sincerethanks do also go to four anonymous reviewers for their in-depth ob-servations and to the guest editorDr RonaldMcRoberts forhis continuedsupport which helped improve the manuscript considerably.

Appendix A

Table 7Description and references for the satellite image bands and transformations

Band / index Description Reference

B1–B4 Blue, Green, Red, Near infrared of QuickbirdPan Panchromatic of QuickbirdVNIR1–VNIR3 Green, Red, Near infrared of ASTERSWIR1-6 Shortwave infrared of ASTERPC1–PC4 Principle components of Quickbird bands

B1-4VPC1-3 Principle components of ASTER bands

VNIR1–VNIR3VSWIRPC1-4 Principle components of ASTER bands

VNIR1-3 and SWIR1-6NDVI Normalized Difference Vegetation Index Jensen (2005)SR Simple Ratio Jensen (2005)SQRT_SR Square root of Simple ratio Jensen (2005)SAVI Soil Adjusted Vegetation Index Jensen (2005)EVI Enhanced Vegetation Index Jensen (2005)ARVI Atmospherically Resistant Vegetation Index Jensen (2005)SARVI Soil and Atmospherically Resistant

Vegetation IndexJensen (2005)

MSARVI Modified Soil and Atmospherically ResistantVegetation Index

Jensen (2005)

TVI Triangular Vegetation Index Jensen (2005)TEXVAR3-25 Occurrence texture Variance of

panchromatic band, (3×3–25×25)Jensen (2005)

II2 Infrared Index Jensen (2005)TNDVI Transformed Normalized Vegetation Index Jensen (2005)MSI Moisture Stress Index Jensen (2005)MIDIR Midinfrared Index Jensen (2005)NDWI Normalized Difference Water Index Kääb (2005)TEXCON TEXENTTEXSAMTEXCORR

Gray-level co-occurrence textures Contrast,Entropy, Second Angular Moment,Correlation of panchromatic band(Quickbird) or VPC1 (ASTER), kernel 3×3–25×25 (Quantization level=64)

Haralick and Shapiro(1992) Haralick et al.(1973)

TC1–TC4 Brightness, Greenness, Wetness, Haze Horne (2003)

References

Abrams, M., Hook, S., & Ramachandran, B. (2003). ASTER User Handbook. Version 2.Jet Propulsion Laboratory, Pasadena CA, EROS Data Center, Siuox Falls SD (pp.25−26).

Ahern, F. J., Epp, H., Cahoon, D. R., Jr., French, N. H., Kasischke, E. S., & Michalek, J. L.(2000). Fire, climate change and carbon cycling in the boreal forest. In E. S.Kasischke & B. J. Stocks (Eds.), Fire, climate change, and carbon cycling in the borealforest, volume 138 of ecological studies chapter using visible and near-infrared satelliteimagery to monitor boreal forests (pp. 312−330). : Springer.

`Alexeyev, V., Birdsey, R., Stakanov, V., & Korotkov, I. (1995). Carbon in vegetation ofRussian forests: Methods to estimate storage and geographical distribution. Water,Air, and Soil Pollution, 82(1), 271−282.

Alexeyev, V. A., & Birdsey, R. A. (Eds.). (1998). Carbon storage in forests and peatlands ofRussia Gen. Tech. Rep. NE-244 Radnor, PA: U.S. Department of Agriculture, ForestService, Northeastern Research Station.

Alexeyev, V. A., Birdsey, R. A., Korotkov, R. A., Stakanov, V. D., Yefremov, S., Yefremova, T.,et al. (1994). Sukachev Institute for Forest Research, Krasnoyarsk (pp. 40−145).(in Russian).

Alexeyev, V. A., Birdsey, R. A., Stakanov, V. D., & Korotkov, I. A. (2000). Carbon storage in theAsian boreal forests of Russia. In E. S. Kasischke & B. J. Stocks (Eds.), Fire, climate changeand carbon cycling in the boreal forest, volume 138 of ecological studies (pp. 239−257). :Springer.

Bazilevich, N. (1993). Biological productivity of ecosystems in Northern Eurasia.Moscow:Nauka (in Russian).

Cabaravdic, A., 2007. Efficient Estimation of Forest Attributes with k NN. PhD thesis, Fakultätfür Forst- und Umweltwissenschaften der Albert-Ludwigs-Universität, Freiburg i. Brsg.

Camill, P., & Clark, J. (1998). Climate change disequilibrium of boreal permafrostpeatlands caused by local processes. The American Naturalist, 151(3), 207−222.

Camill, P., Lynch, J., Clark, J., Adams, J., & Jordan, B. (2001). Changes in biomass, above-ground net primary production, and peat accumulation following permafrost thawin the boreal peatlands of Manitoba, Canada. Ecosystems, 4(5), 461−478.

DigitalGlobe (2008). Standard Satellite Imagery. Accessed on May, 15, 2008. URL:http://www.digitalglobe.com/index.php/48/Products?product_id=2

Dong, J., Kaufmann, R. K., Myneni, R. B., Tucker, C. J., Kauppi, P. E., Liski, J., et al. (2003).Remote sensing estimates of boreal and temperate forest carbon pools, sources, andsinks. Remote Sensing of Environment, 84, 393−410.

Efron, B., & Tibshirani, R. (1997). Improvements on cross-validation—the 0.632+bootstrap method. Journal of the American Statistical Association, 92(438), 548−560.

ESRI, 2006. ArcGIS 9. ESRI Data and Maps. Media Kit 2006. Redlands, CA.s.Fehrmann, L. 2006. Alternative approaches for biomass estimation on single-tree level

with special emphasis on the k-Nearest Neighbour (k-NN)method. PhD thesis. Facultyof Forest Sciences and Forest Ecology, Universität Göttingen. 155 p. (in German).

Finley, A. O., McRoberts, R. E., & Ek, A. R. (2006). Applying an efficient k-nearestneighbor search to forest attribute imputation. Forest Science, 52(2), 130−135.

Flessa, H., Rodionov, A., Dykmans, J., & Guggenberger, G. (2006). Landscape controls ofCH4 fluxes and soil organic matter in a catchment of the forest tundra at the lowerYenissej. In R. Hatano & G. Guggenberger (Eds.), Symptom of Environmental Changein Siberian Permafrost Region Sapporo: Hokkaido University Press.

Franco-Lopez, H., Ek, A. R., & Bauer, M. E. (2001). Estimation andmapping of forest standdensity, volume, and cover type using the k-nearest neighbors method. RemoteSensing of Environment, 77(3), 251−274.

Franklin, S. E., Hall, R. J., Moskal, L. M., Maudie, A. J., & Lavigne, M. B. (2000).Incorporating texture into classification of forest species composition from airbornemultispectral images. International Journal of Remote Sensing, 21(1), 61−79.

Franklin, S. E., Wulder, M. A., & Gerylo, G. R. (2001). Texture analysis of IKONOSpanchromatic data for Douglas-fir forest age class separability in British Columbia.International Journal of Remote Sensing, 22(13), 2627−2632.

Gjertsen, A. K. (2007). Accuracy of forest mapping based on Landsat TM data and a kNN-based method. Remote Sensing of Environment, 110, 420−430.

Greenberg, J. A., Solomon, Z. D., & Ustin, S. L. (2005). Shadow allometry: Estimating treestructural parameters using hyperspatial image analysis. Remote Sensing of Environment,97, 15−25.

Haralick, R. M., Shanmugam, K., & Dinstein, I. (1973). Textural Features for ImageClassification.IEEE Transactions on Systems, Man, and Cybernetics, 610−621 Vol. SMC-3.

Haralick, R. M., & Shapiro, L. G. (1992). Computer and robot vision. Vol. I. (pp. 460). :Addison-Wesly.

Hay, G. J., Niemann, K. O., & McLean, G. F. (1996). An object-specific image-textureanalysis of H-resolution forest imagery. Remote Sensing of Environment, 55,108−122.

Hechenbichler, K., & Schliep, K. (2004). Weighted k-nearest-neighbor techniques andordinal classification. Discussion Paper 399, SFB 386, Ludwigs–Maximilians UniversityMunich (pp. 16). http://epub.ub.uni-muenchen.de/1769/

Hese, S., Lucht,W., Schmullius, C., Barnsley,M., Dubayah, R., Knorr, D., et al. (2005). Globalbiomass mapping for an improved understanding of the CO2 balance—the earthobservation mission carbon-3D. Remote Sensing of Environment, 94(1), 94−104.

Hobbie, S. E., Schimel, J. P., & Trumbore, S. E. (2000). Controls over carbon storage andturnover in high-latitude soils. Global Change Biology, 6, 196−210.

Horne, J. H. (2003). A tasseled cap transformation for IKONOS images. ASPRS AnnualConference Proceedings, Anchorage, Alaska (pp. 9).

Houghton, R., Butman, D., Krankina, O. N., Schlesiger, P., & Stone, T. A. (2007). MappingRussian forest biomass with data from satellites and forest inventories. Environ-mental Research Letters, 2, 1−7.

Intergovernmental Channel on Climate Change (IPCC) (2007). Working Group I: Thephysical science basis of climate change. ARA 4 Report, Observations 4: Changes inSnow, Ice and Frozen Ground http://ipcc-wg1.ucar.edu/wg1/wg1-report.html

International Institute of Applied System Analysis, Forestry program (IIASA FOR) 2007.www.iiasa.ac.at/docs/Research/FOR/russia_cd/, Laxenburg, Austria.

Isaev, A., Korovin, G., Zamolodchikov, D., Utkin, A., & Pryaznikov, A. (1995). Carbon stockand deposition in phytomass of the Russian forests.Water, Air and Soil Pollution, 82,247−256.

Jensen, J.R., 2005. Introductory digital image processing: A remote sensing perspective:Upper Saddle River, NJ: Pearson/Prentice Hall.

Jia, S., & Akiyama, T. (2005). A precise, unified method for estimating carbon storage incool-temperate deciduous forest ecosystems. Agricultural and Forest Meteorology,154, 70−80.

Kääb, A. (2005). Remote sensing of mountain glaciers and permafrost creep. Schrif-tenreihe Physische Geographie Glaziologie und Geomorphodynamik 48, UniversitätZürich.

Page 14: Remote Sensing of Environment - Kangwonsar.kangwon.ac.kr/satrs14/Quickbird_carbon.pdf · 2014-03-24 · Remote sensing applications: High latitude permafrost regions are difficult

531H. Fuchs et al. / Remote Sensing of Environment 113 (2009) 518–531

Kajimoto, T., Matsuura, Y., Osawa, A., Abaimov, A., Zyryanova, O., Isaev, A., et al. (2006).Size-mass allometry and biomass allocation of two larch species growing on thecontinuous permafrost region in Siberia. Forest Ecology and Management, 222(1–3),314−325.

Katila, M., & Tomppo, E. (2001). Selecting estimation parameters for the Finishmultisource National Forest Inventory. Remote Sensing of Environment, 76, 16−32.

Kim, H. -J., & Tomppo, E. (2006). Model-based prediction error uncertainty estimationfor k-nn method. Remote Sensing of Environment, 104, 257−263.

Krause, K. (2003). Radiance conversion of Quickbird data. Technical Note, DigitalGlobeLongmont, C. 4 p.

Labrecque, S., Fournier, R. A., Luther, J. E., & Piercey, D. (2006). A comparison of fourmethods to map biomass from Landsat-TM and inventory data in westernNewfoundland. Forest Ecology and Management, 226, 129−144.

Leboeuf, A., Beaudoin, A., Forunier, R. A., Guindon, L., Luther, J. E., & Lambert, M. -C.(2007). A shadow fraction method for mapping biomass of northern boreal blackspruce forests using QuickBird imagery. Remote Sensing of Environment, 110,488−500.

Leica Geosystems Geospatial Imaging (2006). Erdas Imagine 9.1. On-Line Help.Lu, D. (2005). Aboveground biomass estimation using Landsat TM data in the Brazilian

Amazon. International Journal of Remote Sensing, 26(12), 2509−2525.Lu, D. (2006). The potential and challenge of remote sensing-based biomass estimation.

International Journal of Remote Sensing, 27(7), 1297−1328.Lu, D., Mausel, P., Brondizio, E., & Moran, E. (2002). Above-ground biomass estimation of

successional and mature forests using tm images in the Amazon basin. Advances inSpatial Data Handling: 10th International Symposium on Spatial Data Handling.

Mäkelä, H., & Pekkarinen, A. (2001). Estimation of timber volume at the sample plotlevel by means of image segmentation and Landsat TM imagery. Remote Sensing ofEnvironment, 77, 66−75.

McRoberts, R., Nelson, M. D., & Wendt, D. G. (2002). Stratified estimation of forest areasusing satellite imagery, inventory data and the k-Nearest Neighbor technique. RemoteSensing of Environment, 82, 457−468.

McRoberts, R. E., Tomppo, E., Finley, A. O., & Heikkinen, J. (2007). Estimating areal meansand variances of forest attributes using the k-nearest neighbors technique andsatellite imagery. Remote Sensing of Environment, 111(4), 466−480.

Monserud, R. A., Onucchin, A. A., & Tchebakova, N.M. (1996). Needle, crown, stem, and rootphytomass of Pinus sylvestris stands in Russia. Forest Ecology and Management, 82,59−67.

Montagu, K., Düttmer, K., Barton, C., & Cowie, A. (2005). Developing general allometricrelationships for regional estimates of carbon sequestration—an example usingEucalyptus pilularis from seven contrasting sites. Forest Ecology and Management,204(1), 115−129.

Muinonen, E., Maltamo, M., Hyppänen, H., & Vainikainen, V. (2001). Forest standcharacteristics estimation using a most similar neighbor approach and imagespatial structure information. Remote Sensing of Environment, 78, 223−228.

Muukkonen, P., & Heiskanen, J. (2005). Estimating biomass for boreal forests usingASTER satellite data combined with standwise forest inventory data. RemoteSensing of Environment, 99, 434−447.

Muukkonen, P., & Heiskanen, J. (2006). Biomass estimation over a large area based onstandwise forest inventory data and ASTER and MODIS satellite data: A possibilityto verify carbon inventories. Remote Sensing of Environment, 107(3), 617−624.

Nelson, R., Krabill, W., & Tonelli, J. (1988). Estimating forest biomass and volume usingairborne laser data. Remote Sensing of Environment, 24, 247−267.

Nilsson, S., Shvidenko, A., Stolbovoi, V., Gluck, M., Jonas, M., & Obersteiner, M. (2000).Full Carbon Account for Russia. Interim Report IR-00-021. Laxenburg, Austria:International Institute for Applied Systems Analysis (IIASA).

Ohmann, L. F., Grigal, D. F., & Brander, R. B. (1976). Biomass estimation for five shrubsfrom northeasternMinnesota. Research Paper NC-133, U.S. Dept. of Agriculture, ForestService, St. Paul, MN North Central Forest Experiment Station.

Proisy, C., Couteron, P., & Fromard, F. (2007). Predicting andmapping mangrove biomassfrom canopy grain analysis using Fourier-based textural ordination of Ikonosimages. Remote Sensing of Environment, 109, 379−392.

Puissant, A., Hirsch, J., & Weber, C. (2005). The utility of texture analysis to improve per-pixel classification for high to very high spatial resolution imagery. InternationalJournal of Remote Sensing, 26(4), 733−745.

Rahman, M. M., Csaplovics, E., & Koch, B. (2005). An efficient regression strategy forextracting forest biomass information from satellite sensor data. InternationalJournal of Remote Sensing, 26(7), 1511−1519.

Ranson, K. J., Sun, G., Kharuk, V. I., & Kovacs, K. (2004). Assessing tundra–taiga boundarywith multi-sensor satellite data. Remote Sensing of Environment, 93, 283−295.

R Development Core Team (2007). R: A language and environment for statistical computing.Vienna, Austria: R Foundation for Statistical Computing http://www.R-project.org

Rodionov, A., Flessa, H., Grabe, M., Kazansky, O. A., Shibistova, O., & Guggenberger, G.(2007). Organic carbon and total nitrogen variability in permafrost-affected soils ina forest tundra ecotone. European Journal of Soil Science, 58, 1260−1272.

Schlesinger, W. H. (1997). Biogeochemistry: An analysis of global change, 2. ed. San Diego:Academic press.

Schliep, K., & Hechenbichler, K. (2006). The kknn package.The weighted k-NearestNeighbors : Version 1.0.3 http://cran.r-project.org

Stage, A. R., & Crookston, N. L. (2007). Partitioning error components for accuracy-assessment of near-neighbor methods of imputation. Forest Science, 53, 62−72.

Tomppo, E., Nilsson, M., Rosengren, M., Aalto, P., & Kennedy, P. (2002). Simultaneous useof Landsat-TM and IRS-1C WiFS data in estimating large area tree stem volume andaboveground biomass. Remote Sensing of Environment, 82, 156−171.

Trotter, C. M., Dymond, J. R., & Goulding, C. J. (1997). Estimation of timber volume in aconiferous plantation forest using Landsat TM. International Journal of RemoteSensing, 18(10), 2209−2223.

Tuominen, S., & Pekkarinen, A. (2005). Performance of different spectral and textural aerialphotograph features in multi-source forest inventory. Remote Sensing of Environment, 94,256−268.

Turner, D., Koerper, G., Harmon, M., & Lee, J. (1995). A carbon budget for forests of theconterminous United States. Ecological Applications, 5(2), 421−436.

Vaganov, E., Efremov, S., & Onuchin, A. (2006). Part I anthropogenic greenhouse gases inatmosphere. In L. K. Lombardi & S. Altunina (Eds.), Carbon balance and the emission ofgreenhouse gases in boreal forests and bogs of Siberia, NATO Science Series : Springer.

Van der Sanden, J. J., & Hoekman, D. H. (2005). Review of relationships between grey-tone co-occurrence, semivariance and autocorrelation based image texture analysisapproaches. Canadian Journal of Remote Sensing, 31(3), 207−213.

Vederova, E., Pleshikov, F., & Kaplunov, V. (2002). Structure of organic matter inecosystems of Central Siberian northern taiga, (in Russian). Lesovedenie, 6, 3−12.

White, A., Cannell, M. G. R., & Friend, A. D. (2000). The high-latitude terrestrial carbonsink: A model analysis. Global Change Biology, 6, 227−245.

Wirth, C., Schulze, E., Schulze, W., von Stünzner-Karbe, D., Ziegler, W., Miljukova, I., et al.(1999). Above-ground biomass and structure of pristine Siberian scots pine forestsas controlled by competition and fire. Oecologia, 121(1), 66−80.

Wulder, M. A., White, J. C., Fournier, R. A., Luther, J. E., & Magnussen, S. (2008). Spatiallyexplicit large area biomass estimation: Three approaches using forest inventory andremotely sensed imagery in a gis. Sensors, 8, 529−560.

Zheng, D., Rademacher, J., Chen, J., Crow, T., Bresee, M., Le Moine, J., et al. (2004).Estimating aboveground biomass using Landsat 7 ETM+ data across a managedlandscape in northern Wisconsin, USA. Remote Sensing of Environment, 93, 402−411.