Derivation of the Green Vegetation Fraction of the Whole ...

16
Derivation of the Green Vegetation Fraction of the Whole China from 2000 to 2010 from MODIS Data Xiaosong Li * and Jin Zhang Key Laboratory of Digital Earth Science, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, Beijing, China Received 23 January 2015; in final form 30 December 2015 ABSTRACT: The green vegetation fraction Fg, which represents the hori- zontal density of live vegetation, is an important parameter for the study of global energy, carbon, hydrological, and biogeochemical cycling. A common method of calculating Fg is to create a simple linear mixing model between two NDVI endmembers: bare soil NDVI, NDVI 0 , and full vegetation NDVI, NDVI . However, many uncertainties exist for the determination of these pa- rameters at large scales. The present study investigates how NDVI and NDVI 0 determination can impact Fg calculations for all of China, based on different land-cover datasets, hyperspectral data, and soil type classification maps. The results show the following: 1) The regional ChinaCover dataset, with higher accuracy and more detailed classification, is preferable for calculating Fg in China, compared with the most commonly used MOD12Q1 dataset, although it would not lead to too much difference in NDVI values. 2) The soil NDVI from * Corresponding author address: Xiaosong Li, Key Laboratory of Digital Earth Science, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, No. 1 Beichen West Road, Chaoyang District, Beijing, 100101, China. E-mail address: [email protected] Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 1 DOI: 10.1175/EI-D-15-0010.1 Copyright Ó 2016, Paper 20-008; 38221 words, 6 Figures, 0 Animations, 3 Tables. http://EarthInteractions.org Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Transcript of Derivation of the Green Vegetation Fraction of the Whole ...

Page 1: Derivation of the Green Vegetation Fraction of the Whole ...

Derivation of the Green VegetationFraction of the Whole China from2000 to 2010 from MODIS DataXiaosong Li* and Jin Zhang

Key Laboratory of Digital Earth Science, Institute of Remote Sensing and Digital Earth,Chinese Academy of Sciences, Beijing, China

Received 23 January 2015; in final form 30 December 2015

ABSTRACT: The green vegetation fraction Fg, which represents the hori-zontal density of live vegetation, is an important parameter for the study ofglobal energy, carbon, hydrological, and biogeochemical cycling. A commonmethod of calculating Fg is to create a simple linear mixing model between twoNDVI endmembers: bare soil NDVI, NDVI0, and full vegetation NDVI,NDVI∞. However, many uncertainties exist for the determination of these pa-rameters at large scales. The present study investigates how NDVI∞ and NDVI0determination can impact Fg calculations for all of China, based on differentland-cover datasets, hyperspectral data, and soil type classification maps. Theresults show the following: 1) The regional ChinaCover dataset, with higheraccuracy and more detailed classification, is preferable for calculating Fg inChina, compared with the most commonly used MOD12Q1 dataset, although itwould not lead to too much difference in NDVI∞ values. 2) The soil NDVI from

*Corresponding author address: Xiaosong Li, Key Laboratory of Digital Earth Science,Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, No. 1 Beichen WestRoad, Chaoyang District, Beijing, 100101, China.

E-mail address: [email protected]

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 1

DOI: 10.1175/EI-D-15-0010.1

Copyright � 2016, Paper 20-008; 38221 words, 6 Figures, 0 Animations, 3 Tables.http://EarthInteractions.org

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 2: Derivation of the Green Vegetation Fraction of the Whole ...

Hyperion datasets shows that soils have highly variable NDVI values (0.006–0.2), and 79.36% of the area studied has a much larger NDVI value than thecommonly used value of 0.05. Therefore, the dynamic NDVI0 values withdifferent soil types are much better for Fg calculation than the invariant NDVI0value (0.05), which would yield a significant overestimation of Fg, especiallyfor areas with low vegetation coverage. 3) A high-quality Fg dataset for Chinafrom 2000 to 2010 was established with NDVI∞ and NDVI0 parameters basedon MOD13Q1 250-m NDVI data.

KEYWORDS: Observational techniques and algorithms; Data mining;Remote sensing; Models and modeling; Ecological models

1. IntroductionTerrestrial vegetation plays a key role in global energy, carbon, hydrological,

and biogeochemical cycling (Potter et al. 2008). The green vegetation fraction Fg,which represents the horizontal density of live vegetation, is of particular impor-tance for regional and global carbon modeling, ecological assessment, and agri-cultural monitoring (Asner and Lobell 2000; Lucht et al. 2002; Parmesan and Yohe2003). At the ecosystem level, the normalized difference vegetation index (NDVI)calculated from coarse spatial resolution satellite data, has been widely utilized toestimate Fg by exploiting the difference in visible and near-infrared (NIR) re-flectance due to the presence of chlorophyll (Reed 2006; Tucker 1979; Xiao andMoody 2005).

Models used to derive Fg based on NDVI are generally simple linear (Gutmanand Ignatov 1998) or quadratic (Carlson and Ripley 1997) combinations of twoendmembers: NDVI from dense (LAI . 3) live vegetation and soil. The simplelinear model developed by Gutman and Ignatov (1998), hereafter referred to as theG–I approach, was widely applied because of its ease of implementation, whichstems partly from preselected values of NDVI for the soil and plant endmembers(Montandon and Small 2008). However, the selection of NDVI values for the twoendmembers is complicated by variations in the spectral signals due to differencesin vegetation type, plant health, leaf water content, and other factors (Elmore et al.2000). Also, the spectral signature of soil varies, depending upon mineralogy,moisture, and grain size (Baumgardner et al. 1985). Considering the difficulties inaddressing the spatial variability of live vegetation and soil endmembers over largeareas, both the linear and quadratic models are normally parameterized usingsingle estimated NDVI values of live vegetation NDVI∞ and soil NDVI0 end-members. The most common technique for estimating the two endmembers is toinfer them from the data themselves. The NDVI∞ can be defined as the highestNDVI value within the scene (Gallo et al. 2001; Li et al. 2002). The NDVI0 iscommonly inferred from the lowest historical NDVI values within the scene (e.g.,the G–I approach). However, many authors opt for the use of popular publishedNDVI0 values of 0.05 or less (Gebremichael and Barros 2006; Sridhar et al. 2003;Zeng et al. 2000). Since single NDVI∞ or NDVI0 values are obviously not validwhen studying large areas, alternative methods for determining more appropriateNDVI∞ and NDVI0 values were developed. For NDVI∞, the NDVI data can besplit into biomes, and the maximum value can be selected from each (Matsuiet al. 2005; Oleson et al. 2000). For NDVI0, combining soil spectral databaseswith temporal NDVI information for each pixel can yield better estimates of Fg

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 2

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 3: Derivation of the Green Vegetation Fraction of the Whole ...

than using global-invariant NDVI0 values estimated from whole scenes(Montandon and Small 2008). Nevertheless, the accuracy of the land-coverdataset and the absence of sufficiently detailed soil reflectance data constrain theaccuracy of Fg calculations.

Here, taking all of China as the study region, we investigate how NDVI∞ andNDVI0 determination can impact Fg calculations for all of China using MODIS16-day NDVI imagery. We limit this study to the problems resulting from invariantassumptions of NDVI∞ and NDVI0 when using NDVI in a linear G–I model andhow these assumptions affect Fg estimations. First, we evaluate the influence ofNDVI∞ determination on Fg calculations by comparing two land-cover datasets:the MODIS land cover using the International Geosphere–Biosphere Programme(IGBP) classification system and the regional land-cover dataset ChinaCover de-veloped by the Chinese Academy of Sciences. It is worth noting that different landcover would have different NDVI∞ values. Second, we evaluate the error in Fgintroduced by the NDVI0 selection by comparing a popular single NDVI0 value(0.05) to values from the soil type dataset and hyperspectral data. Finally, wepresent a green vegetation fraction dataset for all of China from 2000 to 2010 byadopting the optimal combinations of NDVI∞ and NDVI0.

2. Materials and methodology

2.1. Datasets

2.1.1. NDVI data

In this study, we used the MOD13Q1 vegetation index product obtained fromNational Aeronautics and Space Administration (NASA)’s Earth Observation Sys-tem (EOS) with a spatial resolution of 250m. It is calculated from the two-wayatmospherically corrected surface reflectivity that reduces the effect of water, clouds,and heavy aerosols and comes with a cloud shadow mask. A 16-day composite wasused to further improve data quality. The value range of the MODIS NDVI dataset isbetween22000 and 10 000, with a scale conversion factor of 10 000. The time rangeof the dataset is from 2000 to 2010. We used 250 completely different time phasesfor China in the entire time range. The data were stitched and cut off from 19 scenesof MODIS images, covering all mainland China and Taiwan.

First, we transformed the sinusoidal projection generally used in MODIS productsto the Albers equal area projection. Then, a Savitzky–Golay filter-based methodproposed by Chen et al. (2004), which was developed to make data approach the upperNDVI envelope and to reflect the changes in NDVI patterns via an iteration process,was applied to the time series NDVI dataset annually in order to further smooth outnoise in NDVI time series, specifically the depressed NDVI values caused primarily bycloud contamination and atmospheric variability. After these preprocessing steps, weprepared 250 cloudless NDVI images for China with minimum deviations.

2.1.2. Land-cover data

Two land-cover datasets were compared to investigate their influence on Fgcalculations. One was the Collection 5.1 MODIS land-cover product (MCD12Q1),which includes adjustments for significant errors that were detected in Collection 5.

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 3

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 4: Derivation of the Green Vegetation Fraction of the Whole ...

It is an annual product from 2000 to 2011 with a spatial resolution of 500m, and theIGBP classification scheme was used in the study, which includes 11 natural vege-tation classes, 3 developed and mosaicked land classes, and 3 nonvegetated landclasses. The other was the ChinaCover product with a spatial resolution of 30m, anew land-cover product developed by the Chinese Academy of Sciences; this pro-duct was specially designed for China’s ecological change analysis and includes atotal of 38 classes. The most striking characteristic of the ChinaCover product is thatmore vegetation types are considered: 24 vegetation types in ChinaCover comparedto 14 vegetation types in the MCD12Q1 product. Land cover in 2010 in both land-cover datasets were chosen to carry out NDVI∞ determination. The changes in landcover were not considered because there were no real-time land-cover datasets.

2.1.3. Soil data

The soil type data were from China’s 1:4 000 000 soil spatial database digital maps,which were published by the Institute of Soil Science, Chinese Academy of Sciences(ISSCAS) in 1996. The maps were digitized based on China’s soil map published byISSCAS in 1978. The Albers equal area conic projection was used, and differentpolygons with identifiers (codes) represent different soil types. This study used the firstclass of the soil type classification system, and a total of 57 soil types were included.

2.1.4. Hyperspectral data

To obtain sufficient soil reflectance data for retrieving NDVI0, Earth Observing-1 (EO-1) Hyperion data for China (Figure 1) with a 30-m spatial resolution from2000 to 2010 were downloaded (670 scenes). The preprocessing techniques wereapplied to the Hyperion data, including fixing bad and outlier pixels, local des-triping, atmospheric correction, and minimum noise fraction smoothing, whichensures a consistent and standardized time series of data that is compatible withfield-scale and airborne measured indices (Datt et al. 2003). To match them withMODIS NDVI data, the Hyperion spectra were first resampled using MODISspectral response functions, then the NDVI was calculated.

2.2. Methods

2.2.1. Fg calculation model

The G–I approach (Gutman and Ignatov 1998) was used to calculate Fg based onthe dimidiate pixel model, assuming each pixel is composed of only two compo-nents: vegetation and nonvegetation. The spectral information results from linearmixing of these two components. The proportional area of each component in thepixel is the weight of each component. The proportional area of vegetation is the Fgof the pixel, as mathematically expressed using the following formula:

Fg5NDVI2NDVI0NDVI∞ 2NDVI0

, (1)

where Fg is the green vegetation fraction in the mixed pixel, NDVI is the NDVIof the mixed pixel, NDVI0 is the NDVI value for bare soil, and NDVI∞ is the

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 4

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 5: Derivation of the Green Vegetation Fraction of the Whole ...

NDVI value for each land-cover type corresponding to 100% green vegetationcover. It is clear that this dimidiate pixel model is a linear regression model forNDVI, and the accuracy of the results of the dimidiate pixel model depends onthe value of NDVI∞ and NDVI0.

2.2.2. NDVI‘ determination

We used the same method described in Zeng et al. (2000) to calculate NDVI∞for each IGBP and ChinaCover land-cover type, since their method was vali-dated with Fg estimates from 1- and 2-m satellite data and has been wildlyapplied for global Fg mapping (Wu et al. 2014; Broxton et al. 2014). First, wecalculated the annual maximum NDVI image from 2000 to 2010, pixel by pixel.Second, histograms of the annual maximum images from 2000 to 2010 werecalculated for each land-cover type for the corresponding years. Third, NDVI∞was taken as the 90th percentile for closed shrublands and urban and built-uplands, and the 75th percentile for other vegetation types. The NDVI∞ for openshrubland or barren or sparsely vegetated land was taken to be the same as thatfor closed shrubland. Finally, the NDVI∞ for each IGBP and ChinaCover

Figure 1. Distribution of validation samples and Hyperion data. The land cover wasbased on ChinaCover 2010 at 30-m spatial resolution. The 38 land-coverclasses were reclassified into sux dominant categories: 1) woodland,2) grassland, 3) wetland, 4) cropland, 5) settlement, and 6) others. TheHyperion coverage represents the coverages of all 670 EO-1 Hyperionscenes acquired from 2000 to 2010. The stars show the locations of 400validation samples. (Because of the coarse scale of the map, a single starmay represent more than one adjacent validation sample.)

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 5

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 6: Derivation of the Green Vegetation Fraction of the Whole ...

category were determined based on the land cover–specific statistic of the an-nual maximum NDVI distribution.

2.2.3. NDVI0 determination

First, we used the popular NDVI0 value of 0.05 proposed by Zeng et al. (2000)and widely applied in Fg calculations (hereafter referred as constant NDVI0). Also,we generated statistics from the bare land Hyperion NDVI data with soil types tocalculate the NDVI0 for each soil group. The bare lands in Hyperion data forNDVI0 determination were defined through two steps. First, the bare land ofChinaCover 2010 was extracted as the background; then, an NDVI threshold(0.001–0.25) was utilized on the background to refine the pure ‘‘bare land’’ pixels.Finally, the NDVI0 for each soil group was defined as the average of the bare lands’Hyperion NDVI values for corresponding soil groups (hereafter referred as dy-namic NDVI0).

2.2.4. Fg validation

The Fg product is validated against high-resolution images from Google EarthPro (Google, Inc.) in 400 locations with different land-cover types (Figure 1). Thehigh-resolution images were chosen according to the following criteria: 1) theoverpass time was between May and September; 2) our initial, quick look, visualassessment of Fg; and 3) the neighboring landscape was essentially homogeneous(approximates a 2 3 2 MODIS 250-m pixel window size). The third criterion wasincorporated in order to reduce the effects of coregistration errors between theMODIS and Google Earth high-resolution images.

The Fg pixel whose centroid was closest to each location selected in GoogleEarth was identified, and the boundary of each pixel, geolocated according to thefour corners, was overlaid on the Google Earth imagery. The boundaries of theselected pixels for each location were then converted to keyhole markup language(KML) files and displayed in Google Earth, and the high-resolution images cor-responding to each validation pixel were saved with a nominal spatial resolution of2.5m in Google Earth Pro. Also, the overpass time of the images was recorded.

First, the land-cover type of each image was determined through analyzing thehigh spatial resolution image carefully. Then, the images were applied to a seg-mentation algorithm in ENVI 5 (from Exelis Visual Information Solutions, Inc.) todivide them into different homogenous objects. We visually merged the objects thatoverlap with the green vegetation and calculated the green vegetation fraction foreach validation location.

2.2.5. Evaluation of the influence of NDVI‘ and NDVI0 determination

Keeping NDVI∞ or NDVI0 constant, Fg was calculated and compared with the 400samples in the validation data. The root-mean-square errors (RMSE) for all thesamples and different classes were further analyzed. Then, the best combination ofNDVI∞ and NDVI0 was applied to estimate the Fg for all of China from 2000 to 2010.The pixels with negative values were set to zero, and the pixels with values larger thanone were set to one. Permanent snow, water, and unclassified pixels were masked.

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 6

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 7: Derivation of the Green Vegetation Fraction of the Whole ...

3. Results

3.1. Values of NDVI‘ and their influence on Fg calculationThe NDVI∞ values are given in Table 1. Also, the spatial distribution of NDVI∞

is shown in Figure 2. Clearly, there is little difference between IGBP and China-Cover for determining the NDVI∞ value. Deciduous broadleaf forests have thelargest values, 0.90 and 0.89 for IGBP and ChinaCover, respectively, whereasgrasslands have the smallest values, 0.74 for IGBP and 0.74–0.76 for ChinaCover.Because more vegetation types were considered in ChinaCover, one IGBP categorywould correspond to more ChinaCover categories; therefore, more detailed NDVI∞values were assigned in ChinaCover. For example, the NDVI∞ value for croplandsin IGBP is 0.84, while it is 0.84 for paddy fields and 0.87 for dry land in China-Cover; the NDVI∞ value for shrublands in IGBP is 0.86, while it is 0.89 forevergreen shrublands and 0.86 for deciduous shrublands in ChinaCover. Therefore,we can see that the NDVI∞ values were more detailed in ChinaCover than in IGBP,despite the similar distribution characteristics.

Assuming a constant NDVI0 of 0.05, the NDVI∞ values based on IGBP andChinaCover were utilized to calculate the Fg values for China, and the Fg resultswere validated by the Fg data of the 400 samples across China. The validationresults for two estimated Fg results are shown in Figure 3. The Fg calculationaccuracy was slightly better on the whole with NDVI∞ values from ChinaCoverthan from IGBP, with RMSE values of 0.1978 and 0.2079, respectively. However, asignificant improvement was observed for high vegetation coverage samples dueto the alleviation of the overestimation problem. For low vegetation coveragesamples (Fg, 0.5), there was no clear difference for Fg calculation whether eitherChinaCover or IGBP land-cover data were used.

Table 2 shows the validation results calculated according to different landcovers. Whether MODIS or ChinaCover, the Fg calculation accuracy is highest for

Table 1. Values of NDVI∞ with different land cover.

IGBP category NDVI∞ value ChinaCover category NDVI‘ value

Evergreen needleleaf forest 0.81 Evergreen needleleaf forest 0.86Evergreen broadleaf forest 0.86 Evergreen broadleaf forest 0.86Deciduous needleleaf forest 0.88 Deciduous needleleaf forest 0.88Deciduous broadleaf forest 0.90 Deciduous broadleaf forest 0.89Mixed forest 0.87 Mixed forest 0.86Closed shrublands 0.86 Evergreen shrublands 0.89Open shrublands 0.86 Deciduous shrublands 0.86Woody savannas 0.75 Meadow 0.74Savannas 0.76 Steppe 0.74Grasslands 0.74 Grasslands 0.76Permanent wetlands 0.86 Forest wetlands 0.86Croplands 0.84 Shrub wetlands 0.86Urban and built-up 0.86 Grass wetland 0.86Cropland/natural vegetation mosaic 0.82 Paddy fields 0.84Barren or sparsely vegetated 0.86 Dry land 0.87

Plantations 0.87Settlements 0.86Barren or sparsely vegetated 0.86

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 7

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 8: Derivation of the Green Vegetation Fraction of the Whole ...

Figure 2. Distribution of (a) NDVI∞ and (b) NDVI0 values based on ChinaCover andHyperion data.

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 8

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 9: Derivation of the Green Vegetation Fraction of the Whole ...

forest then cropland, grassland, and shrubland. The Fg calculation using NDVI∞values from ChinaCover yielded higher accuracies for forest, grassland, andcropland, but there is no obvious difference for shrubland. The highest RMSEdecrease occurred for grassland (a relative reduction of 6.90%), followed by forest(a relative reduction of 6.67%) and cropland (a relative reduction of 3.70%).

Higher RMSE values resulted when IGBP-based NDVI∞ values were used tocalculate Fg because of the inherent classification inaccuracies in global IGBP landcover. According to the land-cover type of validation sample, 94 of 200 forestsamples were misclassified (mainly between different forest types), 16 of 31shrubland samples were misclassified (mainly as forest or cropland), 13 of 19grassland samples were misclassified (mainly as cropland), and 16 of 67 croplandsamples were misclassified (mainly as grassland). In contrast, the ChinaCover weretotally consistent with the validation sample, which could be attributed in part tothe higher accuracy of ChinaCover where the high spatial resolution image existed,since large amounts of field truth data, including field samples and high spatialresolution images, were incorporated when producing ChinaCover 2010 (Zhanget al. 2014). Also, the Fg calculation accuracy was evaluated for each categoryespecially by using the misclassified sample in IGBP. The results show that theRMSE of Fg calculation decreases from 0.19 to 0.16 (a relative reduction of

Figure 3. Validation of the Fg calculation based on different scenarios: (a) IGBP andconstant NDVI0, (b) ChinaCover and constant NDVI0, and (c) ChinaCoverand dynamic constant NDVI0.

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 9

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 10: Derivation of the Green Vegetation Fraction of the Whole ...

15.79%) for forest, 0.23 to 0.21 (a relative reduction of 8.70%) for cropland, and0.29 to 0.28 (a relative reduction of 3.45%) for grassland, with no obvious dif-ference for shrubland, when ChinaCover was used instead of IGBP. Thus, theaccuracy of land cover was a key factor that affected the Fg calculation. Forexample, the misclassification of grassland into cropland would result in an ob-vious underestimation in the IGBP grassland category (Figure 4), since the NDVI∞values for grassland are significantly smaller than for cropland.

3.2. Values of NDVI0 and their influence on Fg calculation

The NDVI0 values of different soil types are shown in Figure 2. The statisticsshow that the NDVI0 ranges from 0.006 to 0.2, with an average value of 0.1, andmost of the values exceed the constant value of 0.05. With ChinaCover-basedNDVI∞ values, a constant NDVI0 value and dynamic NDVI0 values were used tocompare their influence on Fg calculations. The validation results of two estimatedFg values are shown in Figure 3. The Fg calculation errors using dynamic NDVI0values are lower than those using a constant value of 0.05, with RMSE values of0.1918 and 0.1978, respectively.

Table 2. Comparison of Fg calculation errors in different land-cover types.

Validation category No. RMSE (IGBP) RMSE (ChinaCover) Relative change in RMSE

Forest (Total) 200 0.15 0.14 16.67%Misclassified (IGBP) 94 0.19 0.16 115.79%Shrubland (Total) 31 0.30 0.30 No changeMisclassified (IGBP) 16 0.39 0.39 No changeGrassland (Total) 19 0.29 0.27 16.90%Misclassified (IGBP) 13 0.29 0.28 13.45%Cropland (Total) 67 0.27 0.26 13.70%Misclassified (IGBP) 16 0.23 0.21 18.70%

Figure 4. Underestimation of Fg because of the misclassification of grassland intocropland in IGBP (nine samples).

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 10

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 11: Derivation of the Green Vegetation Fraction of the Whole ...

A total of 22 soil types were covered by Fg validation samples, and the RMSEs ofthe different soil types were analyzed and are shown in Table 3. Note that the RMSEwas lower for 18 of the total 22 soil types when dynamic NDVI0 values were used, andthe accuracy improved the most for castanozem (an RMSE decrease of 0.253), fol-lowed by purplish soil (RMSE decrease of 0.051) and felty soil (RMSE decrease of0.045). For other soil types, including Huanglu soil, Mian soil, fluvo-aquic soil, andcinnamon soil, the RMSE increased by 0.033, 0.008, 0.013, and 0.037, respectively.

Considering that Fg in low vegetation cover areas was more sensitive to theNDVI0 determination, we carried out the validation for low vegetation cover (Fg,0.5) separately. Figure 5 shows that the overestimation that occurs when using aconstant NDVI0 value is dramatically reduced in low vegetation cover areas when adynamic NDVI0 is adopted, and the RMSE also decreases significantly from0.2050 to 0.1584 (a relative reduction of 22.73%), showing that a dynamic NDVI0performs better than a constant NDVI0, especially for low vegetation cover areas.

3.3. Fg distributions

The 16-day Fg in China from 2000 to 2010 was computed with NDVI∞ andNDVI0 parameters calculated from ChinaCover and Hyperion NDVI for differentsoil types, respectively, based on MOD13Q1 250-m NDVI data. After calculatingthe 16-day Fg for China from 2000 to 2010 for each pixel in an image, we com-puted the annual maximum Fg image on a pixel-by-pixel basis from 2000 to 2010(Figure 6). An obvious increasing trend of Fg from west to east could be identified.Moreover, the statistics show that 59.6% of total pixels in China express a high Fg(.60%), and 40.4% of the pixels display a low Fg (�40%). Most of the low Fgareas are located in western China, except for water bodies and settlements.

4. Discussion

4.1. NDVI‘ determination and uncertainty

The most common method for NDVI∞ determination is to calculate the dif-ferent percentiles according to land-cover type. In this study, we adopted the same

Table 3. Comparison of Fg calculation errors in different soil types. RMSEd stands forroot-mean-square error of Fg calculation with dynamic NDVI0, and RMSEc standsfor root-mean-square error of Fg calculation with constant NDVI0.

Soil type RMSEd RMSEc Soil type RMSEd RMSEc

South paddy soil 0.148 0.184 Cinnamon soil 0.108 0.071Shanxue paddy soil 0.1 0.12 Dark brown earth 0.28 0.34Huanglu soil 0.2 0.167 Bleached podzolic soil 0.31 0.31Lou soil 0.123 0.127 Castanozem 0.025 0.273Mian soil 0.09 0.082 Dark meadow soil 0.22 0.24Fluvo-aquic soil 0.128 0.115 Coastal solonchak 0.196 0.214Latosolic red soil 0.143 0.144 Limestone soil 0.164 0.165Red earth 0.082 0.093 Purplish soil 0.124 0.175Yellow earth 0.125 0.125 Aeolian soil 0.022 0.03Yellow-brown earth 0.13 0.136 Mountain meadow soil 0.029 0.029Brown earth 0.16 0.2 Felty soil 0.014 0.059

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 11

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 12: Derivation of the Green Vegetation Fraction of the Whole ...

percentile numbers for different land-cover types as utilized by Zeng et al. (2000),who determined the percentile numbers using a global AVHRR 10-day NDVIcomposite by interpreting the rough percentages of full vegetation coverage areasof different land covers in the conterminous United States based on the high-resolution images. Because of the differences in the study area and NDVI datasetsused, these percentile numbers were certainly not the best values. However, we did

Figure 5. Comparison of Fg calculation accuracy in low vegetation cover area (Fg <0.5) with different NDVI0 values: (a) constant NDVI0 and (b) dynamic NDVI0.

Figure 6. Annual maximum mean Fg in China from 2000 to 2010.

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 12

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 13: Derivation of the Green Vegetation Fraction of the Whole ...

not devote too much effort to fixing this problem, since our main objective was toevaluate the effect on Fg calculation of using a more accurate land-cover dataset.

The NDVI∞ values determined in this study were consistent with those found byMontandon and Small (2008) for the conterminous United States, when the sameNDVI dataset and Zeng et al. (2000) method were used. Deciduous broadleaf forestshave the largest value of 0.89 for the conterminous United States and China, whereasgrasslands have the smallest value of 0.67 for the conterminous United States and0.74–0.76 for China. This consistency shows that our NDVI∞ values were reasonableand are comparable with other Fg studies at the national level. Although the differentland-cover datasets would not lead to a big difference in NDVI∞ values owing to thelarge-scale statistics, the accuracy of land-cover data influences the Fg calculationsignificantly. For example, when grassland was misclassified as cropland, an obviousFg underestimation arose, since the NDVI∞ values for grassland are significantlysmaller than those for cropland. Since ChinaCover has higher spatial resolution(30m) and is supported by several hundred thousand field samples, the Fg resultsusing ChinaCover were substantially better, especially for the misclassified grasslandin MOD12Q1. Therefore, using a regional land-cover dataset (e.g., ChinaCover)makes the Fg calculation in China more reasonable compared with other global land-cover datasets, such as MOD12Q1, which has been criticized for its low accuracy inChina (Ran et al. 2010).

4.2. NDVI0 determination and uncertainty

The soil reflectance data available from Hyperion datasets show that soils havehighly variable NDVI values (0.006–0.2), and 79.36% of the areas have a muchlarger NDVI value than that commonly used (0.05) in Fg models. If the constantNDVI0 value (0.05) was adopted, the underestimation of NDVI0 would yieldsignificant overestimation of Fg, especially for low vegetation coverage areas(Figure 5). Also, given the variability of soil NDVI, using a single NDVI0 value forall of China is not appropriate, particularly for large-scale studies. Therefore, thedynamic NDVI0 values computed from the Hyperion dataset according to differentsoil types were much better for Fg calculation than the invariant NDVI0 value(0.05). Montandon and Small (2008) reached a similar conclusion by analyzing theimpact of NDVI0 determination on the Fg calculation from NDVI.

When no information on soil is available, the most popular method of estimatingthe NDVI0 value is to use the lowest historical NDVI values within the study areaby assuming the pixel with the lowest soil NDVI is free of vegetation (bare soil).However, this assumption is likely to be incorrect because of the coarse resolution(250m) of the MODIS NDVI dataset and high vegetation cover in southeasternChina. Therefore, using a high-resolution soil spectra dataset (e.g., the 30-m Hy-perion dataset used in this study) to determine the NDVI0 value is a more realisticapproach, since the relative high spatial resolution would be beneficial for pure soilpixels extraction. Furthermore, this study incorporates the soil type classificationmap to better constrain NDVI0, making the localized NDVI0 more targeted andreasonable when no detailed soil spectral library exists.

There were some sources of uncertainty inherent in the data and method used forNDVI0 determination in this study. Because of the limited coverage of Hyperion

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 13

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 14: Derivation of the Green Vegetation Fraction of the Whole ...

data, values for three soil types were not retrieved, for which the average of allvalues (0.1) was assigned. Additionally, temporal differences between Hyperionimage acquisitions are also likely to reduce the comparability to some extent;NDVI0 as bare soil spectra changes with soil moisture, which is highly variableseasonally, especially in eastern and southern China. Although the bare land pixelswere carefully determined through combining ChinaCover information and NDVIthreshold, the defined ‘‘bare soil’’ inevitably included some surface features.Choosing the average NDVI value as the NDVI0 for each soil type reduced thesepotential misclassification effects to a certain extent. Finally, artificial surfaceswere not considered for NDVI0 determination individually since they do not be-long to any soil type in the existing classification system. However, the big spectraldifference between artificial surfaces and natural soils would lead to extra errors forFg calculations in urban areas.

4.3. Fg validation and its uncertainty

Because of the widespread distribution of validation samples across China, fieldinvestigations were unrealistic. The use of high spatial resolution imagery fromGoogle Earth circumvents this limitation and provides a means to ‘‘ground truth,’’MODIS-derived Fg values over large spatial extents. Admittedly, this evaluationhas some uncertainty associated with it. To ensure the accuracy as much as pos-sible, we adopted an object-oriented classification technique for each small patch(500m 3 500m), which has proven to be effective and has high accuracy for highspatial resolution images (Mathieu et al. 2007).

Additionally, the validation samples were not selected randomly but focused onthe main vegetation types, such as woodlands, shrublands, grasslands, and crop-lands, and considered the existence of the high spatial resolution data in GoogleEarth. Thus, there is a certain subjectivity in the validation samples’ distribution.However, on the whole, the validation samples were representative because theywere evenly distributed across China and included all the major vegetation types indifferent ecoclimate regions. Since nonvegetated areas were not included in thevalidation samples, the validation results do not represent the overall precision ofthe Fg calculation, which should be higher, because our Fg calculation approachproduced very precise estimates for nonvegetated land covers, such as bare landand water bodies.

5. ConclusionsBased on the MODIS MOD13Q1 NDVI product, we improved the NDVI∞ and

NDVI0 estimations in the G–I model in order to make the Fg calculation morereasonable. Our results show that the different land-cover datasets do not lead tobig differences in NDVI∞ values, but the accuracy of land-cover data influences theFg calculation significantly. The regional ChinaCover dataset is preferred for Fgcalculations in China, compared with the more commonly used MOD12Q1 dataset.Furthermore, the soil reflectance data available from Hyperion datasets show thatsoils have highly variable NDVI values (0.006–0.2), and 79.36% of the areasstudied has a much larger NDVI value than that commonly used in Fg models

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 14

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 15: Derivation of the Green Vegetation Fraction of the Whole ...

(0.05). When the constant NDVI0 value (0.05) was adopted, the underestimation ofNDVI0 would yield significant overestimation of Fg, especially for areas with lowvegetation coverage. Therefore, dynamic NDVI0 values with different soil typesare much better for Fg calculation than the invariant NDVI0 value (0.05). Finally, ahigh-quality Fg dataset for China from 2000 to 2010 was produced, with optimalNDVI∞ and NDVI0 parameters with MOD13Q1 250-m NDVI data, which canbetter support regional carbon modeling, ecological assessment, and agriculturalmonitoring in China.

Acknowledgments. We acknowledge financial support from the Forestry Public In-terest Research Program (201404422) and the National Natural Science Foundation ofChina (41361091). We thank all team members for providing the valuable ChinaCoverproduct supported by the ‘‘National Ecological Environment Dynamic assessment basedon Remote Sensing from 2000 to 2010.’’ Finally, we thank the editor and two anonymousreviewers whose comments helped to improve the quality of this paper.

References

Asner, G. P., and D. B. Lobell, 2000: A biogeophysical approach for automated SWIR unmixing ofsoils and vegetation. Remote Sens. Environ., 74, 99–112, doi:10.1016/S0034-4257(00)00126-7.

Baumgardner, M. F., L. F. Silva, L. L. Biehl, and E. R. Stoner, 1985: Reflectance properties of soils.Advances in Agronomy, Vol. 38, Academic Press, 1–44, doi:10.1016/S0065-2113(08)60672-0.

Broxton, P. D., X. Zeng, W. Scheftic, and P. A. Troch, 2014: A MODIS-based global 1-km max-imum green vegetation fraction dataset. J. Appl. Meteor. Climatol., 53, 1996–2004,doi:10.1175/JAMC-D-13-0356.1.

Carlson, T. N., and D. A. Ripley, 1997: On the relation between NDVI, fractional vegetation cover,and leaf area index. Remote Sens. Environ., 62, 241–252, doi:10.1016/S0034-4257(97)00104-1.

Chen, J., P. Jönsson, M. Tamura, Z. Gu, B. Matsushita, and L. Eklundh, 2004: A simple method forreconstructing a high-quality NDVI time-series data set based on the Savitzky–Golay filter.Remote Sens. Environ., 91, 332–344, doi:10.1016/j.rse.2004.03.014.

Datt, B., T. R. McVicar, T. G. Van Niel, D. L. Jupp, and J. S. Pearlman, 2003: Preprocessing EO-1Hyperion hyperspectral data to support the application of agricultural indexes. IEEE Trans.Geosci. Remote Sens., 41, 1246–1259, doi:10.1109/TGRS.2003.813206.

Elmore, A. J., J. F. Mustard, S. J. Manning, and D. B. Lobell, 2000: Quantifying vegetationchange in semiarid environments: Precision and accuracy of spectral mixture analysis andthe normalized difference vegetation index. Remote Sens. Environ., 73, 87–102, doi:10.1016/S0034-4257(00)00100-0.

Gallo, K., D. Tarpley, K. Mitchell, I. Csiszar, T. Owen, and B. Reed, 2001: Monthly fractional greenvegetation cover associated with land cover classes of the conterminous USA. Geophys. Res.Lett., 28, 2089–2092, doi:10.1029/2000GL011874.

Gebremichael, M., and A. P. Barros, 2006: Evaluation of MODIS gross primary productivity(GPP) in tropical monsoon regions. Remote Sens. Environ., 100, 150–166, doi:10.1016/j.rse.2005.10.009.

Gutman, G., and A. Ignatov, 1998: The derivation of the green vegetation fraction from NOAA/AVHRR data for use in numerical weather prediction models. Int. J. Remote Sens., 19, 1533–1543, doi:10.1080/014311698215333.

Li, X., Y. Chen, P. Shi, and J. Chen, 2002: Detecting vegetation fractional coverage of typical steppein northern China based on multi-scale remotely sensed data. Acta Bot. Sin., 45, 1146–1156.

Lucht, W., and Coauthors, 2002: Climatic control of the high-latitude vegetation greening trend andPinatubo effect. Science, 296, 1687–1689, doi:10.1126/science.1071828.

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 15

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC

Page 16: Derivation of the Green Vegetation Fraction of the Whole ...

Mathieu, R., C. Freeman, and J. Aryal, 2007: Mapping private gardens in urban areas using object-oriented techniques and very high-resolution satellite imagery. Landscape Urban Plann., 81,179–192, doi:10.1016/j.landurbplan.2006.11.009.

Matsui, T., V. Lakshmi, and E. E. Small, 2005: The effects of satellite-derived vegetation covervariability on simulated land–atmosphere interactions in the NAMS. J. Climate, 18, 21–40,doi:10.1175/JCLI3254.1.

Montandon, L., and E. Small, 2008: The impact of soil reflectance on the quantification of the greenvegetation fraction from NDVI. Remote Sens. Environ., 112, 1835–1845, doi:10.1016/j.rse.2007.09.007.

Oleson, K., W. Emery, and J. Maslanik, 2000: Evaluating land surface parameters in the Biosphere–Atmosphere Transfer Scheme using remotely sensed data sets. J. Geophys. Res., 105, 7275–7293, doi:10.1029/1999JD901041.

Parmesan, C., and G. Yohe, 2003: A globally coherent fingerprint of climate change impacts acrossnatural systems. Nature, 421, 37–42, doi:10.1038/nature01286.

Potter, C., S. Boriah, M. Steinbach, V. Kumar, and S. Klooster, 2008: Terrestrial vegetation dynamicsand global climate controls. Climate Dyn., 31, 67–78, doi:10.1007/s00382-007-0339-5.

Ran, Y., X. Li, and L. Lu, 2010: Evaluation of four remote sensing based land cover products overChina. Int. J. Remote Sens., 31, 391–401, doi:10.1080/01431160902893451.

Reed, B. C., 2006: Trend analysis of time-series phenology of North America derived from satellitedata. GISci. Remote Sens., 43, 24–38, doi:10.2747/1548-1603.43.1.24.

Sridhar, V., R. L. Elliott, and F. Chen, 2003: Scaling effects on modeled surface energy-balancecomponents using the NOAH-OSU land surface model. J. Hydrol., 280, 105–123,doi:10.1016/S0022-1694(03)00220-8.

Tucker, C. J., 1979: Red and photographic infrared linear combinations for monitoring vegetation.Remote Sens. Environ., 8, 127–150, doi:10.1016/0034-4257(79)90013-0.

Wu, D. H., H. Wu, X. Zhao, T. Zhou, B. J. Tang, W. Q. Zhao, and K. Jia, 2014: Evaluation ofspatiotemporal variations of global fractional vegetation cover based on GIMMS NDVI datafrom 1982 to 2011. Remote Sens., 6, 4217–4239, doi:10.3390/rs6054217.

Xiao, J., and A. Moody, 2005: A comparison of methods for estimating fractional green vegetationcover within a desert-to-upland transition zone in central New Mexico, USA. Remote Sens.Environ., 98, 237–250, doi:10.1016/j.rse.2005.07.011.

Zeng, X., R. E. Dickinson, A. Walker, M. Shaikh, R. S. DeFries, and J. Qi, 2000: Derivation andevaluation of global 1-km fractional vegetation cover data for land modeling. J. Appl. Me-teor., 39, 826–839, doi:10.1175/1520-0450(2000)039,0826:DAEOGK.2.0.CO;2.

Zhang, L., X. S. Li, Q. Z. Yuan, and Y. Liu, 2014: Object-based approach to national land covermapping using HJ satellite imagery. J. Appl. Remote Sens., 8, 083686, doi:10.1117/1.JRS.8.083686.

Earth Interactions is published jointly by the American Meteorological Society, the American Geophysical

Union, and the Association of American Geographers. Permission to use figures, tables, and brief excerpts

from this journal in scientific and educational works is hereby granted provided that the source is

acknowledged. Any use of material in this journal that is determined to be ‘‘fair use’’ under Section 107 or that

satisfies the conditions specified in Section 108 of the U.S. Copyright Law (17 USC, as revised by P.IL. 94-

553) does not require the publishers’ permission. For permission for any other from of copying, contact one of

the copublishing societies.

Earth Interactions d Volume 20 (2016) d Paper No. 8 d Page 16

Unauthenticated | Downloaded 03/17/22 09:57 AM UTC