Friberg–LEO-GEO & GEO-GEO Stereo Imaging Diurnal Observations
Centre for Geo-Information Thesis Report GIRS-2014-35 ...
Transcript of Centre for Geo-Information Thesis Report GIRS-2014-35 ...
Centre for Geo-Information
Thesis Report GIRS-2014-35
Cotton yield forecasting in Tashkent province by using Remote
sensing techniques
Djakhangir Atakhanov
24,
Septe
mber,
2014
iii
Cotton yield forecasting in Tashkent province by using Remote
sensing techniques
Djakhangir Atakhanov
Registration number: 900327020060
Supervisors:
Dr. Ir. Jan Clevers
Dr. Alim Pulatov
A thesis submitted in partial fulfillment of the degree of Master of Science at Wageningen
University and Research Centre,
The Netherlands.
September, 2014
Wageningen, the Netherlands
Thesis code number: GRS-80436 Thesis Report: GIRS-2014-35 Wageningen University and Research Centre Laboratory of Geo-Information Science and Remote Sensing
iv
Acknowledgements
This study was benefited from constructive discussions and meetings with supervisor
Jan Clevers and Alim Pulatov. I would like to thank Alim Pulatov, for help with choosing the
Thesis topic, for discussions and expressing his knowledge about area and crop growth in study
area.
I would like to appreciate my fellow students who helped me during my thesis research
as well. Who helped with analysis and expressed me their experiences and gave me the hints to
find a solution and solve problems.
Two coordinators Ewa Wiestma and Alim Pulatov who helped to construct Double
degree program and gave me opportunity to be awarded by Erasmus Mundus CASIA 1 project
Action 2. I am thankful also for Wageningen University and Tashkent Institute of Irrigation and
Melioration for being involved for the scholarship. Moreover, I would like to appreciate all people
from both educational institutions for useful experience and knowledge.
MGI department and group, which were involved and supported me during my study and
research have to be appreciated as well. Their knowledge and experience in GIS and remote
sensing helped me and increased my level of knowledge.
I would like to thank my family and friends from home country for the help with obtaining
the data and useful information and for the great support and patience during three years of my
study.
I thank all state agencies, educational institutions and centers being involved in process
of delivering and providing me valuable, cognitive and important information. In addition, I would
like to appreciate the NASA USDA archive for providing images from diverse satellites.
v
Abstract
Crop yield forecasting is very important in every country around the world. Central Asian
countries are enriched with agricultural lands. Uzbekistan’s main sector is agriculture with a
large variety of different crops. Nowadays, 46% of all irrigated land is utilized for cotton. Cotton
is considered as most important crop in Uzbekistan. Production of cotton plays a dominant role
in the economy of Uzbekistan. Remote sensing assists to create information at a large spatial
scale. Therefore, it was used for prediction of cotton yield in Tashkent province, which has 15
administrative districts and 12 of them are located in the agricultural zone. The prediction was
made for Tashkent province (oblast) level and for the administrative district level. Classification
of cotton fields using MODIS (MOD13Q1) was applied by using a decision rule with thresholds
on the Normalized Difference Vegetation Index (NDVI) for day of the year 113, 161 and 225.
NDVI data was acquired as 16-day maximum value product in order to get information about
biomass condition. Different NDVI-based indicators were studied and analyzed in terms of
choosing the best option for that region. A regression model between NDVI indicators and yield
was applied using an overall temporal trend component and yearly deviations from this trend.
The temperature and precipitation were studied in order to find a relationship among yield, NDVI
and weather conditions. The correlations between these factors were low and it is assumed that
at least weekly observations of cotton are required for establishing a better correlation between
NDVI and weather conditions. Research has been successfully done and yield for 2013 was
predicted at the province level as well as at administrative districts level. Province level and
administrative district level showed difference in most valuable indicators, relationships between
weather conditions and yield and correlation between weather conditions and NDVI. As
indicators maximum value of NDVI and seasonal sum of NDVI (iNDVI=NDVI integral) were used
in this research. Results have shown that MODIS 250 m spatial resolution is not the most
suitable satellite sensor for Tashkent region and its administrative districts. In addition, there are
many varieties of fruits, vegetables and other crops that make it very difficult to classify and
make an error assessment of the classification and delineation of the cotton, as the cotton
growing calendar is similar to the one of other crops in the region. The results obtained at the
province level showed a decrease of yield in 2013 with 0.2 c\ha. The model was validated by
the leave-one-out cross-validation (LOOCV) procedure and the calculated RMSE for the model
at the Tashkent province level expressed a low value (RMSE=0.034), which means that error
was low. The best indicator was identified for the province level and for each administrative
district level separately, subsequently the forecasting was done for all research levels. Finally,
the correlation between the ground truth data (historical data) and NDVI forecasting was found
to be R2=0.55, which means that there was moderate correlation. Results obtained from this
research indicate that alternative satellites and indicators have to be assessed and analyzed.
The harvesting of the cotton and its time planning can be improved, as well as agricultural
management, in order to get better outcomes.
vi
Table of contents
Acknowledgements ...................................................................................................................................... iv Abstract ......................................................................................................................................................... v
Table of contents .......................................................................................................................................... vi
Abbreviations ............................................................................................................................................... vii
1. Introduction ........................................................................................................................................ 1
1.1. Background (Literature review) ..................................................................................................... 1
1.2. Yield prediction .............................................................................................................................. 2
1.3. MODIS data................................................................................................................................... 3
1.4. Problem Definition ......................................................................................................................... 4
1.5. Research Questions ...................................................................................................................... 4
1.6. Research objectives ...................................................................................................................... 4
2. Methods and Materials ...................................................................................................................... 5
2.1. Study Area ......................................................................................................................................... 5
2.2. Cotton ................................................................................................................................................. 6
2.3. Materials description .......................................................................................................................... 7
2.3.1. Methodology .............................................................................................................................. 7
2.3.2. MOD13Q1 Production ............................................................................................................... 9
2.3.3. NDVI Metrics description ........................................................................................................... 9
2.4. NDVI Analysis ............................................................................................................................. 10
2.5. Classification ............................................................................................................................... 11
2.6. Summary of methodology (Flow chart) ....................................................................................... 14
3. Results ............................................................................................................................................ 15
3.1. Tashkent Province (Oblast level) ................................................................................................ 15
3.1.1. Classification ........................................................................................................................... 15
3.1.2. Validation of the cotton raster layers ....................................................................................... 16
3.2. Yield analysis .............................................................................................................................. 18
3.3. NDVI Metrics ............................................................................................................................... 19
3.4. Results for administrative districts (District level) ........................................................................ 23
3.5. Validation of forecasting model ................................................................................................... 27
4. Discussion ....................................................................................................................................... 28
5. Conclusions and Recommendations .............................................................................................. 30
6. Reference list .................................................................................................................................. 32
vii
Abbreviations
AVHRR Advanced Very High Resolution Radiometer, LR-sensor on-board of the NOAA-
satellites
B Blue
CGMS Crop Growth Monitoring System, the combination of an agrometeorological crop
growth simulation model WOFOST, a database, and a yield prediction routine.
CropSyst Cropping System simulation model
DEM Digital Elevation Model
DOY Day of the Year
EOS Earth Observation System
EVI Enhanced Vegetation Index
fAPAR Fraction of Absorbed Photosynthetically Active Radiation
fPAR Fraction of Photosynthetically Active Radiation
GDP Gross Domestic Product
GIS Geographical Information System, software for storage of geographical data,
mostly in vector format
iNDVI NDVI integrated time series
LAI Leaf Area Index
LOOCV Leave-one-out cross-validation
MODIS Moderate Resolution Imaging Spectroradiometer
NASA National Aeronautics and Space Administration
NDVI Normalized Difference Vegetation Index. RS-indicator for amount of standing
vegetation
NIR Near infrared range of the spectrum, roughly from 780 nm to 1300nm
NOAA Series of near-polar satellites monitored by the US National Oceanographic and
Atmospheric Administration
NPP Net primary productivity
R Red
viii
Rell.eff. Relative efficiency
RMSE Root mean square error
PAR Photosynthetically Active Radiation
RS Remote Sensing: earth observation with imaging sensors on-board of
space/airborne platforms
SPOT Système Pour l’Observation de la Terre
USDA United States Department of Agriculture
UzHydromet Centre of Hydrometeorological Service at Cabinet of Ministers of the Rebublic of
Uzbekistan
WOFOST World Food Studies crop growth model, simulation model
VI Vegetation Index
1
1. Introduction
1.1. Background (Literature review)
Assessment and forecasting of crop yields are important for each country. Central Asian
countries are very rich of agricultural lands covered by various crops. Uzbekistan has the largest
sector of agriculture among five Central Asian countries. The economy of Uzbekistan mostly
depends on agriculture (Abdullaev et al., 2007). Agricultural accountings of Uzbekistan
economy cover about 30% of Gross Domestic Product (GDP), 40% of employment and 60% of
foreign exchange earnings (Abdullaev et al., 2007). There is 45 million hectares of land, where
60% of the land is used for agricultural purposes and 12% of this area are irrigated (FAO, 2003).
Cotton is the key crop in agricultural production of Uzbekistan. Other major cotton-producing
countries are USA, China, India, Pakistan, Uzbekistan, Turkey and Australia (Reddy et al.,
2000).
Nowadays, cotton is considered as the main crop on Uzbekistan agricultural lands and
its production plays a dominant role in Uzbekistan. Among all irrigated lands in this area, 46% is
covered by cotton (Zhou et al. 2007). Uzbekistan was considered as an important cotton-
growing region even in Russian Imperial times. The cotton lands were enhancing during the
Soviet Union, particularly after 1950. Main external influence to cotton yield is generally caused
by rainfall, temperature, incoming light, and nutrition. According to Muminov (1973), Central
Asia is an area with inadequate moisture and dry type of climate, therefore soil moisture is an
important factor for cotton yield evaluation in Central Asia. Water mainly is taken from two main
rivers, which are Amu Darya and Syr Darya. Amu Darya and Syr Darya are two main tributaries
of the Aral Sea (Abdullaev et al., 2007).
The estimation and forecasting of cotton crop yield is of importance for better targeting of
water resources, proper land-use planning, time management, proper use of labor forces,
enhancement of production and establishment of classification maps at a regional scale
(Ruecker et al., 2007). Remote sensing is one of the technologies which gives an unbiased
vision of large areas, provides spatial information and is widely used in assessment and
forecasting of crop yields at a regional scale (Doraiswamy et al., 2004). Satellite images allows
the accumulation of valuable information for the determination of relationships with ground truth
data by using spectral characteristics of the fields with expected harvest (Terekhov et al., 2007).
The normalized difference vegetation index (NDVI) is an indicator which uses visible and near-
infrared bands of the electromagnetic spectrum and can be useful for forecasting of crop yields.
A vegetation index such as the NDVI is adopted to analyze the remotely sensed measurements
and to assess whether the observed target contains live green vegetation or not.
Large size fields make it possible to use satellite information of medium resolution to
study the spectral characteristics of crops (Terekhov et al., 2007). Forecasting of the crop yield
can be based on empirical data using vegetation indexes, obtained from remote observation of
the fields that take place every year during the earing – flowering time (late July-early August).
According to Terekhov and Kauazov (2007), this period allows observation of the fields due to
the amount of green biomass and close relationship between the amount of plant and
productivity. Moreover, it can be a basis for forecasting model of crop yields. Aim of this
research is to forecast the cotton yield in Tashkent province by using remote sensing
techniques. The current methods of crop yield prediction in Uzbekistan are outdated. To renew
2
methods of prediction in Uzbekistan and in order to improve the time management and amount
of needed labor forces during the cotton harvest, accurate remote sensing prediction is needed.
Precise forecasting can be helpful not only for decreasing redundancy of time planning and
labor forces during cotton picking season, but also for good, accurate and statistically
straightforward prediction.
1.2. Yield prediction
Optical remote sensing techniques work sufficiently with agricultural systems, since
remote sensing provides information about actual status of plants at different stages of growth
through the reflectance and spectral signatures of a crop. This technique helps to identify crop
species of interest and also diseases, weed infestations, density, and other values of
agricultural variables (Soria-Ruiz et al., 2004). These variables can be used as an input for crop
growth models as yield indicators (Clevers et al., 1993). The status of a crop can be obtained by
using different indicators or vegetation indices like NDVI (Tucker, 1979).
There are different purposes for yield prediction in agricultural management. Among the
main goals are to categorize and forecast the amount of crop yield and to guarantee resources
for the population, obtained by agricultural and environmental services. Crop yield prediction for
large regions and for the period before the harvest time is rather an important and big problem
for many countries (Soria-Ruiz et al., 2004). Moreover, there are a lot of different models with
various issues associated with them (Soria-Ruiz et al., 2004), such as models which build and
rely on computer simulations and include weather-related variables. Other models are generally
statistical approaches which require several regression equations. In addition, there is no single
model, which can be suitable for all regions and environments. All crop growth prediction and
estimation models are limited to different conditions due to the reason that they were used and
simulated in specific research contexts and sometimes for areas as small as one field.
Moreover, they have to be used in combination with other inputs provided by various resources
such as geostatistical and geo-information systems (Soria-Ruiz et al., 2004).
Previous empirical studies were done to establish the statistical relationship between
crop yield and climate stresses. These studies had as purpose crop yield estimation and
prediction (Liang et al., 2012). The studies showed that the relationships are very complex and
depend on farming practices, climate and soil characteristics during the growing period.
According to Liang et al. (2012), weather conditions and soil characteristics have a strong
influence on cotton yield.
Different vegetation indexes were used, checked and developed in order to apply in one
of the existing models. Spectral vegetation indexes (VI) were used in order to measure the
green biomass at any given time. Satellite data such as NOAA-AVHRR (National Oceanic and
Atmospheric Administration – Advanced Very High Resolution Radiometer) and MODIS
(Moderate Resolution Imaging Spectroradiometer) have been used for different models run at
daily, monthly and yearly time step (Rucker et al. 2007a). Linear relationships were applied
between the fractions of photosynthetically active radiation (fPAR) and biomass of cotton. To
estimate the fPAR, NDVI was derived from AVHRR and the Monteith model was applied
(Bastiaansen, 2003). Moreover, different satellites and sensors have been extensively used to
monitor the condition of the crop and forecast crop yield and production in many different
countries. Studies were done to predict the yield by using remote sensing techniques.
3
Several studies by using a crop growth model instead of using the regression models
have been applied worldwide. One of the studies was done in order to predict yields using
meteorological information. The crop growth monitoring system was created for Europe and this
study led to the objective of another research, which was to monitor agricultural conditions over
the whole of the European Union and neighboring countries, and because of the importance of
wheat in Uzbekistan, the model was adapted there in order to make a quantitative within season
yield forecast at regional and national scale for specific crops by using the Crop Growth Monitor
System (CGMS) (Pulatov, 2008). Many studies have been performed using the AVHRR satellite
in order to monitor and forecast malting barley yield in Germany (Pulatov, 2008).
Many research works have been made in the last years to investigate the contribution of
remote sensing data for crop monitoring and yield prediction. In terms of forecasting the cotton
yield, MODIS (MOD13Q1) NDVI products have been used for Tashkent Province in this
research. The spatial resolution of MODIS is 250m per pixel. The research is aimed on
identifying the best indicator in order to predict cotton yield at Province (oblast) level and at
administrative district level. Forecasting of the yield can lead to better crop management, labor
forces and time management and to know whether this satellite is suitable and sufficient enough
for the scale of the field in the province.
1.3. MODIS data
According to the National Aeronautics and Space Administration (NASA), there is
MODIS (or Moderate Resolution Imaging Spectroradiometer) instrument, which is a key
instrument on aboard of TERRA and AQUA satellites (Lindsey & Herring, 2011). TERRA
satellite passes from north to south across the equator in the morning, while AQUA passes from
south to north and crosses the equator in the afternoon (Lindsey & Herring, 2011). TERRA and
AQUA MODIS acquire 36 spectral bands (Lindsey & Herring, 2011). MODIS is playing an
essential role in gathering global data and improve our understanding of global dynamics of land
surface and processes in the oceans and lower atmosphere. MODIS is a key instrument for
developing Earth System models able to monitor and predict the changes on the surface
(Lindsey & Herring, 2011).
In order to select a good model for cotton crop yield forecasting, a variety of studies on
this issue which were done in diverse countries were investigated and explored. Generally, use
of MODIS for crop yield estimation is often applied. The majority of studies have been
conducted in order to relate NDVI derived from MODIS with crop yield by monitoring the
vegetation conditions, drought, estimation and forecasting. The pros of MODIS are that it has
good spatial resolution (250m) and for example better radiometric calibration than AVHRR
(Mkhabela et al., 2011). In addition, studies on the relation of MODIS with crop yield have been
conducted in Uzbekistan but in another province, conditions and different environment
(Ruecker, 2007c). Another study, performed in China was done in order to test the suitability of
the methodology to estimate crop yield with MODIS NDVI on a regional level (Ren et al., 2008).
The study on the crop yield forecasting on Canadian prairies was taken as an example. The
objective of abovementioned study was to evaluate the possibility of using MODIS-NDVI to
forecast crop yield on the Canadian prairies and also to identify the best time for making a
reliable crop yield forecasting (Mkhabelaa, 2011). Additionally, another research on the near
real time prediction of corn yield in the U.S. was done using MODIS derived Wide Dynamic
Range Vegetation Index (WRDVI) (Sakamoto et al., 2014).
4
Based on the studies described, in terms of crop yield prediction by using MODIS
satellite, this research explained below is going to be done.
1.4. Problem Definition
The cotton harvest is time consuming and needs many labor forces. Due to outdated
methods of empirical forecasting, the amount of labor forces and needed time is unknown. The
redundancy of labor forces during the cotton harvest and weak time planning used for that,
takes a lot of financial and human resources. In order to improve the time management and
amount of labor forces, which are needed for the production of cotton, remote sensing based
yield forecasting will be done in this research as an example of Tashkent province in
Uzbekistan.
Nowadays, two most used approaches to forecast crop yield are the empirical
regression model and the biophysical crop model (Kogan et al., 2013). Currently, insufficient
input data are available in Uzbekistan for applying a dedicated cotton growth model. So, an
empirical regression model will be used in this research. It requires some selected predictors
such as a vegetation index derived from satellite images, meteorological data and historical data
of cotton yield in last 10 years (Kogan et al., 2013). Satellite data can provide continuous,
timely, human-independently information for large territories (Kussul et al., 2009).
The prediction of cotton yield may help farmers to improve the harvesting time, reduce
the risks, which they may meet with the production, and it also may help the government to
determine harvest plans strategically.
1.5. Research Questions
1. How can remote sensing prediction improve logistics of harvest in Tashkent province?
2. How accurate is the remote sensing prediction in comparison to ground truth data?
3. Which of the variables in NDVI metrics is valuable to identify the growing season?
1.6. Research objectives
1. Build statistical model for prediction of cotton yield
2. Validate the forecasting model
3. Determine the correlation between statistical data and remote sensing data
4. Identify the best variable for NDVI metrics
5. Study harvesting logistics in Tashkent province
5
2. Methods and Materials
2.1. Study Area
The study area of current research is Tashkent province (oblast) of Uzbekistan (Figure
1). Figure 2 illustrates the MODIS tile and the Tashkent province as an image.
There are 14 administrative districts: Bekobod, Bostanliq, Buka, Chinaz, Qibray,
Ohangaron, Oqqurgan, Parkent, Pskent, Quyichirchiq, Urtachirchiq, Yangiyul, Yuqorichirchiq
and Zangiota districts. Tashkent is the capital of the country and capital of the province, which
covers an area of 15300 sq. km. The major cities of the province are Angren, Olmaliq,
Ohangaron, Bekobod, Chirchiq, Tashkent, Yangiobod, and Yangiyul. The study was done for
the agricultural zone of the province. Agricultural zone contains not all the districts. As a result,
the analysis was done for Bekobod, Buka, Chinaz, Qibray, Oqqurgan, Pskent, Quyichirchiq,
Urtachirchiq, Yangiyul, Yuqorichirchiq and Zangiota districts (shown below in Figure 3). The
analysis and forecasting was done for all districts individually and at the province level as a
whole. The Tashkent province is bordered at the northeastern of Uzbekistan by ranges of
mountains named “Tian Shyan”. The main crops are cotton, grapes and grain cultivation, as well
as silkworm breeding, fruits and vegetables, cereals and citrus fruits are increasing. Substantial
parts of the province in the south and southwest are foothill flatlands. Tashkent features a
Mediterranean climate with strong continental climate influences. The weather in Tashkent is
characterized by cold and often snowy winters, but with long dry and hot summers. In total, the
winter consists of about 32 fully covered snowy days. Humidity level of air is around 56% on an
annual basis.
Figure 1 Digital Elevation Model representing Tashkent province location
Figure 2 Illustration of extent of Tashkent province on MODIS tile such as MOD13Q1
(250m)
6
Agricultural zone was taken as study area and the area covered by cotton was taken by
applying a pixel-based NDVI threshold classification on the agricultural zone. Threshold is
explained later in the classification section of the report. In Figure 3, the white color areas are
different administrative districts, which are located in the agricultural zone as explained before,
area with orange lines is an area not taken and not calculated for the prediction.
2.2. Cotton
Local varieties of cotton were planted in the study area. The sowing dates starting from
the beginning of April until the middle of May (Muminov, 1973). First harvesting was in the
middle of September.
According to Centre of Hydrometeorological Service at Cabinet of Ministers of the
Republic of Uzbekistan (UzHydromet), cotton is distinguished for 5 phases:
1. Planting
2. First square
3. Flowering
4. Accumulation of cotton bulb
5. Harvest
Figure 3 Agricultural zones and other area of Tashkent province.
7
Yields of cotton crop depend on the number of mature cotton bulb (balls) on the plant.
For the time of planting temperature plays a big role. The most suitable temperature to sow is
10 degrees °C (Muminov, 1973). Freezing also plays an essential role in the development of the
crop. Sometimes freezing can take place during the starting date of cotton growth,
subsequently requiring re-planting of big areas. According to Muminov (1973) first leaves
appear 10 days after sowing, if the temperature is larger than 10 degrees and next leaves start
to appear in 4-5 days range after first leaves. The rate of growth increases and becomes more
intense and leaves appear with intervals of 2-3 days. The agricultural management (fertilizer,
cultivation, crop rotation, soil organic matter, etc.) plays essential role in cotton growth and
productivity as well.
2.3. Materials description
In this study, following datasets were used:
- MOD13Q1 product of MODIS at 250m resolution for 2000-2013 years
- Yearly meteorological observations from 15 stations in Tashkent province for 2002-2013
- Historical data on cotton yields for Tashkent province for 2002-2013
- Historical data on cotton yields for all administrative districts in Tashkent province for
2002-2013
2.3.1. Methodology
A regression model was used as an approach to predict single dependent variable
(cotton yield) by a set of independent variables such as meteorological data and remote sensing
data.
The ground truth data (historical) were obtained from the State Statistics Committee of
Uzbekistan. This data, on cotton yield for the last 10 years, was used to predict the crop yield
and make a validation of the empirical prediction. Research was done at oblast (Province) level
and administrative district level. Oblast level is a sub-national administrative unit.
Satellite data such as MODIS were used, in order to test the suitability of spatial
resolution for Tashkent province and administrative districts of Tashkent province. Vegetation
index NDVI was derived from Terra MODIS satellite sensors product. MODIS product
MOD13Q1 was downloaded for whole period of planting-flowering-harvest. The months for
cotton calendar are approximated and determined as March-September. The MODIS product
MOD13Q1 will be described and introduced later in this report.
Meteorological data was obtained from the archive Uzhydromet. It includes information
from 15 meteorological stations of Tashkent province. All stations provide monthly
meteorological information such as precipitation, rainfall and average minimum temperature and
average maximum temperature.
All models were validated, calibrated and developed using official data on cotton crop
yield at province/oblast level for the period 2002-2013. I applied repeated procedure, which
means models were first calibrated and developed for the period 2002-2012 and forecasts
made for 2013 and 2014.
Linear regression model for cotton yield forecasting with use of satellite data derived
NDVI were applied. Statistical cotton yield data have been derived from the State Statistics
8
Committee of Uzbekistan for Tashkent province (2002-2013) and administrative district (2003-
2013).
The regression model shown below was used for yield representation:
Yi = Ti +dYi (1)
Where Yi is predicted yield, Ti is agricultural changes and in this study agricultural changes
were taken as a trend component of the yield during 11 years. Time series show yearly
changes, which can be compared with yearly NDVI and, moreover, it shows the deviation of the
yield. DYi is representing the difference of actual yield from the trend line (Kogan et al., 2013).
Deterministic component of trend Ti can be acquired from the equation below:
Ti = a+b (year) (2)
Where a is intercept of the trend and b is slope of the trend line of certain year.
Yearly variations have been estimated using the next regression model, which connects
the cotton yield dYi and NDVI.
dYi = Yi - Ti =f(NDVI)=b0+b1*NDVIij (3)
b0 and b1 was applied to include year effects (Street et al., 1988). NDVIij means NDVI indicator
such as maximum or integrated time series (Sum of NDVIij=iNDVI=NDVI integral) for some year
(i) and j represents the day of the year (DOY).
The deviations of NDVI during the growing season (14 times) for each year were
obtained in order to investigate the strength of the relationship with NDVI and to determine
whether the critical point has a strong correlation, when the crop productivity is highly sensitive
to weather conditions.
Leave-one-out cross-validation procedure was implemented for the prediction model. It
means that the forecasting model were developed for the years 2002-2012 at province level and
the years 2003-2012 at administrative district level (test data) in order to calibrate data for all
years except one. Therefore, for each province, each administrative district and each DOY for
which NDVI values are available, the n predicted values for cotton yield for testing data in this
case n=11 for province level and n=10 for administrative district level. Subsequently, after
calibration, model was used for the one left year (for instance 2013) to make a prediction for, in
this case, 2013. Validation was done by using the statistical data for 2013 obtained from State
Statistics Committee. In order to see the correlation between predicted value and statistical yield
this procedure was repeated for each year, meaning each run one year was left out of the
calibration and only used for validation. Afterwards, statistical data and predicted values were
compared.
Root mean square error (RMSE) was calculated, in order to see an error from the
forecasting model. To estimate the RMSE the official historical data and predicted data values
were used. RMSE was calculated by equation below (Kogan et al., 2013):
𝑅𝑀𝑆𝐸 = √∑((𝑃𝑖)−(𝑂𝑖))2
𝑛 , (4)
Where Pi and Oi are estimated yield data (Ti) and historical data of cotton crop, respectively. Ti
also can be determined as slow agricultural changes indicator.
9
2.3.2. MOD13Q1 Production
Global MODIS vegetation indices are designed to provide spatial and temporal data of
vegetation conditions. B, R, NIR reflectance were used to identify vegetation indices. MODIS
NDVI products provide us with time series for historical applications (USGS, 2014). MOD13Q1
has not only NDVI products but Enhanced Vegetation Index (EVI) products as well. EVI
maintains sensitivity at a dense canopy and minimizes variations of canopy background.
MOD13Q1 is computed from bi-directional surface reflectance and has been masked for clouds,
water, shadows etc. (USGS, 2014). In addition, the product data provides 250 meter pixel size
every 16 days. Due to its ease of application, simplicity and familiarity, VIs have wide range of
users (USGS, 2014), (Solano et al., 2010).
For more precise explanation of MOD13Q1 images downloaded by Julian days, Table 1
was created. It illustrates Julian days, which belongs to acquisition dates and the range of the
day acquired by MODIS satellite. Table 1 Illustration of downloaded MOD13Q1 images with NDVI for growing season available for 2002-2013
years
Day range Acquisition Date Julian Day
61-91 03.06 065
61-91 03.22 081
92-121 04.07 097
92-121 04.23 113
122-152 05.09 129
122-152 05.25 145
153-182 06.10 161
153-182 06.26 177
183-213 07.12 193
183-213 07.28 209
214-244 08.13 225
214-244 08.29 241
245-274 09.14 257
245-274 09.30 273
2.3.3. NDVI Metrics description
NDVI metrics are used to characterize the growing season under study. There are two
approaches to identify the growing period. Phenological metrics can be derived from satellite
data as a threshold-based approach and time-series of NDVI data using a curve derivative
method (USGS, 2011). The threshold-based approach use some values, which can be relative
or pre-defined values at which vegetation activity is assumed to begin. The timing and progress
of the plants may help researchers to make a conclusion about condition of the plants and their
environment (Reed et al., 1994). This study will derive the measures (metrics) from 10 years of
NDVI time series data within the period March-September. These NDVI time series data will be
derived from satellite data product MOD13Q1 with resolution of 250 m. Measurements with
NDVI metrics will provide information, which gives the ability to track the seasonal
characteristics of cotton more closely. NDVI indices will be obtained 14 times for 16-day periods
10
from March to September. Moreover, 14 data values are available for each of the 10 years,
which in total are 140 data values. The past efforts concluded assigning a threshold NDVI value
for the activity events of the plant (Reed et al., 1994). The NDVI values were investigated to
identify the photosynthetic activity and deceleration of photosynthesis. The metrics that may be
used as an indicator are shown in Figure 4:
Start of growing season (OnP)
End of growing season (EndP)
Duration of growing season (DurP)
Maximum NDVI during growing season (MaxV)
Time integrated NDVI during growing season (TINDVI)
Different indicators were described and studied in terms of precipitation and vegetation
dynamics. Two studies were done in two study areas in different climatic zones, but generally
with high amount of precipitation during the growing season of crops (Pulatov, 2008). Finally,
there was undertaken the investigation of the main driving indicators for prediction. Integrated
NDVI, which is equal to the sum of NDVI (iNDVI=sum of NDVI), is assumed as good indicator
for seasonal variation of different land types. In the current study, two indicators were taken in
order to see the relationship of NDVI to the yield and to predict the cotton yield in Tashkent
province and administrative district level. One of the best indicators is NDVI integral, which is
sum of the whole growing period (Kastens et al., 2005). This indicator can help to obtain the
acreage of the area by summation of the most greenness period, and it was found out that the
majority of yields is highly correlated with integrated NDVI summarized around the time of
maximum greenness (Tucker et al., 1980). The maximum NDVI and sum NDVI, which is the
integrated NDVI values over the year, derived from MODIS (MOD13Q1) provide with
information on yield prediction (Pulatov, 2008).
2.4. NDVI Analysis
NDVI values were taken for the Agro district of the Tashkent oblast for Julian days
shown in table 1. All 12 year images were obtained from archive of U.S. Department of Interior
Figure 4 Phenological metrics from temporal NDVI profile (Reed et al., 1994)
11
and Geological survey. NDVI analysis was done by choosing the proper and suitable indicators
(max, integrated time series shown in Figure 4). In addition, in order to obtain the cotton pixels
the thresholds were applied (explained in next section 2.5). The growing season was analyzed
between beginning of March and the end of September. Respectively, all NDVI indicators were
obtained for all years by repeating the whole growing season marked by Julian days. NDVI
indicators were analyzed by applying the threshold for certain Julian days in order to repeat and
suit the classification methodology of Platonov (2008) and Zhang (2011). Spectral signatures of
cotton for MODIS were studied and determined by analysis of Figure 6 (see next section).
Subsequently, certain dates were chosen and cotton was identified. During identification of
cotton pixels, the values of two different indicators were obtained, which helped to analyze,
understand and make calculation shown in results chapter.
2.5. Classification
Classification gives an opportunity to model an earth’s surface. Studies on this research
has a range of topics such as mapping of weed (Glenn, 2005); (Gokhale, 2006), modeling
wildfire (Ercanoglu, 2006), forecasting urban growth, and other applications. According to the
results of different scientists (Richards, 2005) classification accuracy depends on pixel’s location
similarities with training sites. Error assessment is a critical point in analysis and interpretation
process (Stehman, 1998).
Cotton classification was made by applying NDVI threshold values and true-false
decision rules in ArcGIS software. For example, NDVI<0.2 was assigned as bare soil. According
to Platonov et al. (2008), there are several crops in this area, whose spectral signature is very
similar to cotton at certain days of the year. However they can be separated at other days. As
an example, cotton, rice, wheat crop and other crops are shown in Figure 5. This graph was
Figure 5 Spectral signatures of different crops growing in Syrdarya province of Uzbekistan for Landsat 7 ETM+ (Platonov et al., 2008)
12
shown in order to know the similarities of the spectral signatures for different crops in Syrdarya
area. Spectral signature used by Platonov was determined for Landsat ETM+ satellite.
In order to apply NDVI, threshold values and decision rules to MODIS suitable values
were explored and acquired. Figure 5 indicates that rice, cotton and wheat have very big
similarity during the growing season and at a certain DOY. For example, rice has similarities
with cotton spectral signatures on 110-114, 131, 160-162 Julian days and at the end of the
growing season has almost the same signature on 274 DOY, while wheat has close pattern with
cotton on 210, 226 and 274 DOY. So, to delineate cotton from other crops suitable values in
comparison with Landsat 7 ETM+ were applied (Figure 6). In order to differentiate cotton from
other crops the thresholds were examined from research done by Zhang (2011). That research
expresses the cognitive information about the values, which were applied as a threshold for
MODIS NDVI product, to obtain the cotton field pixels from the images. The NDVI thresholds
such as NDVI< 0.2 for 065-161 DOY have been applied. That value indicates bare soil. Second
threshold, which determines winter wheat-summer maize reflects the value such as NDVI> 0.35
and NDVI< 0.4 has been applied, moreover threshold for third critical point (225-273 DOY) of
the flowering and harvesting period of growing season is the period when maximum cotton
biomass reflectance have been caught. This period were indicated as NDVI > 0.63 and NDVI<
0.73. Last critical point helps us to get the spatially distributed cotton fields in that area (Zhang
et al., 2011). The most difficult part of acquisition was the end of growing season (225-273
DOY) when several crop signatures showed crossing or had similarities as shown in the
Platonov (2008) study. Then, some uncertainties had been discovered, such as absence of
required mean value in certain layer for certain DOY.
A pixel based threshold was applied to the images. Only three critical points had been
taken due to three critical periods as advised by Platonov (2008) (Figure 5) and Zhang (2011)
(Figure 6). Figure 5 was taken as an example of the differences of spectral signatures for the
different crops during the growing season in that area for Landsat ETM+. Meanwhile, Zhang
(2011) advised to use higher values for MODIS NDVI, which were applied for this research in
order to delineate cotton with the MODIS (MOD13Q1) NDVI product. The images cover the
growing season and have a frequency of one image close to the critical points of cotton spectral
Figure 6 Illustration of cotton, winter wheat-summer maize, and spring maize value MODIS NDVI temporal patters for China (Zhang et al., 2011)
13
signature such as April (113th day), June (161st day) and August (225th day). The limitations
have been met on this research for images of some days in certain year due to applied
thresholds. The issue of thresholds was not essential and were skipped at the moment of the
research. The problems were in insufficient values of threshold for a time when assumingly
cotton biomass, for instance, had maximum values at DOY 225, and in reality it has been less
than the minimum threshold which was applied for the certain days of certain year. More often it
has been met during the middle of the growing season. The classification gave NDVI values
with lower average pixel values spread out around the whole region. The threshold was done for
three images with NDVI by certain day of the year (NDVI DOY). The first moment was the
beginning of the growing season as 113th day of the year, when the temperature degrees are
equal or higher than the effective temperature (10 degrees Celsius). The minimum effective
temperature for cotton growth has to be equal to 10 degrees Celsius, which means cotton is not
growing in case of the less temperature. In order to record the period of wheat harvesting and
cotton sowing (bare soil), 113th day was applied. The second day has been taken as 161st day
of the year, which represents the middle of the growing season for the agricultural region.
Cotton biomass has not yet grown enough for the high VI values representation, but assumingly
it is possible to see the summer maize growth in order to divide these two crops from each
other. The last spot of the obtained NDVI data have been 225th day, which lies on August 13th
and can be counted as period with maximum NDVI value for cotton in Tashkent province level.
Data from day 65-273 for growing season was gathered for 10 years. 65-273 DOY
illustrates us the cotton growing calendar, which is expressed in Figure 6.
To check the accuracy of classification, geo-referenced points in Urtachirchik district
were taken as an example. All roads and cities were delineated from study area. The same
classification was done for each administrative district.
Validation for all regions was done by observations performed in Urtachirchik district.
GPS points taken by me in the summer of 2012 during my internship and additional GPS points
(coordinates), which were obtained by a first year master student of Tashkent institute of
Irrigation and Melioration, obtained during his internship in 2014 were used. According to
Platonov (2008), crops distributed over the area were recognized and validated with thresholds
applied to maps. Due to visible and recognizable fields of cotton in Google maps, it was
possible to check some area. The Urtachirchik administrative district was checked and validated
by control points obtained by GPS Leica Geosystems. Control points were done near cotton and
maize fields (Figure 9).
As an additional validation, the planted area from statistics was compared with the area
obtained from the classified cotton pixels using the NDVI thresholds. This comparison was done
by using data on planted area under cotton in ha. These data were obtained from Tashkent
statistical agency. Validation was done for all pixels for the whole agricultural area of Tashkent
province.
14
2.6. Summary of methodology (Flow chart)
Generally, the methodology of this research is followed by the flow chart indicated by the
conceptual model in Figure 7. There are two steps: first step is for oblast (province) level and
second step is for administrative district level. Each model has input and output. For instance,
software such as R and Microsoft Excel will be used to acquire NDVI metrics as a time series
with input from MOD13Q1 images of Tashkent province. Moreover, historical data obtained from
Uzbekistan will be analyzed in statistical graphs. Regression analysis will be done to compare
the output results collected from remote sensing and actual yield data. Results will be
represented by the yield forecasting after yield analysis. To see the amendments of NDVI
values, meteorological data were used and analyzed with the NDVI indicators.
Figure 7 Schematic overview of methodology at oblast level and district level
15
3. Results
3.1. Tashkent Province (Oblast level)
3.1.1. Classification
Thresholds used in the combination of DOY 113, 161 and 225 to identify cotton fields in
the various years are given in Table 2. Since the precise cotton distribution map is not available,
there are several different methods to check and validate the classification done in this research
(see section 3.1.2). As an example, raster layers acquired after applying the thresholds for the
signatures is shown for 2013 in Figure 8. Hereby, Figure 8 indicates three different days in three
different colors and blue considered as bare soil (113 DOY) while green color was applied for
the time of wheat harvesting. The time of wheat harvesting, which have been used for threshold
(161 DOY) is the same period as cotton sowing time. So, it means that green color illustrates
the cotton sowing period and wheat harvesting time. Finally, the red color was applied for the
period of 225 DOY, when the cotton biomass was reflecting the maximum NDVI values.
Table 2. Thresholds applied to identify cotton fields for specific DOYs
DOY NDVI
113 < 0.2
161 > 0.35 AND < 0.4
225 > 0.63 AND < 0.73
Figure 8 Cotton map distribution of cotton pixels in the study area for 2013 obtained after classification
16
3.1.2. Validation of the cotton raster layers
The major crops distributed over the agricultural zone in Tashkent province and NDVI of
crops consistent with the growing calendar of crops shown in Figure 5. The maximum NDVI for
wheat and other crops such as maize or orchard pixels is situated mainly on 161 DOY within the
given thresholds. In general, cotton pixels did not recognized on rainfed areas in Figure 8.
Control points obtained during my internship were used for validation of NDVI pixels in the study
area and showed that cotton fields at August 13th had highest NDVI value of 0.675 during the 12
years of study (2002-2013). As a comparison, according to Zhang (2011) 0.73 is the maximum
NDVI value, which was picked for cotton on their research. Red pixels illustrated in Figure 9
means that there are cotton fields on that area. The differences of the red color show
differences in amount of biomass of cotton due to weather and agricultural management
conditions (light tones represent less biomass). The points illustrated in Figure 9 expresses the
territory of study during the internship and points were obtained mainly near the cotton areas.
So, control points illustrate the fields covered nearby cotton fields and, for example, it is shown
that the left top point is located near the cotton field, which had not good green biomass at the
time image was taken. In general, all control points illustrate correct (meaning cotton) pixel
layers on the fields. In the middle of the shape of control points, there were potato, water melon
and other non-cotton, small fields, which correctly are not covered by any red pixels. On the
right hand of the shape, control points were taken in maize fields, which are not covered by
pixels as well. Six pixels illustrated on the bottom left are cotton as was discovered by the first
year student. On the top of the image it is unknown whether the cotton fields are correct.
The pattern was validated in order to compare the special critical days, meanwhile the
comparison showed good and almost similar pattern of cotton. As we can see on DOY 65,
which is 6th of March, the pattern is less than 0.2 and the minimum value on DOY 113, which is
Figure 9 Ground control points obtained by Leica GNSS and GPS systems in 2012 nearby cotton, maize,
orchards and other crops
17
0.000.200.400.600.80
ND
VI
Julian days
Comparison with pattern of Zhong (2011) to validate the classification
23rd April, is around 0.2 but increasing up to the maximum value (Figure 10), while maximum
NDVI value appeared at the same time as in the pattern mentioned by Zhang (2011), on 13th of
August (DOY 225). Those values show the similarities of the cotton pattern with Figure 6 and it
may be concluded that the classification was done appropriately.
The validation mentioned in the end of section 2.5 refers to the area covered by the cotton
pixels classified using the thresholds of table 2 for all the years as compared with the official
area statistics for cotton. Figure 11 shows that points are close to the 1:1 line for most years,
indicating that the threshold approach at least provides a good estimate of the total cotton area.
Deviations can be explained because some cotton fields can have a very low biomass, resulting
in a low NDVI value. The NDVI of cotton pixels was obtained after thresholds were applied.
Meanwhile, the obtained NDVI of pixels is the reflectance of greenness and non-greenness of
vegetation, which depends on solar radiation, photosynthetic activity (Ruecker, 2007), amount
of precipitation, water balance (Muminov et al., 1973), soil organic matter, soil bioactivity and
20
40
60
80
100
120
140
20 40 60 80 100 120 140
Pla
nte
d a
rea
(th
ou
san
d h
a)
NDVI pixel area (thousand ha)
Yearly planted area under cotton vs summarized NDVI pixel area
Figure 10 NDVI pattern for 2011 during four critical points indicates significance with the Zhang (2011) pattern of cotton.
Figure 11. 1:1 Relationship illustration comparing the planted area with NDVI area in thoiusand ha
18
other reasons, which can affect the presence of amount of chlorophyll of vegetation (Ruecker,
2007) or absence of biomass.
3.2. Yield analysis
The statistical data for the whole agricultural zone of Tashkent province expresses the
information, which is valuable for the objectives. Time series of the gross yield has a positive
trend line (Figure 12). Trend line was acquired by equation Y=0.4633x-905.69, where two
variables of trend is slope and intercept. Therefore, the slope is represented by value of 0.4633
and the interception is -905.69. In addition, Figure 12 interprets that in 2002, 2003 and 2008
yield was less than in other points of the time series. Moreover, maximum value was obtained
from 2006, continued by the decrease of the trend until 2008. Then it showed that in two last
years crop growth was decreasing. Afterwards, statistical yield data was compared with
temperature (Figure 13) to presume the influence of weather conditions to crop yield and see
the reason of low yield in some years. After the comparison of monthly average of temperature
with the yield, it interprets that there is no relationship (R2=0.01) with cotton yield (See below).
y = -0.0308x + 14.808R² = 0.0103
5.0
7.0
9.0
11.0
13.0
15.0
17.0
20.0 22.0 24.0 26.0 28.0 30.0
Tem
pe
ratu
re
Cotton Yield c\ha
Relationship among Temperature and cotton growth
Figure 12 Yield (c\ha) against time series for 12 years, Tashkent Province
Figure 13 Relationship between cotton gross yield (c\ha) and temperature average over 12 years
19
There is an assumption that, due to unavailable daily or weekly observations of cotton,
the relationship between variables is absent.
Precipitation in the study area mostly is in autumn-winter-beginning of the spring, while
cotton growth calendar starts in the end of March – beginning of April. Cotton is planted in
irrigated zones in Tashkent province and data such as monthly (growing season) or yearly
average of precipitation is not much playing a role in discovering relationship. The optimal
amount of precipitation for the cotton biomass is 5000-8000 m3\ha (Anonymous, 2014). But
generally, low amount of rainfall has to be fallen during the cotton growth, because at the same
time the amount of precipitation is valuable as well. If water content is not enough, cotton
biomass leave out the lowest layer of the boxes, in order to give the strength for the growth and
productivity to the rest (Anonymous, 2014).
Freezing has bad influence on cotton growth too (Muminov, 1973). Additionally, soil in
Central Asian region is inadequate (Muminov, 1973) and soil has to be moist during the seeding
period. This could be a reason of the low gross yield during 2007-2008 years. Assumptions are
that freezing was longer than usually, and farmers started cotton seeding later than usually.
No relationship trend line can be caused by applying the monthly average data obtained
from Uzhydromet rather than daily observations in the fields.
3.3. NDVI Metrics
The only two indicators studied in this research were the NDVImax and the NDVI
integral. The reason of using these two certain indicators are that different studies investigated
two indicators as the best indicators for such kind of studies and they are most essential for
forecasting.
Figure 14 NDVI max changes over 12 years for cotton
20
Figure 14 illustrates the time series of maximum NDVI values for cotton biomass over 12
years (2002-2013). Thus, changes illustrated in Figure 14 says that NDVI maximum indicator
varied not much and expresses very small range of changes during 12 year time series for
Tashkent province level. These small changes indicating negative decrease of an NDVI value
for cotton biomass, which leads to assumption that the temperature is increasing, which in its
turn can be observed in the statistical data and the biomass slightly became drier, and uniformly
lost chlorophyll content.
Figure 15 expresses NDVI integral (iNDVI) changes over 12 years. iNDVI shows wide
range in changes over 12 years and that graph has cognitive information for research. Figure 15
also illustrates the presence of agricultural changes as cotton growth increased and decreased
over 12 years in comparison with Figure 14. Figure 15 expresses that cotton NDVI integral is
decreasing over 10 years but for the last years the pattern shows development and increase,
which can be the influence of agricultural improvements. NDVI integral values is decreasing
from 2003 (4.890) then the maximum value was caught in 2008 (5.043) and shows more
temporal variability than the weather variability during 12 years.
The changes over 12 years of NDVI values shows that the cotton NDVI values are
decreasing, which means that biomass is not growing in some areas anymore or another
assumption is that farmers started to pay more attention to production of cotton boxes than to
growth of biomass. The causes of this kind of assumptions are that NDVI is the reflection of
greenness and non-greenness of vegetation, absence and presence of chlorophyll amount in
biomass, which is dependent on different variables (solar radiation, dry matter content, soil
organic matter, herbicides, fertilizers, water content), but not cotton bulbs (section 3.2.). The
increase of production in 2013 makes to think that predicted yield can be higher (Figure 15).
4.650
4.700
4.750
4.800
4.850
4.900
4.950
5.000
5.050
5.100
2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013
iND
VI
Year
NDVI integrated time series
Figure 15 Changes of NDVI integrated time series over 12 years
21
Figure 16 and 17 illustrate the relationship between the deviation of cotton yield from the
trend line and NDVI indicators. Both indicators at province (oblast) level showed weak
correlations (R2=0.0067 for maximum NDVI and R2=0.0838 for NDVI integral). There is an
assumption that there is an outlier in both graphs, whereas the indicator relationships with the
outlier included are shown. An outlier is interpreted as the maximum deviated point. iNDVI show
us an outlier with a big deviation on point, where deviation is equal to 4.31 and NDVI indicator is
equal to 4.9, while all other deviation values varying near trend line. The negative trend line can
be understood as the biomass reflectance was high and rich with chlorophyll content, while yield
(cotton boxes) production was low. It means that cotton plant was wet and did not produce the
cotton or production was later than usually, as biomass produces cotton boxes when the plant
becomes dry (Anonymous, 2014). In order to produce cotton, farmers increase dry matter
content with herbicides. So, assumptions are that herbicides were added to the biomass late
August or beginning of September. Generally, there is state standard for cotton growth
management, but each farmer can adjust these schemes according to his own circumstances
and conditions, i.e. skip one irrigation or change the date and time of agricultural input activity
due to weather condition or farmer circumstances (Anonymous, 2014).
For all levels the linear trend was chosen. The assumption consequently obtained after
studying Figure 16 and Figure 17 can state that NDVI indicators and MODIS satellites spatial
resolution of 250m is not the most suitable and good enough for the oblast level.
The best indicator for province level is stated as NDVI maximum, it became known after
RMSE value was calculated for all tested indicators. Obtained RMSE for NDVI maximum is
small and announcing a low value such as 0.14, which means that there is a weak error, and
accurate prediction can be done by using this indicator.
y = -70.003x + 46.719R² = 0.0067
-3.00
-2.00
-1.00
0.00
1.00
2.00
3.00
4.00
5.00
0.664 0.665 0.666 0.667 0.668 0.669 0.670 0.671 0.672
DYi
NDVI max
Illustration of yield deviation and NDVI max
Figure 16 Regression model for Tashkent province. Estimated parameters for Eq.(3) were b0=46.719 and b1=-70.003
22
The forecasting of the yield for 2013 is giving a positive result and it interprets that yield
will decrease in comparison with the previous year but cotton production increase. Figure 18
represents the predicted yield (red line) after using the LOOCV procedure. In order to see the
correlation between original yield and predicted yield the scatter plot Figure 19 is created. After
the LOOCV procedure was applied the prediction was fitted and done for 2013. Correlation
shows good relationship on Figure 19, which means that prediction is accurate.
0.0
5.0
10.0
15.0
20.0
25.0
30.0
20
02
20
03
20
04
20
05
20
06
20
07
20
08
20
09
20
10
20
11
20
12
20
13Pre
dic
ted
an
d o
bse
rved
yie
ld c
\ha
Year
Representation of predicted yield and observed data time series
Observed data
Predicted yield
y = -6.6608x + 32.624R² = 0.0838
-3.00
-2.00
-1.00
0.00
1.00
2.00
3.00
4.00
5.00
4.750 4.800 4.850 4.900 4.950 5.000 5.050 5.100
Dyi
iNDVI
Illustration of yield deviation and NDVI integrated
Figure 18 Illustration of predicted yield and original yield obtained from State Statistics Committee
Figure 17 Regression model for Tashkent province. Estimated parameters for Eq.(3) were b0= 32.624 and b1=-6.6608
23
3.4. Results for administrative districts (District level)
For each individual district various results and relationships between predicted yield and
observed (original) yield were found, but the strength of the relationships depended upon the
amount and quality of the imagery used. Figure 20 illustrates that cotton yield in Bekobod district
increases year by year. It also shows unexpected fluctuations in 2004-2005 and 2007-2008
years. Moreover, it can be stated that NDVI integrated performed well in the relationship with
yield deviation over 11 years for almost all districts. All districts show weak correlation, while the
strongest is for Bekobod district. It shows positive correlation and RMSE is 0.28 (Figure 24).
The linear trend indicates the performed correlation. The NDVI integrated indicator shows the
best results for Bekobod, Buka, Oqqurgan and Urtachirchik districts, while NDVI maximum
shows better results in other districts (table3). Several districts also had obstacles in this
y = 0.6409x - 1264.2R² = 0.3375
0.0
5.0
10.0
15.0
20.0
25.0
30.0
2002 2004 2006 2008 2010 2012 2014
Yiel
d c
\ha
Year
Hystorical data time series for Bekobod district
19.0
21.0
23.0
25.0
27.0
29.0
19.0 21.0 23.0 25.0 27.0 29.0
Ob
serv
ed d
ata
Predicted yield
Corelation between Statistical data and predicted yield
Figure 19 Relationship between original yield data obtained from State Statistical Committee
Figure 20 Historical data for Bekobod district. Estimated a0=0.6409 and a1=-1264.2 for that district as slope and intersect for Eq. (2)
24
research such as not complete set of historical data.
As an example, Bekobod district is taken and shown. The best indicator for that
district is iNDVI. The relationship shown in Figure 21 illustrates the deviation of the yield and
iNDVI, where it was explored very low with negative linear trend. Figure 21 expresses the
cognitive information such as illustration of the period when the production of the yield is high,
NDVI expresses low values, or when the yield is low but biomass reflectance is high. The
assumptions are that the agricultural management of cotton requires different inputs such as
herbicides, before harvesting to drive the biomass for drought. If biomass reflects well, the
assumption can appear such as there was rainfall in that area and cotton had increased water
content, which subsequently leads to less or late production while NDVI reflectance is high.
R² = 0.1807
-6.00
-4.00
-2.00
0.00
2.00
4.00
6.00
4.6500 4.7000 4.7500 4.8000 4.8500 4.9000 4.9500
DYi
NDVI integrated
Illustration of yield deviation and NDVI integrated
4.5000
4.6000
4.7000
4.8000
4.9000
5.0000
18.019.020.0
21.022.023.0
24.0
iND
VI
Tem
per
atu
re
Year
Illustration of temperature relationship with NDVI integrated in Bekobod district
Temperature mean iNDVI
Figure 21 Illustration of yield deviation and NDVI integrated
Figure 22 Temperature influence on NDVI integrated in Bekobod district
25
Comparison of the influence between weather conditions and NDVI (reflectance of
biomass) shows similarities from 2007. Meanwhile, temperature shows low relationship with
NDVI integrated. The reason could be that data used in this research, is an average of the
temperature for the growing season of the cotton (Figure 22), which means there are no critical
dates with high or low temperature inside. Positive correlation (Figure 23) with value of R2=0.34
indicates that there is correlation between temperature and cotton growth. According to
personal communication and interview (2014) with an anonymous interviewer, cotton growth
depends on an optimal temperature, and positive trend proves this statement. Statement says
that there is optimal temperature for good and normal tense of cotton growth. In case of
decrease and increase of the temperature, cotton plant decreases or stop growing.
R² = 0.3416
4.6500
4.7000
4.7500
4.8000
4.8500
4.9000
4.9500
20.0 20.5 21.0 21.5 22.0 22.5 23.0 23.5
iND
VI
Temperature mean
Illustration of temperature relationship with NDVI integrated in Bekobod district
15.0
20.0
25.0
30.0
15.0 20.0 25.0 30.0
Ob
serv
ed y
ield
Predicted yield
Correlation between predicted yield and observed historical yield
Figure 23 Correlation of the temperature and NDVI integrated in Bekobod district
Figure 24 Correlation between predicted yield (Yi) and observed yield
26
This district shows good relationship and RMSE is 0.28 between predicted yield and
observed yield (Figure 24). Figure 20 illustrates the historical data time series, which was
correlated with predicted date in Figure 24.
Table 3 illustrates the results of RMSE values estimation for all administrative districts of
Tashkent province. The most suitable for all regions, as was mentioned before, is NDVI
integrated. The correlation (R2) between predicted yield and observed yield is low for almost all
districts except the Tashkent province, which shows better results. Moreover, Bekobod and
Buka indicate the best results among all administrative districts.
Table 3 Results of Root Mean Square Error (RMSE) for all administrative districts in Tashkent province
MODIS (MOD13Q1) indicators
Name NDVI max NDVI
integrated R2
Tashkent province 0.14 0.16 0.45
Bekobod 0.31 0.28 0.47
Buka 0.02 0.005 0.51
Chinaz 0.08 0.12 No
correlation
Kybray NO NO No trend
Kuyichirchik 0.59 0.61 No
correlation
Oqqurgan 0.0078 0.0072 0.28
Pskent 0.054 0.12 0.28
Urtachirchik 0.007 0.002 0.25
Yangiyul NO NO No trend
Yukorichirchik 0.07 0.11 0.37
Zangiota NO NO No trend
The Best
27
Table 4 Illustrates overall results for all districts and all indicators. It expresses the predicted
yield in comparison with observed yield for 2013. The best results are shown for the NDVI
indicators and as correlation between predicted yield and observed yield are Buka, Oqqurgan,
Pskent, Urtachirchik and Yukorichirchik districts.
3.5. Validation of forecasting model
The validation at the province level and administrative district level was done for an
individual year. Root mean square error for Tashkent province and district level was calculated
for the calibration and validation with the results. The results indicate that predicted yield and
observed yield are relatively efficient. The correlation between all of the regions is shown in
Table 3 as R2. By the table and the results of RMSE done according to Eq.(4), it can be
concluded that cotton prediction can be done by using and validating this model. The accuracy
has been calculated as RMSE for the administrative districts as well.
Table 4 Illustration of all indicators for all distrits in 2013 predicted yield.
28
4. Discussion A cotton plant is dependent on many external variables like other crops in the study
area. In addition, cotton plant growth is very sensitive to various inputs and outputs. The results,
which were obtained by models in previous sections, proves it. The personal communication
helped to understand different aspects of this influence. There is a schematic plan, which is
called “technological map of cotton growth” (Ibragimov et al., 2007). According to interviews
(2014), cotton productivity depends on agricultural management.
Different farmers try to keep and follow the rules of the technological map, which
indicates proper and accurate cotton growth methods. In order to explore the influence of
weather conditions on cotton yield and the best NDVI indicator available from MOD13Q1
(MODIS NDVI product) for cotton yield prediction in the study area, regression models and
NDVI prediction models were applied. Due to different crops growing in Tashkent province, the
research needs to delineate the cotton fields from other crops. Classification methods were
applied and some obstacles during this step have been met. According to Platonov, for cotton
classification some thresholds can be applied. The reason of choosing that methodology was
that the research of Platonov was done in a neighboring province, but for the scale of a farm.
The plot made by Platonov gave information about different crops in the region growing at the
same time. Method of Platonov says that in order to classify cotton some decision rules (section
2.4.) can be applied. That method expresses that NDVI pixel value for DOY 113 should be <0.2,
which can be considered as bare soil or water. Then, NDVI pixel value was limited to <0.2 in
order to see rice spectral differences with cotton, as rice is growing at the same time and with
almost the same spectral signatures by DOY. Differences between cotton and rice can be seen
in section 2.4, where spectral signature of rice rises much higher during the period of the
maximum reflectance value of cotton (DOY 225). Previously mentioned DOY 225 has been
chosen in order to classify cotton from other crops as this day has to be the period when the
reflected MODIS NDVI value reflects maximum value, as NDVI at DOY 225 is <0.3 and NDVI at
DOY 225 is >0.5. After applying these thresholds and limits have been met, the question was
whether MODIS NDVI product needs the same NDVI values for the classification and the same
reflections for cotton as Landsat ETM+. Meanwhile, Zhang et al. (2011) stated that MODIS
pixels need higher thresholds for decision rule classification, as NDVI(@DOY113)>0.2,
0.35>NDVI(@DOY161)>0.4 and the latest critical point was NDVI(@DOY225)<0.63 and
NDVI(@DOY225)>0.73. Two methodologies were compared and the results in this research
showed similarities with Zhang’s methodology (Figure 6), where the thresholds for DOY were
taken from.
The reason of deviating outcomes was that cotton at that time had weak biomass, which
just had been grown up and not reflected a strong greenness value. In order to know it, the
thresholds were lower than at the end of cotton calendar and higher than at the beginning of the
growing calendar. So, results illustrated that due to different temporal and spatial resolutions of
both satellites thresholds to distinguish cotton biomass from rice, wheat and maize, has to be
different. MODIS NDVI values are higher than for the Landsat ETM+ satellite. In order to make
an assumption the MODIS spatial resolution and temporal range of images against the Landsat
temporal and spatial resolution were studied and taken into account.
The delineation of the cotton biomass from the rest drove us to more difficulties, as
wheat, maize, rice and cotton have the same growing calendar and the most similar spectral
signature has been studied between rice and cotton. The differences between them were
29
discovered only in one critical point at the end of the growing season for cotton plant and
harvesting time. This period is 225 DOY and acquisition date is equal to 13th of August. Rice
reflects much higher greenness than the cotton because rice is growing on the water, and water
increases the water content of the biomass. Meanwhile cotton is stopping to grow as farmers
apply different agricultural management, such as herbicide inputs in order to increase dry matter
content of the cotton (Anonymous, 2014). The reason of using the herbicides in the end of
cotton is that cotton flowers do not become boxes and seeds are not appearing as well as
cotton wool, when biomass is green and full of water content (Anonymous, 2014).
The reason of choosing the 225th DOY as a critical point is, that farmers chase the
cotton plants on July when 10-12th leaves have appeared in order to stop the growing strength
and drive the plant to give power to the flowers and bulbs. Subsequently, farmers irrigate the
fields in July as a last time (Ibragimov et al., 2007), in order to give water content and then start
to dry it (Anonymous, 2014). After chasing and increasing the water content, herbicides have to
be applied to increase dry matter content as was mentioned before. So, the period between
irrigation was done and herbicides are applied is DOY 225. In case of chasing time was delayed
the herbicides input will delay as well, which means that cotton will give good reflectance but
productivity will decrease (linear trend line illustrated in Figure 16 and Figure 17). Chasing time
and herbicides input time is dependent on money, labor forces, appropriate time management
and time of the last freezing in the beginning of the cotton calendar (Anonymous, 2014).
According to personal communication with the cotton classifier (2014), the results of NDVI
indicators for Tashkent province and for administrative district level leads to make an
assumption that current agricultural management is not leading to better outcomes or MODIS
NDVI indicators are not the most suitable for these regions. The reason of MODIS’
unsuitableness for this area is that the spatial resolution (250m) of the MOD13Q1 NDVI product
is too large to apply it on Tashkent province and district level. Because the fields in Tashkent
province are less than 60 ha (Djanibekov et al., 2010).
Different literature indicates that better classification can be made by using the cotton
distribution map or ground truth data in order to know the province and district. For this research
several points had been used in order to validate cotton fields and it illustrates (Figure 10) that
mostly cotton pixels lay on cotton fields and include other fields as well because of the
resolution of the satellite. In theory, the ideal approach for classification for the purpose of crop
yield forecasting the cotton distribution map is needed.
The optimal temperature for good cotton plant growth during the season is 25-30
degrees. Data used in order to find a correlation between weather conditions and cotton yield
were monthly and yearly averages for 12 years. Using monthly average data is not efficient for
such discovery. For finding better relationships, data has to be daily, because during the month,
there are critical temperatures and there might be critical days with precipitations, which are not
influence to an average number, and data represents small values. The weak negative
relationship between cotton yield and weather conditions helps to find explanations about
limitations and bottlenecks of cotton growth. Assumptions carry information not only in non-
sufficient and no quality data, but in addition it leads to an understanding of the impact of
weather conditions on the growth of cotton. In case the amount of precipitation increases the
adequate and optimal limit of good cotton growth or precipitation decreases below limit, the
cotton biomass stops growing and throw out the lower leaves, in order to give the strength to the
30
leaves on the top of the plant. It means the production will be low but plant will reflect well to the
sensors.
This research was done on a limited area, with restricted time range, with use of specific
methodology and model. Thus, this research might not indicate precise information for the
whole Republic.
5. Conclusions and Recommendations The prediction of the annual cotton yield can lead to knowledge of acreage under cotton
and of gross yield, which gives the opportunity to use proper logistics such as time planning the
amount of labor and others. Better schema of agricultural management will help to increase and
improve the production of the cotton, since cotton is very essential for Uzbekistan and is playing
one of the important roles of economy. The methods used in this research can help the
government of The Republic of Uzbekistan to create their own new-modern forecasting
methodology and improve time management on spending and planning the cotton harvest
(picking). The information and methodology helps to obtain cognitive information, which
subsequently might express the amount of labor forces needed in order to harvest and pick the
cotton without using excessive time and labor. Moreover, the results of this study could be
helpful in time management for use of another sensor in order to predict cotton yield by
accurately delineation of the cotton on that area. Therefore, this research expresses the
selection of other alternative indicators or satellites in order to forecast crop yield in Tashkent
province.
This forecasting used in this research for the cotton crop can effectively and accurately
lead to information about the amount of the yield before the harvesting time and gives to
farmers and others, the hints and the opportunity to make better, accurate and more efficient
cotton management and market decision with good time planning, and govern the harvesting.
The results show negative results for the prediction, which means that cotton product will
decrease in 2013, which is confirmed by historical data. This research leads to the assumption
that remote sensing prediction can transfer better and larger information than the ground truth
prediction because remote sensing gives information not only about crop yield, but it provides
more information such as vision of larger areas and subsequently helps to create the spatial
maps of the area. However, the most crucial things for precise prediction is to select and use
the good, suitable and sufficient indicators for the Tashkent region and moreover the better
sensors with lower resolution.
The best indicators for the province and its administrative districts are NDVI integrated
and NDVI maximum, which had been identified during the research. Other indicators should be
checked as different regions and different scales might require different variables. The districts,
which did not show good results, have to be studied better and more precisely in order to find
the most suitable sensor, model and indicator.
So, methods used in this research can conclude that farmers do not have enough
knowledge about production rather than to increase biomass, which in this case is cotton wools.
In addition, for better cotton prediction, there is need for a cotton distribution map or
ground truth data, which will help for more accurate cotton delineation from other crops.
However, classification was studied and learnt by literature review, which recommends cotton
31
NDVI values for MODIS sensors to apply decision rules. Classification was applied successfully
and it was concluded that other satellites have to be checked in that region as well because
cotton is growing on small scale fields. Therefore for better production, the transfer of
knowledge of the scientist has to be done and farmers have to study all schemes and
agricultural management to improve production. Farmers have to solve all obstacles and
problems to earn or take a loan from a bank to cover financial expenses such as for buying
fertilizers and herbicides, fuels and other costs. However, cotton growing procedure has to be
done according to state technological map (scheme for proper plant growth). This will lead to
better agricultural management, time planning and logistics.
More assumptions are that daily observations have to be done for temperature and
precipitation against cotton growth. Secondly, biweekly cotton observation has to be done in
order to see the influences of the daily changes of the temperature or precipitation instead of
comparing monthly measurements of temperature and precipitations against cotton gross yield,
which can show us the influence of weather conditions on daily growing and the time of
harvesting.
This research can help to understand cotton growth. Moreover, research can derive
valuable information such as to know the amount of data and the valuable data for prediction in
that area. Therefore, this study explains which indicators are the best ones for prediction in
Tashkent province level and at administrative level.
Before making any conclusions regarding cotton yield prediction in Tashkent province
and its administrative districts more research should be done, like collection of ground truth
data, time series analysis and cotton distribution mapping.
32
6. Reference list Abdullaev, I., Giordano, M., Rasulov, A. (2007). Cotton in Uzbekistan: Water and welfare.
Cotton sector in Central Asia (pp. 112-128). London: SOAS.
Anonymous. (2014, September 15). Personal communication with certified classifier of cotton.
(D. Atakhanov, Interviewer)
Asrar, G. Fuchs, M., Kanemasu, E.T., Hatfield, J.L. (1984). Estimating absorbed photosynthetic
radiation and leaf area index from spectral reflectance in wheat. Agronomy Journal 76,
300-306.
Bastiaanssen, W. G., & Ali, S. (2003). A new crop yield forecasting model based on satellite
measurements applied across the Indus Basin, Pakistan. Agricultural, Ecosystem and
Environment, Volume 94, Issue 3, 321-340.
Clevers, J.G.P.W., Bouwman, B., Buker, C., van Leuween, H.J.C. (1993). A conceptual
framework for estimating crop growth using optical remote sensing data. International
archive of phtogrammetry and remote sensing, 29, 18.
Djanibekov, N., Rudenko, I., Lamers, J.P.A., Bobodjonov, I. (2010). Pros and Cons of Cotton
Production in Uzbekistan: Case study #7-9. Ithaca, New York: Cornell University .
Doraiswamy, P.C., Hatfield, J.L., Jackson , T.J., Akhmedov, B., Prueger, J., Stern, A. (2004).
Crop condition and yield simulations using Landsat and MODIS. Remote Sensing of
Environment 92(4), 548-559.
Ercanoglu, M., Weber, K.T., Langille, J., Nevers, R. (2006). Modeling Wildland Fire
Susceptibility Using Fuzzy Systems. GIScience & Remote Sensing 43, №3, 268-282.
FAO. (2003). Fertilizer use of crop in Uzbekistan. Rome: Foor and Agricultural Organisation of
the United Nations.
Gallego, F.J., Carfagna, E., Peedell, S. (1999). The use of CORINE Land Cover to improve area
frame survey estimates. Research in Official Statistics 2 (2),, 99-122.
Glenn, N.F., Munt, T.J., Weber, K.T., Prather, T.S., Lass, L.W., Pettingill, J. (2005).
Hyperspectral data processing for repeat detection of small infestations of leafy spurge.
Remote Sensing of Environment 95, 399-412.
Gokhale, B., Weber,K.T. (2006). Modeling Cheatgrass using Quickbird Imagery. Final Report:
Detection, Prediction, Impact, and Management of Invasive Plants using GIS.
Goward, S.N., Markhan, B., Dye, D.G., Dulaney, W., Yang, J. (1991). Normalized difference
vegetation index measurements from the Advanced Very High Resolution Radiometer.
Remote sensing of Environment 35, 257-277.
Ibragimov, N., Evett, S.R., Esanbekov, Y., Kamilov, S.B., Mirzaev, L., Lamers, P.A.J. (2007).
Water use efficiency of irrigated cotton in Uzbekistan under drip and furrow irrigation.
Agricultural Water Management 90, 112-120.
33
Investopedia US, a. d. (2014). Investopedia US. Retrieved from (I) Investopedia web site:
http://www.investopedia.com/terms/p/polynomial_trending.asp
Kastens, J.H., Kastens, T.L., Kastens, D.L.A., Martinko, E.A., Lee, R.Y. (2005). Image masking
for crop yield forecasting using AVHRR NDVI time series imagery. Remote Sensing of
Environment 99 , 341-356.
Kogan, F., Kussul, N., Adamenko, T., Skakun, S., Kavchenko, O., Kryvobok, O., Shelestov, A.
Kolotii, A., Kussul, O., Lavrenyuk, A. (2013). Winter wheat yield forecasting in Ukraine
based on Earth observation, meteorological data and biophysical models. Journal of
Applied Earth Observation and Geoinformation, 23, 192-203.
Kussul, N., Shelestov, A., Skakun, S. (2009). Grid and sensor web technologies for
environmental monitoring. Earth science informatics 2(1), 37-51.
Liang, X.Zh., Xu, M. Gao, W., Reddy, K.R., Kunkel, K., Schmoldt, D.L., Samel, A.N. (2012).
Physical Modeling of U.S. Cotton Yields and Climate Stresses during 1979 to 2005.
Agronomy Journal, Volume 104, Issue 3, 675-683.
Lindsey, R., Herring, D. (2011). NASA Earth Observing System. Retrieved from NASA:
earthobservatory.nasa.gov
Mkhabela, M.S., Bullock, P., Raj, S., Wang, S., Yang, Y. (2011). Crop yield forecasting on the
Canadian Prairies using MODIS NDVI data. Agricultural and Forest Meteorology 151,
385-393.
Muminov, F.A., Karnaukhov, V.V. (1973). Methods of observing the cotton harvest. SARNIGMI.
Paruelo, J.M., Lauenroth, K.W. (1998). Interannula variability of NDVI and its relationship to
climate for North America Shrublands and grasslands. Biogeograhy 25, 721-733.
Platonov, A., Thenkabail, P.S., Biradar, Ch.M., Cai, X., Muralikrishna, G., Venkateswarlu, D.,
Cohen, Y., Alchanatis, V., Goldshlager, N., Ben-Dor, E., Vithanage, J., Manthrithilake,
H., Kendjabaev, Sh., Isaev, S. (2008). Water Productivity Mapping (WPM) Using
Landsat ETM+ Data for the Irrigated Croplands of the Syrdarya River Basin in Central
Asia. Sensors 8, 8156-8180.
Pulatov, B. (2008). Prediction of wheat yield in Uzbekistan by using the CGMS model and
SPOT-Vegetation data. Wageningen, The Netherlands: Wageningen University and
Research center.
Reddy, K.R., Hodjes, H.F., Kimball, B.A. (2000). Crop Ecosystem Responses to Climatic
Change: Cotton. In J. B. Nösberger, Climate change and global crop productivity (pp.
161-187). Wallingford: CAB International Publishing.
Reed, B.C., Brown, J.F., VanderZee, D., Loveland, Th.R., Merchant, J.W., Ochlend, D.O.
(1994). Measuring ohenological variablity from satellite imagery. Journal of Vegetation
Science 5(5), 703-714.
34
Ren, J., Chen, Zh., Zhou, Q., Tang, H. (2008). Regional yield estimation for winter wheat with
MODIS-NDVI data in Shandong, China. International Journal of Applied Earth
Observation and Geoinformation 10, 403-413.
Ressl, R. Dech, S.W., Ptichnikov, A., Novikova, N., Micklin, P., (1998). Desertification
monitoring and land use optimization in Aral Sea area with GIS. Geoinformationsysteme,
6:25-32.
Richards, J.A. (2005). Remote Sensing Digital Image Analysis. New York: Springer-Verlag.
Ruecker, G.R., Shi, Z., Muller, M., Conrad, C., Ibragimov, N. (2007a). Regional Scale
Estimation of Cotton Yield in Uzbekistan by Integrationg Remote Sensing and Field Data
into an Agrometeorological Model. Tashkent: Uzbekistan Cotton Research Institute.
Ruecker G.R., Wouter, A.D., Lamers,J., Ibragimov, N., Kienzler, K. (2007b). Regional
Estimation of Leaf Chlorophyll in Cotton in Uzbekistan by Upscaling a Vegetation Index
From Plant Scale to PROBA-1/CHRIS Hyperspectral Staellite Data.
Ruecker, G.R., Shi, Z., Muller, M., Conrad, C., Ibragimov, N., Lamers, J.P.A., Martius, C.,
Strunz, G., Dech, S.W. (2007c). Cotton Yield Estimation in Uzbekistan Integrating
MODIS, Landsat ETM+ and Field Data. 123-129. Khorezm province, Uzbekistan.
Sakamoto, T., Gittelson, A.A., Arkebauerc, T.J. (2014). Near real-time prediction of U.S. corn
yields based on time-series MODIS data. Remote Sensing of Environment 147,, 219-
231.
Solano, R., Didan, K. Jacobson, A., Huete, A. (2010). MODIS Vegetation Index User’s Guide.
Arizona: The University of Arizona.
Soria-Ruiz, J. Fernandez-Ordonez, Y., Granados-Ramirez, R. (2004). Methodology for
prediction of corn yield using remote sensing satellite data in Central Mexico.
Investigaciones Geograficas, Boletin del Instituto de Geografia, UNAM, 55, 61-78.
Stehman, S.V.. Czaplewski, R.L. (1998). Design and Analysis for Thematic Map Accuracy
Assessment: Fundamental Principles. Remote sensing and environment 64, 331-344.
Street, J.O., Carroll, R.J., Ruppert, D. (1988). A note on computing robust regression estimates
via iteratively reweighted least squares. American Statistician 42, 152-154.
Terekhov, A.G., Kauazov, A.M. (2007). Sub-satellite MODIS-oriented analysis informative
vegetation indices for the description of the state of spring wheat in Northern
Kazakhstan. Modern problems of remote sensing of the Earth from space 2(4), 352-357.
Tucker, C.J., Holben, B.N., Elgin, J.H.Jr, McMurtrey, J.E. (1980). Relationship of spectral data
to grain yield variation. Photogrammetric Engineering and Remote Sensing 45, 657-666.
Tucker, C.J. (1979). Red and photographic infrared linear combinations for monitoring
vegetation. Remote Sensing of Environment, 8(2), 127-150.
35
USGS. (2011, January 06). USGS Science for a changing the world. Retrieved from USGS
Science for a changing the world: http://phenology.cr.usgs.gov/methods_deriving.php
USGS, (2014, April 14). Retrieved June 17, 2014, from LAND PROCESSES DISTRIBUTED
ACTIVE ARCHIVE CENTER: https://lpdaac.usgs.gov
Zhang, Sh., Lei, Y., Wang, L., Li, H., zhao, H. (2011). Crop Classification Using MODIS NDVI
Data Denoised by Wavelet: A Case Study in Hebei Plain, China. China Geografical
science, 21(3), 322-333.
Zhou, Sh., Ruecker, G.R., Shi, Z., Muller, M., Conrad, C., Ibragimov, N., Lamers, J.P.A.,
Martius, C., Strunz, G., Dech, S.W., Vlek, P.L.G. (2007). Modeling of Cotton Yields in the
Amu Darya River Floodplains of Uzbekistan Integrating Multitemporal Remote Sensing
and Minimum Field Data. Agronomy Journal 99(5), 1317-1326.