Estimation of Oil Thickness

download Estimation of Oil Thickness

of 41

Transcript of Estimation of Oil Thickness

  • 7/26/2019 Estimation of Oil Thickness

    1/41

    AEAT-5279 Issue 1

    Estimation of oil thickness

    A report produced for the Maritime and Coastguard

    Agency

    Louise Davies

    Jenny Corps

    Tim LunelKaren Dooley

    November 1999

  • 7/26/2019 Estimation of Oil Thickness

    2/41

    AEAT-5279 Issue 1

    Estimation of oil thickness

    A report produced for Maritime and Coastguard

    Agency

    Louise Davies

    Jenny Corps

    Tim LunelKaren Dooley

    November 1999

  • 7/26/2019 Estimation of Oil Thickness

    3/41

    - ssue Estimation of Oil Thickness

    AEA Technology ii

    Title Estimation of oil thickness

    Customer Maritime and Coastguard Agency

    Customer reference MSA 10/9/111

    Confidentiality,

    copyright and

    reproduction

    Copyright AEA Technology plc 1999

    All rights reserved.

    Enquiries about copyright and reproduction should be addressed to the

    Commercial Manager, AEA Technology plc.

    File reference EERA 20738001

    Report number AEAT-5279

    Report status Issue 1

    AEA Technology Environment

    National Environmental Technology Centre

    Culham

    Abingdon

    Oxfordshire

    OX14 3ED

    Telephone +44 (0)1235 463117

    Facsimile +44 (0)1235 463030

    AEA Technology is the trading name of AEA Technology plc

    AEA Technology is certificated to BS EN ISO9001:(1994)

    Name Signature Date

    Author Louise Davies

    Reviewed by Tim Lunel

    Approved by Jenny Corps

  • 7/26/2019 Estimation of Oil Thickness

    4/41

    - ssue Estimation of Oil Thickness

    AEA Technology iii

    Executive Summary

    This project was undertaken on behalf of the Maritime and Coastguard Agency to devise a method

    of measuring the total volume of oil on the sea surface using airborne remote sensing techniques.

    This would provide considerable information for those planning a counter pollution response as itwould allow near real time estimates of the change in oil volume over time as a result of natural

    processes and response actions. Such a response tool would revolutionise oil spill response by

    providing a real time measure of the success of the response technique employed.

    Thermal infra-red (IR) is able to detect a different thermal response from the heterogeneous (non-

    uniform) thickness of an oil slick spreading on the sea surface. However, the relationship between oil

    thickness and thermal response is complex. To overcome the complexities of the inter-relationships,

    the National Environmental Technology Centre (NETCEN) developed a method of analysing IR

    data from experimental oil slicks in the North Sea using neural network processing (Wood et al,

    1997). This work demonstrated that it was possible to use a neural network approach to predict the

    oil thickness of the test data set on the basis of the IR response. This initial development project was

    followed with a validation exercise carried out by NETCEN (Davies et al, 1998). The 1998 project

    concluded that the neural network was unable to classify oil thickness reliably for the validation data

    set.

    This report details the re-calibration of the neural network to establish whether it has the ability to

    predict oil thickness from a wider range of input variables taking account of factors such as solar

    heating of the surface oil slick. The neural network was retrained with two data sets: a primary data

    set containing only field trial data from 1994, 1995 and 1997 with oil thickness measured by pad

    sampling, and a secondary data set containing the primary data set and a Sea Empress data set with

    estimated slick thickness. The secondary data set included thermal IR imagery and parameters from

    a wider range of environmental conditions that represent both winter and summer.

    The re-calibrated neural network was subjected to a verification process through testing with a

    selection of data from each of the data sets that had not been used to re-train the network. This

    verification process has established that the neural network is internally consistent and is able to

    predict oil thickness representative of the data sets used to train the network in 50-75% of cases,

    even under a wide range of environmental conditions.

    A committee of networks approach was able to correctly assign the thickness of the slick into one of

    four ranges of slick thickness in 76% of cases. In the majority of the misclassifications, the thicknesswas assigned to an adjacent thickness range. Whilst these results are promising, it must be noted that

    the number of data available for training the network is relatively small compared to the number of

    variables in the network and the network is over-fitted. Therefore the networks derived during this

    project should not be considered as being sufficiently robust to be relied upon for operational use at

    present. In addition it is important to emphasise that there is no guarantee that the data sets being

    used to re-train the network have covered the complete range of the parameters (such as the

    meteorological conditions and oil type) which will be encountered in the event of a spill. The

  • 7/26/2019 Estimation of Oil Thickness

    5/41

    - ssue Estimation of Oil Thickness

    AEA Technology iv

    importance of using a training data set which includes the full range of parameter values was

    emphasised by the fact that the network trained solely with the primary data set (summer field trial)

    was not able to predict the thickness for the Sea Empress imagery taken during the winter.

    However, the results are sufficiently encouraging, we believe, to develop a pilot system to deploy as

    an experimental system at the next real incident or experimental field trial. This will allow the

    technique to continue development at every available opportunity while providing an additional tool

    to the MCA in the event of an oil spill.

    To develop the current experimental system into a pilot operational system would require the

    following relatively minor developments:

    Collaboration with Air Atlantique to standardise on image analysis software (for example, using

    ER Mapper) and to devise procedures for downloading the thermal IR imagery to a suitable

    format for analysis.

    Determine suitable techniques for collecting all the variables required by the neural network, i.e.

    air and sea temperature, irradiance.

    Develop a sorbent pad sampling technique that includes real-time analysis for calibration of the

    neural network.

    We believe that this pilot system could be developed within one month. If the pilot system is

    developed and deployed (potentially using the FLIR system on the SAR helicopters as well as the

    MCAs remote sensing planes) at future incidents then the system can be used to give estimates of

    volumes, alongside the normal visual estimates, for operational use. The pilot system can also be

    evaluated in terms of the practicalities of operating the system and the benefits it provides to MCA in

    an oil spill incident.

    In addition, the neural network could be continually re-trained with the new sets of field data taken indifferent environmental conditions to maximise the chances of the operational data falling in the range

    of environmental parameters used in the training process. This will be particularly useful if pad

    sampling is used to provide in-situ calibration of the slick thickness.

  • 7/26/2019 Estimation of Oil Thickness

    6/41

    - ssue Estimation of Oil Thickness

    AEA Technology v

    Contents

    1 Introduction 1

    2 Thermal Infra-red 2

    2.1 THERMAL IR CAMERA 2

    2.2 THERMAL BEHAVIOUR OF SPILLED OIL 2

    2.2.1Thin Oil Layers 3

    2.2.2Thick oil layers 3

    2.2.2Heat exchange and surface temperature effects 4

    3 Previous Neural Network Research Programmes 5

    3.1 INITIAL DEVELOPMENT OF NEURAL NETWORK SYSTEM 5

    3.2 NEURAL NETWORK VALIDATION 8

    4 Research Programme 9

    5 Data Collation 10

    5.1 DATA SELECTION 10

    5.2 TRAINING PARAMETERS 11

    5.3 PRIMARY DATA SET 135.4 SECONDARY DATA SET 15

    6 Neural Network Re-Training 17

    6.1 REGRESSION APPROACH 18

    6.2 CLASSIFICATION APPROACH 19

    6.3 SUMMARY 21

    7 Neural Network Verification 22

    7.1 PRIMARY DATA SET 227.2 SECONDARY DATA SET 23

    7.3 COMMITTEE OF NETWORKS 25

    7.4 SUMMARY 28

  • 7/26/2019 Estimation of Oil Thickness

    7/41

    - ssue Estimation of Oil Thickness

    AEA Technology vi

    8 Conclusions 29

    9 Recommendations 30

    9 References 33

    Appendices

    APPENDIX 1 PRIMARY AND SECONDARY DATA SETS

    VERIFICATION RESULTS FOR SEA EMPRESS DATA USING

    NEURAL NETWORK TRAINED ON PRIMARY DATA

    THERMAL IR IMAGE OF 1997 FORTIES BLEND EXPERIMENT

  • 7/26/2019 Estimation of Oil Thickness

    8/41

    - ssue Estimation of Oil Thickness

    AEA Technology 1

    1 Introduction

    The ability to measure the total volume of oil on the sea surface using airborne remote sensing

    techniques would provide considerable information for those planning a counter pollution response.This would allow near real time estimates of the change in oil volume over time as a result of natural

    processes and response actions. Such figures would also be invaluable in quantifying the success of

    the response technique employed. For example, a quantification of the volume of oil remaining on

    the sea surface before and after a test run of dispersant spraying would enable the responder to

    rapidly assess whether the spilt oil is responding to the dispersant under the given environmental

    conditions.

    Thermal infra-red (IR) is able to detect a different thermal response from the heterogeneous non-

    uniform thickness of an oil slick spreading on the sea surface. However, the relationship between oil

    thickness and thermal response is complex. To overcome the complexities of the inter-relationships,

    the National Environmental Technology Centre (NETCEN) developed a method of analysing IR

    data from experimental oil slicks in the North Sea using neural network processing on behalf of the

    Maritime and Coastguard Agency (Wood et al, 1997). The 1997 work demonstrated that it was

    possible to use a neural network approach to predict the oil thickness of the test data set on the basis

    of the IR response.

    This initial development project was followed with a validation exercise carried out by NETCEN and

    sponsored by the Maritime and Coastguard Agency (Davies et al, 1998). This exercise consisted of

    a set of field experiments to collect the data that was subsequently used to test the validity of the

    trained neural network. This project concluded the neural network was unable to classify oil

    thickness reliably for the validation data set.

    It was also concluded that the failure of the neural network was mainly due to the data being outside

    the data range the network was originally trained on for a number of the input variables. The neural

    network was also unable to account for the thermal effects of the oil as a result of solar heating of

    the surface oil during this set of field trials. The thermal effects in the IR imagery were not observed

    in the field data sets used to train the network.

    In the report by Davies et al (1998) it was suggested that it might be possible to re-train the neural

    network with the new 1997 which included solar heating effects. However it was emphasised in the

    conclusions that there was no guarantee that even with this new additional data set that the full rangeof environmental variables would be taken into account.

    This report details a study involving the re-calibration of the neural network with all the data sets

    obtained during the initial training of the neural network (Wood et al, 1997) and the validation

    exercise (Davies et al, 1998). The data used in the re-calibration includes additional core variables

    that were considered important for predicting the effect of solar heating on the thermal IR imagery

    during the validation exercise. These additional core variables were not previously included as core

    variables in the initial training programme. The re-calibrated neural network was subjected to a

  • 7/26/2019 Estimation of Oil Thickness

    9/41

    - ssue Estimation of Oil Thickness

    AEA Technology 2

    verification process through testing with a selection of data that had not been used to re-train the

    network. This verification process has established whether the neural network is internally consistent

    and is able to predict oil thickness representative of the data sets used to train the network. This

    process does not provide a validation of the network, since the verification data set is a subset of the

    training data set.

    2 Thermal Infra-red

    2.1 THERMAL IR CAMERA

    A thermal IR camera, such as the Talytherm camera in the Air Atlantique Cessna 404, detects

    thermal IR radiation in the 8-13 m range. The detector measures apparent black body

    temperature, which is a function of actual temperature (expressed as C) modified by the emissivity

    of the material that is omitting the IR radiation. Emissivity is the property of a material that indicates

    the proportion of thermal IR emitted by the material when it is at a particular temperature.

    The system detects very small differences in thermal IR emission. For display on a video output, the

    signal is amplified (the degree of amplification is expressed as gain) and the signal base-level is

    expressed as offset. The gain and offset may be adjusted automatically or set manually. Changing

    the offset will alter the response of the camera image to temperature; manually setting the sea to a

    mid-grey on the screen establishes the prevailing sea temperature as being in the middle of the

    detector range. Increasing the gain causes maximum discrimination in the image. The brightness of a

    recorded thermal IR image is therefore a function of the objects temperature, but is an indication of

    relative and not actual temperature.

    2.2 THERMAL BEHAVIOUR OF SPILLED OIL

    A thermal IR image of an oil slick is a representation of apparent temperature differences in the oil

    slick and these can give an indication of relative oil layer thickness. An oil surface at the same

    temperature as a water surface will have a lower equivalent black body temperature than the water

    surface. This will cause the oil to appear cooler in the thermal IR image, although it is at an identical

    temperature as the water surface. On the basis of the difference in emissivity values of oil and water,

    oil should always appear cooler than water when both are at the same temperature. However, oilslicks tend to appear as two distinct areas in a thermal IR image:

    Areas that are apparently cooler than the sea.

    Areas that are apparently warmer than the sea.

    The apparently cooler areas of the slick are areas of relatively thin oil, from 50 m up to

    approximately 250 to 500 m thick and the warmer areas are thicker or emulsified oil, greater than

    approximately 250 to 500 m (Goodman, 1994).

  • 7/26/2019 Estimation of Oil Thickness

    10/41

    - ssue Estimation of Oil Thickness

    AEA Technology 3

    2.2.1 Thin Oil Layers

    Thin layers of oil will be in intimate thermal contact with the sea surface. The thermal capacity of a

    thin oil film will be very low because there is not much oil per unit area. It is therefore unlikely that the

    oil could be at a significantly different temperature from the sea for any length of time before the

    temperature equilibrates. Under most circumstances, the thinner layers of oil are likely to be at almost

    exactly the same temperature as the sea and appear cooler in the thermal IR image because of the

    differences in emissivity between oil and water. There are two factors that may slightly alter the

    actual temperature of thin oil films:

    In the very early stages of oil weathering, the rapid evaporation of large quantities of volatile

    components from the oil layer may cause cooling, leading to the oil film being at a lower

    temperature than the sea for a very short period.

    Layers of oil of any significant thickness are likely to absorb solar radiation more effectively than

    the surrounding sea because they are opaque and dark coloured. This could cause a slight

    increase in temperature of the oil film.

    Evaporative cooling and solar heating will tend to cancel each other out some time after the oil has

    been released onto the sea. If the oil layer is being exposed to solar heating, the rise in temperature

    may still be insufficient to cause the oil to appear warmer than the sea in the thermal IR image

    because of the effect of emissivity.

    2.2.2 Thick oil layers

    While the thinner layers of oil are essentially in thermal equilibrium with the sea, the areas of thick oil

    or emulsion will not be in thermal equilibrium with the sea and can have a higher temperature. The

    higher temperature of the thicker layers of oil will be due to the balance of heat transfer into and out

    of the spilled oil layer. The precise temperature rise will depend on several factors:

    The temperature rise produced by constant heat input will depend on the thermal capacity of the

    oil layer, which is proportional to its volume per unit area (thickness). The actual temperature of

    an oil layer is the result of the balance between heat input from solar radiation and heat lost to the

    air or sea. Thicker oil layers are therefore capable of sustaining a higher temperature difference

    than thinner oil layers.

    The thermal capacity of an oil layer depends on its mass per unit area (thickness) and its specific

    heat capacity. Oil has a specific heat that is about half that of water. As oil emulsifies its specific

    heat will increase in proportion to the water content. Emulsified oil is therefore capable of holding

    more heat than non-emulsified oil.

    The heat input and output is a result of temperature differences between the oil layer, the sea and

    air, and the degree of solar heating. Thick layers of oil or emulsion will absorb heat from the sun

  • 7/26/2019 Estimation of Oil Thickness

    11/41

    - ssue Estimation of Oil Thickness

    AEA Technology 4

    more effectively than the sea because the oil is opaque or of a dark colour. This will cause a

    significant rise in temperature if the rate of heat input is greater than the rate of heat loss to the sea.

    2.2.2 Heat exchange and surface temperature effects

    Unless the sea and air are at precisely the same temperature there will be heat transfer between

    them. If spilled oil is present it will act as a buffer to this heat exchange. The oil will adsorb heat

    from the warmer of the two and lose heat to the cooler. The thermal capacity of the oil layer is

    minuscule compared to that of the sea and atmosphere and it is inevitable that the temperature of the

    oil layer will change. The temperature differentials between air, sea and oil plus the relative thermal

    conductivities of the system will determine the rate of change of temperature of the oil layer.

    The sea surface temperature will tend to remain almost constant during the period of a day, due to its

    massive thermal capacity. However the oil layer temperature will oscillate; the thick layers of oil

    warm to quite high temperatures during the course of the day as heat is absorbed from the sun and

    cool slowly at night as heat is lost to the sea and air. The oil will be cooler than the sea in the early

    morning, but rise to a temperature higher than that of the sea during the day. At nightfall the

    temperature of the oil will start to drop, but it will still be higher than that of the sea for some time in

    the evening. During the night it will lose heat to the cooler air which is not replaced by solar heating,

    and the temperature of the oil will then drop below that of the sea. Therefore typically the greatest

    extent of thick oil is detected in imagery recorded around midday to early afternoon when it is the

    warmest part of the day.

    Similar effects will occur on a shorter time-scale if there is intermittent sunlight or cloud shadow

    effects. The oil will heat up rapidly in the sun, but start to cool when the sun is obscured. The major

    determinants of the heating and cooling of the oil layers will be the intensity and duration of sunlight

    and the relative temperature differences of the air and sea.

    The temperature attained by an emulsified oil layer will be proportional to its thickness under the

    prevailing conditions; thinner layers of oil will lose heat more rapidly, and therefore have a lower

    temperature, than thicker layers. However, this proportionality only holds true for a specific set of

    conditions and the actual temperature attained by the oil in thicker layers will depend on the heating

    and cooling effects that have occurred previously, rather than those that prevail at the time of

    observation.

    In a thermally equilibrated system with no heat exchange, surface temperature will be a very good

    indicator of bulk temperature as heat will be conducted to the surface from the bulk of the oil.

    However, in situations where the temperatures are constantly changing, such as the dynamic heatexchange (heat input and output) which gives rise to temperature differences in spilled oil layers, the

    surface temperature might not be a direct indicator of bulk temperature. In these circumstances, the

    IR brightness (or relative equivalent black temperature) as measured by a thermal IR camera, will be

    an indication of the surface temperature of the oil layer and not of the internal temperature. This can

    occur if there are large differences in sea and air temperature. Heat transfer from the inside of the oil

    layer to the surface might be slower than heat transfer from the surface of the layer to cold air. The

    surface of the oil layer may be chilled by extremely cold air to lower temperatures than that of the

    sea, even though the internal temperature of the oil is higher due to previous heat input.

  • 7/26/2019 Estimation of Oil Thickness

    12/41

    - ssue Estimation of Oil Thickness

    AEA Technology 5

    3 Previous Neural Network Research

    Programmes

    The National Environmental Technology Centre (NETCEN), have completed two previous research

    projects relating the response of thermal IR to oil slick thickness on behalf of the Maritime and

    Coastguard Agency. The initial research project developed the neural network approach to

    measuring oil slick thickness (Wood et al, 1997) and the following research project validated the

    neural network developed in the initial project (Davies et al., 1998).

    3.1 INITIAL DEVELOPMENT OF NEURAL NETWORK SYSTEM

    Neural networks are computing methods that learn from empirical data rather than being

    programmed explicitly. Neural networks are particularly useful in the modelling of complex

    relationships where non-linear effects could be involved. Instead of being programmed with a series

    of equations, the neural network can be trained to learn a cause-effect relationship. During training,

    examples of input and output are fed to the network, which learns the relationships by building up

    interconnecting paths in an iterative training process.

    A range of object-oriented neural network software exists, some of which are appropriate for the

    analysis of IR imagery data. For this type of problem, a neural network may be regarded as a non-

    linear regression tool. It predicts output values from a number of inputs. The inputs are weighted in

    a similar manner to linear regression, and then passed to a set of non-linear basis functions (each

    with a different set of weights). Then a least squares fit is performed on the outputs of the basis

    functions.

    The initial experimentation (Wood et al., 1997) has trained a Multi-Layer Perceptron (MLP) neural

    network architecture (Figure 1). The MLP consists of a number of interconnected layers of

    processing units. The first layer (left on the diagram) is called the input layer. Each circular node

    represents an input to the network. Each input is a single number. The processing units in the other

    layers all perform a similar function, which is to take a weighted sum of the outputs from the

    previous layer, add a bias term, and apply an activation functionto the result. In the diagram, the

    weights are represented by the connections between the circular nodes.

    The data set used to develop the neural network was collected in experimental field trials carried out

    in 1994 and 1995 by NETCEN. The inputs to the neural network included the thermal IR response

    (brightness of the oil and sea in the imagery), oil type, time of day the data was recorded and various

    meteorological parameters. The resulting oil thickness predictions were compared to measured oil

    thickness values which were obtained in the field through sorbent pad sampling.

  • 7/26/2019 Estimation of Oil Thickness

    13/41

    - ssue Estimation of Oil Thickness

    AEA Technology 6

    Figure 1. Schematic Diagram of an MLP

    The initial development work carried out by NETCEN (Wood et al., 1997) suggested an optimal

    configuration for prediction of oil thickness, which is shown in Figure 2. Bias weights are omitted for

    clarity.

    Figure 2. Optimal Configuration for Prediction of Oil Thickness

    B

    Bs

    cos (Time)

    sin (Time)

    Oil T e

    Wind Speed

    Sea Temp

    log( )

    brightness of oil B

    brightness of sea + oil Bsobrightness of sea Bsoil thickness Note that B B Bso s= .

    This optimum configuration was then used to predict oil thickness on a proportion of the field data

    set from the 1994 and 1995 experimental field trials. Figure 3 shows the typical performance of the

  • 7/26/2019 Estimation of Oil Thickness

    14/41

    - ssue Estimation of Oil Thickness

    AEA Technology 7

    network; predicted oil thickness is plotted against the actual oil thickness. The straight line is the line

    y=xand corresponds to a perfect prediction. The R2value is 0.846.

    Figure 3. Scatter Plot of Predicted Against Actual Thickness for Optimal Configuration

    a

    -0.5 0 0.5 1 1.5 2 2.5 3 3.5

    Actual log( )

    -0.5

    0

    0.5

    1

    1.5

    2

    2.5

    3

    Predicted log( ) R

    = 0.846221

    Additionally, a classification approach was used to categorise the data into four thickness ranges:

    0-100 m

    101-500 m

    501-1000 m

    1001-2500 m.

    Table 1 presents typical results from the combined data set obtained in analysing the previous sea

    trial data sets (Wood et al. 1997). When all available sea trial data were used to train the network it

    was possible to classify correctly 80 % of data into one of four thickness classes for the previous

    sea trial data sets.

    Table 1. Confusion Matrix for Combined Sea Trial Data Set

    Bin 0-100

    m

    101-500

    m

    501-1000

    m

    1001-2500

    m

    Correct

    %

    0-100 m

    136

    3

    0

    1

    97.1%

    101-500 m 8 15 3 0 57.7%

    501-1000 m 2 1 7 0 70%

    1001-2500 m 0 1 0 8 88.9%

  • 7/26/2019 Estimation of Oil Thickness

    15/41

    - ssue Estimation of Oil Thickness

    AEA Technology 8

    Once the initial neural network system was completed, it was recognised that the validity of this

    approach had to be tested against data sets which were totally unrelated to the data sets used to

    initially train the network. This led to the validation exercise as described in the following section.

    3.2 NEURAL NETWORK VALIDATION

    The validation exercise consisted of collecting a new data set though an experimental field trial. This

    was carried out in 1997 by NETCEN, a data set was obtained containing thermal IR imagery and

    meteorological parameters for input into the neural network trained in the initial project. Oil

    thickness was also measured during the field trial using sorbent pad sampling for comparison to the

    predicted thickness values. The data sets collated from the 1997 field trial for the validation of the

    neural network were processed by the neural network in order to obtain predicted thickness values

    (Davies et al., 1998).

    As previously stated, the core input variables required by the neural network trained in 1997 were :

    Thermal infrared sea and oil brightness Time of day

    Wind speed

    Sea temperature

    Oil type

    The 1997 field trial data set consisted of data collected from three oil types; Forties Blend crude oil,

    Alaska North Slope crude oil, and IFO-180 heavy fuel oil. The network was originally developed

    with data from the 1994 and 1995 field trials obtained from Medium Fuel Oil and Forties Blend

    crude oil experiments. The neural network requires specification of oil type so the 1997 Forties data

    set was analysed using only the trained network for Forties. The data sets from the Alaska NorthSlope Crude oil slick and IFO-180 were processed by the network with oil type as both MFO and

    Forties.

    Using these input data, the neural network analysis was unable to accurately predict the measured oil

    thickness. For example, the data set from the experimental Forties blend slicks showed that

    measured slick thickness data were representative of all four thickness categories. Yet the neural

    network analysis categorised all the thickness measurements into just two, non-consecutive,

    categories 0-100 m and 501-1000 m. Further, the categorisation was purely a function of time:

    before midday all the IR data was categorised into 0-100 m despite a measured thickness of 336-

    2625 m; and after midday all the data was categorised into 501-1000 m despite a measuredthickness of 139-2359 m.

    The poor performance of the neural network classification was likely to be due to the degree of solar

    heating of the surface oil during this set of field trials which was not observed in the field data sets

    used to train the network. In addition, in the training data sets the air temperature was above the

    seawater temperature in nearly all cases whereas in the validation data set, air temperature was

    always below sea temperature thereby reversing the net thermal flux.

  • 7/26/2019 Estimation of Oil Thickness

    16/41

    - ssue Estimation of Oil Thickness

    AEA Technology 9

    4 Research Programme

    The validation exercise demonstrated that the neural network trained in 1997 was incapable of

    reliably predicting oil slick thickness. Therefore, a research programme was devised to re-calibrate

    the neural network using both the initial training data set and the validation data set. This would

    indicate whether the neural network is able to predict oil slick thickness from a more varied data set.

    The re-calibrated neural network would then be tested with a subset of the data that was not used

    for training as a verification of whether the neural network is internally consistent and therefore has

    the ability to predict oil slick thickness.

    The research programme was divided into three stages;

    1. Data collation

    2. Neural network re-training

    3. Neural network verification

    The data collation stage involved collating all the data sets that had been used in the previous two

    neural network research projects. This was followed by a data selection procedure to ensure that

    the data used to retrain the neural network was as accurate and robust as possible. This involved re-

    examination of the thermal IR imagery and the thickness measurements. Specific meteorological

    factors were selected that were considered essential to the prediction of the oil thickness, i.e.

    emissivity, irradiance, air and sea temperature. Once the final data set was complete, a proportion of

    the data was removed for the verification stage and the remaining data set was used to re-train the

    neural network.

    The neural network was re-trained using the brightness measurements from the thermal IR imagery,

    the thickness measurements and the selected meteorological parameters. The optimum neural

    network settings for the data set were devised based on the work undertaken in the initial training of

    the neural network. Once the neural network was trained to the best output for the data set, the

    verification stage was undertaken.

    A spreadsheet was developed to enable the trained network to be tested using the verification data

    that was not used for the training. The network was then tested and the results examined todetermine the accuracy of the neural network.

    Each of these stages is described in more details in the following three sections.

  • 7/26/2019 Estimation of Oil Thickness

    17/41

    - ssue Estimation of Oil Thickness

    AEA Technology 10

    5 Data Collation

    There are two sub-sections of data that have been used in the neural network research programmes;

    the 1994 and 1995 sea trial data, which was used to train the original neural network, and the 1997

    sea trial data which was used to validate the trained neural network.These combined data sets consist of a total of 225 data observations from four oil types (Alaska

    North Slope crude oil, Forties Blend crude oil, Medium Fuel Oil/Gas Oil blend and IFO-180 Heavy

    Fuel Oil). The data sets contain the following parameters for each data observation:

    1. Oil slick thickness (as measured by sorbent pad sampling)

    2. Thermal IR response (brightness values)

    3. Altitude of aircraft

    4. Time of day

    5. Camera offset and gain6. Oil type

    7. Wind speed and direction

    8. Air and sea temperature

    9. Irradiance

    10.Surface current (at 1 metre depth)

    All of these parameters were studied in the initial research programme and it was determined that

    many of these parameters have no effect on the prediction of oil thickness using the thermal IR

    brightness data. The studies undertaken as part of the earlier validation of the neural network

    (Davies et al, 1998) highlighted the importance of solar heating on the prediction of oil thicknesstherefore certain parameters were identified as essential to predict the changes in the thermal IR

    imagery as a result of heating. This is discussed further in Section 4.2.

    These data sets have undergone different levels of data processing and interpretation. The latest data

    set, from the 1997 sea trial, is the most accurate and robust as a result of the continuing development

    of our data processing and sampling techniques. Therefore it has been vital to the retraining

    programme that all the data sets were reassessed to ensure all the data used in the retraining was as

    accurate and robust as possible.

    5.1 DATA SELECTION

    The data selection process was aimed at reducing the likelihood of data being included in the

    retraining which represents a false relationship between the oil thickness and IR response.

    The first stage of the data selection procedure was to re-calculate the thickness measurements made

    in 1994 and 1995 using sorbent pad samples to ensure that the increased thickness of emulsions

    was considered.

  • 7/26/2019 Estimation of Oil Thickness

    18/41

    - ssue Estimation of Oil Thickness

    AEA Technology 11

    The data sets were then subjected to a range of data selection criteria, listed as follows:

    1. Thermal IR imagery must have been obtained at an altitude of less than 3000ft (as the resolution

    of the image reduces significantly with increasing altitude).

    2. Removal of spurious data when:

    The brightness value is taken from a poor quality image

    The brightness value is not taken in the vicinity of the sampling vessel.

    The thickness sampling position is unknown.

    3. Data sets of a particular oil type, which because of the nature of the field experiment, over-

    represented data of particularly low thickness value were reduced in number to prevent the data

    from biasing the training of the neural network toward these very low thickness values.

    This data selection process reduced the number of data observations from 225 to 96. The majority

    of data removed was that obtained during the 1994 sea trials during the Medium Fuel Oil/Gas Oil

    blend continuous release trials, where the data obtained consisted of thickness measurements of less

    than 10 m for more than 80 data observations. This level of thickness is difficult to accurately

    measure with sorbent pad samples and is at the limit of detection for the thermal IR camera.

    Therefore, the relationship between the thermal IR brightness, oil thickness and other variables may

    not be represented as accurately as the data observations taken at higher oil thickness levels. So by

    including such a large number of samples of less than 10 m, this may have biased the training of the

    neural network toward the low oil thickness values, which are of little interest in operational

    response. The majority of the other data observations removed were as a result of low resolution

    (high altitude) and poor quality thermal IR imagery.

    5.2 TRAINING PARAMETERS

    The neural network was trained in the initial research programme (Wood et al., 1997) using the

    following key parameters:

    Brightness of oil

    Brightness of oil and sea

    Time

    Oil type

    Wind speed

    Sea temperature

    During the validation studies in the previous research project, it was observed that there were further

    parameters that may be important in predicting oil thickness from thermal IR imagery. It was

    observed in the validation data that the extent of solar heating significantly affected the brightness of

    the thermal IR imagery, therefore it would be important to include irradiance and air temperature

    data as key parameters in the neural network.

  • 7/26/2019 Estimation of Oil Thickness

    19/41

    - ssue Estimation of Oil Thickness

    AEA Technology 12

    Thermal IR emission is related to both the way a surface emits IR radiation as a function of

    temperature and the actual temperature of the surface. This relationship is described by the

    emissivity, therefore emissivity is an important parameter when studying thermal IR response. The

    emissivity values for the oils were not included in the data sets used to train the original neural

    network. Therefore the inclusion of emissivity data as a parameter in the re-training was considered

    essential to improve the accuracy of the thickness predictions.

    The emissivity value for each of the four oils in the data set was obtained and is detailed as follows.

    The directional reflectance of the oil was measured over a wavelength range of 2-25 m. The

    thermal IR camera operates over a range of 8-13 m, so the emissivity values over this range were

    noted and the average emissivity value taken for each oil type. The emissivity values are shown in

    the following table.

    Table 2. Emissivity Values for the Oils.

    Wavelength

    (m)

    Alaska North

    Slope crude oil

    Forties Blend

    crude oil

    Medium Fuel

    oil/Gas oil blend

    IFO-180 Heavy

    Fuel Oil

    8 0.957 0.963 0.960 0.953

    9 0.958 0.964 0.960 0.953

    10 0.957 0.963 0.960 0.953

    11 0.957 0.963 0.960 0.953

    12 0.957 0.963 0.959 0.952

    13 0.956 0.962 0.958 0.951

    Average 0.957 0.963 0.960 0.953

    The parameters selected for re-training the neural network are as follows.

    1. As used in the original neural network;

    Brightness of oil

    Brightness of oil and sea

    Sea Temperature

    2. Additional core variables identified during the validation exercise; Emissivity

    Irradiance

    Air Temperature

    The additional core variables in the current training parameters were identified during the validation

    exercise.

  • 7/26/2019 Estimation of Oil Thickness

    20/41

    - ssue Estimation of Oil Thickness

    AEA Technology 13

    It is important to keep the number of key training parameters to a minimum to reduce over-fitting in

    the neural network training. Note that the emissivity values replaced the oil type input variable in

    the neural network. Also the time of day and wind speed parameters have been excluded as these

    are considered to have a lesser effect on the brightness and the effects are accounted for through the

    other parameters (irradiance, sea and air temperature).

    The change in the parameters selected for re-training the neural network in comparison to the

    previous training are summarised in the following table.

    Table 3. Previous and current training parameters

    Previous training parameters Current training parameters

    Brightness of oil Brightness of oil

    Brightness of oil and sea Brightness of oil and seaSea temperature Sea Temperature

    Oil type Emissivity

    Time of day Irradiance

    Wind speed Air Temperature

    5.3 PRIMARY DATA SETAfter the data had undergone the data selection procedure and the key training parameters were

    selected, the data, labelled the primary data set, was divided into a training data set and a

    verification data set. The selection of the verification data set was carried out by taking a

    representative sample from each thickness prediction range to:

    Obtain a verification data set that was representative of a range of oil thickness with different oil

    types, at different temperatures, different brightness ranges and different irradiance levels.

    Ensure the input variables for the verification data were within the limits of the training data set.

    The following table details the total number of data observations in each of the thickness classification

    ranges and the amount of data taken for training and testing the neural network.

    Table 4. Number of thickness observations in each prediction range for the primary data set.

    Thickness Range TOTAL Training Verification

  • 7/26/2019 Estimation of Oil Thickness

    21/41

    - ssue Estimation of Oil Thickness

    AEA Technology 14

    0-100 28 23 5

    101-500 33 28 5

    501-1000 14 11 3

    >1001 21 18 3

    TOTAL 96 80 16

    This table can be divided into each oil type as follows:

    Table 5. Number of thickness observations in each prediction range for each oil type.

    Thickness

    Range

    Alaska North Slope crude oil Forties Blend crude oil

    Total Training Verification Total Training Verification

    0-100 3 3 0 5 4 1

    101-500 9 8 1 5 4 1501-1000 3 2 1 5 4 1

    >1001 6 5 1 12 10 2

    TOTAL 21 18 3 27 22 5

    Thickness

    Range

    IFO-180 Heavy fuel oil Medium Fuel Oil/Gas Oil

    Total Training Verification Total Training Verification

    0-100 0 0 0 20 16 4

    101-500 2 2 0 17 14 3

    501-1000 0 0 0 6 5 1>1001 1 1 0 2 2 0

    TOTAL 3 3 0 45 37 8

    Tables 4 and 5 demonstrate that the primary data set consists of data observations in each of the

    thickness ranges for all of the oil types specified (except for IFO-180 heavy fuel oil). There is a

    fairly even spread of data observations for each thickness range, thereby reducing the chance of

    biasing the training of the neural network to certain thickness ranges during the re-training.

    It is important that the limits of the training set are defined clearly if the trained neural network is to beapplied to other data sets. The neural network can not reliably predict outside the limits of the

    training data set. The primary training data set consists of the following upper and lower boundaries

    for each of the parameters.

    Table 6. Primary Training Set Range of Parameters.

  • 7/26/2019 Estimation of Oil Thickness

    22/41

    - ssue Estimation of Oil Thickness

    AEA Technology 15

    Parameter All Data

    Max Min

    Thickness 5079 4

    Brightness of oil & sea 230 32

    Brightness of sea 215 87

    Brightness of oil 85 -126

    Sea Temperature 18.15 14.8Air Temperature 18 14.12

    Irradiance 940 -11.7

    Emissivity 0.963 0.953

    5.4 SECONDARY DATA SET

    The primary data set consists of data from the 1994, 1995 and 1997 sea trials only. These trials

    were all carried out during the summer months (July, August and September) when the ambient

    temperature was fairly warm (14-18C). As a result, once trained with this data, the neural network

    may only be able to predict oil thickness within these summer temperatures. Therefore the neuralnetwork was also trained with a secondary data set containing data collected at colder temperatures

    (i.e. during the winter months). This secondary data set allows the robustness of the neural network

    to be established.

    The secondary data set consists of the primary data set (summer data) and data from the Sea

    Empress incident (winter data). The Sea Empress incident occurred in February 1996 and

    throughout the incident, thermal IR imagery and a range of meteorological parameters were

    recorded. This information was compiled into a data set to train the neural network. However, it

    must be noted that this data should not be considered an accurate training set because of the absence

    of in-situ measurements of oil thickness and will only be used to determine the robustness of thetrained neural network.

    The data required by the neural network trained in 1999 is:

    Thermal IR imagery brightness values

    Sea temperature

    Air temperature

    Irradiance

    Emissivity

    Oil thickness

    The thermal IR imagery, sea and air temperature were recorded during the Sea Empress incident.

    The irradiance data were obtained from Met Office for the dates required specifically for this

    project. The emissivity of the oil spilt during the Sea Empress (Forties Blend crude oil) was obtained

    for the primary data set so further emissivity analysis was not required. The oil thickness was not

    measured at specific locations during the incident, however visual estimates were made during the

    early stages of the incident so can be used as a guide to the range of oil thickness observed during

    the incident.

  • 7/26/2019 Estimation of Oil Thickness

    23/41

    - ssue Estimation of Oil Thickness

    AEA Technology 16

    A data set was created for the neural network from thermal IR imagery taken from three dates in

    February 1996 (22, 27 and 29). As no thickness measurements were made during the incident, the

    data set was created through taking the brightness level in the thermal IR image from different areas

    of the slick that are most likely to have a different oil thickness. The oil thickness at the specific

    location where the brightness level was recorded was estimated into one of four categories as

    follows:

    1. 0-100 m = specific thickness applied to the data observation was 50 m

    2. 101-500 m = specific thickness applied to the data observation was 250 m

    3. 501-1000 m = specific thickness applied to the data observation was 750 m

    4. 1000-2500 m = specific thickness applied to the data observation was 1750 m

    These estimates of thickness were based on the relative thickness indicated in the thermal IR imagery

    and our knowledge of the behaviour of Forties Blend crude oil. The thickness estimates underwent a

    series of checks through independent estimates of the thickness by three experienced staff. This

    resulted in more than 60% of the data being classified into the same thickness category. However, it

    is important to note that this method of estimating the thickness is by no means accurate and has

    been used as a method of allowing the use of the Sea Empress data, because it is invaluable in testing

    the neural network at this stage.

    The final Sea Empress data set consisted of 84 data observations from the three dates in February.

    The data included imagery from thick and thin oil layers, ambient temperatures ranging from 2 to 6C

    and a range of irradiance levels. A total of 18 data observations were removed from the data set for

    the testing phase of the project. These were representative of the range of the parameters

    represented in the whole data set.

    The following table details the thickness observations in the Sea Empress data set.

    Table 7. Number of observations in each thickness category for the Sea Empress data set.

    Thickness Range Forties Blend crude oil (Sea Empress)

    Total Training Verification

    0-100 7 5 2

    101-500 19 15 4

    501-1000 31 26 5

    >1001 27 20 7

    TOTAL 84 66 18

    The Sea Empress data set was added to the primary data set to create the secondary data set. This

    data set contains a total of 180 data observations of which 146 are for the training and 34 are for the

    verification. The following table details the number of data observations for each thickness range for

    the secondary data set.

  • 7/26/2019 Estimation of Oil Thickness

    24/41

    - ssue Estimation of Oil Thickness

    AEA Technology 17

    Table 8. Number of data observations in each thickness category for the secondary data set.

    Thickness Range Secondary data set

    Total Training Verification

    0-100 35 28 7101-500 52 43 9

    501-1000 45 37 8

    >1001 48 38 10

    TOTAL 180 146 34

    The secondary data set, including observations of the Forties Blend crude oil spilt at the Sea

    Empress incident, significantly extends the range of environmental parameters in comparison to the

    primary data set. The following table shows the range of parameters the total of the secondary data

    set covers (primary and Sea Empress). The range of parameters for the other oil types are as in the

    primary data set detailed in Table 6.

    Table 9. Secondary Data Set Range of Parameters for the Secondary Data.

    Parameter All Data

    Max Min

    Thickness 5079 1

    Brightness of oil & sea 255 0

    Brightness of sea 230 32

    Brightness of oil 215 -170

    Sea Temperature 18 8Air Temperature 18 2

    Irradiance 940 -11.7

    Emissivity 0.963 0.953

    6 Neural Network Re-Training

    The primary and secondary training data sets were used in a Multi-Layer Perceptron (MLP) neural

    network architecture, as used in the initial development of the neural network. Our previous work

    (Wood et al., 1997) has suggested this is the best system for approaching these types of complex

    relationships. This methodology is described in Section 3.1.

    A problem anticipated at the start of the neural network training process was the danger of over-

    fitting the network - a problem which frequently arises when the number of observations is small

    compared to the number of adjustable parameters. The MLP has the property of being a universal

  • 7/26/2019 Estimation of Oil Thickness

    25/41

    - ssue Estimation of Oil Thickness

    AEA Technology 18

    function approximator, which means that given the right architecture (number of processing units) it

    can learn any functional mapping. This is its strength and also its weakness. The ability to learn

    mapping means that it can learn to model the noise in the data used to train it. Thus, training

    examples can be reproduced well, but the model is influenced by the noise in the measurement data.

    The model will not then produce sensible answers when tested on a new data set.

    As detailed in Section 5.2, 7 input variables have been selected for the revised training of the

    network. The network is forced to define the relationships between each of these variables and each

    of the nodes in the inner layers (described in Section 3.1) and between the elements of the hidden

    layer and the final output variable. The two sets of data available for training contain just 80 (primary

    data set) and 146 (secondary data set) data observations. These small numbers of data were not

    expected to be sufficient to avoid the problems of over-fitting.

    As discussed previously (Wood et al., 1997) an initial solution is to train the network on only a

    specified fraction of the available training data (the training data set) and to retain the remainder of

    the training data set for testing the network during the training process (the training test data set).

    The ability of the network to reproduce the training data set improves throughout the training

    process. However, the parameters used to define the trained network are not those obtained when

    the best reproduction of the training data set is achieved, instead they are those obtained when the

    best performance against the training test data set is achieved.

    Once the best configuration of the neural network was defined for the primary and secondary data

    sets, it was then tested against the verification data set. The MLP network allows for two

    approaches to be adopted during training i.e. regression and classification. Both of these have been

    explored during this project.

    6.1 REGRESSION APPROACH

    The network was trained using 75% of the available training data from the Primary data set and, as a

    separate process, using 75% of the available training data from the Secondary data set. Whilst the

    data span the range from 4-5000 m, only 3 out of 146 data are in excess of 2500 m. The

    network was, therefore, trained to estimate log (thickness) to prevent errors in the estimation of these

    few large values biasing the error analysis.

    To determine the optimum number of hidden neurons for the network, 15 training runs (trials) were

    carried out with the network configured with 2-10 hidden neurons. The performance of the network

    in reproducing the test data set was compared for each of the groups of 15 trials using a T-test. The

    results indicated that the network did not perform significantly better with any one number of hiddenneurons - this is again an indication that the problem was over-fitted.

    In general, the network was able to provide an acceptable reproduction of the measured thickness in

    the training data set. The typical performance of the network is shown in Fig 4 for the network

    configured with 7 hidden neurons and trained against the Primary data set. Whilst acceptable, this is

    not as good as the performance reported during the initial training of the network (Wood et al.,

    1997) which had an r2of 0.85.

  • 7/26/2019 Estimation of Oil Thickness

    26/41

    - ssue Estimation of Oil Thickness

    AEA Technology 19

    Figure 4. Scatter plot for predicted against actual thickness for the primary data set.

    6.2 CLASSIFICATION APPROACH

    Instead of treating the prediction of oil thickness as a regression problem, it is also possible to treat it

    as a classification problem. Using this approach, the network does not attempt to predict the exact

    thickness, but to assign the data into one of a series of bins representing thickness ranges. The

    thickness ranges used in this study were selected to be consistent with those used in this previous

    study (Wood et al, 1997). Alternative ranges, e.g. based on equal bandwidths, would have been

    equally valid.

    Training the network on both the Primary and Secondary data sets resulted in the same optimalarchitecture for both data sets. In both cases, 8 hidden neurons gave the optimal performance,

    although the numerical values assigned to each of the weights in the network differed. The optimal

    architecture is shown below:

    Figure 5. Optimal architecture of neural network for both data sets as a classification problem.

    Emissivity

    Brightness of oil and sea

    0 - 100 m

    Brightness of sea

    101 - 500 m

    Brightness of oil

    501 - 1000 m

  • 7/26/2019 Estimation of Oil Thickness

    27/41

    - ssue Estimation of Oil Thickness

    AEA Technology 20

    Sea temperature

    1001 - 6000 m

    Air temperature

    Irradiance

    Hidden

    Neurons

    The bias weights are omitted for clarity on Figure 5.

    The results from a typical training run of the network trained on the Primary data set are summarisedin thematrix below. The numerical values relate to the number of data points predicted to lie in each

    combination of predicted and measured thickness categories. Diagonal elements indicate a correct

    classification, whilst off-diagonal elements indicate an incorrect classification.

    Table 10. Classification matrix using the Primary data set

    Predicted

    Measured 0-100 m 101-500 m 501-1000 m 1001-6000 m % Correct

    0-100 m 17 6 0 0 74%

    101-500 m 6 20 1 1 71%

    501-1000 m 1 2 6 2 55%1001-6000 m 2 1 0 15 83%

    73%

    When trained with data from the Primary data set, the network is able to correctly classify over 70%

    of the available training data.

    The corresponding matrix for the network trained with the Secondary data set is summarised below.

    Table 11. Classification matrix for the Secondary data set

    PredictedMeasured 0-100 m 101-500 m 501-1000 m 1001-6000 m % Correct

    0-100 m 17 7 1 3 61%

    101-500 m 2 36 2 3 84%

    501-1000 m 3 3 31 0 84%

    1001-6000 m 1 1 4 32 84%

    79%

  • 7/26/2019 Estimation of Oil Thickness

    28/41

    - ssue Estimation of Oil Thickness

    AEA Technology 21

    Using data from a wider range of environmental conditions, the network is able to correctly classify

    almost 80% of the available training data.

    6.3 SUMMARY

    In summary, the neural network was trained with two data sets; the primary and secondary data sets.

    The primary data set contained reasonably accurate in-situ thickness observations. The secondary

    data set consisted of the primary data set and data from the Sea Empress incident which consisted of

    estimates of the in-situ thickness by three members of the NETCEN team.

    The optimum training approach was the classification method. The neural network was trained with

    each of the data sets and the test errors of the training are as follows:

    Correct Classification (%)

    Primary Data Set 73

    Secondary Data Set 79

    This indicates the training using the secondary data set was able to produce a higher percentage of

    correct classifications than for the primary data set.

  • 7/26/2019 Estimation of Oil Thickness

    29/41

    - ssue Estimation of Oil Thickness

    AEA Technology 22

    7 Neural Network Verification

    This phase consists of testing the trained neural network with a selection of data that was not used to

    re-train the network. This exercise can not be used as a true validation of the network since the

    verification data set is a subset of the data being used to train the network. This verification processis used to establish whether the neural network is internally consistent and is able to predict oil

    thickness representative of the data sets used to train the network.

    A spreadsheet was developed to run the neural network model produced in the training phase. The

    results of the verification are detailed as follows.

    7.1 PRIMARY DATA SET

    The following table details the results of the verification of the neural network trained as a

    classification problem with the primary data set. The table includes the measured oil thickness values

    and those predicted by the trained neural network (the correctly classified thickness values are

    shaded in solid grey).

    Table 12. Test data and thickness predictions for the primary data set.

    Oil type Oil

    Brightness

    Sea

    Temp

    (C)

    Air

    Temp

    (C)

    Irradiance Measured

    Thickness

    (m)

    Predicted

    Thickness

    (m)

    Alaska North Slope 13 18.01 14.12 742 222 0-100

    Alaska North Slope -72 18.05 14.58 184 525 1001-6000

    Alaska North Slope 52 18.00 15.00 24 1556 1001-6000

    Forties Blend 25 17.68 16.83 410 7 1001-6000

    Forties Blend 37 18.10 17.02 375 310 101-500

    Forties Blend 27 18.10 16.08 86 954 101-500

    Forties Blend 10 14.99 16.93 270 1531 1001-6000

    Forties Blend 19 14.80 16.88 20 3316 1001-6000

    MFO/GO 15 14.99 17.02 5 7 0-100

    MFO/GO 35 16.44 17.75 359 21 101-500MFO/GO 15 16.83 17.71 412 23 0-100

    MFO/GO 8 15.29 16.68 120 29 0-100

    MFO/GO 12 15.24 16.97 260 104 0-100

    MFO/GO 30 16.73 17.71 451 204 101-500

    MFO/GO 25 14.99 17.17 840 435 0-100

    MFO/GO 10 15.04 17.17 930 537 101-500

  • 7/26/2019 Estimation of Oil Thickness

    30/41

    - ssue Estimation of Oil Thickness

    AEA Technology 23

    The results of the verification are summarised in the matrix below.

    Table 13. Classification matrix for the verification of the network trained on the Primary data set

    Predicted

    Measured 0-100 m 101-500 m 501-1000 m 1001-6000 m % correct0-100 m 3 1 0 1 60%

    101-500 m 3 2 0 0 40%

    501-1000 m 0 2 0 1 0%

    1001-6000 m 0 0 0 3 100%

    50%

    The network correctly classifies 50% of the verification data into the appropriate thickness bin.

    However the verification data set contains a number of datum for which the measured thickness lies

    on the borderline of two bins (e.g. 104 m has been classified as 0-100 m). These are indicated inTable 12 by shading in grey diagonal lines. If these values are also included, the network can be

    regarded as correctly classifying 63% of the values into the correct thickness bin.

    On examination of the correct and incorrect thickness predictions (Table 13), the neural network

    does not appear to be predicting thickness consistently incorrectly for specific oil types, thickness

    categories, thermal IR brightness or temperature. There is no systematic bias to the 37-50%

    incorrect classification.

    It was observed in the validation project (Davies et al, 1998) that once the data was outside the

    training range of the neural network, the network could not predict oil thickness reliably. As an

    additional verification, the neural network trained with the primary data set was used to predict theoil thickness using the data from the Sea Empress incident. The primary network is trained with data

    collected during summer conditions and the Sea Empress data was collected during the winter.

    Therefore the Sea Empress data set will not fall in the range of the primary training data set. The

    result of this verification is that the neural network classifies the oil thickness for all the Sea Empress

    data observations into the 1001-6000 m thickness range despite the estimated thickness ranging

    from 50-1750 m. This demonstrates that the neural network cannot reliably predict oil thickness

    outside the range of the training data set. The results of this verification are shown in more detail in

    Appendix 1.

    7.2 SECONDARY DATA SET

    The following table details the results of the verification of the neural network trained as a

    classification problem with the secondary data set. The correct thickness classifications are

    indicated with a solid grey shaded area.

  • 7/26/2019 Estimation of Oil Thickness

    31/41

    - ssue Estimation of Oil Thickness

    AEA Technology 24

    Table 14. Test data and thickness predictions for the secondary data set.

    Oil type Oil

    Brightness

    Sea

    Temp

    (C)

    Air

    Temp

    (C)

    Irradiance Measured

    Thickness

    (m)

    Predicted

    Thickness (m)

    Alaska North Slope 13 18.01 14.12 742 222 0-100Alaska North Slope -72 18.05 14.58 184 525 501-1000

    Alaska North Slope 52 18.00 15.00 24 1556 1001-6000

    Forties Blend 25 17.68 16.83 410 7 101-500

    Forties Blend 37 18.10 17.02 375 310 101-500

    Forties Blend 27 18.10 16.08 86 954 1001-6000

    Forties Blend 10 14.99 16.93 270 1531 501-1000

    Forties Blend 19 14.80 16.88 20 3316 1001-6000

    MFO/GO 15 14.99 17.02 5 7 0-100

    MFO/GO 35 16.44 17.75 359 21 101-500

    MFO/GO 15 16.83 17.71 412 23 101-500MFO/GO 8 15.29 16.68 120 29 0-100

    MFO/GO 12 15.24 16.97 260 104 0-100

    MFO/GO 30 16.73 17.71 451 204 0-100

    MFO/GO 25 14.99 17.17 840 435 0-100

    MFO/GO 10 15.04 17.17 930 537 101-500

    Forties Blend 15 7.75 2.14 127 50 101-500

    Forties Blend 24 7.95 7.00 251 50 101-500

    Forties Blend 32 7.65 2.14 400 250 501-1000

    Forties Blend 38 7.75 2.14 127 250 101-500

    Forties Blend 39 7.80 6.69 75 250 101-500

    Forties Blend 58 7.95 7.00 251 250 501-1000

    Forties Blend 54 7.65 2.14 400 750 501-1000

    Forties Blend 44 7.65 2.14 400 750 501-1000

    Forties Blend 59 7.75 2.14 127 750 501-1000

    Forties Blend 88 7.80 6.69 75 750 501-1000

    Forties Blend 60 7.95 7.00 251 750 501-1000

    Forties Blend 71 7.65 2.14 400 1750 1001-6000

    Forties Blend 59 7.65 2.14 400 1750 1001-6000

    Forties Blend 83 7.65 2.14 400 1750 1001-6000

    Forties Blend -40 7.75 2.14 127 1750 1001-6000

    Forties Blend 98 7.80 6.69 75 1750 501-1000

    Forties Blend -62 7.95 7.00 251 1750 501-1000

    Forties Blend -51 7.95 7.00 251 1750 1001-6000

    The verification results are summarised in the following matrix.

  • 7/26/2019 Estimation of Oil Thickness

    32/41

    - ssue Estimation of Oil Thickness

    AEA Technology 25

    Table 15. Classification matrix for the network trained on the Secondary data set

    Predicted

    Measured 0-100 m 101-500 m 501-1000 m 1001-6000 m % correct

    0-100 m 2 5 0 0 29%

    101-500 m 4 3 2 0 33%

    501-1000 m 0 1 6 1 75%1001-6000 m 0 0 3 7 70%

    53%

    Overall, the network is able to correctly classify 53% of the data into the appropriate thickness bin.

    As with the primary data set, the verification data contains a number of datum for which the

    measured thickness lies on the borderline of two bins (e.g. 104 m has been classified as 0-100m).

    These are indicated in Table 14 by shading in grey diagonal lines. If these values are also included,

    the network can be regarded as correctly classifying 59% of the values into the correct thickness bin.

    In some situations, the network returns a high probability that the thickness lies in one particular bin,indicating a high confidence that the result lies in that bin. In other cases, a more even distribution of

    probabilities may be returned, indicating a lower confidence in the prediction of the network. If the

    analysis also takes account of the predictions where the probability that the thickness lying in the

    correct bin is only slightly lower than the probability that the thickness lies in an alternative bin, then

    the results improve and the network comes close to correctly classifying almost 70% of the data.

    The overall percentage error in the verification results for the secondary data set is very similar to that

    for the primary data set. In the secondary data set, the verification data classified incorrectly

    accounted for 16 of the 34 data observations whereas in the primary data set 8 of the 16 data

    observations were incorrectly classified. As discussed previously there is no apparent bias to theincorrect predictions, i.e. they are not oil type, thickness or specific variable dependent.

    It is worthy of note that the majority of the data set incorrectly classified by the network trained into

    the primary data set were also incorrectly classified by the network trained with secondary data set.

    This implies that the network is consistently failing to learn the relationship between certain variables

    as a result of the lack of training data. It is also possible that the input variables used in the training

    do not adequately represent the relationship between the input variables and the thickness, therefore

    it may be that another input variable is required to predict the thickness in some instances.

    7.3 COMMITTEE OF NETWORKS

  • 7/26/2019 Estimation of Oil Thickness

    33/41

    - ssue Estimation of Oil Thickness

    AEA Technology 26

    To further address the problem of over-fitting the data, another approach was adopted in which a

    number of networks were trained using the Secondary data set. To make a prediction, the outputs

    from these trained networks were combined, summed and normalised. As each network has a

    slightly different training set (because only 75% of the available training data is used for training, as

    described in Section 6.1) each one will over-fit the data in a different way and hence when averaged

    the effects of over-fitting should be averaged out. This approach (termedforming a committee of

    networks) is a well-established technique and was also used in the earlier studies (Wood et al.,

    1997).

    The neural network was re-trained using the committee of networks approach with the secondary

    data set. The resultant committee underwent testing with the verification data set and the results are

    detailed in the Table 16.

  • 7/26/2019 Estimation of Oil Thickness

    34/41

    - ssue Estimation of Oil Thickness

    AEA Technology 27

    Table 16. Test data and thickness predictions for the secondary data set using a committee of

    networks.

    Oil type Oil

    Brightness

    Sea

    Temp

    (C)

    Air

    Temp

    (C)

    Irradiance Measured

    Thickness

    (m)

    Predicted

    Thickness (m)

    Alaska North Slope 13 18.01 14.12 742 222 101-500

    Alaska North Slope -72 18.05 14.58 184 525 501-1000

    Alaska North Slope 52 18.00 15.00 24 1556 1001-6000

    Forties Blend 25 17.68 16.83 410 7 101-500

    Forties Blend 37 18.10 17.02 375 310 101-500

    Forties Blend 27 18.10 16.08 86 954 1001-6000

    Forties Blend 10 14.99 16.93 270 1531 0-100

    Forties Blend 19 14.80 16.88 20 3316 1001-6000

    MFO/GO 15 14.99 17.02 5 7 0-100

    MFO/GO 35 16.44 17.75 359 21 500-1000MFO/GO 15 16.83 17.71 412 23 0-100

    MFO/GO 8 15.29 16.68 120 29 0-100

    MFO/GO 12 15.24 16.97 260 104 0-100

    MFO/GO 30 16.73 17.71 451 204 0-100

    MFO/GO 25 14.99 17.17 840 435 101-500

    MFO/GO 10 15.04 17.17 930 537 101-500

    Forties Blend 15 7.75 2.14 127 50 0-100

    Forties Blend 24 7.95 7.00 251 50 0-100

    Forties Blend 32 7.65 2.14 400 250 101-500

    Forties Blend 38 7.75 2.14 127 250 101-500

    Forties Blend 39 7.80 6.69 75 250 101-500

    Forties Blend 58 7.95 7.00 251 250 101-500

    Forties Blend 54 7.65 2.14 400 750 501-1000

    Forties Blend 44 7.65 2.14 400 750 501-1000

    Forties Blend 59 7.75 2.14 127 750 501-1000

    Forties Blend 88 7.80 6.69 75 750 501-1000

    Forties Blend 60 7.95 7.00 251 750 501-1000

    Forties Blend 71 7.65 2.14 400 1750 1001-6000

    Forties Blend 59 7.65 2.14 400 1750 1001-6000

    Forties Blend 83 7.65 2.14 400 1750 1001-6000

    Forties Blend -40 7.75 2.14 127 1750 1001-6000

    Forties Blend 98 7.80 6.69 75 1750 501-1000

    Forties Blend -62 7.95 7.00 251 1750 1001-6000

    Forties Blend -51 7.95 7.00 251 1750 1001-6000

  • 7/26/2019 Estimation of Oil Thickness

    35/41

    - ssue Estimation of Oil Thickness

    AEA Technology 28

    The data are summarised into the following matrix.

    Table 17. Classification matrix for the committee of networks trained on the Secondary data set

    Predicted

    Measured 0-100 m 101-500 m 501-1000 m 1001-6000 m % correct

    0-100 m 5 1 1 0 71%

    101-500 m 2 7 0 0 78%

    501-1000 m 0 1 6 1 75%

    1001-6000 m 1 0 1 8 80%

    76%

    The network is able to correctly classify 76% of the data into the appropriate thickness bin.

    As previously, the verification data contains a number of datum for which the measured thickness lieson the borderline of two bins (e.g. 104 m has been classified as 0-100 m). These are indicated in

    Table 16 by shading in grey diagonal lines. If these values are also included, the network can be

    regarded as correctly classifying 82% of the values into the correct thickness bin.

    The committee of networks approach should decrease the noise in the network training and thereby

    decrease the error in the predictions. On comparison of the results of the verification using the single

    classification training (Tables 14 and 15) and using the committee of networks training (Tables 16

    and 17), the percentage error in the predictions has decreased as expected. There was a significant

    improvement in the number of correct predictions for the lower thickness categories (0-100 m and

    101-500 m) and a slight improvement in the high thickness category (1001-6000 m). This is atotal of 26 correct predictions of the 34 data observations in the verification data set.

    The incorrect predictions are the same data observations incorrectly predicted in the verification of

    the single classification training. This supports the theory that the network is consistently failing to

    learn the relationship between certain variables and the oil thickness.

    7.4 SUMMARY

    The verification tests of three neural networks trained resulted in the following verification results:

  • 7/26/2019 Estimation of Oil Thickness

    36/41

    - ssue Estimation of Oil Thickness

    AEA Technology 29

    Table 18. Summary of the verification tests for the neural networks

    Network Training

    Type

    Data Set % Correct % Correct within

    10% error

    Single Classification Primary 50 63

    Single Classification Secondary 53 59Committee of

    networks classification

    Secondary 76 82

    The best verification results were obtained from the committee of networks classification with the

    secondary data set which was shown to be capable of correctly classifying the thickness of the

    verification data in over 76% of cases. A 10% error is included in the % correct to allow for

    occasions where the actual thickness is close to the boundary of the thickness categories, for

    example an actual thickness of 104m is classified as 0-100m. This raises the percentage correctly

    predicted for all the training types, the highest % correct remains the committee of networks at 82%.

    It was observed with each of the trained neural networks that the same verification data observations

    were being incorrectly classified for the majority of cases. Some of the data observations had similar

    input variables, so the network classified them into the same thickness category despite sometimes

    having actual thickness values from different thickness categories.

    This implies that the network is consistently failing to learn the relationship between certain input

    variables and the thickness. This could be due to the size of the training data set, if more data was

    available for training the network would have a better chance of fully learning the relationship

    between the input variables and the thickness. It is also possible that the input variables used in the

    training do not adequately represent the relationship between the input variables and the thickness,

    therefore another input variable may be required to predict the thickness in some cases.

    8 Conclusions

    The re-calibration suggests that the re-trained neural network is internally consistent in predicting oil

    thickness based on the input data used. This analysis includes data sets consisting of a wider range

    of environmental variables (sea and air temperature) and thermal IR imagery recorded in both winter

    and summer that has not been previously included in the training.

    The errors identified in the predictions by the network show no clear pattern with oil type, brightness

    or thickness categories. This suggests the network does not have a systematic bias. The 1999 re-

    trained network demonstrates the ability to account for solar heating which was not the case for the

    1997 trained network used for the validation project (Davies et al., 1998). This is evident through

    correct predictions from data taken throughout the day (i.e. at different irradiance levels and ambient

    temperatures) and from different oil thickness levels.

  • 7/26/2019 Estimation of Oil Thickness

    37/41

    - ssue Estimation of Oil Thickness

    AEA Technology 30

    However it is important to emphasise that there is no guarantee that the data sets that have been used

    to retrain the network will cover the complete range of the parameters which will be encountered in

    the event of a spill, such as the meteorological conditions and oil type. In addition, some of the data

    used are estimated rather than measured. It is still not clear whether the size of the training data set

    required to produce a robust system makes this neural network technique impracticable in the long

    term.

    Using the committee of networks approach it was possible to obtain the correct classification in 76%

    of cases. For the majority of the remaining cases, the misclassification was only by one thickness

    category. However, the verification studies carried out in this project indicate that there is still a

    significant error in the thickness predictions. This may be improved with:

    A larger training data set

    Or/and

    More input variables

    This additional data could be obtained during actual incidents and experimental field trials in the

    future. In this way, the neural network can be continually re-trained and updated to take account of

    additional validated data taken in different environmental conditions. This would maximise the

    chances of the operational data falling in the range of environmental parameters used in the training

    process.

    9 Recommendations

    The ultimate aim of this research is to develop a technique for MCA to measure oil slick thickness.

    Having knowledge of the thickness of the slick allows calculation of the volume of oil on the sea

    surface in an oil spill incident. This enhances the operational response to a spill by allowing

    assessments to be made on the efficiency of a response technique and ensures that response

    equipment is used to its maximum effectiveness.

    The re-training of the neural network has shown that this technique is partially successful in measuring

    oil thickness. However, the networks derived during this project can not be considered as beingsufficiently robust to be reliable during operational use at present. In order to continue the

    development of this technique and to provide some benefit of the research to MCA at this stage, we

    recommend a pilot system is developed which can be deployed as an experimental system at the

    next spill or experimental field trial.

    On deployment, the pilot system can provide MCA with an additional tool for the operational

    response. The practicalities of the system and its usefulness to the MCA can be evaluated. This

    includes the data analysis and collection facilities and procedures. The accuracy of the neural

  • 7/26/2019 Estimation of Oil Thickness

    38/41

    - ssue Estimation of Oil Thickness

    AEA Technology 31

    network can be studied further and re-training undertaken as and when additional data sets are

    collected to continue development of this technique.

    To develop the current experimental system into a pilot operational system would require the

    following relatively minor developments:

    Collaboration with Air Atlantique to standardise on image analysis software (for example, using

    ER Mapper) and to devise procedures for downloading the thermal IR imagery to a suitable

    format for analysis.

    Determine suitable techniques for collecting all the variables required by the neural network, i.e.

    air and sea temperature, irradiance.

    Develop a sorbent pad sampling technique that includes real-time analysis for calibration of the

    neural network.

    If the pilot system is developed and deployed (potentially using the FLIR system on the SAR

    helicopters as well as the MCAs remote sensing planes) at future incidents then the system can be

    used to give estimates of volumes, alongside the normal visual estimates, for operational use.

    As an illustration of how the volume could be calculated from the oil thickness predictions from the

    neural network, a thermal IR image was taken from the Forties Blend experiment in the 1997

    experimental sea trials. This image (shown in Appendix 1) was divided into four brightness bands

    and the surface area of the oil slick in each brightness band determined. The thickness relating to a

    single brightness value in each brightness band was determined using the trained committee of

    networks. For each of the brightness bands, the neural network predicted the thickness in the range

    of 1001-6000 m. This is not unreasonable because the release of the Forties Blend was completed

    only 10 minutes before the thermal IR was recorded, therefore the oil is likely to be at a fairly

    uniform thickness. To calculate the volume of oil, a single thickness value was assigned to the

    thickness range. The value for the 1001-6000 m range was taken to be 1750 m based on thespread of the training data in the thickness range. Table 18 details the surface area and predicted

    volume using the neural network.

    Table 19. Volume prediction using the committee of networks.

    Neural Network

    prediction

    IR

    Brightness

    Range

    Area

    (m2)

    Thickness

    (m)

    Volume

    (m3)

    0-126 155 1750 0.3

    126-183 6035 1750 10.6

    183-208 10324 1750 18.0

    208-255 11116 1750 19.5

    TOTAL 27630 TOTAL 48.4

  • 7/26/2019 Estimation of Oil Thickness

    39/41

    - ssue Estimation of Oil Thickness

    AEA Technology 32

    The actual volume of the oil slick when the image was taken is estimated at 44-51 m3. The neural

    network volume estimate is 48 m3, which is within this estimated range.

  • 7/26/2019 Estimation of Oil Thickness

    40/41

    - ssue Estimation of Oil Thickness

    AEA Technology 33

    9 References

    Davies L., Dooley K., Lunel T and Lewis A., Field validation of neural network analysis to measure

    oil slick thickness, AEAT-3474, AEA Technology, Culham, Oxfordshire, UK, 1998.

    Goodman R., Overview and future trends in oil spill remote sensing, Spill Science and Technology

    Bulletin, Vol 1, No 1, pp11-21, 1994.

    Wood P, Strachan I, Davies L and Lunel T., Determination of oil thickness by neural network

    analysis, AEAT-1151, AEA Technology, Culham, Oxfordshire, UK, 1997.

  • 7/26/2019 Estimation of Oil Thickness

    41/41

    Appendix 1

    Data Sets

    CONTENTS

    Table 1 Primary Data Set

    Table 2 Secondary Data Set

    Table 3 Verification results using Sea Empress data on the neural network

    trained with the primary data set

    Figure 1 Thermal IR image of Forties Blend experiment from 1997 seatrial