Assessment of the parameter estimation capabilities of gPROMS ...

77
Graduation report Professor: Prof. Dr. Ir. M. Steinbuch Supervisors: Prof. Dr. Ir. A.C.P.M. Backx Ir. G.M.P. Perkins Eindhoven Technical University Department Mechanical Engineering Dynamics and Control Technology Group Amsterdam, July 2005 Assessment of the parameter estimation capabilities of gPROMS and Aspen Custom Modeler, using the Sec-Butyl- Alcohol stripper kinetics case study Peter Tijl DCT 2005.96

Transcript of Assessment of the parameter estimation capabilities of gPROMS ...

Graduation report

Professor: Prof. Dr. Ir. M. Steinbuch

Supervisors: Prof. Dr. Ir. A.C.P.M. Backx

Ir. G.M.P. Perkins

Eindhoven Technical University

Department Mechanical Engineering

Dynamics and Control Technology Group

Amsterdam, July 2005

Assessment of the parameter estimation

capabilities of gPROMS and Aspen

Custom Modeler, using the Sec-Butyl-

Alcohol stripper kinetics case study

Peter Tijl

DCT 2005.96

2

Summary

In order to optimize the chemical process that occurs in the Sec-Butyl-Alcohol (SBA)

stripper operated by Shell in Pernis, the kinetic parameters of the relevant reaction

scheme are required. For this purpose laboratory experiments have previously been

performed and modelled with Aspen Custom Modeler (ACM), a commercially available

software package developed by AspenTech. The objective of this graduation project is to

build a second model of the experiments using the gPROMS software package, which is

developed by Process Systems Enterprise (PSE), subsequently perform parameter

estimation with both software tools and assess their capabilities. Both experiment models

should be as similar as possible to allow for a comparative assessment. Therefore, the

physical and thermodynamic properties of the components in ACM are made available to

the gPROMS model via the CAPE-OPEN interface, which is successfully applied. The

experiment model developed in gPROMS consists of a vapour-liquid equilibrium that

describes the distribution of nine components with reactions taking place in the liquid

phase. The model responses are found to be very similar to the existing ACM model.

During parameter estimation it became clear that not all the kinetic parameter of the

proposed reaction scheme are individually observable in combination with the available

experiment data. Therefore, the system is partially reparameterised and the parameters

are determined with sufficient accuracy. Various aspects of parameter estimation are

assessed, such as: experiment data input, output interpretation, available combinations of

objective functions and optimization solvers and their ability, speed and accuracy of

obtaining a solution. From this work it is concluded that the parameter estimation

capabilities of gPROMS are better than ACM. Additionally, it is investigated what type

of experiments are required in order to obtain parameters that remained unobservable

with the current experiment data. Attempts to apply the gPROMS Sequential Experiment

Design (SED) were unsuccessful and the added value of the SED functionality could not

be assessed. Alternatively designed experiments are simulated with a model that contains

known kinetic parameters and subsequently this synthetic experiment data is used to

estimate the parameters. The effect of data reduction on the observability of the

parameters is investigated. It proved to be possible to estimate all parameters in three

different reaction systems using a reduced set of synthesized experiment data.

3

Table of contents

SUMMARY ................................................................................ 2

TABLE OF CONTENTS.............................................................. 3

1 INTRODUCTION ................................................................. 5

2 PROJECT OBJECTIVES ....................................................... 6

3 ASSESSMENT CRITERIA .................................................... 7

3.1 MODEL BUILDING ........................................................................................................................ 8 3.2 PARAMETER ESTIMATION............................................................................................................ 8 3.3 EXPERIMENT DESIGN................................................................................................................... 9

4 CASE STUDY .....................................................................11

4.1 PROCESS DESCRIPTION ............................................................................................................ 11 4.2 LABORATORY EXPERIMENTS .................................................................................................... 12 4.3 REACTION SCHEME ................................................................................................................... 13 4.4 COMPONENTS AND INTERMEDIATES ......................................................................................... 14

5 MODEL BUILDING.............................................................16

5.1 ESSENTIAL SYSTEM EQUATIONS............................................................................................... 16 5.2 THE CAPE-OPEN INTERFACE ................................................................................................ 19 5.3 THE VAPOUR-LIQUID EQUILIBRIUM (VLE) ................................................................................ 19

5.3.1 Implementation ..................................................................................................................... 21 5.4 EXPERIMENT DATA INPUT ......................................................................................................... 23 5.5 ASSESSMENT ON MODEL BUILDING .......................................................................................... 26

6 PARAMETER ESTIMATION ................................................28

6.1 REACTION KINETICS .................................................................................................................. 28 6.1.1 Reaction kinetics reduced scheme ........................................................................................ 29 6.1.2 Reaction kinetics complete scheme....................................................................................... 34

6.2 SOLVING METHODS ................................................................................................................... 42 6.2.1 Objective functions ............................................................................................................... 42 6.2.2 Optimisation solvers ............................................................................................................. 45

6.3 PERFORMANCE ......................................................................................................................... 47 6.3.1 Ability and speed .................................................................................................................. 47 6.3.2 Accuracy ............................................................................................................................... 48 6.3.3 Effect of optimisation tolerance............................................................................................ 50

6.4 OUTPUT INTERPRETATION ........................................................................................................ 51 6.4.1 Overlay plots......................................................................................................................... 52 6.4.2 Statistical analysis ................................................................................................................ 54

6.5 ASSESSMENT ON PARAMETER ESTIMATION ............................................................................. 56

7 EXPERIMENT DESIGN.......................................................58

7.1 INTRODUCTION .......................................................................................................................... 58 7.2 APPROACH ................................................................................................................................ 59 7.3 TRIANGLE REACTION SCHEME .................................................................................................. 60 7.4 CONSTRAINTS ........................................................................................................................... 64

7.4.1 Reduced measurement data .................................................................................................. 65

4

7.4.2 Effect of measurement error ................................................................................................. 67 7.5 CONCLUSIONS ON EXPERIMENT DESIGN................................................................................... 67

8 CONCLUSIONS..................................................................69

REFERENCES ..........................................................................70

NOMENCLATURE.....................................................................72

POINTING FACTOR .....................................................................72

APPENDIX A CHEMICAL STRUCTURES OF COMPONENTS...75

APPENDIX B HAMMETT ACIDITY ..........................................76

SAMENVATTING .....................................................................77

5

1 Introduction

In the petrochemical process industry there is an increasing challenge to improve

efficiency and reduce costs due to tight margins, a maturing business and environmental

legislation. An important method to face this challenge is process modeling with the use

of so-called Computer Aided Process Engineering (CAPE) tools. These CAPE tools are

extensively used in the design, operation and optimisation of chemical processes. In order

to obtain useful models that can be applied to design optimisation, values of unknown

parameters relating to chemical kinetics must be obtained from laboratory or plant data.

The process of formulating a model and determining unknown parameters is referred to

as model development and is investigated in this project.

Multiple commercial developers of CAPE tools exist and each software package has it’s

own specific functionalities. In this project the model development and parameter

estimation capabilities of two widely used commercial software packages are assessed

and compared from a user point of view. The two packages are Aspen Custom Modeler

11.1 developed by AspenTech and gPROMS 2.3.3 developed by Process Systems

Enterprise. Both software programs have previously been reviewed by the Eurokin

consortium [8] however, with test cases that were not very challenging. The case study

applied to assess and compare the model development capabilities of both CAPE tools in

this project is a realistic and industrially relevant example. In the case study the aim is to

obtain the unknown kinetic parameters of the reactions taking place in the Sec-Butyl

Alcohol (SBA) stripper, operated by Shell. Experiments have been performed on

laboratory scale and a dynamic model of these experiments is built to fit the experiment

data.

The outline of this report is as follows: In chapter 2 the project objectives are identified.

Next, the process of model development is presented and the criteria used to asses both

software packages are defined at the various model development stages in chapter 3. In

chapter 4, the case study that is applied to investigate the model development capabilities

is described in terms of the actual process and the laboratory experiments. Chapter 5

elaborates on how the model is developed in gPROMS. The unknown parameters in this

model are estimated as described in chapter 6. Most of this work is also performed in

Aspen Custom Modeler (ACM) and at the end of chapters 5 and 6 the model

development capabilities of gPROMS are assessed and where possible compared with

ACM. In chapter 7 it is investigated what type of experiments are required to obtain the

kinetic parameters. Finally, the conclusions that are drawn from the assessment are

presented.

6

2 Project objectives

An important objective of a company in the petrochemical process industry is to optimize

chemical processes and their operation. Therefore, it is essential to have insight into the

process of interest, in particular, to understand which chemical reactions take place and

what the accessory reaction kinetics are. A method frequently used in practice to obtain

the reaction scheme and its kinetic parameters is to model the system and perform

parameter estimation with the use of CAPE software tools [4]. A number of such

commercial software packages exist; two widely used tools are gPROMS and Aspen

Custom Modeler. The main objective of this project is to:

Assess the parameter estimation capabilities of the gPROMS software and

compare them with the Aspen Custom Modeler software, using a realistic

industrial process as case study, which is the Sec-Butyl-Alcohol stripper.

At the start of this project, laboratory experiment data and an experiment model in the

ACM software with the necessary physical properties of the components were available.

Furthermore, a reaction scheme was proposed that was assumed to describe the reactions

taking place both in the actual stripper process and in the laboratory experiments. The

main objective is divided into three underlying objectives, followed by a brief

explanation:

1. Develop an experiment model in gPROMS similar to the existing model in ACM,

applying the CAPE-OPEN interface to ensure identical physical properties.

In order to compare the model development capabilities of both software packages, it is

essential to use two identical experiment models with identical physical properties. The

exchange of physical properties between the two software packages should be possible

via the so-called CAPE-OPEN interface.

2. Perform parameter estimation to obtain a unique reaction scheme and its kinetic

parameters with both software packages and compare their performance and

usability.

A unique reaction scheme implies that a model response is the result of a unique set of

model parameters only. The criteria used to assess and compare the performance and

usability of both software packages are introduced in the next chapter.

3. Investigate what type of experiments is required to obtain the parameters

preferably by applying the gPROMS Sequential Experiment Design functionality

The gPROMS software has an extra functionality compared to the ACM software, which

is Sequential Experiment Design (SED) [20]. This would enable the user to sequentially

design future experiments such that the experiment data, used to improve the accuracy of

previously estimated parameters, has maximal information content. Then the response of

the system is such that the parameters become more observable.

7

3 Assessment criteria

In this chapter the criteria used to assess and compare the model development capabilities

of both software packages are introduced by walking through the process of model

development as schematically presented in Figure 3.1. Each step involving the use of a

CAPE tool is briefly discussed and the criteria relevant for that step are defined.

Figure 3.1 Schematic representation of model development.

Start

Model

kin data

phys.

Parameter

Estimation

Good fit?

Parameter

accuracy ok?

One clearly best

candidate model?

Finished

Alternative

kinetic scheme

Perform

experiment

Experiment

Design

parameter

precision

model

discrimination

yes

no

no

yes

yes

no

no

8

3.1 Model building

Assuming an experiment is already performed, the first stage in the model development

process is the building of a model that describes the performed experiments. In this

context, a model is referred to as a mathematical system of equations and variables that

describes the behaviour of a physical system. Typical items present in a model that

describes chemical experiments carried out to obtain a unique reaction scheme and its

kinetic parameters are: equations of conservation of mass and individual components

according to a reaction scheme with kinetic parameters, physical properties of the

components and the experiment data required to perform parameter estimation. The

criteria used to evaluate both packages in the model building section are related to the

latter two typical items of an experiment model:

• CAPE-OPEN compliancy; can the physical properties available in the ACM

environment be applied in the gPROMS software via the CAPE-OPEN interface?

• Interaction with MS Excel for experiment data input.

Other aspects such as programming effort and syntax are not taken into account since

they are very similar. Furthermore, the user is expected to have basic knowledge and

skills in model building, which is required for applying both software tools.

3.2 Parameter estimation

When the experiments are performed and a model of these experiments is built, the next

stage in the model development process is the estimation of unknown model parameters.

The software tools apply a mathematical routine that optimizes the fit of the model

response to measured values from the experiment data by varying certain model

parameters. The criterion for what is an optimal fit is defined by a so-called objective

function. The mathematical routine used to minimize the objective function by moving

the model parameters from an initial guess to their optimal values is referred to as the

optimisation solver. The first criterion used to evaluate both packages in the parameter

estimation section is:

• Ability and speed of obtaining optimal values for kinetic parameters

If the parameter estimation routine has found an optimal solution of the model

parameters, critical questions need to be asked that can hopefully be answered by correct

interpretation of the output provided by the software package. Correct interpretation of

the parameter estimation output depends on the skills and knowledge of the user, but also

on the amount, type and presentation of the statistical output provided by the software

package.

Good fit? The first aspect of the parameter estimation solution that has to be

investigated is the goodness of fit. This can be done by making an overlay plot of the

9

measured and predicted values of the experiment values. When the predicted values do

not match the measured values at all, this indicates that the model cannot describe the

actual process. Assuming other model aspects are correct, the proposed solution is to

generate an alternative kinetic scheme that does represent the reaction mechanism taking

place in the chemical process. The criterion to evaluate both software tools in the output

interpretation stage with regard to this first critical question is:

• Accessibility and functionalities of overlay plots.

Parameter accuracy ok? If the goodness of fit is satisfactory, the second aspect in

interpreting the parameter estimation output is the parameter accuracy. The accuracy is

usually presented in terms of the standard deviation of an estimated parameter and a

confidence interval with a percentage of certainty that the actual parameter value is

within that interval. The reason for possible unsatisfactory parameter accuracy is that the

experiment data does not contain sufficient information to accurately determine the

model parameters. There are two approaches to tackle this problem; the first one is to

design one or more new experiments that, in combination with the already available

measurement data, do contain sufficient information. This approach is discussed in more

detail in the next stage of model development, experiment design. Secondly, the model

can be re-parameterised, reducing the number of parameters to be estimated. Assuming

other model aspects are correct this can result in a reduced kinetic scheme with less or

other reactions then the initial scheme. However, it can also be the case that some

parameters cannot be determined individually because they become clustered in the rate

equations, this then becomes apparent by high cross-correlations of the parameters

involved. The group of clustered parameters can be replaced by a single parameter that

can probably be determined with sufficient accuracy. Alternatively, all but one of the

parameters that are clustered can be set to a fixed value. All these forms of re-

parameterisation reduce the number of parameters to be estimated. To indicate the reason

for unsatisfactory parameter accuracy and deciding which parameters are redundant

requires detailed knowledge of the reaction system and is usually not straightforward.

The criteria used to assess and compare both software packages in the output

interpretation stage regarding parameter accuracy are:

• Level of accuracy achieved compared to estimated value of parameter.

• Available statistical information.

3.3 Experiment design

When the parameter accuracy is unsatisfactory or when it is not clear which of multiple

candidate models is superior, one or more new experiments need to be performed. Since

performing experiments is usually costly and time consuming, it is required they are

designed as efficiently as possible. Here, efficiently means that the experiment data

contains maximal information to either improve the parameter accuracy or distinguish

10

between candidate models. The criteria used to assess and compare both software

packages in the experiment design stage of the model development process are:

• Availability and functionality of experiment design for parameter precision.

• Added value of the experiment design feature.

The last criterion is difficult to assess without walking through the following steps:

perform an experiment designed by the software packages and an experiment designed

intuitively, make an iteration by performing parameter estimation with the extra data

from both experiments and finally evaluating whether the parameter accuracy or model

discrimination is significantly better in the case of the experiments designed by the

software package based on statistical methods. It was however not possible to perform

new experiments in the scope of this project, therefore other methods need to be found to

evaluate the added value of experiment design by CAPE tools. This is discussed in more

detail in chapter 7.

11

4 Case study

The case study investigated to compare the model development capabilities of the two

software packages gPROMS and ACM, is a realistic industrial case. It involves the

stripper section of the production process of Secondary-Butyl Alcohol (SBA). In order to

obtain the reaction scheme and the intrinsic kinetics parameters of the reactions taking

place in the SBA stripper section, batch experiments have been performed in a laboratory

and a model that describes these experiments has been developed and validated with the

available experiment data. In the model development process, the kinetic parameters are

determined using the parameter estimation functionalities in both commercial modeling

software tools gPROMS and ACM.

In this chapter, first a short description of the actual production process of SBA is

presented, followed by a description of the conducted laboratory experiments used in this

case study. Next, the kinetic scheme of the reactions expected to take place during both

the actual process and the experiments is discussed and finally an overview is given of

the components involved in these reactions.

4.1 Process description

The COF/2 unit at Pernis produces SBA, precursor to Methyl Ethyl Ketone (MEK), by

hydration of Butene (C4) in the presence of sulphuric acid (H2SO4). In the reaction

section liquid C4 is reacted with 72% sulphuric acid at 40 to 55 ºC, thereby forming so-

called fat acid containing SBA in free form and as Mono-Butyl Sulphate (MBS). This acid

is hydrolysed by the addition of water in the hydrolysis section, after which it is fed into

the stripper section. Low-pressure steam as a stripper medium enters the stripping

columns from the bottom. SBA and other light components, such as various butenes, are

drawn off at the top vapour outlet and diluted sulphuric acid or spent acid is drawn off at

the bottom liquid outlet. All light components drawn off are fed into the next section,

where caustic is used to neutralize any residual acid. Spent acid is sent to the Mantius,

where it is re-concentrated to approximately 72% acid strength, after which it can be re-

used in the reaction section [7].

The focus in this project is on the stripper section. The SBA stripper is a reactive

distillation column in which, as the products are being removed by distillation, the MBS

to SBA hydrolysis equilibrium is shifted towards the alcohol. Not only does the desired

hydrolysis reaction take place in the stripper, but also the undesirable reversion of MBS

and SBA to butanes. Roughly 25 to 30% of the potential alcohol is turned into reversion

gas that must be recycled to the reactors.

The objective in the case study is to obtain the reaction scheme and its kinetic parameters,

which will be used in a later stage to improve understanding in how to manipulate the

stripper operating variables such that the formation of reversion gas is minimized.

12

4.2 Laboratory experiments

The laboratory batch experiments were carried out using a “Hastelloy-C bolted-head”

autoclave with a total blocked-in volume, which includes the liquid phase and the vapour

cap, of 1033.2 cc. A “Welker Cylinder” with a capacity of 500 cc was used as a feed

charging system for the transfer of butane/butene liquid feed into the autoclave. Samples

taken from the liquid phase in the autoclave were collected in stainless steel sample

bombs, using an automated sampling system, and were diluted with cyclohexane and

water [16]. A schematic representation of the experiment set-up is presented in Figure

4.1.

Figure 4.1 Schematic representation of experiment set-up.

Two types of experiments were carried out to determine the reaction kinetics. The first

type involves the (desired) hydrolysis of MBS via an intermediate to SBA. The second

type of experiments is the reverse of the first type, it is the (undesired) sulfation of SBA

via an intermediate to MBS. Both temperature and pressure are maintained constant

during individual experiments. In order to raise the temperature, a heating jacket was

used and the reactor was pressurized by adding nitrogen (N2) gas, which is inert.

In order for the experiment data to obtain information on the reaction activation energies,

it is required to have experiment data at multiple temperatures. Therefore, both the

hydrolysis and the sulfation experiments are performed at three temperatures: 60, 75 and

90 °C with various initial compositions. Unfortunately, the experiments could not be

performed at the actual process temperature of about 115 °C. At this temperature, the

reactions occur too fast for the equipment to take multiple samples during the transient

behaviour of the reactions. Capturing the transient behaviour is essential to determine the

kinetic parameters that describe the reaction dynamics.

13

A total of 19 experiments have been performed, named “exp097” increasing in number

up to “exp115”. They are distinguished into two different types and three different

temperatures, as presented in Table 4.1. Five samples from the liquid phase are taken

during each experiment and the composition in terms of weight percentage of seven

components is measured, resulting in 665 data points. The weight percentage of the eight’

component (water) is determined indirectly, satisfying the constraint that the weight of all

eight components together is 100% of the total weight.

Table 4.1 Overview of experiments used in case study.

Sulfation Hydrolysis

exp097 exp104

exp099 exp105

exp112 exp106 T = 60 °C

exp115

exp098 exp107

exp100 exp108

exp101 exp109

exp113

T = 75 °C

exp114

exp102 exp110 T = 90 °C

exp103 exp111

Additional experiments have been performed to investigate the vapour-liquid phase

distribution, in this case study only the experiments in Table 4.1 are made use of. Details

of the experiment procedures and the data obtained from the experiments in Table 4.1

and the additional vapour-liquid phase distribution experiments can be found in [16].

4.3 Reaction scheme

Multiple reaction schemes have been proposed to describe the reaction kinetics in both

the batch experiments and the actual stripper process. The reaction scheme applied for the

reaction section in previous work was adopted to describe the reactions in the stripper

section [16]. However, during model development it became clear that the kinetic

parameters of the initially proposed reaction scheme could not be obtained in

combination with the available measurement data. The original reaction scheme is

simplified by removing one reversible reaction in order to obtain a unique fit with the

available measurement data. In Chapter 7 it is investigated what type of experiments

would provide the data that is required to estimate the parameters of this reaction. The

reaction scheme used in this case study to compare both software tools is schematically

displayed in Figure 4.2.

14

C4⊕ SBE

SBA

H⊕H⊕

H⊕

H⊕

1-C4= T2-C4

= C2-C4=

SBAMBS

H2SO4 H2O

H⊕

H⊕

C4⊕ SBE

SBA

H⊕H⊕

H⊕

H⊕

1-C4= T2-C4

= C2-C4=

SBAMBS

H2SO4 H2O

H⊕

H⊕

Figure 4.2 Schematic representation of reaction scheme

All the reactions that occur in the reaction scheme as presented in Figure 4.3 are

reversible and take place in the liquid phase. The reaction rate constants ki are presented

in brackets where the first of the two reaction rate constants belongs to the forward

reaction rate.

1-C4 + H+ C4

+ (k1 and k2)

c2-C4 + H+ C4+ (k3 and k4)

t2-C4 + H+ C4+ (k5 and k6)

C4+ + H2SO4 MBS + H+ (k7 and k8)

C4+ + H2O SBA + H+ (k9 and k10)

C4+ + SBA SBE + H+ (k11 and k12)

Figure 4.3 Reaction scheme.

A description of the components and intermediates that occur in the reaction scheme is

presented in the following paragraph.

4.4 Components and intermediates

In Table 4.2 an overview is provided of the components and intermediates C4+ and H

+

that are present in the reaction scheme. During the experiments the tank is pressurized

with the inert nitrogen gas, therefore this component is included in the overview as well.

15

Table 4.2 Overview of components and intermediates.

Abbreviation Full name Chemical formula

H2O Water H2O

H2SO4 Sulphuric acid H2SO4

SBA Sec-Butyl Alcohol C4H10O

MBS Mono-Sec-Butyl Sulfate C4H10O4S

SBE Sec-Butyl Ether C8H18O

C4+

Protonated Butene C4H9

1-C4 1-Butene C4H8

c2-C4 Cis-2-Butene C4H8

t2-C4 Trans-2-Butene C4H8

H +

Proton H

N2 Nitrogen N2

The chemical structures of the components in Table 4.2 are displayed in Appendix A.

16

5 Model building

The first stage in the model development process is model building. A model is defined

as a system of equations that describes the behaviour of a physical system in terms of

inputs, states and outputs. Before constructing a model it should be clear what it’s

purpose and requirements are. In this case, the purpose of the model is to determine the

unknown kinetic parameters of the reaction scheme that describes the SBA stripper

process. Therefore, batch experiments have been performed and the experiment model

will be validated using the experiment data, thereby determining optimal estimates of the

values for the intrinsic kinetic parameters.

The batch experiment model is required to describe the vapour-liquid equilibrium (VLE)

that exists in the CSTR and the reactions that take place in the liquid phase. Such an

experiment model has previously been built in ACM and a new, identical model needs to

be developed in gPROMS. Some challenging aspects in modelling the batch experiments

are the use of the CAPE-OPEN interface to describe the VLE and the fact that the amount

of nitrogen is a free variable that is determined by satisfying the constraint that the

pressure is fixed in every experiment. The VLE determines the component distribution

over a vapour and a liquid phase, which is commonly referred to as a flash calculation. In

this chapter, first the essential system of equations is described that is required to solve a

tank reactor model consisting of a vapour-liquid equilibrium, N components and R

reactions, making use of a CAPE-OPEN interface. Next, the implementation and use of

the CAPE-OPEN interface is described. Finally, the underlying theory of vapour-liquid

equilibria is addressed followed by a consideration on how the VLE is best implemented

in the gPROMS model in order to result in a component distribution over both phases

similar to what the existing ACM model predicts.

5.1 Essential system equations

The system of equations presented in this paragraph is the minimal set of equations

required to solve a tank reactor model consisting of a vapour-liquid equilibrium, N

components and R reactions, making use of a CAPE-OPEN interface. In order to be able

to perform a Degree Of Freedom (DOF) analysis, the variables introduced in Table 5.1

are assigned a certain status (Known, Unknown: Algebraic, Unknown: Differential or

Initial) according to the terminology used in gPROMS. In this system, the number of

moles of the Nth component (N2) has status Unknown: Algebraic and is a free variable.

Otherwise the system would be overspecified, since the temperature, the pressure and the

total volume of the tank all have status Known and are thus fixed. The reader is referred

to the nomenclature for a description of the symbols.

17

Table 5.1 Status of essential model variables according to gPROMS terminology.

Indices Variable status Symbol

component reaction

Known T ,P , TV , Li ,φ , Vi ,φ , Ln,υ , Vn,υ , ji ,υ , jia , , jA , jEa , gR Ni ,...,1= Rj ,...,1=

Unknown: Algebraic Tn , Nn , ix , iy , Lin , , Vin , , Ln , Vn , LV , VV , LiC , , jk Ni ,...,1= Rj ,...,1=

Unknown: Differential t

ni

∂ 1,...,1 −= Ni

Initial in 1,...,1 −= Ni

Equations (5.1) up to and including (5.16) represent the core of the experiment model

developed in gPROMS. As follows from the DOF analysis in Table 5.2 they form a

consistent set, which is a necessary requirement since all model equations are solved

simultaneously and most equations are implicitly solved. The conservation of the number

of moles is given by

∑=

=N

i

iT nn1

(5.1)

∑=

=N

i

LiL nn1

, (5.2)

∑=

=N

i

ViV nn1

, (5.3)

The liquid and vapour mole fractions are required to satisfy equations (5.4) up to and

including (5.11).

L

Li

in

nx

,= 1,...,1 −= Ni (5.4)

V

Vi

in

ny

,= 1,...,1 −= Ni (5.5)

11

=∑=

N

i

ix (5.6)

11

=∑=

N

i

iy (5.7)

18

ViLii nynxn ⋅+⋅= Ni ,...,1= (5.8)

The liquid and vapour mole fractions also appear as input variables for the CAPE-OPEN

interface, as is the case in equations (5.9) up to and including (5.11). The vapour-liquid

equilibrium is described by (5.9), as will be discussed in detail in paragraph 5.3.

( ) ( )iViiiLii yPTyxPTx ,,,, ,, φφ ⋅=⋅ Ni ,...,1= (5.9)

The liquid volume, the vapour volume and the total volume satisfy equations (5.10),

(5.11) and (5.12) respectively

( )iLnLL xPTnV ,,,υ⋅= Ni ,...,1= (5.10)

( )iVnVV yPTnV ,,,υ⋅= Ni ,...,1= (5.11)

VLT VVV += (5.12)

Equations (5.13) up to and including (5.16) are required to describe the change of the

number of moles in the liquid phase due to chemical reactions.

∑=

⋅⋅=∂

∂ R

j

jjiL

i rVt

n

1

,υ 1,...,1 −= Ni (5.13)

jiaN

i

Lijj Ckr,

1

,∏=

⋅= Rj ,...,1= (5.14)

TR

Ea

jj

j

eAk ⋅

⋅= Rj ,...,1= (5.15)

LiLLi CVn ,, ⋅= Ni ,...,1= (5.16)

In Table 5.2 a DOF analysis of equations (5.1) up to and including (5.16) is presented.

Table 5.2 DOF analysis.

Unknown: Algebraic 5N + 2R + 6

Differential N - 1 +

Total 6N + 2R + 5

Initial N - 1

Equations 6N + 2R + 5

19

As can be seen from the DOF analysis in Table 5.2, the total number of Unknown

variables is equal to the number of Equations. Furthermore, the number of Initial

variables equals the number of Differential variables. These two requirements have to be

met in order to have a well-posed system.

5.2 The CAPE-OPEN interface

The CAPE-OPEN (CO) thermodynamic and physical properties interface is applied in

order to guarantee that both the ACM and the gPROMS model make use of identical

thermodynamic methods and physical properties. This interface allows software

components that provide thermodynamic and physical properties calculations to be used

in a CAPE-OPEN compliant Simulation Environment (COSE). The essential

thermodynamic and physical properties data used in CAPE tools is also referred to as

basic data.

The basic data for the SBA process is exported from AspenPlus as a CO file with the

extension *.cota. While exporting, information is written in the MS Windows Registry

containing name, location and ProgId of the so-called Property Package (PP). A Property

Package is a complete, consistent, reusable and ready-to-use collection of methods,

chemical compounds and model parameters for calculating any of a set of known

properties for the phases of a material. It includes all the pure compound methods and

data, together with the relevant mixing rules and interaction parameters [19]. A collection

of PP’s is called a Property System (PS). In order to access the AspenTech PP in

gPROMS, it has to be introduced as a Foreign Object (FO). It then behaves as a sub-

model that makes use of methods (pre-defined functions) that usually require a certain

input (e.g. temperature, pressure, mole fraction) and return the value of the property

corresponding to the method.

The methods used in the gPROMS experiment model are: MolecularWeight,

LiquidFugacityCoefficient, VapourFugacityCoefficient, LiquidVolume and

VapourVolume. Not all existing methods are available in the CO PP that is generated by

AspenPlus. For instance, the method Kvalues that is commonly used to perform a flash

calculation and determine the Vapour-Liquid Equilibrium cannot be accessed. However,

as will follow in the next paragraph, there are many ways to describe the Vapour-Liquid

Equilibrium.

5.3 The vapour-liquid equilibrium (VLE)

For a mixture consisting N components, the liquid and vapour phase are in equilibrium if

the chemical potential of component i in the liquid phase is equal to the chemical

potential of component i in the vapour phase. Furthermore, it must hold that the

temperature and the pressure of both phases are equal [15]. Instead of the chemical

potential, another quantity is introduced named fugacity, for it is more directly related to

20

pressure and therefore often preferred in engineering calculations [15]. The fugacity is

related to the chemical potential by its definition

( ) ( ) ( )( )0,00

00,00

0

,,

,,ln,,,,

ii

ii

iiiixTPf

xTPfRTxTPxTP += µµ (5.17)

From which it follows that for a mixture to be in phase equilibrium, the fugacity of

component i in the liquid phase must be equal to the fugacity of component i in the

vapour phase, this is referred to as the isofugacity condition. Summarizing, a mixture

consisting of a liquid and a vapour phase is in phase equilibrium if the following

conditions are met:

( ) ( )iViiLi yTPfxTPf ,,,, ,, = (5.18)

VL TTT == (5.19)

VL PPP == (5.20)

Next, the fugacity coefficient is introduced for both the liquid and vapour phase defined

by respectively

Pxf iLiLi ⋅⋅= ,, φ ; Pyf iViVi ⋅⋅= ,, φ (5.21)

Combining (5.18) and (5.21) results in

iViiLi yx ⋅=⋅ ,, φφ (5.22)

The fugacity coefficients take account for the non-ideal behaviour of the phases. For an

ideal gas it holds that the vapour fugacity coefficient is one and that the vapour fugacity

of component i is equal to its partial pressure.

1, =Viφ ; iiVi PPyf ≡⋅=, (5.23)

The liquid fugacity is usually expressed in terms of an activity coefficient. This approach

makes the system suited for describing the interaction between multiple liquid phases that

occurs in liquid-liquid and vapour-liquid-liquid systems. The activity coefficient is

derived from the quantity activity, which is defined as the ratio of the actual fugacity and

a reference fugacity of the pure liquid at saturation pressure. The definitions for

respectively activity and activity coefficient are

( )( )0,0

0 ,,

,,

ii

ii

ixTPf

xTPfa ≡ (5.24)

21

i

i

ix

a≡γ (5.25)

Combining (5.24) and (5.25) results in the following expression for the liquid fugacity

0

,, LiiiLi fxf ⋅⋅= γ (5.26)

Furthermore it holds that

ifLi

sat

iLi Pf ,

0

,

0

, Ρ⋅⋅= φ (5.27)

Combining (5.26) and (5.27) results in

ifLi

sat

iiiLi Pxf ,

0

,, Ρ⋅⋅⋅⋅= φγ (5.28)

We now have two different types of expressions for the liquid and the vapour fugacity,

which still need to satisfy the isofugacity condition in (5.18). Therefore, combining and

reorganising (5.18), (5.21) and (5.28) results in a more explicit form of the vapour-liquid

equilibrium equation

ifLiii

sat

iVii xPyP ,

0

,, Ρ⋅⋅⋅⋅=⋅⋅ φγφ (5.29)

This equation is generally valid at pressure conditions below 10 bar and it is the basis

from which approximations can be derived, such as Raoult’s law. Another feature of

equation (5.29) is that it reveals which types of Basic Data are used.

5.3.1 Implementation

The existing ACM model makes use of the Non-Random Two Liquid (NRTL) model that

computes the activity coefficient of the various components. This model is based on the

idea that on microscopic scale the local composition in a mixture deviates from the

overall composition due to intermolecular interaction. An important aspect of the NRTL

model is that it assumes the gas phase is ideal, resulting in a vapour fugacity coefficient

of one for all present components. Furthermore, the Poynting factor if ,Ρ , which is a

pressure correction factor, is also one for all present components. Thus when applying the

NRTL model, the vapour-liquid equilibrium equation (5.29) is reduced to

0

,Liii

sat

ii xPyP φγ ⋅⋅⋅=⋅ (5.30)

The most logical approach would be to implement equation (5.30) in the gPROMS model

as well. Therefore, all the variables apart from the mole fractions in both phases xi and yi

need to be known, since the relation between these fractions for all components is

22

required. The total pressure is known from the experiments and the liquid activity

coefficient together with the liquid fugacity coefficient at reference pressure are available

in the CO package exported from AspenPlus. The saturation pressure can also be

exported from AspenPlus in the CO package, it is however not available. In the ACM

model, the saturation pressure or vapour pressure of a pure component i is calculated as a

function of temperature and a set of seven coefficients, using the extended Antoine

equation. This correlation is also applied in the gPROMS model together with equation

(5.30) to describe the VLE. In order to test that the ACM and the gPROMS model give

similar results, flash calculations without any reactions have been performed. The results

of the vapour-liquid distribution are compared in terms of the liquid and vapour mole

fractions of all components. The relative deviation of the liquid mole fraction of each

component is determined between the ACM and the gPROMS results using

%100, ⋅−

=ACM

i

gPROMS

i

ACM

iiL

x

xxDeviation (5.31)

The relative deviations of the vapour mole fractions are defined equivalently. The results

are displayed in Table 5.3.

Table 5.3 Comparing flash results using the NRTL equation

Component Relative deviation

vapour [%]

Relative deviation

liquid [%]

H2O -4.65E-01 -4.36E-01

H2SO4 7.61E-01 -4.36E-01

SBA -2.14E+00 -4.35E-01

MBS 1.92E+00 -4.38E-01

c2-C4 -1.36E+01 1.75E+00

SBE -1.28E+01 -3.66E-01

t2-C4 -1.29E+01 1.82E+00

1-C4 -1.26E+01 2.26E+00

N2 7.69E-01 9.36E+01

As can be concluded from the results presented in Table 5.3, the flash calculation in

ACM and gPROMS with the NRTL equation show very poor similarity. This is most

likely caused by the fact that the applied saturation pressure correlation has an upper

temperature limit for N2 of 126.2 K, while the simulated experiment takes place at a

constant temperature of 333.15 K. Alternatively, the VLE in gPROMS is described with

the iso-fugacity equation (5.6), which is generally valid. The results of the comparison

between the ACM and the gPROMS model using the iso-fugacity equation are presented

in Table 5.4.

23

Table 5.4 Comparing flash results using the iso-fugacity equation

Component Relative deviation

vapour [%]

Relative deviation

liquid [%]

H2O -8.54E-04 -6.93E-05

H2SO4 -3.20E-04 1.31E-04

SBA 4.03E-04 8.28E-05

MBS < 1E-08 -1.57E-03

c2-C4 -6.61E-04 -1.33E-03

SBE -5.68E-03 -3.35E-03

t2-C4 < 1E-08 8.61E-04

1-C4 < 1E-08 -3.36E-03

N2 -5.12E-05 < 1E-08

The flash calculation output of the ACM model and the gPROMS model using the iso-

fugacity equation is very similar, as can be seen in Table 5.4. The deviations are

considered to be small enough in order to assume that the flash calculation in both

models is identical. Summarizing, the vapour-liquid equilibrium is successfully modelled

in gPROMS using the iso-fugacity equation (5.22), where the fugacity coefficients are

provided through the CAPE-OPEN interface. The developed gPROMS experiment model

and the existing ACM experiment model are considered to be similar enough to perform

a comparative assessment with.

5.4 Experiment data input

In this paragraph the input of experiment data in both gPROMS and ACM is described.

This aspect is considered to be important, since large data sets are frequently encountered

and minor differences in the data input sequence can result in a large difference in

required time and effort.

ACM

Experiment data input in ACM is found under Tools, Estimation. This opens a window

with five tabs, one of which is named Dynamic Experiments. Behind this tab, the user can

introduce an experiment with a weighting factor. Furthermore there is a tick box that

indicates if the experiment is active, which means whether it is taken into account when

performing a parameter estimation run. Experiments can be copied as a whole within this

window, however not to another model. It is not possible to have more than one model

opened in an ACM session in general, but even when two sessions are opened it is not

possible to copy an experiment from one model to the other.

When an experiment is added, it can be edited by clicking Edit. This opens a window

with three tabs, being Measured Variables, Fixed Variables and Initial Variables. Behind

the latter tab, the user specifies the names and values of the variables that are indicated as

initial in the model. It is not possible to copy more than one value, from for instance MS

Excel, into this window. Behind the Fixed Variables tab, the user can specify variables

24

and corresponding values that maintain constant during an experiment, but can vary

between experiments. It is possible to specify time dependant values by selecting a Fixed

Variable and clicking Edit. This enables the user to assign certain values at certain points

in time for the Fixed Variable, with a linear change in time between the various points.

This is indicated with the word ramped where otherwise the value of the Fixed Variable

would appear, as can be seen in Figure 5.1.

Figure 5.1 Screenshot of ACM experiment data input for Fixed Variables.

Behind the Measured Variables tab the user can introduce the variable names followed

by clicking Edit, which opens a window where the actual measurement data input for a

specific experiment and a specific measured variable is required. A table appears from

which three columns can be edited: Time, Weight, and Observed Value. The measurement

weight is optional and can be specified for each individual measurement point. The other

columns (Predicted Value, Absolute Residual, % Residual and Standardized Residual)

are reserved for the results of a parameter estimation run and contain N/A when results

are not available. It is possible to paste two columns with values for the measurement

time and the corresponding observed value for each variable in each experiment.

However, it is not possible to paste measurement data for all measured variables of an

experiment at once, since the data for each measured variable is behind a separate

window.

gPROMS

A so-called project in gPROMS consists of multiple entities such as in this case the

variable types, a model entity with the system equations, a process entity to perform

simulation, an estimation entity and also experiment entities. The measured experiment

data is to be imported in these experiment entities. There is a distinction between

experiment entities for parameter estimation and for experiment design, in this paragraph

the setup of an experiment entity for parameter estimation is described.

25

One of aspects in which gPROMS differs from ACM is its accessibility and open code. A

good example is the experiment entity, which can be accessed and edited both through a

Graphical User Interface (GUI) and by using the gPROMS language. Both are different

representations of one item, therefore an action performed via the GUI is immediately

visible in the language and vise versa. Similar to ACM, the experiment entity contains

three tabs to specify initial conditions, constant or fixed variables and the measured data

itself. These three tabs are accessible via the GUI and the fourth tab contains the

gPROMS language representation of the first three tabs and thereby the entire experiment

entity. The fifth tab named properties contains general file information about the

experiment entity.

In the first tab named initial conditions, the user specifies the names and initial values of

the variables that have a time derivative in the model. In the second tab named controls,

the variable names and values of so-called time-invariant or piecewise constant controls

can be specified. The piecewise constant option enables the user to define a discontinuous

change of a variable with a zero order hold, opposed to a ramped change (first order hold

behaviour) in ACM. In the third tab named measured data, all measured variables and

their values at various points in time during one experiment can be defined in one table.

The complete table with values for all variables can be pasted from a spreadsheet in e.g.

MS Excel. There is an option to transpose the table in gPROMS, which can be useful

depending on how the raw experiment data is available. A measured variable and its data

can be ignored by commenting it out in the gPROMS language representation of the

experiment entity, which is behind the fourth tab. In the estimation entity the user

specifies which experiments to use for parameter estimation and it is straightforward to

exclude complete experiments here. Part of two experiment entities of experiments 97

and 98 is displayed in Figure 5.2. The measured data for experiment 97 is represented

through the GUI, for experiment 98 it is represented in the gPROMS language. The

project tree with its various previously discussed entities can be seen on the left.

Figure 5.2 Screenshot of gPROMS experiment data input entities for experiments 97 and 98.

26

5.5 Assessment on model building

A model of the laboratory batch experiments performed in a CSTR was available in

ACM and an equivalent model has been built in gPROMS. In these batch experiments

some components were significantly present in two phases and therefore a vapour-liquid

equilibrium is taken into account in both models. Part of a gPROMS library model is

used to incorporate this vapour-liquid equilibrium. The required thermodynamic and

physical properties of the components are made available through the CAPE-OPEN

interface. Furthermore, the dynamic experiment data is imported in both software tools.

In this paragraph an assessment is made of both software packages concerning the work

performed in the model building stage.

CAPE-OPEN

Both AspenTech and PSE claim that their software is fully CAPE-OPEN compliant,

however not all methods are supported by the property package exported from

AspenPlus. This was encountered when attempting to describe the vapour-liquid

equilibrium with the commonly used method named K-values. Furthermore, the method

Vapourpressure that is required when using the NRTL approach of describing the VLE,

returned values of 1e35 for all components, which cannot be trusted. Fortunately, the

methods LiquidFugacitycoeffient and VapourFugacitycoefficient are supported by the

property package, which makes it possible to describe the VLE with the iso-fugacity

equation. No compliancy problems were faced on the gPROMS side.

Another issue encountered using the CAPE-OPEN interface was the inability to set the

initial mass holdup of one or more components to zero. Even values smaller than 1e-5

caused gPROMS to fail during initialisation. No further investigation has been made to

whether the problem is on the gPROMS or on the AspenPlus side. The most robust

solution to this problem is to remove the redundant components and their thermodynamic

parameters in AspenPlus and export a new CAPE-OPEN package containing only the

components required in the model. When this solution is not possible for some reason, a

workaround is to run a simulation with the smallest values allowed and save the complete

variable set immediately after a successful initialisation. Use the saved variable set to

initialise a new simulation with smaller values for the components mass holdup where

necessary and again save the variable set just after initialisation. Repeating this procedure

eight times resulted in minimal values for the mass holdup of the relevant components of

1e-13 kg. The first, most robust solution is preferred and is applied in this work.

Summarizing, regarding the use of the CAPE-OPEN interface it is concluded that:

• The CAPE-OPEN thermodynamic and physical properties interface is

successfully applied to describe the vapour-liquid equilibrium.

• AspenPlus does not support all CAPE-OPEN methods when exporting a property

package.

27

• It is advised to remove redundant components and their binary interaction

parameters from a property package.

Experiment data input

What ACM and gPROMS have in common regarding experiment data input are the three

tabs where initial conditions, constant variables and measured variables and their

respective values are specified. For both software programs this can be done by copying

and pasting the values from a spreadsheet in e.g. MS Excel. A difference in the constant

variables tab is that ACM allows to specify constant and ramped (first order hold)

behaviour of a process variable e.g. temperature, where gPROMS has the option to

prescribe constant and piecewise constant (zero order hold) behaviour.

The most significant difference appears in the measured variables tab, where in gPROMS

it is possible to define all the variables and their values that are measured at various

points during the experiment. In ACM these values can be defined of one variable only as

a result of the structure where each variable in each experiment is accessed and edited in

a separate window. Due to this structure, 499 windows would have to be opened, edited

and closed in ACM opposed to 57 in gPROMS in order to perform the experiment data

input for this case study.

The software tools also differ regarding their options to exclude and re-include single

measurements, variables or complete experiments from a parameter estimation run.

Complete experiments can easily be in- or excluded in ACM by means of a tick-box

indicating whether an experiment is active. In gPROMS this can be done with similar

effort by placing or removing comment symbols {…} around or #… in front of the

experiment name in the estimation entity. The advantages of the having an open language

in gPROMS, which can also be represented and edited via a GUI, becomes clear when

excluding and re-including single measurements or measured variables. This can be done

in the language tab of an experiment entity, again by placing or removing comment

symbols. In ACM measured variables or single measurements can be removed from an

experiment via the GUI, this is however permanent and the measurement data is lost.

The conclusions drawn from comparing the experiment data input functionalities in ACM

and gPROMS are:

• Sets of measured values can be copied from a spreadsheet in e.g. MS Excel and

pasted into both software tools.

• The structure in ACM where each measured variable in each experiment is in a

separate window is very inconvenient for large data sets, in gPROMS all

measured variables and their values of an experiment are in a single table.

• Experiment data can easily be excluded and re-included in a parameter estimation

run in gPROMS due to its open language. Apart from complete experiments,

measured data can only be removed permanently and not re-included in ACM.

28

6 Parameter estimation

The various aspects of parameter estimation are discussed in this chapter. First the

development of the reaction scheme throughout the process of parameter estimation in

gPROMS is treated. Next, the solving methods of the two software packages are

highlighted in terms of the objective functions and the solvers that can be used to

minimize the objective function value. The performance of the various solving methods

is evaluated for three cases. The ability and the speed of the available solving methods to

find an optimal solution and the accuracy of the estimated parameters compared to their

standard deviations is assessed. For both software packages, their standard features for

the interpretation of the parameter estimation output results are investigated. Finally, the

conclusions drawn from these various aspects are summarized in the assessment

paragraph.

6.1 Reaction kinetics

The complete reaction scheme as introduced in paragraph 4.3 can be reduced to a

reaction scheme only taking into account the major components that are present in large

quantities in the liquid phase. Furthermore, from flash calculations it is concluded that

more than 99% of the total mixture is in the liquid phase. As a result of these

considerations, it is justified to approximate the kinetics of the reactions that involve the

major components by ignoring the reactions that are involved in forming the volatile

components. The reasons to simplify the kinetic scheme and perform model development

with this reduced scheme first, are to obtain insight in the process of model development

and to get initial guesses for the kinetic parameters of the reactions in the reduced scheme

when making the step to perform model development with the complete kinetic scheme.

Measurements at the end of a typical experiment show that the composition of the

mixture is such that four components are present in relatively large amounts, as can be

seen in Figure 6.1.

H20

H2SO4

SBA

MBS

SBE

t2-C4=

c2-C4=

1-C4=

Figure 6.1 Liquid mole composition in exp114 at t = 75 min.

29

6.1.1 Reaction kinetics reduced scheme

The reaction scheme is reduced to the reactions that involve the four major components

only, as presented in Figure 6.2. In order to maintain consistency with the complete

kinetic scheme, the numbering of the reaction rate constants is not changed.

C4

+ + H2SO4 MBS + H + (k7 and k8)

C4 + + H2O SBA + H + (k9 and k10)

Figure 6.2 Reduced reaction scheme.

The rate expressions for the components and the C4+ intermediate in the reduced scheme

are

]][[]][[ 8424742

++ +−= HMBSkSOHCkr SOH (6.1)

]][[]][[ 84247

++ −= HMBSkSOHCkrMBS (6.2)

]][[]][[ 102492

++ +−= HSBAkOHCkr OH (6.3)

]][[]][[ 10249

++ −= HSBAkOHCkrSBA (6.4)

=+4C

r ]][[]][[ 84247

++ +− HMBSkSOHCk (6.5)

]][[]][[ 10249

++ +− HSBAkOHCk

The concentration of the H+ intermediate is given by (6.6) where the Hammett acidity H0

is introduced, which follows from a correlation as presented in Appendix B.

010][

HH

−+ = (6.6)

The concentration of the C4+ intermediate cannot be measured directly or indirectly.

Therefore, the so-called Bodenstein approximation [11] is applied, which assumes that

the reaction rate of the intermediate is small and constant. In both the reduced and the

complete kinetic scheme these requirements are assumed to hold, resulting in

04

=+Cr (6.7)

Combining (6.5) with (6.7) and rearranging, results in the following expression for the

C4+ intermediate concentration

][][

][][][][

29427

108

4OHkSOHk

SBAkMBSkHC

+

+= ++ (6.8)

30

Next, this expression obtained as a result of the Bodenstein approximation is substituted

in the component rate expressions. Substituting (6.8) in (6.1) up to and including (6.4)

and rearranging results in component rate expressions that can be described by one

overall reaction rate r.

][][

]][[]][[][

29427

42107298

OHkSOHk

SBASOHkkMBSOHkkHr

+

−= + (6.9)

The component rate expressions are

rrr SBASOH ==42

(6.10)

rrr MBSOH −==2

(6.11)

The overall rate expression (6.9) is overparameterised, which becomes clear after

rearranging by dividing both numerator and denominator by k7. As can be seen from

equation (6.12), only three independent clusters of parameters appear, where there are

four individual parameters.

][][

]][[]][[

][

2

7

942

42102

7

98

OHk

kSOH

SBASOHkMBSOHk

kk

Hr

+

= + (6.12)

Applying the Bodenstein approximation reduced the number of independently observable

kinetic parameters by one. Instead of k7 and k9, only their ratio can be observed since

neither of both parameters appears individually in rate expression (6.12) and therefore the

parameter 9k ′ is introduced for which the following holds

7

9

9k

kk =′ (6.13)

Combining (6.12) and (6.13) results in

][][

]][[]][[][

2942

4210298

OHkSOH

SBASOHkMBSOHkkHr

′+

−′= + (6.14)

This overall rate is used to describe the component rate expressions and parameter

estimation is performed with the measurement data at a temperature of 60 °C. The first

aspect that should be considered with regards to interpreting parameter estimation output

is the ability to fit trends in the measurements. The overlay plots show that the predicted

values agree sufficiently with the measured values to confirm a good fit. Next, the

parameter accuracy is investigated. In this work it is considered that a parameter is

31

estimated with sufficient accuracy if the standard deviation is at least one order of

magnitude lower than the value of the estimated parameter. Furthermore, gPROMS

performs a t-test to investigate the individual parameter accuracy and a summary of the

parameter estimation output file is presented in Table 6.1.

Table 6.1 Summary of gPROMS parameter estimation output at T = 60 °C

A 95% t-value for a parameter component smaller than the reference t-value indicates that the data is not sufficient to estimate this parameter precisely.

Parameter Optimal Estimate 95% Confidence

Interval 95% t-value

Standard Deviation

k8 4.818019E+00 2.431668E+05 1.981364E-05 1.228733E+05

k’9 1.564989E-07 7.854225E-03 1.992545E-05 3.968777E-03

k10 3.603384E-07 1.533570E-08 2.349671E+01 7.749199E-09

Reference t-value (95%): 1.657715E+00

The gPROMS output file indicates that not all parameter are estimated with sufficient

accuracy. This conclusion can be drawn both from applying the previously mentioned

criterion on the standard deviation and from the information about the performed t-test.

The reason for the parameter inaccuracy becomes clear when considering the

denominator of the overall rate equation (6.14). Both concentrations that appear in the

denominator are of equal order of magnitude in the applied experiment data, 9k ′ is estimated to be in the order of 1e-7, thus for this dataset the following holds

][][ 4229 SOHOHk <<′ (6.15)

Allowing the simplification of (6.14) to

][

]][[]][[][

42

4210298

SOH

SBASOHkMBSOHkkHr

−′= + (6.16)

Rate expression (6.16) is over-parameterised due to the clustering of k8 and 9k ′ and therefore these two parameters are not individually observable. It is important to note that

the reason for the unobservability in this case is the lack of information in the experiment

data, which is different from the reason why parameters k7 and k9 were individually

unobservable. This was a direct effect of applying the Bodenstein approximation as

previously explained. In order to obtain a unique fit the parameter 9k ′′ is introduced, where

7

98

989k

kkkkk

⋅=′⋅=′′ (6.17)

Combining (6.16) and (6.17) leads to

32

][

]][[]][[][

42

421029

SOH

SBASOHkMBSOHkHr

−′′= + (6.18)

With overall rate equation (6.18) an alternative kinetic scheme is proposed and again

parameter estimation is performed. The overlay plots show a good fit and next the

parameter accuracy is investigated. A summary of the output is presented in Table 6.2.

Table 6.2 Summary of gPROMS parameter estimation output

Parameter Optimal Estimate 95% Confidence

Interval 95% t-value

Standard Deviation

k’’9 7.540134E-07 6.935782E-09 1.087135E+02 3.504683E-09

k10 3.603376E-07 2.781764E-09 1.295357E+02 1.405638E-09

Reference t-value (95%): 1.657715E+00

The 95% t-values of both estimated parameters are larger than the reference t-value,

indicating sufficient accuracy. This is confirmed by considering that the standard

deviation is more than two orders of magnitude lower than the estimated value. Note that

multiplying the previously estimated values of k8 and 9k ′ approximately gives the

estimated value of 9k ′′ . This explains why in estimating k8, 9k ′ and k10 the overlay plots

showed a good fit, although the parameters could not be obtained with sufficient

accuracy. We can now conclude that rate equation (6.18) in combination with (6.10) and

(6.11) describes the behaviour of the major components and that the kinetic parameters

9k ′′ and k10 can be determined accurately. Next, the values of 9k ′′ and k10 are estimated

using the experiments performed at the two other temperatures of 75 and 90 °C. An

overview of the results at the three different temperatures is presented in Table 6.3.

Table 6.3 Parameter estimation results for reduced scheme.

Temperature [°C] Parameter Optimal estimate Standard deviation

60 9k ′′ 7.540134E-07 3.504683E-09

k10 3.603376E-07 1.405638E-09

75 9k ′′ 6.282991E-06 5.263236E-08

k10 3.373953E-06 2.997604E-08

90 9k ′′ 3.320208E-05 4.056078E-07

k10 1.536291E-05 2.040826E-07

33

The parameter accuracy of both 9k ′′ and k10 is sufficient at all three temperatures since the

order of magnitude of the standard deviations is two orders of magnitude less than the

estimated values. Finally, the kinetic parameters of the Arrhenius equation that describes

the temperature dependant behaviour of ki need to be determined. The reaction rate ki of

reaction i depends on the reaction temperature T according to the Arrhenius equation

TR

E

ii

ia

ekk ⋅−

⋅=,

,0 (6.19)

Where R is the universal gas constant and k0,i and Ea,i respectively are the pre-exponential

factor and the activation energy of reaction i. As can be concluded from (6.19), the pre-

exponential factor k0,i is a reference ki at a temperature of infinity. It is better to re-

parameterise (6.19), introducing a different reference kref,i that is related to a reference

temperature Tref that can be chosen within the range of the actual experiment

temperatures [4], [8]. The re-parameterised Arrhenius equation is given by

⋅=TTR

E

irefi

ref

ia

ekk

11

,

,

(6.20)

This re-parameterised Arrhenius equation is introduced in the model and the kinetic

parameters kref,i and Ea,i are estimated using the experiment data from all three

temperatures and a reference temperature of Tref = 75 °C. The results are presented in

Table 6.4.

Table 6.4 Parameter estimation results for reduced scheme, all temperatures

Parameter Optimal estimate Standard deviation

9,refk ′′ 5.696409E-06 3.005909E-08

kref,10 2.850522E-06 1.594955E-08

9,aE ′′ 1.330211E+05 3.907733E+02

Ea,10 1.322812E+05 3.906087E+02

Again sufficient accuracy is achieved and with these four parameters the time dependent

and the temperature dependant behaviour of the major components in all experiments

using the reduced reaction scheme is described.

34

6.1.2 Reaction kinetics complete scheme

For the complete kinetic scheme a similar approach is used as for the reduced kinetic

scheme. The complete kinetic scheme is displayed again in Figure 6.3, to aid the reader in

the derivation of the resulting component rate expressions.

1-C4 + H+ C4

+ (k1 and k2)

c2-C4 + H+ C4+ (k3 and k4)

t2-C4 + H+ C4+ (k5 and k6)

C4+ + H2SO4 MBS + H+ (k7 and k8)

C4+ + H2O SBA + H+ (k9 and k10)

C4+ + SBA SBE + H+ (k11 and k12)

Figure 6.3 Complete reaction scheme.

The resulting component rate expressions are

][]][1[ 42411 4

++− +−−= CkHCkr C (6.21)

][]][2[ 44432 4

++− +−−= CkHCckr Cc (6.22)

][]][2[ 46452 4

++− +−−= CkHCtkr Ct (6.23)

]][[]][[ 8424742

++ +−= HMBSkSOHCkr SOH (6.24)

]][[]][[ 84247

++ −= HMBSkSOHCkrMBS (6.25)

]][[]][[ 102492

++ +−= HSBAkOHCkr OH (6.26)

]][[]][[ 12411

++ −= HSBEkSBACkrSBE (6.27)

=SBAr ]][[]][[ 10249

++ − HSBAkOHCk (6.28)

]][[]][[ 12411

++ +− HSBEkSBACk

35

=+4C

r +−−+−− ++++ ][]][2[][]][1[ 44434241 CkHCckCkHCk

+−−− +++ ]][[][]][2[ 42474645 SOHCkCkHCtk

(6.29)

]][[]][[]][[ 102498

+++ +− HSBAkOHCkHMBSk

]][[]][[ 12411

++ +− HSBEkSBACk

010][

HH

−+ = (6.30)

Applying the Bodenstein approximation for the complete scheme, combining (6.7) and

(6.29), results in the following expression for the concentration of C4+ as a function of the

kinetic parameters and the concentration of the other components

][][][

][][][]2[]2[]1[][][

1129427642

12108454341

4SBAkOHkSOHkkkk

SBEkSBAkMBSkCtkCckCkHC

+++++

+++−+−+−= ++

(6.31)

Substituting the expression for [C4+] in the original component rate expressions (6.21) up

to and including (6.28), rearranging and dividing numerator and denominator by k7

(similar to the approach for the reduced scheme), leads to the following component rate

expressions (6.32) to (6.39). These expressions are presented such that the clustering of

individual parameters is clearly visible.

=− 41 Cr (6.32)

+++−+−+ ][][][]2[]2[][7

122

7

102

7

82

4

7

52

4

7

32 SBEk

kkSBA

k

kkMBS

k

kkCt

k

kkCc

k

kkH

+

+++⋅−− ][][][]1[

7

1112

7

91421

7

61

7

414 SBA

k

kkOH

k

kkSOHk

k

kk

k

kkC

_____________________________________________________________

][][][7

112

7

9

42

7

6

7

4

7

2 SBAk

kOH

k

kSOH

k

k

k

k

k

k+++++

36

=− 42 Ccr (6.33)

+++−+−+ ][][][]2[]1[][7

124

7

104

7

84

4

7

54

4

7

41 SBEk

kkSBA

k

kkMBS

k

kkCt

k

kkC

k

kkH

+

+++⋅−− ][][][]2[

7

1132

7

93423

7

63

7

324 SBA

k

kkOH

k

kkSOHk

k

kk

k

kkCc

_____________________________________________________________

][][][7

112

7

9

42

7

6

7

4

7

2 SBAk

kOH

k

kSOH

k

k

k

k

k

k+++++

=− 42 Ctr (6.34)

+++−+−+ ][][][]2[]1[][7

126

7

106

7

86

4

7

63

4

7

61 SBEk

kkSBA

k

kkMBS

k

kkCc

k

kkC

k

kkH

+

+++⋅−− ][][][]2[

7

1152

7

95425

7

54

7

524 SBA

k

kkOH

k

kkSOHk

k

kk

k

kkCt

_____________________________________________________________

][][][7

112

7

9

42

7

6

7

4

7

2 SBAk

kOH

k

kSOH

k

k

k

k

k

k+++++

=42SOHr (6.35)

+

+++⋅++ ][][][][

7

1182

7

98

7

86

7

84

7

82 SBAk

kkOH

k

kk

k

kk

k

kk

k

kkMBSH

( )}][][]2[]2[]1[][ 121045434142 SBEkSBAkCtkCckCkSOH ++−+−+−⋅−

____________________________________________________________

][][][7

112

7

9

42

7

6

7

4

7

2 SBAk

kOH

k

kSOH

k

k

k

k

k

k+++++

37

=MBSr (6.36)

+

+++⋅−+ ][][][][

7

1182

7

98

7

86

7

84

7

82 SBAk

kkOH

k

kk

k

kk

k

kk

k

kkMBSH

( )}][][]2[]2[]1[][ 121045434142 SBEkSBAkCtkCckCkSOH ++−+−+−⋅+

____________________________________________________________

][][][7

112

7

9

42

7

6

7

4

7

2 SBAk

kOH

k

kSOH

k

k

k

k

k

k+++++

=OHr 2 (6.37)

++−+−+−⋅−+ ][]2[]2[]1[][][

7

984

7

954

7

934

7

912 MBS

k

kkCt

k

kkCc

k

kkC

k

kkOHH

++++⋅+

][][][][

7

111042

7

106

7

104

7

102

7

129 SBAk

kkSOH

k

kk

k

kk

k

kkSBASBE

k

kk

_____________________________________________________________

][][][7

112

7

9

42

7

6

7

4

7

2 SBAk

kOH

k

kSOH

k

k

k

k

k

k+++++

=SBEr (6.38)

+

+−+−+−⋅+ ][]2[]2[]1[][][

7

1184

7

1154

7

1134

7

111 MBSk

kkCt

k

kkCc

k

kkC

k

kkSBAH

++++⋅−

][][][][ 2

7

1294212

7

126

7

124

7

122

7

1110 OHk

kkSOHk

k

kk

k

kk

k

kkSBESBA

k

kk

_____________________________________________________________

][][][7

112

7

9

42

7

6

7

4

7

2 SBAk

kOH

k

kSOH

k

k

k

k

k

k+++++

38

=SBAr (6.39)

( )

+−+−+−⋅

−+ ][]2[]2[]1[][][][ 8454341

7

112

7

9 MBSkCtkCckCkSBAk

kOH

k

kH

( )

+++⋅−− ][][][ 42

7

6

7

4

7

21210 SOH

k

k

k

k

k

kSBEkSBAk

_____________________________________________________________

][][][7

112

7

9

42

7

6

7

4

7

2 SBAk

kOH

k

kSOH

k

k

k

k

k

k+++++

Similar to the approach for the reduced scheme, applying the Bodenstein approximation

reduces the number of independently determinable parameters by one. For the complete

scheme, the parameters that have become observable only in relation to k7 as a result of

applying the Bodenstein approximation are: k2, k4, k6, k9 and k11. A new set of parameters

is introduced: 2k ′ , 4k ′ , 6k ′ , 9k ′ and 11k ′ for which it holds that

7k

kk i

i =′ ; i = 2, 4, 6, 9, 11 (6.40)

The complete set including the new kinetic parameters is estimated with the experiment

data at 60 °C. A summary of the output is presented in Table 6.5.

Table 6.5 Parameter estimation output complete scheme at T = 60 °C.

A 95% t-value for a parameter component smaller than the reference t-value indicates that the data is not sufficient to estimate this parameter precisely.

Parameter Optimal Estimate 95% Confidence

Interval 95% t-value

Standard Deviation

k1 4.761434E-06 8.556002E-08 5.565022E+01 4.346984E-08

k’2 7.150141E-05 1.478812E-01 4.835058E-04 7.513289E-02

k3 7.534677E-06 1.312637E-07 5.740109E+01 6.669015E-08

k’4 9.864279E-04 2.040164E+00 4.835043E-04 1.036531E+00

k5 4.694857E-06 8.984409E-08 5.225559E+01 4.564642E-08

k’6 9.517142E-04 1.968360E+00 4.835061E-04 1.000050E+00

k8 2.213487E+00 4.577868E+03 4.835192E-04 2.325843E+03

k’9 4.886781E-07 1.010665E-03 4.835214E-04 5.134810E-04

k10 5.232387E-07 2.367798E-09 2.209812E+02 1.202989E-09

k’11 1.584490E-08 3.277003E-05 4.835180E-04 1.664922E-05

k12 5.170035E-08 2.045581E-08 2.527417E+00 1.039283E-08

Reference t-value (95%): 1.649902E+00

39

From Table 6.5 it becomes clear that 2k ′ , 4k ′ , 6k ′ , k8, 9k ′ and 11k ′ cannot be estimated with sufficient accuracy, as indicated by their 95% t-values and standard deviations with

respect to the estimated values. The reason for the inability to estimate these parameters

accurately is clustering, which is indicated by the correlation matrix presented in Table

6.6.

Table 6.6 Correlation matrix complete scheme at T = 60 °C.

k1 2k ′ k3 4k ′ k5 6k ′ k8 9k ′ k10 11k ′ k12

k1 1

2k ′ 0.34 1

k3 0.15 0.34 1

4k ′ 0.34 1.00 0.34 1

k5 0.13 0.28 0.15 0.28 1

6k ′ 0.34 1.00 0.34 1.00 0.28 1

k8 -0.34 -1.00 -0.34 -1.00 -0.28 -1.00 1

9k ′ 0.34 1.00 0.34 1.00 0.28 1.00 -1.00 1

k10 -0.02 -0.07 -0.02 -0.07 0.01 -0.07 0.07 -0.07 1

11k ′ 0.34 1.00 0.34 1.00 0.28 1.00 -1.00 1.00 -0.07 1

k12 0.00 -0.02 0.00 -0.02 0.00 -0.02 0.02 -0.02 -0.03 -0.02 1

The cross-correlations of the parameters 2k ′ , 4k ′ , 6k ′ , k8, 9k ′ and 11k ′ are all one or minus one, indicating that they are all completely coupled due to the clustering in the

component rate expressions. From the components rate expressions (6.32) up to and

including (6.39), it can be concluded that all except one of the coupled parameters are

also present outside a cluster of parameters. The one parameter that only appears

clustered is k8. Therefore, similar to the reduced scheme a new set of parameters is

introduced for which the following holds

7

8

8k

kkkkk i

ii

⋅=′⋅=′′ ; i = 2, 4, 6, 9, 11 (6.41)

Parameter estimation is performed with the complete set, including the newly defined

parameters and the overlay plots show a good fit. The result for the parameter estimates

and their accuracy is presented in Table 6.7.

40

Table 6.7 Parameter estimation output complete scheme at T = 60 °C.

Parameter Optimal Estimate 95% Confidence

Interval 95% t-value

Standard Deviation

k1 4.761457E-06 8.203025E-08 5.804514E+01 4.167649E-08

k’’2 1.582677E-04 1.493950E-06 1.059391E+02 7.590201E-07

k3 7.534717E-06 1.260763E-07 5.976313E+01 6.405466E-08

k’’4 2.183451E-03 2.453116E-05 8.900725E+01 1.246336E-05

k5 4.694869E-06 8.781254E-08 5.346468E+01 4.461426E-08

k’’6 2.106610E-03 2.148849E-05 9.803437E+01 1.091749E-05

k’’9 1.081686E-06 8.733718E-09 1.238517E+02 4.437275E-09

k10 5.232403E-07 2.405954E-09 2.174773E+02 1.222375E-09

k’’11 3.507204E-08 6.196273E-10 5.660184E+01 3.148094E-10

k12 5.168586E-08 2.045040E-08 2.527377E+00 1.039008E-08

Reference t-value (95%): 1.649902E+00

All parameters can be determined with sufficient accuracy according to the t-test. Except

for k12, the standard deviations are all small enough compared to the estimated values of

the parameters. Furthermore, introducing new parameters had the desired effect of

decoupling the complete set of kinetic parameters as can be seen in the correlation matrix

in Table 6.8.

Table 6.8 Correlation matrix complete scheme at T = 60 °C.

k1 2k ′′ k3 4k ′′ k5 6k ′′ 9k ′′ k10 11k ′′ k12

k1 1

2k ′′ 0.88 1

k3 0.08 -0.01 1

4k ′′ 0.01 0.00 0.92 1

k5 0.07 0.00 0.09 0.03 1

6k ′′ 0.00 0.01 0.01 0.03 0.90 1

9k ′′ -0.15 0.12 -0.17 0.05 -0.13 0.10 1

k10 0.04 0.12 0.04 0.10 0.06 0.11 0.73 1

11k ′′ -0.04 0.00 -0.04 -0.01 -0.04 0.00 0.06 -0.06 1

k12 0.01 0.00 0.01 0.00 0.00 0.00 -0.02 -0.03 0.82 1

The results of parameter estimation with the complete reaction scheme and the newly

introduced parameters are satisfactory for the experiment data at 60 °C. Next, the kinetic

parameters at the other temperatures of 75 °C and 90 °C are estimated. This proved

unsuccessful for the experiment data at 90 °C, due to lack of information in the data. At

higher temperatures, the reactions occur faster and steady state is reached quickly. The

experiment data at 90 °C shows little variation, which suggest that the reactions were

41

near steady state during those measurements. As in the reduced scheme, the final step is

to estimate the kinetic parameters kref and Ea in equation (6.42), which describes the

temperature dependent behaviour of the reactions. Again the reference temperature is

chosen in the middle of the investigated range, at 75 °C.

⋅=TTR

E

irefi

ref

ia

ekk

11

,

,

(6.42)

The results of parameter estimation with the complete scheme with the newly introduced

parameters, using all experiment data are presented in Table 6.9.

Table 6.9 Parameter estimation results for complete scheme.

Parameter Optimal

estimate

Standard

deviation

kref,1 1.654031E-05 1.166450E-07

2,refk ′′ 9.634294E-04 4.961189E-06

kref,3 2.733107E-05 2.336426E-07

4,refk ′′ 1.270317E-02 9.044965E-05

kref,5 1.401267E-05 9.320541E-08

6,refk ′′ 1.098498E-02 4.971996E-05

9,refk ′′ 1.067818E-05 7.025021E-08

kref,10 5.656640E-06 3.539302E-08

11,refk ′′ 3.317467E-07 1.451926E-09

kref,12 1.161906E-06 2.716320E-08

Ea,1 7.109098E+04 6.497152E+02

2,aE ′′ 1.133150E+05 4.041160E+02

Ea,3 7.630520E+04 6.953930E+02

4,aE ′′ 1.113993E+05 5.319371E+02

Ea,5 5.974739E+04 6.347992E+02

6,aE ′′ 1.022666E+05 3.654522E+02

9,aE ′′ 1.574787E+05 4.402616E+02

Ea,10 1.557858E+05 4.108734E+02

11,aE ′′ 1.507085E+05 3.836134E+02

Ea,12 1.499830E+05 2.077247E+03

As the results in Table 6.9 indicate, the accuracy is sufficient for all parameters. It is

concluded that the kinetic parameters of the complete reaction scheme with the newly

introduced parameters can be determined sufficiently accurate in combination with the

available measurement data. Although the parameter values are accurately determined, it

42

is important to recognize that the model with this set of parameters will probably not

accurately describe experiments under conditions where the reduction of the set 2k ′ , 4k ′ ,

6k ′ , k8, 9k ′ and 11k ′ to the set of parameters 2k ′′ , 4k ′′ , 6k ′′ , 9k ′′ and 11k ′ is no longer valid. From the analysis of the reduced scheme it follows that the case that equation (6.15) no

longer holds, could lead to such a the situation.

6.2 Solving methods

In both software tools, the mathematical problem to be solved in order to obtain optimal

parameter estimates is the minimization of an objective function. The combination of an

objective function with a solver is referred to as a solving method. For gPROMS, one

objective function is available, which is minimized with one specifically designed solver,

resulting in one solving method. For ACM, the user can choose from two objective

functions, each of which can be solved by two solvers, leaving four solving methods. A

schematic overview of the objective functions and their solvers of both software tools is

presented in Figure 6.4.

Software

package

gPROMS ACM

Objective

function

Max log

likelihood

Least

squares

Max log

likelihood

Optimisation

solver

MXLKHD

(indirect)

NL2SOL

(indirect)

Nelder-Mead

(direct)

FEASOPT

(indirect) Figure 6.4 Overview of solving methods for both software packages.

In this chapter, first the objective functions will be discussed for each software package,

followed by the various optimisation solvers.

6.2.1 Objective functions

In general, the process of parameter estimation is obtaining values for a set of model

parameters θ such that the set of predicted model output ( )θz agrees best with the set of

43

measured values z~ . The qualification ‘agrees best’ can be specified in various ways and

depends on how the objective function is defined. The difference between the measured

values and the predicted model output for measurement i is

( )iii zz θξ −= ~ (6.43)

The principle of maximum likelihood estimation takes into account disturbances and

measurement noise, assuming stochastic behaviour for iξ [5]. The Gaussian probability

density function, which assumes iξ is independent and normally distributed with zero

means and standard deviation iσ , for the total number of measurements N is

( )∑

=

−N

i i

i

ep

N

i

i

2

2

2

2

1 σ

ξ

σπξ (6.44)

Combining with (6.43) gives

( )( )( )

=

−−

N

i i

ii zzN

i

i ezp2

2

2

~

2

1;~

σ

θ

σπθ (6.45)

This function describes the probability of iz~ for a known set of model parameters θ. In

the reverse case when iz~ is known and θ unknown, (6.45) is changed from the á priori

probability density function to the following á posteriori likelihood function

( )( )( )

=

−−

N

i i

ii zzN

i

i ezL2

2

2

~

2

1;~

σ

θ

σπθ (6.46)

The maximum of this likelihood function ( )θ;~izL with corresponding argument θ̂ , gives the values for the set of unknown model parameters θ for which the plausibility is

maximal that the corresponding iz~ would have been measured. The algebraic maximum

of L is found in the set θ̂ , where the first derivate with respect to θ is zero and around θ̂ the sign of the first derivative changes from positive to negative. Finding the maximum

of L is equivalent to finding the minimum of )ln(L− , as follows from

( )( ) ( )( )

θθθ ∂∂−

=∂∂

∂−∂

=∂

−∂ L

L

L

L

LL 1lnln (6.47)

The objective function to be minimized when applying maximum likelihood is thus

44

( )( ) ( ) ( )( )( )

2

2

2

~

ln2ln2

;~lni

N

i ii

ii

zzN

NzL

σ

θσπθ ∑ −

++=− (6.48)

Minimizing (6.48) for the special case of a constant standard deviation σi is equivalent to

minimizing the weighted least squares objective (6.49)

( )( )

−∑N

i ii

i

zz2

2

~1min θ

σθ (6.49)

When the standard deviation is equal for all measurements, the weighting factor is

unnecessary, reducing the problem to the minimization of the normal least squares

function.

In both software programs a distinction in the measurements is made in three levels, each

with a separate index. The experiment number is indicated by i, the measured variable

has index j and for dynamic experiments the measurement number is labeled k.

Furthermore, the inequality constraints on the parameters to be estimated are given by

lower and upper bounds

UL θθθ ≤≤ (6.50)

The model equations provide the equality constraints to which the optimisation of θ is

subjected.

gPROMS

The maximal log likelihood objective function is implemented in gPROMS as follows

( ) ( ) ( )

−++=Φ ∑∑∑

= = =

NE

i

NV

j

NM

k ijk

ijkijk

ijk

i ij zzN

1 1 12

2

2

~

lnmin2

12ln

2 σσπ

θ (6.51)

This is identical to what is expected from theory as presented in equation (6.48). Various

variance models can be applied as presented in Table 6.10.

Table 6.10 Variance models in gPROMS.

Variance model Mathematical description

Constant variance 22 ωσ =

Constant relative variance predicted values ( )εωσ += z22

Constant relative variance measured values ( )εωσ += z~22

Heteroscedastic predicted values ( )γεωσ += z22

Heteroscedastic measured values ( )γεωσ += z~22

45

Any of the available variance models can be described by the accompanying

heteroscedastic formulation, where the value of γ is bounded between 0 and 1. The

heteroscedasticity parameters ω and γ can be given fixed values or they can be

determined as a part of the estimation.

ACM

The maximal log likelihood objective function for dynamic experiments is implemented

in ACM as

( )( ) ( )

+

−++ ∑∑∑ ∑∑

= == = =

NE

j

NM

k

ijkj

NV

i

i

NE

j

NM

k ijk

ijkijk

jiii

ijij

i

zWz

zzWnnn

1 11

2

1 1

2 ~ln~

~

ln12ln2

1min λπ λθ

(6.52)

Due to an alternative implementation, the heteroscedasticity parameter is bounded

between 0 and 2 and therefore indicated with λ.

A second option is the weighted least squares objective function, implemented as the

following minimization problem

( )

−∑∑∑= = =

NE

i

NV

j

NM

k

ijkijkijk

i ij

zzW1 1 1

22 ~minθ

(6.53)

It is advised in the documentation to set the weights of the measured variables to the

reciprocal of the standard deviation. The resulting minimization problem is identical to

(6.48) as derived from the maximal likelihood formulation with a constant standard

deviation.

6.2.2 Optimisation solvers

gPROMS MXLKHD

One solver named MXLKHD is available for parameter estimation, which is specifically

designed for solving maximal likelihood optimisation problems. This solver applies a

sophisticated sequential quadratic programming (SQP) method to find the global

optimum. It is an indirect solver since it calculates the objective function gradient with

respect to the parameters to be estimated and uses this 1st order derivative information to

determine its search direction. Several settings of the solver can be adjusted as is clearly

explained in [21].

46

ACM

Nelder-Mead

Both objective functions can be minimized using the direct Nelder-Mead solver. This is a

direct solver, as it does not make use of any 1st order derivative information. It is a

simplex method for the minimization of a function of n variables, which depends on

comparison of function values at the (n + 1) vertices of a general simplex, followed by

the replacement of the vertex with the highest value [17]. The general idea can be

illustrated by an example where n = 2, as shown in Figure 6.5.

Figure 6.5 Nelder-Mead simplex method for n = 2.

The function values of the three vertices of the simplex are determined by function

evaluation and the function value of vertex p1 is the highest. This vertex is replaced by p4,

which is the reflection of p1 through the centroid of the initial simplex, multiplied by a

reflection factor. Next, the function value of p4 is computed and again the highest

function value is replaced. Apart from reflection, the simplex can also contract or expand,

depending on whether the new point is a new minimum or still the maximum function

value of the vertices.

This solver requires many iterations, which itself are computationally cheap since they

only involve function evaluations and no gradient or Hessian information [17].

Furthermore, it is a robust solver, which will most certainly find a solution. The simplex

can however be trapped in a local minimum and it is advised to perform the optimisation

starting from various initial points to make sure that the proposed solution is indeed the

global minimum. The Nelder-Mead solver proved to be fast for estimating the four

parameters of the reduced reaction scheme, but very slow for the 20 parameters of the

complete reaction scheme.

NL2SOL

This solver can be applied only in combination with the Least Squares objective function.

It is a dense adaptive non-linear least squares algorithm developed by Dennis, Gay and

Welsch [6]. It is an indirect solver since it calculates gradient information. NL2SOL

maintains a secant approximation to the second-order part of the least squares Hessian

and adaptively decides when to use this approximation. The step choice algorithm is

p1

p2

p3

p2

p3

p4

47

based on minimizing a local quadratic model of the sum of squares function constrained

to an elliptical trust region centred at the current approximate minimiser.

FEASOPT

This feasible optimisation solver can only be applied in combination with the maximal

log likelihood objective function. The actual solver is VF02AD, which is a dense

sequential quadratic programming method from the Harwell subroutine library [10]. It is

based on the BFGS algorithm that minimizes a Lagrange multiplier penalty function,

subject to nonlinear equality and inequality constraints. FEASOPT makes use of

derivative information and is therefore an indirect solver.

6.3 Performance

The performance of the parameter estimation in both software packages is first assessed

in terms of ability and speed of obtaining an optimal solution. This is followed by an

assessment on the accuracy of the solution.

6.3.1 Ability and speed

An overview of the performance of the solving methods in terms of ability, and CPU time

is presented in Table 6.11. The computer central processing unit is 2.66 GHz with 1 GB

RAM. In the first case where two parameters are estimated in the reduced scheme, the

experiments at 60 °C are used.

Table 6.11 CPU time [h] for various parameter estimation cases.

Software package

gPROMS ACM

Objective function

MLL LSQ MLL

Optimisation

solver

MXLKHD NL2SOL NM NM FEASOPT

Reduced scheme

(2 parameters)

0.0291 0.1944 0.4717 0.3781 1.2242

Reduced scheme

(4 parameters)

0.2902 1.1511 3.9886 3.1522 3.5011

Complete scheme

(20 parameters)

0.7053 2.954 29.833 33.559 failed

48

From Table 6.11 it can be concluded that gPROMS performs best in terms of ability and

required CPU time to find the optimal solution. Furthermore, for ACM the combination

of the Least Squares objective function with the NL2SOL optimisation solver performs

better than the other combinations possible. The Nelder-Mead optimisation solver is

reasonably fast for a small problem (with little parameters), but the required CPU-time

increases significantly with the number of parameters to be estimated.

6.3.2 Accuracy

Apart from the ability and speed of finding an optimal solution to the parameter

estimation problem, also the accuracy of the solution is of interest. A measure for the

parameter accuracy is the ratio between the estimated value and corresponding the

standard deviation, according to

2,

ˆ

i

iiaccr

σθ

= (6.54)

In this case study, an accuracy ratio equal to 10 is considered to be sufficient, where a

ratio larger than 100 is qualified as very good. The parameter estimation results for both

software packages, with the solving methods in ACM that resulted in the highest and the

lowest accuracy ratios for that case, are presented in Tables 6.12 up to and including

6.14, corresponding to the three cases in Table 6.11.

Table 6.12 Optimal estimate and accuracy ratio for case: Reduced scheme (2 parameters).

gPROMS ACM LSQ NM ACM MLL NM

θ̂ racc θ̂ racc θ̂ racc

9k ′′ 7.540E-07 215 7.946E-07 35.0 7.728E-07 28.4

k10 3.603E-07 265 3.721E-07 41.0 3.760E-07 22.2

For this case it is concluded that for both software packages the estimated parameters are

very similar and that all accuracy ratios indicate sufficient accuracy. For gPROMS the

accuracy ratios are higher then for the best result in ACM.

Table 6.13 Optimal estimate and accuracy ratio for case: Reduced scheme (4 parameters).

gPROMS ACM LSQ NL2SOL ACM MLL NM

θ̂ racc θ̂ racc θ̂ racc

9,refk ′′ 5.696E-06 190 6.117E-06 27.0 5.872E-06 14.7

kref,10 2.851E-06 179 3.040E-06 25.4 2.943E-06 13.3

9,aE ′′ 1.330E+05 340 1.343E+05 48.7 1.295E+05 26.5

Ea,10 1.323E+05 339 1.343E+05 48.0 1.291E+05 23.4

49

Table 6.13 shows that again the values of the estimated parameters are similar for the

three solving methods and that all accuracy ratios are larger than 10 indicating sufficient

accuracy.

Table 6.14 Optimal estimate and accuracy ratio for case: Complete scheme (20 parameters).

gPROMS ACM LSQ NL2SOL ACM MLL NM

θ̂ racc θ̂ racc θ̂ racc

kref,1 1.654E-05 142 2.316E-05 10.0 4.993E-02 0.000276

2,refk ′′ 9.634E-04 194 1.345E-06 13.3 1.965E-03 0.000276

kref,3 2.733E-05 117 3.629E-05 9.20 1.879E-05 0.795

4,refk ′′ 1.270E-02 140 1.690E-05 11.0 9.382E-06 0.945

kref,5 1.401E-05 150 1.846E-05 10.9 1.243E-05 1.23

6,refk ′′ 1.098E-02 221 1.452E-05 16.1 1.052E-05 1.70

9,refk ′′ 1.068E-05 152 1.082E-05 12.2 3.965E-06 22.5

kref,10 5.657E-06 160 5.786E-06 12.7 2.450E-06 23.2

11,refk ′′ 3.317E-07 228 3.341E-07 15.5 3.801E-07 2.05

kref,12 1.162E-06 42.8 1.168E-06 3.06 1.593E-06 0.867

Ea,1 7.109E+04 109 1.029E+05 10.5 1.000E+05 0.000340

Ea,2 1.133E+05 280 1.348E+05 21.6 1.000E+05 0.000395

Ea,3 7.630E+04 110 8.642E+04 9.56 1.000E+05 0.890

Ea,4 1.114E+05 209 1.204E+05 17.4 1.050E+05 1.22

Ea,5 5.975E+04 94.1 6.982E+04 7.89 1.000E+05 1.18

Ea,6 1.023E+05 280 1.102E+05 21.5 1.011E+05 1.95

Ea,9 1.575E+05 358 1.588E+05 27.7 1.000E+05 28.5

Ea,10 1.558E+05 379 1.572E+05 29.3 1.065E+05 32.5

Ea,11 1.507E+05 393 1.508E+05 26.3 1.000E+05 1.61

Ea,12 1.500E+05 72.2 1.484E+05 4.73 1.000E+05 0.782

Comparing the optimal estimates in Table 6.14 from the gPROMS and the ACM LSQ

NL2SOL parameter estimation, it is concluded that the values for the parameters in

reactions 9 up to and including 12, which are related to the major components are very

similar; their relative difference is less than 3%. The parameters 2,refk ′′ , 4,refk ′′ and 6,refk ′′ ,

which are related to the formation of the three butenes, differ significantly with about

7.5e4 %. This is probably due to the fact that different objective functions are applied in

combination with different optimisation solvers. The accuracy ratio is sufficient for most

parameters in the ACM LSQ NL2SOL method and very good for gPROMS.

50

In all three cases the ACM MLL NM method resulted in the poorest accuracy ratio. In the

first two cases, the optimal estimates are close to the values found with the best method

in ACM for that case. This does not hold for the last case with 20 parameters and in

combination with the very low accuracy ratios, this leads to distrust the results for the

ACM MLL NM method in this case. The most accurate method in ACM is considered to

be the LSQ NL2SOL combination, with the highest accuracy ratios for the last two cases

and near-best ratios in the first case. In all three cases, gPROMS has the highest accuracy

ratios, which are roughly a factor of 10 higher than the ACM LSQ NL2SOL method.

6.3.3 Effect of optimisation tolerance

The sensitivity of the optimal solution with respect to the optimisation tolerance is

investigated for gPROMS and the most accurate solving method in ACM, which is the

LSQ NL2SOL combination. Both software packages offer a variety of tolerances in their

solvers used for parameter estimation, in this analysis the optimisation tolerance that

determines the optimisation termination criterion is of interest. The results for parameter

kref,3 in the complete scheme are displayed in Figure 6.6, showing the typical behaviour

that is observed for all parameters. On the vertical axis the value of the optimal estimate

is displayed and the numbers 1 up to 9 on the horizontal axis correspond to the elements

in the vector of values used for the optimisation tolerance according to (6.55)

[ ]1119171615141312111 −−−−−−−−− eeeeeeeee (6.55)

0.00E+00

1.00E-05

2.00E-05

3.00E-05

4.00E-05

5.00E-05

0 2 4 6 8 10

Kre

f 3 [s^-1

]

gPROMS ACM

Figure 6.6 Optimal estimate as a function of optimisation tolerance specified in (6.55)

The time required to obtain the solution as a function of the optimisation tolerance

specified in (6.55) is presented in Figure 6.7.

51

0

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

0 2 4 6 8 10

tim

e [h]

gPROMS ACM

Figure 6.7 Time required to obtain solution as a function of optimisation tolerance specified in (6.55)

For gPROMS the estimator optimisation tolerance named OptTol has an effect on the

solution over the entire range of applied values in vector (6.55). Tightening the tolerance

beyond 1e-6 no longer has significant effect on the estimated value, it does have effect

however on the time required to obtain the optimal solution.

For ACM it was found that tightening the estimator optimisation tolerance named

Solution convergence tolerance beyond 1e-4 has no effect on the values of the parameter

estimates and on the time required to obtain the solution. The reason for this was found to

be that ACM LSQ NL2SOL uses three termination criteria and whichever one is met

first, causes the optimisation to terminate and present the solution. These criteria are:

Solution convergence tolerance, Relative function tolerance and Absolute function

tolerance. Tightening the Solution convergence tolerance beyond 1e-4 caused the

optimisation to terminate on the Relative function tolerance for which the default value of

1e-6 was used. It was attempted to tighten the Relative function tolerance, this resulted in

failing optimisations and required adjusting other solver settings and was not further

investigated.

6.4 Output interpretation

Once the parameter estimation with one of the CAPE tools is successfully completed, the

output should be carefully examined. One cannot just take the values of the predicted

parameters and claim they are correct “because the software program calculated it”. The

output interpretation can be divided into two parts. First, what should be investigated is

how the predicted model outputs compare with the measured values from experiments.

This is usually done graphically with the use of so-called overlay plots. Secondly, the

statistical significance and accuracy of the estimated parameters should be analyzed.

Further statistical information, such as a correlation matrix and confidence ellipsoids, can

provide additional insight in the problem under investigation. Both parts of output

interpretation are discussed for both software tools in this paragraph.

52

6.4.1 Overlay plots

In order to have a general idea whether the model is capable of describing the observed

behaviour in the experiments, the predicted model output is compared with measured

values in overlay plots. These plots give a good indication if the model can predict trends

in the measured data. When this is not the case, we speak of a bad fit and an alternative

model should be suggested. In case of good resemblance at first sight, the next step is to

investigate the residuals, which are the differences between the predicted and measured

values. Patterns in the residuals indicate a systematic mismatch and a lack of encaptured

system behaviour in the model. Ideally, the residuals should appear to be randomly

distributed around a zero mean. For both software tools, the accessibility and

functionality of overlay and residual plots is discussed.

ACM

The functionality of making overlay plots is available in ACM. It is found under the tab

Tools, Estimation, Dynamic Experiments. Here the user can choose an experiment and

click Edit, than choose a measured variable and Edit again. The resulting window is the

same where the experiment data input takes place and where the predicted values are

stored after an estimation run is successfully completed. Furthermore, provided the

predicted values are available after a successful parameter estimation run, the absolute,

relative and standardized residuals are calculated and presented in the same table.

Clicking the Plot button generates an overlay plot of the measured and predicted values.

If the predicted values available are not available, the user is made aware of this via a

pop-up message and a plot is generated of the measured values only. There is an option to

plot the predicted values at measurement times connected by straight lines, or to plot a

continuous curve, which requires a simulation of that particular experiment with the

estimated model parameters to calculate the specific model output over the entire

experiment duration. An example of an overlay plot with a continuous curve of the liquid

mass fraction of component MBS during experiment 105 is presented in Figure 6.8.

Estimation: Measured Variable - R_105

Time: Hours

B1.xmf1("MBS") kg/kg

0 0.5 1 1.5 2 2.5 3 3.5 4

0.1

0.15

0.2

Observed

Predicted

Figure 6.8 Continuous overlay plot of liquid mass fraction MBS in experiment 105.

53

The overlay plot gives a good indication if the measured values and predicted model

output agree; in this case the fit is good. Although the residuals are calculated and

presented in a table together with the measured and predicted values, there is no option to

generate residual plots. Since the option to generate overlay plots is in the measurement

data input window, the same inconvenience that was encountered during data input exists

in the generation of overlay plots. A separate window needs to be opened for each

variable in each experiment, therefore only one overlay plot such as Figure 6.8 can be

created at a time. If the user would like to get an overview of all available overlay plots,

133 in this case study, he is required to repeat a sequence of at least four mouse-clicks,

resulting in a minimum of 532 mouse-clicks. This is considered to be very user-

unfriendly, especially since it can be expected that the user is interested in all the

available overlay plots. The generated overlay plots can easily be customized and copied

to other programs such as MS Word and Excel.

gPROMS

When a parameter estimation run is successfully completed in gPROMS, an output file is

created containing all the available information including the option to generate overlay

plots. The machine-readable estimation file with extension *.stat-mr can be imported in

MS Excel via a pre-defined workbook that is available in the Process Systems Enterprise

directory from the start-menu. This workbook contains a macro that adds a button named

Parameter Estimation to the MS Excel toolbar and clicking on this button gives a menu

where the user can chose Import file… and Plot, which gives more options. Once the file

is imported, all the available statistical information from the parameter estimation run is

accessible on a number of spreadsheets; these will be discussed in the next paragraph.

Behind the Parameter Estimation -> Plot button, there are three options available, these

are: Overlay Chart, Residual Chart and Confidence Ellipsoids. Clicking Overlay Chart,

gives a dialog box with the options to include the standard deviations in the figures that

will be displayed and the format of these figures. The user can specify for which

variable(s) and experiment(s) the overlay plots need to be generated and also if they

should appear on the sheet with the corresponding variable, each chart on a new sheet or

all charts on a new sheet. A convenient option is to select multiple charts on a new

worksheet and click Plot all, this generates all 133 overlay plots at once. An example of

an overlay plot of the liquid mass percentage of component MBS during experiment 105,

with the measurement standard deviation included is presented in Figure 6.9.

54

Overlay Plot

EXP105

TANK.COMP_LIQ_MASS_PERC(4)

0

5

10

15

20

0 5000 10000 15000

Measurement Time

Predicted Experimental

Figure 6.9 Overlay plot of liquid mass percentage MBS in experiment 105.

The standard deviations around the measured values are indicated with error bars, which

are not clearly visible here due to the small standard deviations. The generated plots can

be edited according to all the available MS Excel features. As mentioned, also residual

plots can be generated with three options: absolute, relative and weighted. Finally, there

is the feature of generating two-dimensional confidence ellipsoids for any pair of

estimated parameters using one of the following confidence levels: 90%, 95% or 99%.

Ideally, the ellipsoids are spherical and small, indicating low cross-correlation and high

accuracy respectively.

6.4.2 Statistical analysis

After investigating the overlay plots to see if the model with the estimated parameters has

the ability to match the measured values, a statistical analysis on the estimated parameter

is performed. The minimal types of statistical information required to analyze if the

parameters are estimated with sufficient accuracy are: the estimated values, their standard

deviation or a confidence interval and the correlation matrix. In this paragraph, the

availability and type of statistical information required to perform such an analysis is

assessed for both ACM and gPROMS.

ACM

When performing an activity such as initialisation, simulation or parameter estimation,

information about this activity is created in a window at the bottom in the screen named

Simulation Messages. A right-mouse-click in this window gives a pop-up menu where

the output destination can be specified. The user choose to send the output to the screen

and also to send it directly to a log-file with the extension *.txt. This log-file contains

55

details about the optimisation iterations and at the end the statistical information is

available.

gPROMS

As mentioned in the previous paragraph, all available statistical information from a

successful parameter estimation run is stored in a *.stat-mr output file, which can be

imported in MS Excel. The workbook contains multiple spreadsheets where the statistical

information is presented in a clear way.

An overview of the available statistical information in the parameter estimation output of

both software packages is presented in Table 6.15

Table 6.15 Overview of available statistical information in parameter estimation output.

ACM gPROMS

Parameters Estimated value Yes Yes

Initial guess Yes Yes

Lower and upper bounds No Yes

Accuracy Standard deviation Yes Yes

Confidence interval Yes (95%) Yes (90%, 95% and

99%)

Student t-test No Yes (95%)

Fischer matrix No Yes

RMS error Yes No

Objective function Value Yes Yes

Residual sum Yes, total and per

variable

Yes, total

Contribution Per variable Per experiment,

per variable

Correlation Correlation matrix Yes Yes

Confidance ellipsoids No Yes

Variance Variance-covariance matrix Yes Yes

Model ranking F-values Yes Yes (90%, 95% and

99%)

Measured variables Measured values Yes Yes

Predicted values Yes Yes

Residual values Yes (absolute,

relative and

weighted)

Yes (absolute,

relative and

weighted)

56

From Table 6.15 it is concluded that the essential types of statistical information required

for analysis are present in both software packages. Furthermore, both tools have very

similar types of statistical information available, for gPROMS it is slightly more

extensive.

The student t-test in gPROMS provides an objective indication if the individual

parameter accuracy is sufficient [21]. A 95% t-value is calculated, which is the ratio of an

estimated parameter and the distance to a bound of its corresponding 95% confidence

interval. A reference 95% t-value is determined according to t(0.95, N – Np), where N is

the total number of measurements and Np is the number of parameters to be estimated,

resulting in N – Np degrees of freedom. When the 95% t-value is larger than the reference

value, the parameter accuracy is statistically significant. Without the reference value, the

user has to make an own judgment to how high the ratio should be for a parameter to be

considered statistically significant.

6.5 Assessment on parameter estimation

In this paragraph the assessment of both software packages on the various aspects of

parameter estimation is summarized. Regarding the first assessment criterion defined in

chapter 3, which is the ability and speed of obtaining the optimal values for the kinetic

parameters, it is concluded that:

• In all three cases gPROMS was able to obtain optimal values for the parameters.

With ACM the parameters were found in all cases with the different solving

methods, except for the MLL FEASOPT solving method in the third case with 20

parameters.

• In all three cases gPROMS obtained the optimal solution faster than ACM. In

ACM the LSQ NL2SOL solving method proved to be fastest.

An important aspect of parameter estimation is the individual parameter accuracy of the

solution. This is quantified in terms of the ratio between the estimated value and the

standard deviation. From the determined accuracy ratios in the three parameter estimation

cases the following conclusions are drawn:

• The results from gPROMS and the most accurate ACM solving method proved to

have sufficient accuracy ratios in all three cases. The ratios are roughly a factor 10

higher for gPROMS, indicating very good accuracy.

• The most accurate solving method in ACM proved to be LSQ NL2SOL with

sufficient accuracy, the most inaccurate is MLL NM resulting in insufficient

accuracy.

57

Output interpretation

Apart from the performance of the parameter estimation capabilities in both software

packages, the features involving the availability and interpretation of the output are

assessed. A strong aspect of gPROMS in this context is that a parameter estimation

output file with all available statistical information is created, which can be imported in

MS Excel from where overlay, residual and confidence ellipsoid plots can be generated

with predefined macro’s. In ACM there is a separation between statistical information

that can be stored in a text file and the feature of generating overlay plots. This

functionality is lost when the project is closed since the predicted values are not stored

when the project is saved. Furthermore, The overlay plots can only be generated

separately for one variable in one experiment at a time, whereas in the gPROMS output

file there is the option to generate all plots in one sheet. These and other findings are

summarized in the following conclusions:

• The standard overlay plots functionality is good in gPROMS and poor in ACM, in

gPROMS also residual and confidence ellipsoid plots can be generated.

• Individual parameter accuracy is evaluated with a t-test in gPROMS and a

warning is created in case of insufficient accuracy, no such feature is available in

ACM

• Further available statistical information is similar and sufficient for both tools, it

is slightly more extensive for gPROMS

58

7 Experiment design

7.1 Introduction

During the process of parameter estimation as described in the previous chapter it was

found that, for both the complete and the reduced reaction scheme, parameter k8 was

unobservable with the existing set of experiment data. Given the available experiment

data a reduced kinetic model was derived by replacing the parameters that were coupled

with k8 with new parameters according to

7

88

k

kkkkk i

ii

⋅=′⋅=′′ (7.1)

All the parameters ik ′′ proved to be observable now, but remain coupled to k8 and their

use is only valid in combination with that specific value for k8. It is believed from

investigating the reduced reaction scheme that it should be possible to determine k8 and

the parameters it is coupled with individually, since the parameters with which k8 is

coupled also appear separately in the rate expressions. This is explained with the reduced

reaction scheme as presented in Figure 7.1

C4

+ + H2SO4 MBS + H + (k7 and k8)

C4 + + H2O SBA + H + (k9 and k10)

Figure 7.1 Reduced reaction scheme.

As discussed in the paragraph 6.1.1, the component rate expressions are

rrr SBASOH ==42

(7.2)

rrr MBSOH −==2

(7.3)

with

][][

]][[]][[][

29427

42107298

OHkSOHk

SBASOHkkMBSOHkkHr

+

−= + (7.4)

As explained in paragraph 6.1.1, one parameter becomes unobservable as a result of

applying the Bodenstein approximation. Therefore, a new parameter is introduced

according to

7

9

9k

kk =′ (7.5)

59

Combining (7.4) and (7.5) results in

][][

]][[]][[][

2942

4210298

OHkSOH

SBASOHkMBSOHkkHr

′+

−′= + (7.6)

In principle, all three parameters in rate expression (7.6) should be individually

determinable. However, this was not the case since in the available measurement data the

following holds

][][ 2942 OHkSOH ′>> or 7

9

2

42

][

][

k

k

OH

SOH>> (7.7)

Allowing the simplification of (7.6) to

][

]][[]][[][

42

4210298

SOH

SBASOHkMBSOHkkHr

−′= + (7.8)

In the resulting rate expression k8 and 9k ′ cannot be determined individually, and the

parameter 9k ′′ was introduced for which the following holds

7

98

989k

kkkkk

⋅=′⋅=′′ (7.9)

However, if there would be more variation in the ratio of the concentration of H2SO4 and

H2O and the condition in (7.7) would not be satisfied, then both k8 and 9k ′ are expected to be individually observable.

7.2 Approach

In order to test this hypothesis, a set of kinetic parameters is arbitrarily chosen and

simulations with the model are used to synthesize experiment data. This approach

provides perfect data without measurement error and allows measurements to be taken as

frequently as desired. It is recognized that these conditions are very different from those

when performing actual experiments. In a later stadium, the simulation conditions can be

changed to resemble the actual experiments by e.g. adding constraints on the minimal

time between measurements and by adding noise to the data.

Attempts have been made to apply the gPROMS SED feature to design experiments

using the reduced scheme in Figure 7.1 such that they have maximal information content

[20]. This approach was abandoned due to the problem that the designed experiments

were either identical to the previous one after a few designs or they were very close to the

initial guess of the initial composition, which is the degree of freedom in the design. It

60

was initially suspected that these problems might be caused by the CAPE-OPEN

interface, however this was found not to be the case in subsequent trials of a model that

did not use the CAPE-OPEN interface. At the moment of writing this report, it is still

under investigation how the gPROMS SED feature can be applied successfully.

Alternatively, the experiments have been designed manually, by characterizing the design

space with ratios of the component concentrations. This was also found to be

unsuccessful and a different set of values for the kinetic parameters is chosen, with a ratio

of k9 over k7 in the order of 1 instead of 1e-4, as was the case in the previous set.

Furthermore, the number of experiments and the number of measurements was increased.

Due to the Bodenstein approximation one parameter is unobservable and k7 is fixed to its

chosen value. With these modifications, the parameters k8, k9 and k10 were observable.

Next, the reversible reaction with parameters k1 and k2 forming one of the butenes (1-C4)

is added to the reaction scheme. Again new experiments are simulated and the

measurements are used to estimate the five parameters, which also proved to be

successful. As mentioned in Chapter 4, the complete scheme was a simplification to the

original scheme. The reversible direct reaction between MBS and SBA, which was

omitted from the original scheme, is now re-introduced resulting in what will be referred

to as the semi-original scheme.

The process of performing simulations with the designed experiments and applying the

measurements for parameter estimation is repeated and the complete set of parameters

(except for k7 due to the Bodenstein approximation) was re-obtained. The procedure for

the three cases (reduced scheme, reduced scheme including 1-C4 and the triangle scheme)

is similar, although becoming more extensive with increasing number of kinetic

parameters. It is described for the triangle scheme in the following paragraph.

7.3 Triangle reaction scheme

The reaction scheme that is used in the final case is referred to as the triangle scheme. It

incorporates the reversible direct reaction between MBS and SBA from the scheme

proposed originally for this work by the chemist who designed and supervised the

experimental program. A schematic of the triangle scheme is presented in Figure 7.2.

61

C 4 ⊕

SBA

H ⊕

H 2 O

H ⊕

SBA MBS

H 2 SO 4

H 2 SO 4 H 2 O

H ⊕

C 4 ⊕

SBA

H ⊕

H 2 O

H ⊕

1 - C 4 = 1 - C 4 =

SBA MBS

H 2 SO 4

H 2 SO 4 H 2 O

H ⊕

Figure 7.2 Schematic representation of the triangle scheme

Instead of three, only one of the reversible reactions that form the butenes is included

here, since this is sufficient to capture the principle behaviour of the presence of butenes

in the reaction scheme. The triangle reaction scheme with the kinetic parameters is

presented in Figure 7.3

1-C4 + H+ C4

+ (k1 and k2)

C4+ + H2SO4 MBS + H+ (k7 and k8)

C4+ + H2O SBA + H+ (k9 and k10)

MBS + H2O SBA + H2SO4 (k15 and k16)

Figure 7.3 Triangle reaction scheme.

In Figure 7.2 it can be seen that the reactions involving k7, k8 ; k9, k10 and k15, k16 form a

reaction loop. The presence of this loop in the reaction network enables an additional

constraint to be developed which relates the chemical equilibriums of each reaction. The

equilibrium relationships of these three reversible reactions are described by (7.10) to

(7.12)

8

7

442

87]][[

]][[

k

k

CSOH

HMBSK ==

+

+

− (7.10)

10

9

42

109]][[

]][[

k

k

COH

HSBAK ==

+

+

− (7.11)

62

16

15

2

421615

]][][[

]][][[

k

k

HOHMBS

HSOHSBAK ==

+

+

− (7.12)

The above equilibrium relationships are not independent since the following holds

87

1091615

−− =

K

KK (7.13)

This can be used to derive the relationship that must exist between all of the reaction rates

for a well-posed problem. This relationship is found to be

7

8

10

9

16

15

k

k

k

k

k

k= (7.14)

One of the kinetic parameters in this relationship is not independent and in this work the

value of k16 is derived from Equation 7.14 as

8

7

9

101516

k

k

k

kkk = (7.15)

This brings the set of kinetic parameters for the triangle scheme to the seven as presented

in Table 7.1. Their values are chosen such that time to steady state of the reactions is in

the order of hours, to avoid very fast or very slow reactions.

Table 7.1 Chosen parameter values

Parameter Value

k1 1.5e-4

k2 2.5e-4

k7 1e-4

k8 2e-4

k9 3e-4

k10 4e-4

k15 5e-8

From the Bodenstein approximation it follows that one kinetic parameter becomes

unobservable and therefore k7 is fixed to its chosen value and not estimated. In this

synthetic case, the other parameters can be determined without having to introduce new

parameters.

Now that the model parameters are defined, the next aspect is the design of the

experiments. In this work, we considered that experiments are conducted under

isothermal and isobaric conditions. The design freedom for the experiments to be

designed is the initial composition of the five components H2O, H2SO4, SBA, MBS and 1-

C4. The design space will be characterized by introducing four ratios according to

63

][

][

42

21

SOH

OHr = (7.16)

][

][2

MBS

SBAr = (7.17)

][

][ 23

SBA

OHr = (7.18)

][

][ 424

MBS

SOHr = (7.19)

The fifth dimension of the design space is the amount of 1-C4. In every experiment three

of the five dimensions are assigned a value that can be either 1, high or low

corresponding to Table 7.2

Table 7.2 values for high and low design points

high low

ri 100 0.01

1-C4 [kg] 0.01 0

The initial composition of the set of designed experiments has the characteristics as

presented in Table 7.3

Table 7.3 Characteristics of initial composition of designed experiments

Experiment r1 r2 r3 r4 1-C4

1 high high high

2 high low high

3 low high high

4 low low high

5 high high high

6 high low high

7 low high high

8 low low high

9 1 1 1 1 high

10 high high low

11 high low low

12 low high low

13 low low low

14 high high low

15 high low low

16 low high low

17 low low low

18 1 1 1 1 low

64

The measurements during simulation of the experiments are taken very frequently,

especially in the first period of the reaction. A specification of the measurement times, 66

in total, is presented in Table 7.4

Table 7.4 Specification of measurement times

Start [s] End [s] Interval [s]

0 30 1

35 50 5

60 100 10

150 300 50

400 1000 100

1250 2000 250

2500 5000 500

6000 10000 1000

With this setup of synthesizing experiments it was found that all observable parameters

could be determined. The criterion to indicate whether the parameters are successfully

determined is chosen to be a 10% range around the original parameter value. This

criterion can off course only be applied in this synthetic case where the original

parameter values are known. In practice, the confidence interval and standard deviation

provide information to judge the accuracy of the solution. The parameter estimation

results are presented in Table 7.5

Table 7.5 parameter estimation results triangle scheme with synthesized experiments.

Correlation matrix Parameter

Optimal

estimate

Standard

deviation k1 k2 k8 k9 k10 k15

k1 1.499998E-04 4.558409E-08 1.00

k2 2.499997E-04 4.072335E-07 0.07 1.00

k8 1.999995E-04 3.360493E-07 0.01 -0.95 1.00

k9 3.000008E-04 3.541149E-07 -0.01 0.42 -0.39 1.00

k10 4.000007E-04 5.854902E-07 0.01 -0.70 0.72 0.30 1.00

k15 5.000024E-08 4.016255E-11 -0.01 0.43 -0.44 0.10 -0.40 1.00

As can be observed in Table 7.5 the parameters are estimated with sufficient accuracy

and well within a 10% range of the original values. However, a high correlation exists

between k2, k8 and k10. This indicates that these parameters did not become sufficiently

observable even from the large set of synthesized experiment data.

7.4 Constraints

The procedure described in section 7.3 represents an ideal case that would never be

encountered in practice. In actual experiment data a finite amount of measurement error

will be associated with all measurements. In addition, resource and equipment constraints

65

mean that only a small number of experiments in which a subset of the possible

experiment data points can ever be measured. Given these constraints, it is thus useful to

understand which experiments and which experiment data can provide maximal

information. In this section, the amount of measurement data is reduced to investigate

what the required minimum set of measurement points is, in order to estimate the

parameters with sufficient accuracy. Next, the effect of measurement error is investigated

by adding noise to the perfect data.

7.4.1 Reduced measurement data

The total data set of 18 experiments containing 66 measurement points each is reduced

until the parameters were no longer individually observable. This is done in various ways

for all three reaction systems (reduced scheme, reduced scheme including 1-C4 and the

triangle scheme) that are mentioned in paragraph 7.2. First the number of measurement

points per experiment is reduced, keeping the number of experiments used for estimation

at the maximum of 18. For all three reaction systems the minimum was found to be one

single measurement point, provided that it is located in the first part of the reaction,

before steady state is reached. This first part of the reaction contains most information

about the time-dependent behaviour of the system, which is required to obtain the kinetic

parameters. The steady-state part contains only information on the reaction equilibrium

constants.

Next, the number of complete experiments is reduced, using all 66 measurement points in

the remaining experiments to estimate the parameters. Finally, the number of experiments

containing one single measurement point is reduced until the parameters were no longer

individually observable. An overview of the results is presented in Table 7.6.

Table 7.6 Minimal number of experiments required to obtain parameters.

The type of experiments is also of influence, experiments with more variation in the

measured variables are found to contain more information. For the triangle scheme using

66 measurement points per experiment it was not possible to obtain the parameters with

experiments 1 and 13, although the overlay plots showed a good fit. Parameter estimation

Reaction system

Number of

measurement

points per

measured variable

Number of

measured

variables per

experiment

Minimal

number of

experiments

Total number of

measurement

points

66 4 2 528 Reduced scheme

1 4 3 12

66 5 2 660 Reduced scheme

including 1-C4 1 5 4 20

66 5 2 660 Triangle scheme

1 5 4 20

66

proved to be successful with experiments 3 and 11. Typical overlay plots of both

parameter estimation runs are presented in Figures 7.4 and 7.5.

91.4

91.5

91.6

91.7

91.8

91.9

92

0 2000 4000 6000 8000 10000

Time [s]

weig

htp

erc

enta

ge H

2O

[%

]

Predicted Experimental

Figure 7.4 Typical variation of a measured value in experiment 1

0

1

2

3

4

5

0 2000 4000 6000 8000 10000

Time [s]

Weig

htp

erc

enta

ge H

2O

[%

]

Predicted Experimental

Figure 7.5 Typical variation of a measured value in experiment 3

The low variation in the measured values as can be seen in Figure 7.4 was observed in

experiments with both ratios either high or low, as specified in Table 7.3. In the case of

experiments 9 and 18 where all four ratios have a value of 1, it was observed that little

variation occurred for components H2O and H2SO4 and much more for components SBA

and MBS. High variation was encountered in experiments with a high value for one ratio

and a low value for the other. The typical variation of a measured variable in such an

experiment, is observed in Figure 7.5.

67

This is explained by looking at the definition of the ratios in equations (7.16) to (7.19)

and the triangle reaction scheme in Figure 7.3. When r1 and r2 are both high or low, this

results in respectively low or high concentration on both sides of reactions 7, 8 and high

or low concentrations on both sides of reactions 9, 10. Also the difference in

concentration in reactions 15 and 16 is small in both cases. As the driving force for

reaction is the difference in concentration on both sides of the reaction, a small difference

results in small variation of the measured variables. In the case that r3 and r4 are both

high or low, the concentration difference for reactions 7, 8 and 9, 10 may be high, but

both reaction consume the same intermediate, which now becomes the limiting

component. Reaction 15, 16 lacks driving force due to little difference in the

concentrations on both sides of the reaction. Neither of these reaction limiting aspects

occur in the cases that a combination of two ratios has different values. This explains why

it is observed that experiments 2, 3, 6, 7, 11, 12, 15 and 16 contain more variation than

experiments 1, 4, 5, 8, 10, 13, 14 and 17.

7.4.2 Effect of measurement error

The effect of measurement error is mimicked by adding noise to the perfect data that was

obtained from the simulations. The measurement error is assumed to be normally

distributed and therefore the noise is created with a Gaussian random number generator,

which is based on the polar Box-Muller transformation [3]. The input parameters are the

mean µ and the standard deviation σ of the Gaussian random number to be generated. The

synthesized data is the value for µ and one value for σ is assigned for the complete

dataset. The approach to investigate the effect of measurement error on the ability to

obtain the parameters is to vary σ of the Gaussian noise. The case that σ = 0 is equivalent

to having no noise and returns the synthesized data point obtained from simulation. Using

the complete dataset for parameter estimation, it was found that σ = 1e-8 resulted in good

overlay plots and in parameters values within a 10% range of the original values.

However, σ = 1e-7 resulted in parameter estimates outside a 10% range of the original

values for k8, k9 and k10 and poor overlay plots. In the case of σ = 1e-6 the overlay plots

are equally poor and the parameter estimates of k2, k8, k9 and k10 are outside their 10%

range. It is regarded to be suspicious that already for σ =1e-7 the parameters cannot all be

obtained within the 10% range of their original values This leads to believe that either the

applied method of adding noise to the synthesized data is not suitable for this purpose or

that the system is indeed extremely sensitive to measurement error. It is recommended to

further investigate what the cause of these observations is. Due to lack of time this was

not possible before the moment of finalizing this report.

7.5 Conclusions on experiment design

The conclusions that are drawn from investigating the design of experiments for the three

reaction systems are the following:

• The gPROMS SED feature has not been applied and is therefore not assessed.

68

• It proved to be possible to estimate all independent parameters in the three

reaction systems using the synthesized experiment data.

• Experiments with a high and a low initial concentration ratio exhibit more

variation in the measured variable than experiments with both ratios either high or

low.

• Applying perfect synthesized measurement data allows to estimate the parameters

with very little measurements.

No conclusions can be drawn on the part of adding noise to the synthesized data. It is

recommended that this topic is further investigated.

69

8 Conclusions

The main project objective is to assess the parameter estimation capabilities of the

gPROMS software and compare them with the Aspen Custom Modeler software, using a

realistic industrial process as case study, which is the Sec-Butyl-Alcohol stripper. The

assessment criteria are divided into three parts, the conclusions with respect to these

criteria are:

Model building

• The CAPE-OPEN interface is successfully applied in gPROMS and an

experiment model has been built that is equal to the existing model in ACM.

• AspenPlus does not support all CAPE-OPEN methods when exporting a property

package.

• Experiment data input requires significantly less effort in gPROMS for data sets

with multiple measured variables per experiment.

Parameter estimation

• gPROMS showed better ability, speed and accuracy in obtaining optimal

estimates compared to ACM.

• From the four solving methods in ACM, the LSQ NL2SOL method performed

best with reasonable speed and sufficient accuracy.

• The gPROMS output file provides better features to generate overlay plots and

has additional features to create residual and confidence ellipsoid plots.

• The available statistical information is sufficient in both tools, but slightly more

extensive in gPROMS.

Experiment design

• The gPROMS SED feature has not been applied and is therefore not assessed.

• It proved to be possible to estimate all independent parameters in three reaction

systems using synthesized experiment data.

• Experiments with one high and one low initial concentration ratio exhibit more

variation in the measured variables than experiments with both ratios either high

or low.

70

References [1] AspenTech, “Introduction to Aspen Custom Modeler”, 2002

[2] Bard, Y., Nonlinear Parameter Estimation, 1974

[3] Box, G.E.P., Muller, M.E., A note on the generation of random normal deviates,

Annals Math. Stat, V. 29 : 610-611, 1958

[4] Chewter, L.A., Colantonio, M.C., McBrien, J., Mooijer-van den Heuvel, M., de

Noord, O.E., Pingen, J., de Wolf, S., Weve, D.N.M.M., User guide to the

modelling of chemical processes, CT.04.20611 (Confidential), 2004

[5] Damen, A.A.H., Physiological Processes and Parameter Estimation, TUE reader,

2003

[6] Dennis, J.E., Gay, D.M., Welsch, R.E., An adaptive Nonlinear Least Squares

Algorithm. ACM Transactions on Mathematical Software, 7 : 348-383, 1981

[7] During, F., de Jong, J., Jonkers, G., Pernis COF/2 Strippers Performance Study –

Data Report., Shell International Chemicals B.V., CT.04.20655 (Confidential),

2004

[8] Eurokin, URL: http://www.dct.tudelft.nl/eurokin , 2005

[9] Fogler, Scott H., Elements of Chemical Reaction Engineering (Third Edition),

2002

[10] Harwell subroutine library, URL: http://www.cse.clrc.ac.uk/nag/hsl , 2005

[11] Helfferich, F.G., Kinetics of homogenous multistep reactions, 2001

[12] Hendriks, E.M., Mooijer, M.M., Meijer, H., Engineering data and

thermodynamics, Shell course “Thermodynamics for Distillation”, 2004

[13] Himmelblau, D.M., Applied Nonlinear Programming, 1972

[14] Koplos, G.J., French, R.N., Development of a Kinetic Model for the Shell Acid-

Olefin Process Employed for the Manufacture of Sec-Butyl Alcohol – Part 3,

Shell Global Solutions (U.S.), OG.03.80039 (Confidential), 2003

[15] Loos, T.W.d. & Kooi, H.J.v.d., Toegepaste thermodynamica en fasenleer, TUD

reader, 1977

71

[16] Muthusamy, D., Development of a Kinetic Model for the Shell Acid-Olefin

Process Employed for the Manufacture of Sec-Butyl Alcohol – Part 1, Shell Oil

Company, Technical Progress Report WTC 54-02 (Confidential), 2002

[17] Nelder, J.A., Mead, R., A simplex method for function minimization. Computer

Journal, 7 : 308-313, 1965

[18] Perkins, G., Progress report: Parameter estimation of the reaction kinetics of the

SBA stripper, 2004

[19] Piñol, D. (AspenTech), Halloran, M. (AspenTech Ltd.), Szczepanski, R.

(Infochem), Pons, M. (TotalFinaElf), Drewitz, W. (BASF), Banks, P. (BP),

CAPE-OPEN Open Interface Specifications, Thermodynamics and Physical

Properties Version 1.1, 2003

[20] Process Systems Enterprise, Experiment Design for Parameter Precision in

gPROMS, 2004

[21] Process Systems Enterprise, gPROMS Advanced User Guide, 2004

[22] Process Systems Enterprise, gPROMS Introductory User Guide, 2004

[23] Reman, W.G., The Sec-Butyl Alcohol / Methyl Ethyl Ketone Process., Shell

International Chemicals B.V., AMGR.96.212 (Confidential), 1996

[24] Rogers, A.N., Development of a Kinetic Model for the Shell Acid-Olefin Process

Employed for the Manufacture of Sec-Butyl Alcohol – Part 2, Shell International

Chemicals B.V., CA.02.20793 (Confidential), 2003

[25] Spiering, W., Muthusamy, D., Rogers, A.N., Shell Acid-Olefin Process for

SBA/MEK Modelling Plant Behaviour, CT.03.20656 (Confidential), 2003

[26] Walas, Stanley M., Phase Equilibria in chemical engineering, 1984

72

Nomenclature

symbol quantity units

a activity [-]

a order of concentration [-]

A correlation parameter various

B correlation parameter various

C correlation parameter various

C concentration [mole · m3]

D correlation parameter various

E correlation parameter various

Ea activation energy [J · mole-1]

f fugacity [Pa]

F correlation parameter various

H0 Hammett acidity [-]

k reaction rate constant [s-1]

kp pre-exponential factor [s-1]

kref referrence pre-exponential factor [s-1]

L( ) likelihood [-]

n molar holdup [mole]

N number of components [-]

N number of measurements [-]

Np number of parameters [-]

NE number of experiments [-]

NM number of measurements [-]

NV number of measured variables [-]

p( ) probability [-]

P pressure [Pa]

fΡ Pointing factor [-]

Pi partial pressure [Pa]

P sat

saturation pressure (vapour pressure) [Pa]

r reaction rate [mole · s-1]

R number of reactions [-]

R universal gas constant (≈8.3145) [J · mole-1 · K

-1]

t time [s]

T temperature [K] or [°C]

V volume [m3]

W weighting factor [-]

w% weight percentage [%]

x liquid mole fraction [-]

73

y vapour mole fraction [-]

z predicted model response [-]

z~ measured response [-]

Greek

ε absolute tolerance [-] γ activity coefficient [-] γ heteroscedasticity parameter [-]

φ fugacity coefficient [-]

µ chemical potential [J] µ mean [-]

υ stoichiometric factor [-]

nυ molar specific volume [m3 · mole

-1]

θ set of model parameters [-]

ξ measurement error [-]

π pi constant (≈3.1416) [-]

σ standard deviation [-]

Φ gPROMS objective function value [-]

ω heteroscedasticity parameter, standard deviation [-]

superscripts

+ protonated

0 reference state

sat saturated

´ unobservable parameter due to Bodenstein approximation.

´´ unobservable parameter in combination with available data.

subscripts

L liquid

L lower

U upper

V vapour

T total

74

indices

0 reference state (P = P sat)

i ith component

j jth reaction

k kth measurement

abbreviations

ACM Aspen Custom Modeler

CAPE Computer Aided Process Engineering

CO CAPE-OPEN

COSE CAPE-OPEN compliant Simulation Environment

CPU Central Processing Unit

CSTR Continuous Stirred Tank Reactor

FatnessDBI Fatness Dissolved Butenes Included

FO Foreign Object

GB Giga Byte

gPROMS general Process Modelling System

HCFA HydroCarbon Free Acid

LSQ Least Squares

MEK Methyl Ethyl Ketone

MLL Maximal log likelihood

NM Nelder-Mead

NRTL Non-Random Two Liquids

PP Property Package

PS Property System

PSE Process Systems Enterprise

RAM Random Access Memory

SBA Secondary-Butyl Alcohol

SED Sequential Experiment Design

SQP Sequential Quadratic Programming

VLE Vapour-Liquid Equilibrium

75

Appendix A Chemical structures of components

H H

O

H2O H2SO4

O

O

HO OH S CH2

CH

CH3

CH3

OH

SBA

MBS

CH2

CH

CH3

CH3

O

O

O OH S

CH2

CH +

CH3

CH3

C4+

SBE

CH2

CH

CH3

CH3

O

CH3

CH2

CH

CH3

H +

H +

N2

N

N

1-C4

H

C

CH2

H

H CH3

C

c2-C4

H

C

CH3

H

C

CH3

t2-C4

H

C

CH3 H

C

CH3

76

Appendix B Hammett acidity

In reactions involving the protonation of a weak base (butylenes in this case) by a strong

acid such as the 72% sulphuric acid used in the SBA process, the catalytic efficiency of

the acid is not a linear function of the acid concentration present in the reaction medium.

A measure of the ability of the acid to donate a proton to a weak base is given by the

Hammett acidity function. Experiments have been performed to obtain a correlation for

the Hammett acidity H0 as a function of two other variables: HCFA and FatnessDBI. The

Hammett acidity correlation is presented in equation (B.1).

2438

0 1071408.41035172.3 HCFAHCFAH ⋅⋅+⋅⋅=− −− 312 107997.31023125.5 FatnessDBIHCFA ⋅⋅−⋅⋅+ −− (B.1)

42162.06003.377132.1 2 −⋅−⋅+ FatnessDBIFatnessDBI

The variables ‘hydrocarbon free acid’ (HCFA) and the ‘Fatness dissolved butenes

included’ (FatnessDBI) are both a function of the liquid weight percentage of some of the

components as presented in (B.2) and (B.3) respectively. HCFA (Hydrocarbon-Free

Acidity) is a measure for the capability of the liquid to absorb hydrocarbons. The

mathematical expression for HCFA is

%100

213.1330

015.18%

123.74

015.18%%

182.154

074.98%%

182.154

074.98%%

,,,,4,

,4,

22

2

SBELSBALOHLMBSLSOHL

MBSLSOHL

wwwww

ww

HCFA

++++

+=

(B.2)

FatnessDBI is defined as the number of moles of C4 equivalents, including the dissolved

butane and butene isomers, per mole of H2SO4 equivalent present in the acid phase. The

mathematical expression for FatnessDBI is

182.154

%

074.98

%231.130

%2

123.74

%

108.56

%%%

182.154

%

,4,

,,2,2,1,,

2

444

MBSLSOHL

SBELSBALCtLCcLCLMBSL

ww

wwwwww

FatnessDBI

+

++++

+=

−−−

(B.3)

77

Samenvatting

De kinitiek parameters behorend bij het relevante reactieschema zijn vereist om een

chemische proces te optimaliseren dat plaatsvindt in de Sec-Butyl-Alcohol stripper die

bedreven wordt door Shell in Pernis. Hiervoor zijn in het verleden

laboratoriumexperimenten uitgevoerd, die gemodelleerd zijn met Aspen Custom Modeler

(ACM), een commercieel software pakket dat ontwikkeld wordt door AspenTech. Het

doel van dit afstudeerproject is de parameterschatting capaciteiten het softwarepakket

gPROMS, dat ontwikkeld wordt door Process Systems Enterprise, te beoordelen en te

vergelijken met die van ACM. Hiertoe dient een model van de laboratoriumexperimenten

ontwikkeld te worden met de gPROMS software dat zo equivalent mogelijk dient te zijn

aan het bestaande model in ACM. Daartoe zijn de fysische en thermodynamische

eigenschappen van de componenten in het ACM model beschikbaar gemaakt voor het

gPROMS model via de CAPE-OPEN interface, die succesvol toegepast is. Het model van

de experimenten dat ontwikkeld is in gPROMS bevat een vloeistof-damp evenwicht dat

de verdeling van negen componenten over beide fasen beschrijft die zeer goed

overeenkomt met het ACM model. Tijdens parameterschatten werd duidelijk dat niet alle

kinitiek parameters individueel waarneembaar zijn in combinatie met de beschikbare data

van de laboratoriumexperimenten. Het systeem is gedeeltelijk ge-herparameteriseerd en

de nieuwe set parameters is vervolgens bepaald met voldoende nauwkeurigheid.

Verschillende aspecten van parameterschatten met beide software pakketten zijn

behandeld, onder andere: invoer van experiment data, interpretatie van de uitvoer,

beschikbare combinaties van doelfuncties en optimalisatie methoden alsmede het

vermogen, de snelheid en de nauwkeurigheid van het verkrijgen van een oplossing van

deze combinaties. Aan de hand van de beoordeling van deze aspecten is geconcludeerd

dat de parameterschatting capaciteiten van gPROMS beter zijn dan die van ACM.

Vervolgens is onderzocht welk type experimenten vereist zijn om de kinetiek parameters

te schatten die niet waarneembaar zijn met de tot dan toe beschikbare data. Pogingen om

de gPROMS “Sequential Experiment Design” (SED) functionaliteit toe te passen waren

zonder succes, waardoor geen uitspraak gedaan kan worden over de toegevoegde waarde

van deze functionaliteit. Alternatief ontworpen experimenten zijn gesimuleerd met het

model dat bekende parameters bevat, vervolgens wordt de synthetische data gebruikt

voor het schatten van de parameters. Het effect van datareductie op de waarneembaarheid

van de parameters is tevens onderzocht. Er is gebleken dat het mogelijk is alle parameters

te schatten van drie reactie systemen met een gereduceerde set synthetische data.