Constraint-based Equilibrium and Stiffness Control of ... · PDF fileConstraint-based...

Constraint-based Equilibrium and Stiffness Control of Variable

Stiffness Actuators

Matthew Howard, David J. Braun and Sethu Vijayakumar

Abstract—Considerable research effort has gone into thedesign of variable passive stiffness actuators (VSAs). A numberof different mechanical designs have been proposed, aimed ateither a biomorphic (i.e., antagonistic) design, compactness, orsimplified modelling and control. In this paper, we propose a(model-based) unified control methodology that is able to exploitthe benefits of variable stiffness independent of the specificsof the mechanical design. Our approach is based on formingconstraints on commands sent to the VSA to ensure that theequilibrium position and stiffness of the VSA are tracked tothe desired values. We outline how our approach can be usedfor tracking stiffness and equilibrium position both in joint andtask space, and how it may be used in the context of constrainedlocal optimal control. In our experiments we illustrate theutility of our approach in the context of online teleoperation,to transfer compliant human behaviour to a variable stiffnessdevice.

I. INTRODUCTION

In recent years, considerable research effort has gone intothe design of variable passive stiffness actuators (VSAs). Anumber of designs have been proposed, directed at differentapplications, each with their own benefits and disadvantages.For example, several designs have focused on imitatingthe human musculoskeletal system [11], often resulting inantagonistic actuation systems. These have the benefit thattransferring behaviour from human to robot is relativelysimple (e.g., by drawing a correspondence between EMGsignals and actuator commands), but the disadvantage thatthey have complex dynamics and can be hard to build intomulti-joint devices. Other proposed designs have focused onsimplifying the dynamics (and thereby the control) [5] orimproving scalability, e.g., with joint-internal VSA designs[15]. These often have several benefits, such as compactness,but the difficulty then lies in finding appropriate controllers,especially when trying to mimic the capabilities of humans[4] and exploit the benefits of variable stiffness.In general, the final choice of VSA mechanism for a

particular robotic device will depend on many differentfactors related to the specific application. However, if wecan find an appropriately general control framework that canbe applied to a number of different designs, we can easethis design decision. In other words, we would like to finda unified control methodology, that is able to exploit thebenefits of variable stiffness independent of the specifics ofthe mechanical design.In previous work, several approaches have been suggested

to achieve this goal. For example, De Luca et al. [10]proposed using inverse dynamics and feedback linearisationin order to cancel out non-linearities and track joint stiffnessand equilibrium position profiles. They showed that, assum-ing sufficiently smooth reference trajectories (differentiable

M. Howard, D. J. Braun and S. Vijayakumar are with the Institute ofPerception Action and Behaviour, University of Edinburgh, Scotland, UK.E-mail: [email protected]

up to 4th order), and assuming a diagonal stiffness matrix,these profiles can be tracked in a similar way to that usedfor position and torque control. Albu-Schaeffer et al. [1] useda similar approach with the introduction of torque feedbackterms to improve disturbance rejection, with demonstrationsof the method on a 7-axis robot arm.More recently, Tahara et al. [13] suggested an approach

for resolving redundancy in the actuation of antagonisticallyactuated systems, with a view to understanding the controlof the human musculoskeletal system. Their approach wasbased on defining Cartesian-space force controllers withvirtual spring dynamics, and using the inverse mapping fromCartesian space to joint space, and then on to the musclespace, in order to find appropriate muscle activations torealise the desired movement. In this case, stiffness wasimplicitly controlled according to the choice of the redundantinternal forces.In this paper, we propose a novel constraint-based frame-

work that allows us to control equilibrium position andstiffness of an arbitrary VSA, given appropriate informationabout the actuator dynamics. The approach is similar topopular constraint-based schemes in kinematic [9], or torquecontrol [12], but is applicable to redundantly actuated roboticVSA devices to give accurate, closed loop tracking of desiredstiffness and position profiles. In addition, our approachallows us to design (i) hierarchical controllers where actu-ation redundancy can be explicitly resolved in a prioritisedframework, (ii) controllers in which constraints on stiffnesscan be imposed (e.g., enforcing a particular stiffness profilefor safety reasons). Furthermore, our approach can also beused for assessing the benefits of VSAs over conventionalfixed-stiffness actuators, as we will show in the context ofconstrained optimal control. In our investigations we test ourapproach for tracking pre-specified equilibrium position andstiffness profiles on a number of different simulated VSAs,and in an online teleoperation task on hardware using aMACCEPA joint [5].

II. PROBLEM DEFINITION

Our aim is to derive joint stiffness and equilibrium positioncontrollers for VSAs, independently of the mechanism usedfor varying stiffness. For example, we may have a controllaw determining a stiffness profile for one VSA, (e.g., theMACCEPA, Fig. 1(b)) and wish to transfer it to a second,different VSA (e.g., the Edinburgh SEA, Fig. 1(c)) for com-parison. Alternatively, we may wish to transfer the stiffnessfrom a human, measured during some task, to reproduce thesame compliant behaviour on a robotic device.Specifically, we assume that the VSA that we wish to

control has state x ∈ Rp (e.g., joint positions q ∈ R

n andvelocities q ∈ R

n), and applying command u ∈ Rm results

in a joint torque of the form

τ = τ (x,u) (1)

which, for a variable stiffness actuator, may be re-written inthe form

τ (x,u) = −K(x,u)(q− q0(x,u)) (2)

where K(x,u) ∈ Rn×n is the joint stiffness matrix and

q0(x,u) ∈ Rn are the equilibrium positions of the joints.

Note that both of these quantities may, in general, have non-linear dependence on x and u, depending on the design ofthe mechanism.Our goal is to derive appropriate commands u that realise

a desired stiffness Kd and equilibrium position q0,d. Thesemay be given either as fixed values (e.g., enforcing a fixedstiffness for the joint) or as variables to be tracked (e.g.,in online tracking of equilibrium position and stiffness in ateleoperation task). Alternatively, we may wish to prioritisecontrol of the position over that of stiffness. For example, ina punching task, we may want a high stiffness at the timeof impact (for a hard punch), but since the first priority isto hit the target, we may have to sacrifice some stiffness inorder to achieve that goal. The extent to which this sacrificeneeds to be made will depend on the design of the variablestiffness mechanism, and any coupling that exists betweenposition and stiffness. In the next section, we clarify theseissues with reference to different example implementationsof variable stiffness actuation.

Example: Ideal VSA, MACCEPA and Edinburgh SEA

To illustrate the influence that different mechanical designshave on the control of stiffness and equilibrium position, weconsider three possible designs for a single-joint VSA.The first and simplest of the three, is the idealised VSA

(see Fig. 1(a)), in which we assume that the stiffness andequilibrium position are directly controllable in the commandvector, i.e., u = (q0, k)

T . In this case, the control ofequilibrium position and stiffness is orthogonal, enablinguse to select any combination of position and stiffness. Thisis illustrated in Fig. 1(a), right panel, where, for example,moving along the y-axis (corresponding to u2) adjusts thestiffness, but has no effect on the equilibrium position, andvice versa. Unfortunately, in real mechanisms it is rarelypossible to achieve such ideal behaviour.In contrast, consider the MACCEPA [5] and the Edinburgh

SEA [11] as examples of actuators of competing designs, thathave both been realised in hardware. For the MACCEPA, theapplied joint torque (2) is given by

τ(x,u) = κBC sinα

(

1 +ru2 − (C −B)√

B2 + C2 − 2BC cosα

)

(3)where x = (q, q)T , α = u1 − q, κ is the spring constant,B and C are the distances illustrated in Fig. 1(b) and r isthe radius of the winding drum (mounted on the servo thatextends the spring). Note that, due to the multiplication ofterms dependent on u1 and u2, there exists a coupling be-tween equilibrium position and stiffness. In particular, whilethe equilibrium position is only influenced by controllingthe angle of the lever arm (u1), away from equilibriumthe stiffness is influenced both by the pre-tensioning ofthe spring (u2) as well as the lever arm angle (u1). Toillustrate this, we may make a similar plot of the equilibrium

position and stiffness as a function of motor commands asfor the ideal VSA (ref Fig. 1, middle row). In this casewe can see that, though the equilibrium position is onlyinfluenced by the position of the first motor (u1), there isa rather complex, non-linear relationship between the motorcommands and joint stiffness, making independent controlof stiffness difficult.A similar argument applies to control of the Edinburgh

SEA [11], for which the torque relationship is

τ(x,u) = −zT (a× F1 − a× F2) (4)

where z is the unit vector along the joint rotation axis,a = (a cos q, a sin q, 0)T , Fi = κ(si − s0)

sisi, i ∈ {1, 2}

are the forces due to the two springs (both with springconstant κ), s1 = (−h−L sinu1,−d+L cosu1, 0)

T +a ands2 = (h+L sinu2,−d+L cosu2, 0)

T −a are the extensionsof the two springs, and all other quantities are illustratedin Fig. 1(c). In this case, due to the antagonistic actuation,there is a strongly coupled, non-linear relationship betweenthe motor commands and the joint equilibrium position andstiffness (as illustrated in Fig. 1(c), right) making it difficultto control these quantities directly.As illustrated by these examples, it is clear that even

for relatively simple VSA designs, there is considerabledifficulty in directly regulating the position and stiffness.At first glance, it would seem that, in order to exploit thedynamic properties of these actuators, it would be necessaryto develop specialised controllers for each design. However,in the next section we will outline a general method for con-trolling arbitrary VSAs with a constraint-based framework.

III. METHOD

Motivated by the examples in the preceding section, here weoutline our method for constraint-based control of equilib-rium position and stiffness. We will first outline the basicapproach, and then illustrate how such an approach can beused in the context of constrained local optimal control.

A. Model-based Equilibrium and Stiffness Prediction

Our approach is a model based control scheme in which weassume that we have knowledge of the relationship betweenthe robot state x ∈ R

p (e.g., x = (qT , qT )T ∈ R2n), the

command vector u ∈ Rm, and the resultant joint torque

τ (x,u), either in closed form or as a non-parametric model(e.g., from non-parametric regression). Note that, in general,variable stiffness actuators are redundantly actuated (sincethey have at least one additional degree of freedom per jointfor changing the stiffness), so m>n.Given (1) we can derive an expression for the equilibrium

position vector as a function of state and command

q0 = q0(x,u) ∈ Rn (5)

by solving τ (x,u)=0 for q. This may be derived analyti-cally, or calculated numerically with a root-finding algorithm(e.g., the Newton-Raphson method).Also using (1), we can derive the joint stiffness matrix

K = K(x,u) = −∂τ (x,u)

∂q

∣

∣

∣

q∈ R

n×n (6)

Again, in many cases we may obtain (6) in closed analyticalform, or alternatively use numerical finite differences. For

−1 0 1

0.2

0.4

0.6

0.8

1

u1 (rad)

u2 (

rad)

eq. pos. (rad)

−1.5

−1

−0.5

0

0.5

1

1.5

−1 0 1

0.2

0.4

0.6

0.8

1

u1 (rad)

u2 (

rad)

stiffness (Nm/rad)

0.2

0.4

0.6

0.8

(a) Ideal 1-link VSA.

−1 0 10

0.5

1

1.5

2

2.5

3

u1 (rad)

u2 (

rad)

eq. pos. (rad)

−1.5

−1

−0.5

0

0.5

1

1.5

−1 0 10

0.5

1

1.5

2

2.5

3

u1 (rad)

u2 (

rad)

stiffness (Nm/rad)

0.05

0.1

0.15

0.2

0.25

0.3

0.35

(b) MACCEPA.

0 1 2 30

0.5

1

1.5

2

2.5

3

u1 (rad)

u2 (

rad)

eq. pos. (rad)

−0.5

0

0.5

0 1 2 30

0.5

1

1.5

2

2.5

3

u1 (rad)

u2 (

rad)

stiffness (Nm/rad)

0.3

0.35

0.4

0.45

0.5

(c) Edinburgh SEA.

Fig. 1. Left: Geometry, dynamics and hardware implementation of the 1-link variable stiffness actuators used in the numerical simulations and experiments.Right: Equilibrium position and stiffness as a function of commands u (evaluated at q = 0, q = 0).

convenience, we can write the stiffness in vector form ask(x,u) = vec(K(x,u)) ∈ R

n2

.

Note that in general (5) and (6) are non-linear functionsof the state and commands. Note also that, depending onthe system, the dimensionality of k(x,u) may vary. Forexample, the stiffness of each joint may be coupled so thatK is symmetric, or, alternatively, the stiffness of individualjoints may be independent (e.g., as would be the case in achain of MACCEPA actuators). In the latter case, K reducesto a diagonal matrix and we can omit the off-diagonalelements, resulting in k ∈ R

n.

B. Resolved Equilibrium and Stiffness Tracking Control

Having derived (5) and (6) for estimating the equilibriumposition and stiffness, we are now in a position to designconstraint-based controllers. We note that, in general, forVSAs with an actuation relationship of the form (2), wecannot find a linear, orthogonal decomposition in the directcontrol space since the multiplication of stiffness with equi-librium position introduces a quadratic dependence on u. Forthis reason, we must instead move to the command velocityspace for control.

In particular, we can take the time derivative of (5) and(6) to find the linearised forward impedance dynamics

q0 = Jq0(x,u)u+Pq0

(x,u)x, (7)

k = Jk (x,u)u+Pk (x,u)x, (8)

where q0, k are the change in equilibrium position andstiffness with respect to time, u ∈ R

n is the rate of change

of motor commands, Jq0∈ R

n×m and Jk ∈ Rn2×m are

the Jacobian of the equilibrium position and the stiffness

with respect to motor commands, while Pq0∈ R

n×p and

Pk ∈ Rn2×p are the corresponding Jacobians with respect

to the state.

To simultaneously control equilibrium position and stiff-ness, we can invert this relationship to yield1

u = J†r+ (I− J†J)u0 (9)

where r = ((q0 − Pq0x)T , (k − Pkx)

T )T ∈ Rn+n2

,J = (Jq0

,Jk)T is the combined Jacobian, I is the identity

matrix, J† denotes the Moore-Penrose pseudoinverse of Jand u0 is an arbitrary vector. The latter can be used to resolveany further redundancy in the actuation (such as additionalactuators used for varying damping [8]).

Application of (9) requires state derivatives, provided byfeedback, or calculated from the analytical model of thesystem dynamics. To avoid the requirement on analyticalmodelling, and also to circumvent the noise and phase-lag issues related with the feedback on x, we use on-linefeedback from the current stiffness and equilibrium state,i.e., we choose r according to the difference in the desiredand actual equilibrium and stiffness values r = ((q∗

0 −q0)

T , (k∗ − k)T )T . This solution is similar to Closed-LoopInverse Kinematic (CLIK) control [3], and also mitigatesinstabilities due to constraint drift [2]. For this, since wecannot directly measure the stiffness and equilibrium positionon-line, we use (5) and (6) to estimate the current valuesbased on the current estimate of the state.

1We omit the dependence on x and u for readability.

C. Equilibrium and Stiffness Tracking in Task Space

The approach described so far can also be extended toequilibrium and stiffness tracking in task (e.g., end-effector)space coordinates. In task space, the restoring force inresponse to a perturbation is

Fs = −Ks(x,u)δs ∈ Rq (10)

where s ∈ Rq is a vector of task space coordinates and

Ks ∈ Rq×q is the task space stiffness. This force is related

to the joint torques through the relationship

τ = W(q)TFs (11)

where W(q) ∈ Rq×n is the Jacobian from joint to task

space2: δs = Wδq. Substituting this and (10) into (11), wefind

τ = −WTKsWδq = −Kδq.

Assuming that W is square and full-rank (i.e., q = n), byelimination we can then identify the task space stiffness

Ks = (WT )−1KW−1. (12)

Similar to the case of joint space stiffness tracking, we can

conveniently write ks = vec(Ks) ∈ Rq2 and then derive the

task space stiffness Jacobian Jks∈ R

q2×m with respect tothe motor commands u.The task space equilibrium position s0 can be found by

solving

Fs = (WT )−1τ = 0. (13)

For non-redundant robots3, if we have the expression (5), wecan calculate s0 directly by mapping the joint space equilib-rium position q0 through the forward kinematics function,since at this point τ = 0, which implies Fs = 0 through(13). Again the Jacobian Js0 ∈ R

q×m may be derived eitherin closed form or numerically through finite differences.For equilibrium position and stiffness tracking in task

space, we then take a similar approach to that described inSec. III-B, replacing Jq0

,Jk with Js0 ,Jksin (9).

D. Optimal Control with Constrained Stiffness

Finally, we briefly describe how the framework developedso far can be used in combination with optimal controltechniques to place constraints on the change in stiffness.In general, the optimal control problem is to find a

commands u∗ that minimise a cost function of the form

J = h(x(T )) +

∫ T

0

l(x,u, t) dt ∈ R (14)

under dynamics of the form

x = f(x,u) ∈ Rp. (15)

In many cases, we may wish to constrain the optimisation,subject to higher priority considerations, for example, byseeking the optimal controls subject to maintaining a par-ticular stiffness profile for safety reasons. We can impose

2We omit the dependence on q for readability.3Note that, for redundant robots, a similar analysis applies but with the

inverses in (12)-(13) replaced by pseudo-inverses. This has the implicationthat there may exist multiple joint-space equilibria q0 for a given task spaceequilibrium s0.

such a constraint using our framework by reformulating theabove problem in the command velocity domain.Specifically, we use the augmented state y = (y1,y2)

T =(x,u)T ∈ R

p+m, and command v = u ∈ Rm and seek the

optimal controls v∗ in command velocity space with respectto (14) under the constrained dynamics

y = g(y,v) =

(

f(y1,y2)

J†kr(y1,y2) + (I− J

†kJk)v

)

∈ Rp+m

(16)where Jk is the stiffness Jacobian, r= k∗− k(y1,y2) andk∗ is the desired stiffness. Reformulating the problem inthis way ensures that the control sequence is optimised inthe null-space of the stiffness Jacobian. This means thatwhatever control sequence that comes out of the optimisationwill have no effect on the stiffness profile. In Sec. IV-D webriefly illustrate how such an optimisation can be used inthe context of establishing the benefits of variable stiffnessdesigns over a fixed stiffness actuator.

IV. EXPERIMENTS

In this section, we report numerical simulations and experi-ments applying our method to the control of several variablestiffness devices. We first illustrate the basic tracking capa-bility of our method on three simulated, singe-joint VSAsgiven desired equilibrium position and stiffness profiles. Wethen test the scalability to plants of higher-dimensionalityin the context of task-space position and stiffness tracking.We illustrate our method’s use in online, interactive controlof stiffness in a teleoperation task in hardware. Finally, weillustrate how our method can be applied in the optimalcontrol setting, for analysing the benefits of variable stiffnessactuation over traditional fixed-stiffness devices.

A. Basic Tracking Behaviour

We first test the tracking quality achieved when applyingour approach to a number of different VSAs with differingdesigns in simulation. For this, we apply the approachoutlined in Sec. III-B to control the three single-joint VSAsdescribed in Sec. II, namely, the Ideal VSA, the MACCEPAand the Edinburgh SEA to track simple, pre-specified jointequilibrium position and stiffness trajectories. Specifically,the task here is to track a desired stiffness profile of theform

k∗(t) =1

2Ak(sin(ωk t) + 1) + bk (17)

and equilibrium position

q∗0(t) =1

2Aq0(sin(ωq0 t) + 1) + bq0 (18)

where the amplitude Ak = kmax−kmin corresponds to thefull range of stiffness values (between minimum stiffnesskmin and maximum stiffness kmax for the specific actuator)and similarly Aq0 =q0,max−q0,min is the amplitude for theequilibrium position. The offsets bk=kmin and bq0 =q0,min

ensure that the desired trajectories stay within the admissiblelimits of the VSAs. We arbitrarily selected ωk=2 and ωq0 =3 rad/s. The trajectories were tracked for 4 s, with controlat a rate of 50Hz.In Fig. 2, we show the tracking performance in terms of

the equilibrium position and stiffness, the command sequencegenerated with the tracking controller (9), and the resultant

Ideal VSA MACCEPA Edinburgh SEAEq.Position

0 1 2 3

−1

0

1

t (s)

q (

rad)

0 1 2 3−0.4

−0.2

0

0.2

t (s)

q (

rad)

0 1 2 3

−0.1

0

0.1

t (s)

q (

rad

)

q0

*

q0

q

Stiffness

0 1 2 3

0.20.40.60.8

t (s)

k (

Nm

/rad)

0 1 2 3

0.1

0.15

0.2

0.25

t (s)

k (

Nm

/rad)

0 1 2 3

0.360.38

0.40.420.44

t (s)

k (

Nm

/ra

d)

k*

k

Command

0 1 2 3

−1

0

1

t (s)

u

0 1 2 3

0

1

2

t (s)

u0 1 2 3

0.5

1

t (s)

u

u

1

u2

Fig. 2. Simultaneous tracking of sinusoidal equilibrium position and stiff-ness profiles on the three single-joint VSAs. Top row: desired equilibriumposition q∗

0(light red), realised equilibrium position q0 (dashed black) and

actual joint position q (medium grey). Middle row: desired stiffness k∗ (lightred) and realised stiffness k (dashed black). Bottom row: Motor commandsu1 (solid black) and u2 (dashed black).

RMSE(q0, q∗0) RMSE(k, k∗)Ideal 1-link VSA 0.000000 0.000000

MACCEPA 0.000000 0.000218Edinburgh SEA 0.000132 0.000308

TABLE I

ERROR IN TRACKED EQUILIBRIUM POSITION AND STIFFNESS FOR THE

THREE SIMULATED SINGLE-JOINT VSAS.

trajectory of the joint, for the three VSAs. Looking at thecommand sequence (top row), we note that the controllergenerates a different sequence of commands for each ofthe three VSAs. However, when we look at the equilibriumposition (middle row) and stiffness profiles (bottom row)we see that there is good agreement between the realisedtrajectories (q0, k) and the desired (q∗0 , k

∗) in all cases. Thisis confirmed further by the figures for the RMSE betweenthe desired and actual trajectories estimated over the durationof the movement (ref. Table I), which are uniformly low.

It is also interesting to note how, due to the differentdynamics of the three plants, the actual position of thejoint q lags or oscillates around the commanded equilibriumposition, especially when the joint stiffness is very low(around 2 s into the movement). This is to be expected sincein the low gain control realised here, the response of thesystem comes as a combined effect of the plant dynamicsand the control actions.

B. End-effector Stiffness Tracking on Multi-link Systems

Our second numerical investigation tests the scalability ofour approach for end-effector stiffness tracking (cf. Sec. III-C) on two higher-dimensional VSA systems. For this, weused (i) an ideal 2-link VSA, and (ii) a biologically plau-sible model of the human arm with muscle-like actuators(schematic diagrams are provided in Fig. 3). The former canbe considered a generalisation of the ideal single-joint VSA,in which the joint torques due to the controls are given by

τ = −K(q− q0) (19)

and the equilibrium position and stiffness are directly con-trollable, i.e., u = (q0, vec(K))T . The latter is controlledthrough a system of 6 muscles with Kelvin-Voigt muscledynamics [7]. Specifically, the control vector u∈R

6 repre-sents muscle activations with a non-linear relationship to theapplied torques

τ = −ATT(q, q,u) (20)

(a) Ideal 2-link VSA. (b) 6-muscle arm model.

Fig. 3. Dynamics models for the two 2-link variable stiffness actuators.

0 2 4 6 8 10

0

0.2

0.4

t (s)

s (

m)

s0

*

s0

ssx

sy

(a)

0 2 4 6 8 10

80

100

120

140

160

180

t (s)

ks

(N/m

)

ks

*

ks

kx

ky

(b)

(c)

0 2 4 6 8 10

0

0.2

0.4

t (s)

s (

m)

sy

sx

(d)

0 2 4 6 8 10

80

100

120

140

160

180

t (s)

ks

(N/m

)

kx

ky

(e)

(f)

Fig. 4. Tracking of equilibrium position and stiffness profiles in end-effector space on the Ideal 2-DOF VSA (top), and 6-muscle arm model(bottom). Shown are: (a) & (d) Desired end-effector equilibrium positionss∗0(light red), realised end-effector equilibrium positions s0 (dashed black)

and actual end-effector positions s (grey) during the movement, (b) & (e)Desired end-effector stiffness ks

∗ and realised end-effector stiffness ks,(c) & (f) stroboscopic plots of the resultant behaviour with the end-effectorpath shown in light grey.

where A∈R2×6 is a matrix of moment arms and T∈R

6 arethe muscle tensions (for space reasons, we refer the readerto [7] for full details of the model).The desired equilibrium position trajectory s∗0(t) ∈ R

2

is a figure-8 in end-effector space, while the desired end-effector stiffness K∗

s(t) was designed to switch from low-to high-stiffness in the x-direction, and from high to lowin the y-direction (ref. Fig. 4). Note that, only the diagonalelements of Ks were controlled (i.e., ks= diag(Ks)∈R

2),the remaining redundancy4 was resolved through a null-spacepolicy u0=−α(u−ud) where u is the active command, ud

is a default command vector (selected for each of the plantsindividually) and α is scaling factor. The trajectories weretracked for 12 s at a control rate of 150Hz.The results are shown in Fig. 4. As can be seen, for

the ideal 2-link VSA there is excellent agreement betweenthe desired and tracked equilibrium positions and stiffness(compare light red and dashed black trajectories in Fig. 4(a)-(b)). The tracking for the muscle model is also fairly accurate(Fig. 4(d)-(e)), although there are some ‘perturbations’ awayfrom the desired profiles. Upon examination, we found thisto be due to the controller hitting command limits (in themuscle model the commands are constrained such that u ≥

4Note that, here the full control dimensionality of the two actuators underconsideration is u∈R

6, however we only control s0∈R2 and the diagonal

elements ks ∈ R2 the effective control is r∗ = (s0,ks)T ∈ R

4. Thiseffectively leaves two dimensions of redundancy.

0, see [7]). This may be alleviated by taking commandconstraints into account explicitly in the controller design(e.g., using unilateral constraints). However, we note thateven without this, the controlled trajectory rapidly convergesback to the desired profile when the configuration movesaway from these limits.

Looking at the behaviour, (stroboscopic plots in Fig. 4(c),(f)) we see that actual trajectory of the arm behaves like avariable-gain controller. For example, when the stiffness inx is low (first 6 s of movement), tracking of the eight suffersin this dimension (ref. light grey line sx in Fig. 4(a)), but isthen re-gained the stiffness in x returns to be high (final 6 s).The opposite trend can be observed in the y dimension wherestiffness starts high and switches to low. This confirms ourexpectations about the behaviour under variable end-effectorstiffness.

C. Tracking Human Impedance Profiles

In this section we illustrate how the proposed approach canbe applied to online control of VSAs in an experiment inhardware. For this, we chose to investigate a teleoperationtask in which a human operator controls equilibrium positionand stiffness of a MACCEPA joint via recordings of hismuscle activations. The experimental setup is as follows.

The human operator was fitted with a pair of surfaceEMG sensors to the wrist extensor and flexor muscles ofthe forearm (see Fig. 5) that provide streaming data onthe activation of the muscles. The raw data was filteredthrough a band pass filter to remove the lowest and highestfrequency components and smooth out noise. The resultantactivation data a = (aext, aflex)

T was then pre-processed insuch a way as to predict the human-commanded equilibriumposition q∗0 and stiffness k∗ of the wrist. Specifically, as ameasure of stiffness we used the co-contraction level

k∗ = gk min(aext, aflex) + ck, (21)

and as a measure of equilibrium positions we used the signaldifference

q∗0 = gq0(aext − aflex) (22)

where gq0 and gk are gain parameters that scale the effectof the EMG on the commanded stiffness and equilibriumpositions, and ck = kmin is an offset parameter that ensuresthe commanded stiffness never falls below the minimumachievable stiffness for the VSA.

In Fig. 5 we show results over 20 s of operation. These arebroken into three phases: (i) alternating left-right hand move-ment with low stiffness (muscles relaxed), (ii) alternation be-tween low and high stiffness at q = 0 (relaxed/co-contractedmuscles, respectively) and, (iii) alternating left-right handmovement with high stiffness (muscles co-contracted). Thefirst and last conditions are indicated by the shaded regionsin the plots.

As can be seen, during phase (i), the EMG signals indicatealternating activation between the two muscles, resulting ina left-right movement of the desired equilibrium position.The robot tracks this movement with considerable accuracy,albeit with a slight time delay, which we attribute to thelimited speed of the servos used in the device. We also notethat there is some small level of co-activation in the EMGsignals even in this relaxed state: this is also tracked as small

increases in stiffness. During phase (ii), the hand remains atthe rest position q = 0 and the operator co-contracts twice.As can be seen, this causes two spikes in the stiffness profile,which are also accurately tracked. It is interesting to note inthe plot of the commands to the MACCEPA, the controllerprimarily relies on the second (pre-tensioning) motor for this,since there is a linear dependence between u2 and stiffnessat equilibrium. Finally, during phase (iii) we again see goodtracking of the equilibrium position with increased overallstiffness, despite the relatively high noise in the recordedEMG. The performance of the controller can be furtherverified in the accompanying video.

D. Constrained-stiffness Optimal Control

In our final investigation, we briefly illustrate the use of ourapproach for analysing the benefits of VSAs over traditional,fixed-stiffness actuators. For this, we use the method de-scribed in Sec. III-D to compare the optimal behaviour ofthe three example 1-DOF VSAs described in Sec. II in aball-hitting task, similar to that described in [6]. The set-upwas as follows.We used the iterative Linear Quadratic Regulator (iLQR)

algorithm [14] to seek local optimal controllers (OCs) thatminimise the objective

J = w1(q(T )− q∗)2 − w2q(T ) +

∫ T

0

w3τ2 dt (23)

where q∗ = 30◦ is the target angle (corresponding to theangle of impact with the ball), τ is applied joint torqueand wi, i ∈ {1, 2, 3} are weight factors that determine therelative importance of the three terms. These, respectively,correspond to (i) minimising the distance to the target (ball)at the time of impact T , (ii) maximising the angular velocityat T , and (iii) minimising effort during the movement.We performed the optimisation for each of the VSAs,

(i) with no constraint, and (ii) with the constraint that thestiffness must remain fixed throughout the movement at theinitial value k0. In the latter case, the constraint is enforced inthe manner described in Sec. III-D by specifying k∗(t) = k0throughout the movement.In Fig. 6 we plot the stiffness profile, joint positions and

joint velocities generated by the OCs for the three VSAs,under the two conditions.The first thing we see, looking at Fig. 6 (top row) is

that the stiffness constraint is successfully enforced withgood accuracy, albeit with some small deviation from thedesired stiffness toward the end of the movement. We at-tribute these small errors to constraint drift, and the onlinefeedback not fully compensating for the non-linear dynamics(both of which factors can be exacerbated in high velocitymovements, as here). These errors, however, are negligiblein comparison to the overall variation in stiffness seen in theunconstrained stiffness profiles (compare light red and blacklines in Fig. 6, top row).Looking at the behaviour (Fig. 6, middle and bottom

rows), we see that if the stiffness is allowed to vary freely(light red lines), the OCs exploit this additional degree ofredundancy to improve task performance compared to thecase that the stiffness is fixed (black lines). For example,comparing the variable against the fixed stiffness behaviourwe see that (i) the variable stiffness controllers come closer

Fig. 5. Teleoperated control of equilibrium position and stiffness on the MACCEPA. Shown are human EMG signals, equilibrium position, stiffness, robotmotor commands. Light-red and black-dashed lines denote desired trajectories (predicted from the human data via (21) & (22)) and realised trajectories,respectively.

Ideal VSA MACCEPA Edinburgh SEA

Stiffness

0 0.1 0.2 0.3 0.40.09

0.1

0.11

k (

Nm

/rad)

t (s)

0 0.1 0.2 0.3 0.4

0.08

0.09

k (

Nm

/ra

d)

t (s)0 0.1 0.2 0.3 0.4

0.3

0.32

k (

Nm

/rad)

t (s)

Position

0 0.1 0.2 0.3 0.4

−0.4

−0.2

0

0.2

q (

rad

)

t

0 0.1 0.2 0.3 0.4

−0.4

−0.2

0

0.2

q (

rad

)

t

0 0.1 0.2 0.3 0.4

−0.4

−0.2

0

0.2

q (

rad

)

t

Velocity

0 0.1 0.2 0.3 0.4−2

02468

dq

/dt

(ra

d/s

)

t (s)

Variable k

Fixed k

0 0.1 0.2 0.3 0.40

2

4

dq/d

t (r

ad/s

)

t (s)0 0.1 0.2 0.3 0.4

−2

0

2

4

6

dq/d

t (r

ad/s

)

t (s)

Fig. 6. Optimised ball hitting behaviour for the three single-joint VSAswhen stiffness is (i) fixed (constrained) to the initial value (black), and (ii)allowed to vary freely (light red). Shown are: joint stiffness k (top row),joint positions q (middle row) and joint velocity q (bottom row) during themovement. The position of the target (ball) is indicated by ‘o’.

Fixed k Variable kIdeal 1-link VSA −0.242372 −0.489173

MACCEPA −0.075511 −0.274747Edinburgh SEA −0.076414 −0.356527

TABLE II

COST INCURRED BY OPTIMAL CONTROLLERS FOR THE THREE

SINGLE-JOINT VSAS PERFORMING THE BALL-HITTING TASK UNDER

CONDITIONS OF FIXED OR VARIABLE STIFFNESS.

to the target (Fig. 6, middle row) and (ii) their end-timevelocity is greater (Fig. 6, bottom row). The benefit ofvariable stiffness is further confirmed by comparing the costincurred during the movement under the different conditions,which is uniformly lower when the stiffness is allowed tovary (see Table II).

V. CONCLUSION

In conclusion, we have presented a novel model-basedmethod for control of variable stiffness actuators usingconstraints on equilibrium positions and stiffness in taskand joint space. The proposed approach is generic by itsformulation, and can be applied to many different designsof variable stiffness devices for accurate tracking of desiredstiffness and equilibrium position profiles. Furthermore, asshown in simulation and experiment, it is fast to computeand can be used with ease for online stiffness control, suchas in the teleoperation setting explored here.In future work, we intend to exploit this method as a tool

for (i) assessing in detail the benefits of variable stiffnessactuation as compared to traditional, fixed stiffness actua-tors, and (ii) for evaluating methods for transfer of human

impedance behaviour to artificial systems. Furthermore, weintend to explore extensions of the method, for example,to incorporate unilateral constraints, so that safety limits(such as limits on the maximum admissible stiffness) maybe realised with arbitrary VSA hardware designs.

VI. ACKNOWLEDGEMENTS

This work was funded by the EU Seventh FrameworkProgramme (FP7) as part of the STIFF project.

REFERENCES

[1] Alin Albu-Schaeffer, Christian Ott, and Gerd Hirzinger. A unifiedpassivity-based control framework for position, torque and impedancecontrol of flexible joint robots. Int. J. Robotics Res., 26:23–39, 2007.

[2] David J. Braun and Michael Goldfarb. Eliminating constraint drift inthe numerical simulation of constrained dynamical systems. ComputerMethods in Appl. Mech. and Engineering, 198:3151–3160, 2009.

[3] Pasquale Chiacchio, Stefano Chiaverini, Lorenzo Sciavicco, and BrunoSicilian. Closed-loop inverse kinematics schemes for constrainedredundant manipulators with task space augmentation and task prioritystrategy. Int. J. Robotics Res., 10:410–425, 1991.

[4] Gowrishankar Ganesh, Alin Albu-Schaeffer, Masahiko Haruno, MitsuoKawato, and Etienne Burdet. Biomimetic motor behavior for simul-taneous adaptation of force, impedance and trajectory in interactiontasks. ICRA, 2010.

[5] R. Van Ham, B. Vanderborght, M. Van Damme, B. Verrelst, andD. Lefeber. MACCEPA, the mechanically adjustable compliance andcontrollable equilibrium position actuator: Design and implementationin a biped robot. Robotics and Auton. Systems, (10):761–768, 2007.

[6] M. Howard, D. Mitrovic, and S. Vijayakumar. Transferring impedancecontrol strategies between heterogeneous systems via apprenticeshiplearning. Humanoids, 2010.

[7] Masazumi Katayama and Mitsuo Kawato. Virtual trajectory andstiffness ellipse during multijoint arm movement predicted by neuralinverse models. Biol. Cybern., 69:353–362, 1993.

[8] Matteo Laffranchi, Nikos G. Tsagarakis, and Darwin G.Caldwell. Avariable physical damping actuator (VPDA) for compliant roboticjoints. ICRA, 2010.

[9] A. Liegeois. Automatic supervisory control of the configuration andbehavior of multibody mechanisms. IEEE Trans. Sys., Man andCybernetics, SMC-7:245–250, 1977.

[10] Alessandro De Luca, Riccardo Farina, and Pasquale Lucibello. On thecontrol of robots with visco-elastic joints. ICRA, 2005.

[11] Djordje Mitrovic, Stefan Klanke, and Sethu Vijayakumar. Exploitingsensorimotor stochasticity for learning control of variable impedanceactuators. Int. J. Robotics Res., 2010.

[12] Jan Peters, Michael Mistry, Firdaus Udwadia, Rick Cory, Jun Nakan-ishi, and Stefan Schaal. A unifying methodology for the control ofrobotic systems. IROS, 2005.

[13] Kenji Tahara, Suguru Arimoto, Masahiro Sekimoto, and Zhi-Wei Luo.On control of reaching movements for musculo-skeletal redundant armmodel. Applied Bionics and Biomechanics, 6:11–26, 2009.

[14] Emanuel Todorov and Weiwei Li. A generalized iterative LQGmethod for locally-optimal feedback control of constrained nonlinearstochastic systems. American Control Conference, 2005.

[15] Sebastian Wolf and Gerd Hirzinger. A new variable stiffness design:Matching requirements of the next robot generation. ICRA, 2008.

Constraint-based Equilibrium and Stiffness Control of ... · PDF fileConstraint-based...

Documents

Transcript of Constraint-based Equilibrium and Stiffness Control of ... · PDF fileConstraint-based...