GEOMETRIC CONTROL OF A BINOCULAR HEAD

Eduardo Bayro-Corrochano and Julio Zamora-Esquivel

Department of Electrical Engineering and Computer Science

CINVESTAV, unidad Guadalajara. Jalisco, Mexico

Keywords:

Conformal Geometry, Kinematics, Tracking.

Abstract:

In this paper the authors use geometric algebra to formulate the differential kinematics of a binocular robotic

head and reformulate the interaction matrix in terms of the lines that represent the principal axes of the camera.

This matrix relates the velocities of 3D objects and the velocities of their images in the stereo images. Our

main objective is the formulation of a kinematic control law in order to close the loop between perception and

action, which allows to perform a smooth visual tracking.

1 INTRODUCTION

In this work we formulate the problem of visual track-

ing and design a control law by velocity feedback

that allows us to close the loop between perception

and action. Geometric algebra allow us to work

with geometric entities like points, lines and planes

and helps in the representation of rigid transforma-

tions. In this mathematical framework we straightfor-

wardly formulate the direct and differential kinemat-

ics of robotic devices like the binocular robot head.

On the other hand we show a reformulation of vi-

sual Jacobean which relates the velocity of a tridimen-

sional object with the velocity of its projection onto

the stereo camera images. Finally we write an ex-

pression that relates the joint velocities in the pan-tilt

unit and velocities of the points in the camera image.

We start this work presenting a brief description of

the geometric entities and conformal transformations

them we show the kinematics of a pan-tilt unit and

formulate its control law.

In contrast to other authors like (C. Canudas de

Wit and Bastin, 1996), (Ruf, 2000) or (Kim Jung-Ha.,

1990) we will use multivectors instead of matrices to

formulate the control law it reduces the computation

and improve the performance of the controller.

2 GEOMETRIC ALGEBRA: AN

OUTLINE

Let

denote the geometric algebra of n-dimensions,

this is a graded linear space. As well as vector

addition and scalar multiplication we have a non-

commutative product which is associative and dis-

tributive over addition – this is the geometric or Clif-

ford product.

The inner product of two vectors is the standard

scalar or dot product and produces a scalar. The outer

or wedge product of two vectors is a new quantity

which we call a bivector. We think of a bivector as a

oriented area in the plane containing a and b, formed

by sweeping a along b.

Thus, b∧a will have the opposite orientation mak-

ing the wedge product anti-commutative. The outer

product is immediately generalizable to higher di-

mensions – for example, (a ∧ b) ∧ c, a trivector, is

interpreted as the oriented volume formed by sweep-

ing the area a∧ b along vector c. The outer product of

k vectors is a k-vector or k-blade, and such a quantity

is said to have grade k. A multivector (linear combi-

nation of objects of different type) is homogeneous if

it contains terms of only a single grade.

2.1 The Geometric Algebra of n-D

Space

In this paper we will specify a geometric algebra

of the n dimensional space by G

p,q,r

, where p, q and r

183

Bayro-Corrochano E. and Zamora-Esquivel J. (2007).

GEOMETRIC CONTROL OF A BINOCULAR HEAD.

In Proceedings of the Fourth International Conference on Informatics in Control, Automation and Robotics, pages 183-188

DOI: 10.5220/0001628001830188

 SciTePress

stand for the number of basis vector which squares to

1, -1 and 0 respectively and fulﬁll n = p+q+ r.

We will use e

to denote the vector basis i. In a Ge-

ometric algebra G

p,q,r

, the geometric product of two

basis vector is deﬁned as











1 for i = j ∈ 1, ··· , p

−1 for i = j ∈ p+ 1, ·· · , p+ q

0 for i = j ∈ p+ q+ 1, ·· · , p+ q+ r.

∧ e

for i 6= j

This leads to a basis for the entire algebra:

{1}, {e

}, {e

∧ e

}, {e

∧ e

}, . . . , {e

∧ e

∧ . . . ∧ e

} (1)

Any multivector can be expressed in terms of this

basis.

3 CONFORMAL GEOMETRY

Geometric algebra G

4,1

can be used to treat confor-

mal geometry in a very elegant way. To see how this

is possible, we follow the same formulation presented

in (H. Li, 2001) and show how the Euclidean vector

space R

is represented in R

4,1

. This space has an or-

thonormal vector basis given by {e

} and e

= e

∧ e

are bivectorial basis and e

, e

and e

correspond to

the Hamilton basis. The unit Euclidean pseudo-scalar

:= e

∧ e

, a pseudo-scalar I

:= I

E and the

bivector E := e

∧ e

= e

are used for computing

the inverse and duals of multivectors.

3.1 The Stereographic Projection

The conformal geometry is related to a stereographic

projection in Euclidean space. A stereographic pro-

jection is a mapping taking points lying on a hyper-

sphere to points lying on a hyperplane. In this case,

the projection plane passes through the equator and

the sphere is centered at the origin. To make a projec-

tion, a line is drawn from the north pole to each point

on the sphere and the intersection of this line with the

projection plane constitutes the stereographic projec-

tion.

For simplicity, we will illustrate the equivalence

between stereographic projections and conformal ge-

ometric algebra in R

. We will be working in R

2,1

with the basis vectors {e

, e

} having the usual

properties. The projection plane will be the x-axis

and the sphere will be a circle centered at the origin

with unitary radius.

Given a scalar x

representing a point on the x-

axis, we wish to ﬁnd the point x

lying on the circle

that projects to it (see Figure 1). The equation of the

line passing through the north pole and x

is given by

f(x) = −

x + 1 and the equation of the circle x

Figure 1: Stereographic projection for 1-D.

f(x)

= 1. Substituting the equation of the line on the

circle, we get the point of intersection x

which can be

represented in homogeneous coordinates as the vector

= 2

+ 1

− 1

+ 1

+ e

. (2)

From (2) we can infer the coordinates on the circle for

the point at inﬁnity as

∞

= lim

→∞

{

}

= e

+ e

, (3)

lim

→0

{

}

(−e

+ e

), (4)

Note that (2) can be rewritten to

= x

∞

+ e

, (5)

3.2 Spheres and Planes

The equation of a sphere of radius ρ centered at point

∈ R

can be written as (x

− p

)

= ρ

. Since

· y

= −

− y

)

and x

· p

= −

., we can

rewrite the formula above in terms of homogeneous

coordinates as. Since x

· e

∞

= −1 we can factor the

expression above to

· (p

−

∞

) = 0. (6)

Which ﬁnally yields the simpliﬁed equation for the

sphere as s = p

−

∞

. Note from this equation

that a point is just a sphere with zero radius. Alterna-

tively, the dual of the sphere is represented as 4-vector

∗

= sI

. The advantage of the dual form is that the

sphere can be directly computed from four points as

∗

= x

∧ x

. (7)

If we replace one of these points for the point at inﬁn-

ity we get the equation of a plane

∗

= x

∧ x

∧ e

∞

. (8)

So that π becomes in the standard form

π = I

∗

= n+ de

∞

(9)

Where n is the normal vector and d represents the

Hesse distance.

ICINCO 2007 - International Conference on Informatics in Control, Automation and Robotics

184

3.3 Circles and Lines

A circle z can be regarded as the intersection of two

spheres s

and s

as z = (s

∧ s

). The dual form of

the circle can be expressed by three points lying on it

∗

= x

∧ x

. (10)

Similar to the case of planes, lines can be deﬁned

by circles passing through the point at inﬁnity as:

∗

= x

∧ x

∧ e

∞

. (11)

The standard form of the line can be expressed by

L = l + e

∞

(t · l), (12)

the line in the standard form is a bivector, and it has

six parameters (Plucker coordinates).

4 RIGID TRANSFORMATIONS

We can express rigid transformations in conformal ge-

ometry carrying out reﬂections between planes.

4.0.1 Reﬂection

The reﬂection of conformal geometric entities help us

to do any other transformation. The reﬂection of a

point x respect to the plane π is equal x minus twice

the direct distance between the point and plane see

the image 2, that is x = x− 2(π·x)π

−1

to simplify this

expression recalling the property of Clifford product

of vectors 2(b· a) = ab+ ba.

Figure 2: Reﬂection of a point x respect to the plane π.

For any geometric entity Q, the reﬂection respect to

the plane π is given by

′

= πQπ

−1

(13)

4.0.2 Translation

The translation of conformal entities can be done by

carrying out two reﬂections in parallel planes π

and

see the image (3), that is

′

= (π

)

{z}

Q(π

−1

)

{z }

(14)

= (n+ de

∞

)n = 1+

∞

= e

−

∞

(15)

With a = 2dn.

Figure 3: Reﬂection about parallel planes.

4.0.3 Rotation

The rotation is the product of two reﬂections between

nonparallel planes, (see image (4))

Figure 4: Reﬂection about nonparallel planes.

′

= (π

)

{z}

Q(π

−1

)

{z }

(16)

Or computing the conformal product of the normals

of the planes.

= n

= Cos(

) − Sin(

)l = e

−

(17)

GEOMETRIC CONTROL OF A BINOCULAR HEAD

185

With l = n

∧ n

, and θ twice the angle between the

planes π

and π

. The screw motion called motor re-

lated to an arbitrary axis L is M = TR

′

= (TR

{z }

Q((T

T))

{z }

(18)

= TR

T = Cos(

) − Sin(

)L = e

−

(19)

4.1 Kinematic Chains

The direct kinematics for serial robot arms is a succes-

sion of motors as you can see in (Bayro-Corrochano

and Kahler, 2000) and it is valid for points, lines,

planes, circles and spheres.

′

∏

i=1

∏

i=1

n−i+1

(20)

5 DIRECT KINEMATICS OF A

PAN-TILT UNIT

We implement algorithm for the velocity control of a

pan-tilt unit (PTU Fig. 5) assuming three degree of

freedom. We consider the stereo depth as one virtual

D.O.F. thus the PTU has a similar kinematic behavior

as a robot with three D.O.F.

Figure 5: Pan tilt unit.

In order to carry out a velocity control, we need

ﬁrst to compute the direct kinematics, this is very easy

to do, as we know the axis lines:

= −e

(21)

= e

+ d

∞

(22)

= e

∞

(23)

Since M

= e

−

and

= e

, The position

of end effectors is computed as

(q) = x

′

= M

, (24)

The state variable representation of the system is

as follows











˙x

′

= x

′



′











y = x

′

(25)

where the position of end effector at home po-

sition x

is the conformal mapping of x

= d

+ d

(see eq. 5), the line L

′

is the current posi-

tion of L

and u

is the velocity of the i-junction of the

system. As L

is an axis at inﬁnity M

is a translator,

that is, the virtual component is a prismatic junction.

5.1 Exact Linearization via Feedback

Now the following state feedback control law is cho-

sen in order to get a new linear an controllable system.











′

· L

′

· L

′

· L

′



−1









(26)

WhereV = (v

, v

)

is the new input to the lin-

ear system, then we rewrite the equations of the sys-

tem



˙x

′

= V

y = x

′

(27)

5.2 Asymptotical Output Tracking

The problem of follow a constant reference x

solved computing the error between end effectors po-

sition x

′

and the target position x

as e

= (x

′

∧x

)·e

∞

the control law is then given by.

V = −ke (28)

This error is small if the control system is doing

it’s job, it is mapped to an error in the joint space using

the inverse Jacobian.

U = J

−1

V (29)

Doing the Jacobian J = x

′



′



= x

′

· (L

) (30)

= x

′

· (M

) (31)

= x

′

· (M

) (32)

ICINCO 2007 - International Conference on Informatics in Control, Automation and Robotics

186

Once that we have the Jacobian is easy to compute

the dq

using the crammer’s rule.









= ( j

∧ j

)

−1





V ∧ j

∧ j

∧V ∧ j

∧ j

∧V





(33)

This is possible because j

∧ j

= det(J)I

Finally we have dq

which will tend to reduce these

errors.

5.3 Visual Jacobian

A point in the image is given by s = (x, y)

whereas

a 3-D point is represented as X. The relationship be-

tween ˙s and

S is called visual Jacobian.

Taking a camera in general position his projection

matrix is represented by the planes π

, π

y π

details in (Hartley and Zisserman, 2000).

P =









, (34)

The point X is projected in the image in the point

s =

·X

(35)

To simplify the explanation the x variable is intro-

duced and his time derivative ˙x deﬁned as

x =





· X





˙x =









(36)

Now s is given by s

and his derivative

˙s

= ˙x

+ x



− ˙x



(37)

˙s

˙x

− x

˙x

(38)

By sustitution of x and ˙x in the equation 38 is obtained

˙s

= κ[(π

· X)π

− (π

· X)π

] ·

X (39)

˙s

= κ[X ·(π

∧ π

)] ·

X (40)

where κ =

. Doing the same steps for s

it possible

to write the equation

˙s = κX ·



∧ π



X (41)

Geometrically π

∧π

represents a line of intersection

of the planes π

and π

. Denoting by L

and L

the

lines of this intersection as

= π

∧ π

(42)

= π

∧ π

(43)

It is posible to rewrite 41 as

˙s = κX ·





X (44)

In order to close the loop between the perception

and action, the relationship between velocities in the

points of the image and the velocities in the joints of

the pan-tilt unit is computed.

Taking the equation of differential kinematics 25

and visual Jacobian 44 it is possible to write a new

expression

˙s = κ



′

· L

′

) · (X

′

· L

′

) (X

′

· L

′

) · (X

′

· L

′

)



′

· L

′



· (X

′

· L

′

)



′

· L

′



· (X

′

· L

′

)



˙q (45)

We can write a similar expression using the differen-

tial kinematics of the Barrett Hand. The equation 45

is very useful to design a control law to track an object

or to grasp it.

5.4 Exact Linearization via Feedback

Now the following state feedback control law is cho-

sen in order to get a new linear an controllable sys-

tem.







′

· L

′

) ·



′

· L

′



′

· L

′

) ·



′

· L

′





′

· L

′





′

· L

′

 

′

· L

′





′

· L

′





−1





Where V = (v

, v

)

is the new input to the linear

system, then we rewrite the equations of the system



˙s

′

= V

y = s

′

(46)

5.5 Experimental Results

In this experiment the binocular head should

smoothly track a target. The ﬁgure (6) show the 3D

coordinates of the focus of attention. The ﬁgure (7)

show examples of the image sequence. We can see

that the curves of the 3D object trajectory are very

rough, however the control rule manages to keep the

trajectory of the pan-tilt unit smooth.

In the experiment the coordinate system is in the

center of the camera. Then the principal planes of the

camera are given by

= f

+ x

(47)

= f

+ y

(48)

= e

(49)

whre f

, f

, x

y y

are the camera’s parameters. Us-

ing this planes we compute the lines L

y L

, by the

way the axis of the pan-tilt are known.

= e

+ d

(50)

= e

+ d

(51)

GEOMETRIC CONTROL OF A BINOCULAR HEAD

187

0 50 100 150 200 250

−30

−25

−20

−15

−10

−5

x−PTU

x−Object

0 50 100 150 200 250

−15

−10

−5

y−Object

y−PTU

0 50 100 150 200 250

−20

−15

−10

−5

z−Object

z−PTU

Figure 6: x, y and z coordinate of the focus of attention.

Figure 7: Sequence of tracking.

Note that the tilt axis is called L

and the pan axis

is L

, because the coordinate system is in the camera.

also L

′

is a function of the tilt angle L

′

= M

with M

= cos(θ

tilt

)+sen(θ

tilt

. In this experiment

a point over the boar was selected and using the KLT

algorithm was tracked the displacement in the image

is transformed to velocities of the pan-tilt’s joint using

the visual-mechanical Jacobean Eq. 45.

As result in the image (8) we can see a sequence

of pictures captured by the robot. In these images the

position of the board do not change while the back-

ground is in continuous change.

Figure 8: Sequence of tracking.

6 CONCLUSION

The authors show that is possible and easy to write

a control law using the lines of the camera’s axes as

bivectors (Plcker coordinates) in the conformal geom-

etry instead of the interaction matrix. This formula-

tion combines the information of the camera’s param-

eters with the axes of the pan tilt unit in order to create

a matrix of the visual-mechanical Jacobian useful to

write a velocity control law. The experiments conﬁrm

the effectiveness of our approach.

REFERENCES

Bayro-Corrochano, E. and Kahler, D. (2000). Motor algebra

approach for computing the kinematics of robot ma-

nipulators. In Journal of Robotic Systems. 17(9):495-

516.

C. Canudas de Wit, B. S. and Bastin, G. (1996). Theory of

Robot Control. Springer, Berlin, 1st edition.

H. Li, D. Hestenes, A. R. (2001). Generalized Homoge-

neous coordinates for computational geometry. pages

27-52, in (Somer, 2001).

Hartley and Zisserman, A. (2000). Multiple View Geometry

in Computer Vision. Cambridge University Press, UK,

1st edition.

Kim Jung-Ha., K. V. R. (1990). Kinematics of robot manip-

ulators via line transformations. In Journal of Robotic

Systems. pp. 647-674.

Ruf, A. (2000). Closing the loop between articulated mo-

tion and stereo vision: a projective approach. In PhD.

Thesis, INP, GRENOBLE.

Somer, G. (2001). Geometric Computing with Clifford Al-

gebras. Springer-Verlag, Heidelberg, 2nd edition.

ICINCO 2007 - International Conference on Informatics in Control, Automation and Robotics

188