A Multiscale Circum-ellipse Area Representation for Planar Shape

Retrieval

Taha Faidi

1,2

, Faten Chaieb

and Faouzi Ghorbel

Cristal Laboratory, ENSI, Manouba University, 2010, Manouba, Tunisia

Research and Studies Telecommunications Center (CERT), Technology Parc, Ariana, Tunisia

{taha.faidi, faten.chaieb}@ensi-uma.tn, faouzi.ghorbel@ensi.rnu.tn

Keywords:

Shape Signature, Multiscale Representation, Afﬁne Invariant, Triangle Circumscribed Ellipse.

Abstract:

In this paper, we propose a new Multiscale Circum-ellipse Area Representation (MCAR) for planar contours.

The proposed representation deals with a multiscale shape signature deﬁned from the local area delimited by

the circumscribed ellipse of the triangle formed by three contour points and the contour. This shape signature

describes, at each scale level, the concavity/convexity at each contour point. Then, Fourier descriptors are

obtained by applying Fourier transform to the proposed multiscale signature. Thus, the proposed MCAR based

Fourier Descriptors handle the local and global shape characteristics. Furthermore, it is invariant to afﬁne

transformation and robust to local deformations. The performance of our proposed method was evaluated

through the precision recall and bull’s-eye tests on the two well-known databases (MCD and MPEG7-setB).

Obtained results indicate that our method outperforms the shape signatures based Fourier descriptor proposed

in the literature.

1 INTRODUCTION

Shape representation and description of planar ob-

jects, which are subjected to certain viewpoint vari-

ation and partially occultation, is widely considered

as a fundamental subject in many applications of pat-

tern recognition and computer vision, such as robotic

vision, content-based image retrieval, and pose esti-

mation.

Deformations induced by capturing a planar object

from the real space in different viewpoint is often ap-

proximated by an afﬁne transformation when the ob-

ject is far away from the camera. Thus, a shape de-

scriptor should be invariant under afﬁne transforma-

tions which includes scaling, changes in orientation,

shearing and translation.

A variety of shape descriptors have been proposed in

the literature during the last decades that can be di-

vided in two main classes: contour based-techniques

and region based techniques.

In region based technique, all the pixel within a shape

are used to derive the shape representation, but only

the boundary points are used to obtain the contour

based shape representation technique.

Common region-based shape descriptors are,

moment based techniques including geometric,

Zernike, pseudo Zernike and Legendre moments (Hu,

1962; Lin and Chou, 2003), Angular radial transform

(ART) (Bober, 2001), shape matrix (Bober, 2001)

and generic Fourier descriptor.

In recent years, several contour based-shape descrip-

tors have been proposed in the literature due to its

good performance in different applications.

Fourier descriptors is a promising contour based

approach for shape retrieval. In general, the planar

contour is ﬁrstly converted to a periodic 1-D signa-

ture and followed by the application of the Fourier

transform. Many signatures have been proposed as

a Fourier descriptor in the literature (T.Zahn, 1972;

D.S.Zhang, 2005; I.Kunttu, 2007). Some of them

are, the complex coordinates (CC), the radial distance

(RD), the triangular centroid area (TCA),Angular ra-

dial coordinates (ARC) and the farthest point distance

(FPD)(A.El-Ghazal, 2007). Most of Fourier descrip-

tor are based on the magnitude of the Fourier trans-

form and ignore the phase information in order to

make descriptor invariant to rotation and independent

to starting point. To maintain the phase informa-

tion,(Bartolini et al., 2005) have proposed a Fourier

descriptor by using the magnitude and the phases of

Fourier transform. In (F.Chaker, 2003), an afﬁne

and complete based Fourier Descriptors has been pro-

398

Faidi T., Chaieb F. and Ghorbel F.

A Multiscale Circum-ellipse Area Representation for Planar Shape Retrieval.

DOI: 10.5220/0006172803980404

In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017), pages 398-404

ISBN: 978-989-758-225-7

posed.

Multiscale approaches are widely studied in the re-

cent years and they are based on a natural representa-

tion of shape information at multiple level of details.

In general multiscale descriptors could be classiﬁed

in two main categories: Those based on a multiscale

geometric contour representation and those based on

a multiscale signature deﬁnition. In the ﬁrst category,

the contour is gradually smoothed and a shape sig-

nature is obtained from each contour scale. We can

cite the well-known MPEG 7 shape descriptor, called

Curvature scale space descriptor (CSS) (Abbasi et al.,

1999; Abbasi et al., 2000) where the initial contour is

gradually smoothed with an increasing value of Gaus-

sian kernel, and the shape signature is computed by

the ranking of the inﬂexion points at different scales.

Similar to the CSS, multiscale concavity/convexity

(MCC) (Adamek T., 2004), represent the shape con-

tour by the degree of the concavity/convexity between

two consecutive Gaussian contour scales. This rep-

resentation is invariant to Euclidian transformation

and robust to partial occultation but suffers from the

high time complexity needed to solve shape match-

ing problem. The Multi-resolution afﬁne invariant

Fourier descriptor (FD-APS) proposed in (T.Faidi,

2015), is based on a multi resolution representation

of the contour. For each contour resolution, the

shape signature is computed as the area of the triangle

formed by the centroid points of the original contour

and two given points from respectively original con-

tour and the contour at a given scale. This representa-

tion is invariant to afﬁne transformation and robust to

small occultation.

In the second approach, the descriptor is derived from

a multiscale deﬁnition of the signature such as the

Triangular area representation (TAR) (Alajlan et al.,

2007) and the multiscale contour ﬂexibility based

Fourier descriptor(Xin Shu, 2015). The Triangular

area representation (TAR) deﬁne the multiscale

representation by a progressive triangle side length

at each point and take the area of each triangle as a

shape signature. The contour ﬂexibility concept is

ﬁrstly introduced in (C.Xu, 2009) where an interior

and exterior neighboring regions centered to a land-

marks points in the contour are used to measure the

contour ﬂexibility by an Euclidean distance function.

Recently, Xin Shu and al are extending ﬂexibility

signature to a multiscale approach by taking different

level of bendable parameter in order to describe the

shape by a local and global representation (C.Xu,

2009). This representation it can reﬂect how ex-

tensively the neighborhood of a given point in the

contour are connected to the main body but it is not

invariant to afﬁne transformation.

In this work we propose a new multiscale shape de-

scriptor, denoted MCAR, derived from a multiscale

signature deﬁnition based on the area of triangle

circum-ellipse. Furthermore, the proposed descriptor

is by deﬁnition invariant to afﬁne transformations.

This paper is organized as follows: Section 2 in-

troduces the proposed multiscale shape signature

(MCAR). Section 3 describes the MCAR based

Fourier Descriptors obtained by applying Fourier

transform to the proposed multiscale signature. In

Section 4, experimental results are presented to eval-

uate our descriptor and compare it to state of the art

methods.

2 MULTISCALE

CIRCUM-ELLIPSE SHAPE

SIGNATURE

Let Γ = (x(t), y(t)) be a closed contour of a 2D planar

shape.

Figure 1: MCAR shape descriptor bloc diagram.

In ﬁgure1 we show the main steps needed to com-

pute our proposed multiscale shape signature. First,

the contour Γ is normalized by translating the con-

tour centroid to the origin of the 2D coordinate sys-

tem. Furthermore, the initial contour parametrization

would not be necessary the same for different views.

The descriptors computed from two different parame-

terizations of the same geometric curve are generally

different. This is due to parametrization dependance

on transformations. One solution to this problem con-

sist in performing a G-invariant reparametrization of

the curve where G is the considered geometric trans-

formations group.

In the case of afﬁne transformations group, we carry

out a reparametrization by the normalized afﬁne arc-

length deﬁned as:

¯s

(t) =

(||det(γ

(t), γ

(t))||)

dt, t ∈[0, T ].

(1)

where L

is the curve afﬁne length.

A Multiscale Circum-ellipse Area Representation for Planar Shape Retrieval

399

Finally, a multiscale shape signature is computed.

This latter is detailed in the next section.

2.1 Formulation

In this section, we denote by {P

= {x

, y

}}

i=1···N

the discrete afﬁne arc-length reparametrization of Γ

where N is the number of obtained contour points.

Let E

n,t

, the circumscribed ellipse of the triangle

n,t

= P

n−t

n+t

where P

n−t

= (x

n−t

, y

n−t

= (x

, y

) and P

n+t

= (x

n+t

, y

n+t

) are three

consecutive contour points and t

is the triangle side

length (See ﬁgure 2. a).

The multiscale signature at scale t

denoted by

W (n, t

) is deﬁned according to the intersection be-

tween the arc γ

n,t

n−ts

n+t

of the ellipse E

n,t

and

the shape contour. Two cases can occur:

• case1 (See ﬁgure 2.b): γ

n,t

γ =

The shape signature at scale t

is deﬁned as

the area of the region delimited by γ

n,t

and the

segment [P

n−t

n+t

]. It’s given by:

W (n, t

) =

4π

√

Area(4

n,t

) (2)

• case 2 (See ﬁgure 2.c): γ

n,t

γ = {p

}

i=1···m

The shape signature at scale t

is deﬁned as the

sum of the area of the regions {S

n,t

}

i=1···m

de-

limited by the elliptic arc \p

i+1

and the segment

i+1

]. It’s given by the following equation:

W (n, t

) =

∑

i=0

Area(S

n,t

) (3)

2.2 Properties

Afﬁne Invariance. The proposed shape descriptor

is invariant to afﬁne transformations. Given a three

points A, B and C from the contour and A

, B

, C

their

image by an afﬁne transformation. If E is the circum-

scribed ellipse of the triangle 4 = ABC then the cir-

cumscribed ellipse E

of the triangle 4= A

exist

and it’s the image of E by the same afﬁne transforma-

tion. This property is veriﬁed due to the following

claims :

• The afﬁne transformation maps an ellipse to an

ellipse and preserves the intersection between

curves.

• The uniqueness of the triangle circumscribed el-

lipse.

(a) E

: Circumscribed

ellipse

(b) W (n, t

) : Shape signa-

ture (case 1)

) : Shape signa-

ture (case 2)

Figure 2: Shape signature steps.

Discrimination Property. Figure3 shows the

MCAR shape signature of three shapes from two

different classes (ﬁgures 3 (a), (c) and (e)). We can

see that their corresponding shape signatures (ﬁgures

3(b), (d) and (f)) are similar for the same class shapes

(camel shape) and dissimilar for those from different

classes (camel and star shape).

Relation Between MCAR Signature and Contour

Concavity/Convexity. Figure 4 shows the signature

at different scales. We can notice that at the scales

(c, d, e and f) all the concavity/convexity parts of the

contours are described by the signature and only dom-

inant features persist over the scales (f and g).

3 MULTISCALE FOURIER

SHAPE DESCRIPTOR

Fourier Descriptors of the multiscale signature are

computed. The discrete Fourier transform of the

signature W (t, t

) is given by:

(ts)

N−1

∑

t=0

W (t, t

)exp(

−j 2πnt

), n = 0, ··· , N −1

. A Fourier descriptor of the signature W(t, t

) are

derived from the Fourier coefﬁcients a

)

as follows:

VISAPP 2017 - International Conference on Computer Vision Theory and Applications

400

(a) (b)

(e) (f)

Figure 3: MCAR Signature of For 3 shapes at scale ts=40.

(a) Shape boundary (b) Concavity/convexity at the

scale:ts=25

(f) ts=20 (g) ts=25

Figure 4: MCAR Signature at different scales

ts={5,10,15,20,25}.

)

= DF(W (t, t

)) =

(

)

n=1···p

, (4)

where p is the number of Fourier coefﬁcients.

Therefore, the proposed descriptor is deﬁned by

}

k=1···N

}

=1···T

= {DF

)

}

=1···T

, (5)

where T

is the number of scales. In this work, we

consider

scales.

The similarity measure used to compare two shape

contours Γ1 and Γ2 is formalized as follows:

d(Γ1, Γ2) =

∑

k=1

kJ1

−J2

(6)

where T

is the number of contour scales and k.k

the L

norm.

Since the Fourier Descriptors of the signature are in-

variant to rotation and starting point, the proposed

descriptor is invariant to afﬁne transformations and

starting point.

4 EXPERIMENTAL RESULTS

In this section, the performance of our proposed

method is shown using two standard shape datasets

MPEG7-setB and MCD.

Also, it’s compared with some commonly used sig-

natures such as Multiscale Contour ﬂexibility shape

signature for Fourier descriptor, Curvature scale

space(CSS), Perimeter area function(PAF) and the

Multi-resolution Afﬁne Invariant Planar Contour De-

scriptor(DFAP).

4.1 Datasets Description

The well-known MPEG-7 setB dataset (ﬁgure5), con-

sists of 1400 images classiﬁed into 70 classes. It’s

used for similarity-based retrieval accuracy and shape

descriptors robustness under various arbitrary shape

distortions, that include rotation, scaling, arbitrary

skew, stretching, defection, and indentation. The

Multiview Curve dataset (ﬁgure 6) is composed of

40 shape classes selected from MPEG-7 database,

where Each class contains 14 shape samples that cor-

respond to different perspective examples of the orig-

inal curve. This dataset is used in order to evaluate

shape descriptors under afﬁne transformation.

4.2 Performance Evaluation

In order to evaluate the performance of our shape de-

scriptor in the context of image retrieval, we deal with

A Multiscale Circum-ellipse Area Representation for Planar Shape Retrieval

401

(a) (b)

Figure 5: MPEG7 setB dataset.

(a) (b)

Figure 6: MCD dataset.

the precision-recall and the bull’s-eye test which are

the most commonly used test measures.

The precision is deﬁned as the number of relevant

shapes retrieved divided by the total number of shape

retrieved, while the recall is deﬁned as the number of

relevant shapes retrieved divided by the total number

of relevant shapes in the class. The average of the pre-

cision and recall over all the database is used to plot

the precision-recall curve. In the case of the Bull’s-

eye test each shape from the database is used as a

query, if the retrieved shape is in the same class as the

query one then it is considered as a correct response.

The number of correct retrievals in the top 2M (where

M is the size of the class) ranks is counted, includ-

ing the query. Retrieval rate is the percentage of the

maximal possible number of correct retrievals shapes.

In order to be in the same conditions when comparing

our approach with other methods in the MPEG-7 Set

B data set, 128 points are sampled from each contour.

The bulls-eye test results obtained for the the

MPEG7-SetB database are shown in table 1 and

indicate that our descriptor outperforms Fourier

based shape signatures proposed in the literature such

as Fourier Decriptors of TAR, multiscale contour

ﬂexibility descriptors and the DFAP proposed in

(T.Faidi, 2015). The MCAR signature’s performance

is equal to 71.41%. In fact, this is tied to the the-

oretical invariance property of our descriptor under

afﬁne transformations and it’s capability to perfectly

capture the local and global information thanks to the

multiscale technique.

However, the CSS outperforms our descriptor in term

of bulls-eye test. It’s important to note that this de-

scriptor requires high complexity time to solve shape

matching problem. Our proposed descriptor is easy to

compute and uses a simple L

distance to measure the

distance between two shape signatures.

Table 1: The Bull-eyes test for our descriptor and other sig-

natures using MPEG-7 database setB.

Shape descriptor Bull-eyes

CSS 75.44

PAF+CD 74.36%

MCAR (proposed) 71.41%

TAR + Fourier descriptor 68.67%

Multiscale Contour ﬂexibility

Fourier descriptor (W+)

67.57%

PAF 66.49 %

DFAP 66.46%

Multiscale Contour ﬂexibility

Fourier descriptor (W+)

65.39%

Figure 7 shows ten random retrieval results from the

MCD database based on the proposed MCAR de-

scriptor. Incorrect responses are obtained for only one

query at the 8, 11, 12 and 13 ranks.

In ﬁgure 8, we show the average precision and recall

curves of MCD database for different shape descrip-

tors : Fourier Descriptors of TAR, DFPAS Descriptor

and the CSS descriptor. Our proposed descriptor out-

performs the others.

Figure 8: Average precision and recall of MCD database

retrieval.

5 CONCLUSION

In this paper, a closed-boundary multiscale shape sig-

nature invariant to afﬁne transformation has been in-

troduced. It is deﬁned from the local area delimited

by the circumscribed ellipse of the triangle formed by

three contour points and the contour. At each scale

level, the proposed shape signature, denoted MCAR,

depicts the contour point concavity/convexity. Then,

Fourier descriptors of the MCAR signature are gen-

erated. The multiscale approach combined with the

VISAPP 2017 - International Conference on Computer Vision Theory and Applications

402

Query Retrieved Results

Figure 7: 10 random retrieval results from MCD database.

Fourier Descriptors captures both global and local ge-

ometric characteristics of the shape. Furthermore,

it is invariant to afﬁne transformation and robust to

partial occlusion. The performance of our proposed

method was evaluated through the precision recall

and bull’s-eye tests on the two well-known databases

(MCD and MPEG7-setB). Obtained results indicate

that our method outperforms the shape signatures

based Fourier descriptor proposed in the literature.

ACKNOWLEDGEMENTS

This work is carried out as part of a MOBIDOC the-

sis ﬁnanced by the European Union under the PASRI

program in partnership with the Research and Studies

Telecommunications Center CERT

and Cristal

lab-

oratory and was supported by the PHC Utique pro-

gram for the CMCU DEFI project (N

16G 1403).

REFERENCES

Abbasi, S., Mokhtarian, F., and Kittler, J. (1999). Curvature

scale space image in shape similarity retrieval. Multi-

media Syst, 7(6):467–476.

Abbasi, S., Mokhtarian, F., and Kittler, J. (2000). Enhanc-

ing css-based shape retrieval for objects with shallow

concavities. Image and Vision Computing, 18(3):199–

211.

Adamek T., O. N. (2004). A multiscale representation

method for nonrigid shapes with a single closed con-

http://www.pasri.tn/

http://www.cert.nat.tn/

http://www.arts-pi.org.tn/cristal.php/

tour. IEEE TRANSACTIONS, Circuit Syst,Video Tech-

nol, 14:742–753.

A.El-Ghazal, O.Basir, S. (2007). A new shape signature

for fourier descriptors. 14 th IEEE International Con-

ference on Image Processing SanAntonio,Texas,USA,

pages 161–164.

Alajlan, N., Rube, I., M.S.Kamel, and G.Freeman (2007).

Shape retrieval using triangle-area representation

and dynamic space warping. Pattern Recognition,

40(7):1911–1920.

Bartolini, I., Ciaccia, P., and Patella, M. (2005). Warp: ac-

curate retrieval of shapes using phase of fourier de-

scriptors and time warping distance. Pattern Analy-

sis and Machine Intelligence, IEEE Transactions on,

27(1):142–147.

Bober, M. (2001). Mpeg-7 visual shape descriptors. IEEE

Trans. Circuits and Systems for Video Technology,

11(6):716–719.

C.Xu, J.Liu, X. (2009). 2d shape matching by contour ﬂex-

ibility. IEEE TRANSACTIONS ON PATTERN ANAL-

YSIS AND MACHINE INTELLIGENCE, 31.

D.S.Zhang, G. (2005). Study and evaluation of different

fourier methods for image retrieval. Image and Vision

Computing, 23:33–49.

F.Chaker, M.Bannour, F. (2003). A complete and stable

set of afﬁne-invariant fourier descriptors. 12th In-

ternational Conference on Image Analysis and Pro-

cessing, pages 578–581.

Hu, M.-K. (1962). Visual pattern recognition by moment

invariants. Information Theory, IRE Transactions on,

8(2):179–187.

I.Kunttu, L. (2007). Shape-based retrieval of industrial sur-

face defects using angular radius fourier descriptor.

IET Image Processing, 1:231–236.

Lin, T.-W. and Chou, Y.-F. (2003). A comparative study of

zernike moments. In Web Intelligence, 2003. WI 2003.

Proceedings. IEEE/WIC International Conference on,

pages 516–519.

T.Faidi, F.Chaieb, F. (2015). A new multi-resolution afﬁne

A Multiscale Circum-ellipse Area Representation for Planar Shape Retrieval

403

invariant planar contour descriptor. Image Analysis

and Processing-ICIAP 2015, 9280:494–509.

T.Zahn, R. (1972). Fourier descriptors for plane closed

curves. IEEE Transactions on Computers, 21:269–

281.

Xin Shu, Lei Pan, X.-J. W. (2015). Multi-scale contour ﬂex-

ibility shape signature for fourier descriptor. Journal

of Visual Communication and Image Representation,

26:161167.

VISAPP 2017 - International Conference on Computer Vision Theory and Applications

404