annotation techniques. The proposed technique show 
a good improvement although must be said that a 
straight comparison is impossible due to the 
different adopted features and the number of classes. 
Table 3: Comparision of proposed technique with state of 
art annotation techniques. 
 
TM 
CMR
M 
ME MBRM 
Propo
sed 
Tech. 
Prec 0.06 0.10 0.09  0.24  0.36 
Recall 0.04  0.09  0.12  0.25  0.36 
6  CONCLUSION AND FUTURE 
WORKS 
Image annotation needs to exploit information from 
different orthogonal features to capture the visual 
elements carrying a symbolic meaning matched with 
the text labels. 
The shown techniques use information from 
different features and merge together visual 
information represented in term of scores related to 
different labels. Different information fusion 
techniques have been compared showing that, for 
this application, the weighted sum of g-scores 
produces better results than other fusion techniques.  
The information fusion produced putting a HL 
LDF to summarize the results of the first stage 
LDFs, allows an improvement in performance when 
the characterization of input images, through g-units 
scores, is adherent to their content. Decision trees 
have a reduced utility in this case mainly due to the 
reduced generalization capability. 
Further investigations will be focused on the 
training of the images in terms of more specific 
classes or sub-classes that despite a reduced number 
of samples for each category are more specific as 
content. The application of more complex models 
instead of LDF can also allow capturing the positive 
and negative classes in a more flexible way and 
allow a better performance for fusion algorithms. 
ACKNOLEDGEMENTS 
Authors would like to thank Kobus Barnard and 
Shen Gao for their help with the images data set and 
Rulequest company for the evaluation version of the 
See5/C5.0 software for decision trees building. 
REFERENCES 
Barnard K., Duygulu P., Forsyth D., de Freitas N., Blei D., 
Jordan. M., 2003, “Matching words and pictures”, 
Journal of Machine Learning Research, Vol.3, pp 
1107-1135. 
Bellegarda J.-R., 2000, “Exploiting latent semantic 
information in statistical language modelling”, Proc. 
of the IEEE, Vol. 88, No. 8, pp 1279-1296. 
Blei D., Jordan M.-I., 2003, “Modeling annotated data”, 
ACM SIGIR. 
Carbonetto P., de Freitas N., Barnard K., 2004, “A 
statistical model for general contextual object 
recognition”, Proc. of ECCV. 
Duygulu P., de Freitas N., Barnard K., Forsyth D., 2002, 
“Object recognition as machine translation: Learning a 
lexicon for a fixed vocabulary”, Proc. of ECCV. 
Feng S.-L., Manmatha R., Lavrenko V., 2004, “Multiple 
Bernoulli relevance models for image and video 
annotation,” , Proc of CVPR’04. 
Gao S., Wang D.-H., Lee C.-H., 2006, “Automatic Image 
Annotation through Multi-Topic Text Categorization”, 
Proc. of ICASSP. 
Gao S., Wu W., Lee C.-H., Chua T.-S. , 2004, “A MFoM 
learning approach to robust multiclass multi-label text 
categorization”, Proc. of ICML. 
He X.-M., Zemel R. S., Carreira-Perpiñán M. A., 2004, 
“Multiscale conditional random fields for automatic 
image annotation”, Proc. of CVPR 
Jeon J., Manmatha R., 2004., “Using maximum entropy 
for automatic image annotation”, Proc of ICVR. 
Jeon J., Manmatha R., 2003, “Automatic image annotation 
and retrieval using cross-media relevance models”, 
ACM SIGIR. 
Landgrebe T.C.W., Paclik P., Duin R.P.W., Bradley A.P., 
2006, "Precision-recall operating characteristic (P-
ROC) curves in imprecise environments", Proc. of the 
18th Int. Conf. on Pattern Recognition  
Linde Y., Buzo A., Gray R., 1980. “An Algorithm for 
Vector Quantizer Design”. IEEE Transaction on 
Communications, vol. 28 (1), pp 84–94. 
Mitchell T.M., 1997, Machine Learning, McGrawHill 
Mori Y, Takahashi H., Oka R., 1999, Image-to-word 
transformation based on dividing and vector 
quantizing images with words, In Proc of MISRM'99  
Quinlan J.R., 2006, Data Mining Tools See5 and C5.0, 
from Rule Quest web site: www.rulequest.com/ see5-
info.html 
Salton G., 1971, The SMART Retrieval System, Prentice-
Hall, Englewood Cliffs, NJ 
Sebastiani F., 2002, “Machine Learning in Automated 
Text Categorization”, ACM Computer Surveys, Vol. 
34, No. 1, pp 1-47. 
Wang D.-H, Gao S., Tian Q., Sung W.-K, 2005, 
“Discriminative fusion approach for automatic image 
annotation”,  Proc. of IEEE 7
th
 Workshop on 
Multimedia Signal Processing 
INFORMATION FUSION TECHNIQUES FOR AUTOMATIC IMAGE ANNOTATION
67