Resources and End-to-End Neural Network Models for Arabic Image Captioning

Obeida ElJundi; Mohamad Dhaybi; Kotaiba Mokadam; Hazem Hajj; Daniel Asmar

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Resources and End-to-End Neural Network Models for Arabic Image Captioning

Topics: Categorization and Scene Understanding; Deep Learning for Visual Understanding ; Entertainment Imaging Applications; Event and Human Activity Recognition; Human and Computer Interaction; Object Detection and Localization

In Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, 233-241, 2020 , Valletta, Malta

Authors: Obeida ElJundi ¹ ; Mohamad Dhaybi ¹ ; Kotaiba Mokadam ² ; Hazem Hajj ¹ and Daniel Asmar ³

Affiliations: ¹ American University of Beirut, Electrical and Computer Engineering Department, Lebanon ; ² American University of Beirut, Civil and Environmental Engineering Department, Lebanon ; ³ American University of Beirut, Mechanical Engineering Department, Lebanon

Keyword(s): Deep Learning, Computer Vision, Natural Language Processing, Image Captioning, Arabic.

Abstract: Image Captioning (IC) is the process of automatically augmenting an image with semantically-laden descriptive text. While English IC has made remarkable strides forward in the past decade, very little work exists on IC for other languages. One possible solution to this problem is to boostrap off of existing English IC systems for image understanding, and then translate the outcome to the required language. Unfortunately, as this paper will show, translated IC is lacking due to the error accumulation of the two tasks; IC and translation. In this paper, we address the problem of image captioning in Arabic. We propose an end-to-end model that directly transcribes images into Arabic text. Due to the lack of Arabic resources, we develop an annotated dataset for Arabic image captioning (AIC). We also develop a base model for AIC that relies on text translation from English image captions. The two models are evaluated with the new dataset, and the results show the superiority of our end-to- end model. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.22.240.205

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

ElJundi, O.; Dhaybi, M.; Mokadam, K.; Hajj, H. and Asmar, D. (2020). Resources and End-to-End Neural Network Models for Arabic Image Captioning. In Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP; ISBN 978-989-758-402-2; ISSN 2184-4321, SciTePress, pages 233-241. DOI: 10.5220/0008881202330241

@conference{visapp20,
author={Obeida ElJundi. and Mohamad Dhaybi. and Kotaiba Mokadam. and Hazem Hajj. and Daniel Asmar.},
title={Resources and End-to-End Neural Network Models for Arabic Image Captioning},
booktitle={Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP},
year={2020},
pages={233-241},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008881202330241},
isbn={978-989-758-402-2},
issn={2184-4321},
}

TY - CONF

JO - Proceedings of the 15th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2020) - Volume 5: VISAPP
TI - Resources and End-to-End Neural Network Models for Arabic Image Captioning
SN - 978-989-758-402-2
IS - 2184-4321
AU - ElJundi, O.
AU - Dhaybi, M.
AU - Mokadam, K.
AU - Hajj, H.
AU - Asmar, D.
PY - 2020
SP - 233
EP - 241
DO - 10.5220/0008881202330241
PB - SciTePress