loading
Documents

Research.Publish.Connect.

Paper

Authors: Birhanu Belay 1 ; Tewodros Habtegebrial 2 ; Gebeyehu Belay 3 and Didier Stricker 4

Affiliations: 1 Technical University of Kaiserslautern, Kaiserslautern, Germany, Bahir Dar Institute of Technology, Bahir Dar, Ethiopia ; 2 Technical University of Kaiserslautern, Kaiserslautern, Germany ; 3 Bahir Dar Institute of Technology, Bahir Dar, Ethiopia ; 4 Technical University of Kaiserslautern, Kaiserslautern, Germany, DFKI, Augmented Vision Department, Kaiserslautern, Germany

ISBN: 978-989-758-397-1

ISSN: 2184-4313

Keyword(s): Amharic Document Image, Automatic Feature, Binary SVM, CNN, Handwritten, Machine Printed, OCR, Pattern Recognition.

Abstract: In many documents, ranging from historical to modern archived documents, handwritten and machine printed texts may coexist in the same document image, raising significant issues within the recognition process and affects the performance of OCR application. It is, therefore, necessary to discriminate the two types of texts so that it becomes possible to apply the desired recognition techniques. Inspired by the recent successes CNN based features on pattern recognition, in this paper, we propose a method that can discriminate handwritten from machine printed text-lines in Amharic document image. In addition, we also demonstrate the effect of replacing the last fully connected layer with a binary support vector machine which minimizes a margin-based loss instead of the cross-entropy loss. Based on the results observed during experimentation, using Binary SVM gives significant discrimination performance compared to the fully connected layers.

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.237.94.109

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Belay, B.; Habtegebrial, T.; Belay, G. and Stricker, D. (2020). Using Automatic Features for Text-image Classification in Amharic Documents.In Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-397-1, ISSN 2184-4313, pages 440-445. DOI: 10.5220/0008940704400445

@conference{icpram20,
author={Birhanu Belay. and Tewodros Habtegebrial. and Gebeyehu Belay. and Didier Stricker.},
title={Using Automatic Features for Text-image Classification in Amharic Documents},
booktitle={Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2020},
pages={440-445},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008940704400445},
isbn={978-989-758-397-1},
}

TY - CONF

JO - Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Using Automatic Features for Text-image Classification in Amharic Documents
SN - 978-989-758-397-1
AU - Belay, B.
AU - Habtegebrial, T.
AU - Belay, G.
AU - Stricker, D.
PY - 2020
SP - 440
EP - 445
DO - 10.5220/0008940704400445

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.