loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Birhanu Belay 1 ; 2 ; Tewodros Habtegebrial 2 ; Gebeyehu Belay 1 and Didier Stricker 3 ; 2

Affiliations: 1 Bahir Dar Institute of Technology, Bahir Dar, Ethiopia ; 2 Technical University of Kaiserslautern, Kaiserslautern, Germany ; 3 DFKI, Augmented Vision Department, Kaiserslautern, Germany

Keyword(s): Amharic Document Image, Automatic Feature, Binary SVM, CNN, Handwritten, Machine Printed, OCR, Pattern Recognition.

Abstract: In many documents, ranging from historical to modern archived documents, handwritten and machine printed texts may coexist in the same document image, raising significant issues within the recognition process and affects the performance of OCR application. It is, therefore, necessary to discriminate the two types of texts so that it becomes possible to apply the desired recognition techniques. Inspired by the recent successes CNN based features on pattern recognition, in this paper, we propose a method that can discriminate handwritten from machine printed text-lines in Amharic document image. In addition, we also demonstrate the effect of replacing the last fully connected layer with a binary support vector machine which minimizes a margin-based loss instead of the cross-entropy loss. Based on the results observed during experimentation, using Binary SVM gives significant discrimination performance compared to the fully connected layers.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.149.252.37

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Belay, B.; Habtegebrial, T.; Belay, G. and Stricker, D. (2020). Using Automatic Features for Text-image Classification in Amharic Documents. In Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-397-1; ISSN 2184-4313, SciTePress, pages 440-445. DOI: 10.5220/0008940704400445

@conference{icpram20,
author={Birhanu Belay. and Tewodros Habtegebrial. and Gebeyehu Belay. and Didier Stricker.},
title={Using Automatic Features for Text-image Classification in Amharic Documents},
booktitle={Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM},
year={2020},
pages={440-445},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008940704400445},
isbn={978-989-758-397-1},
issn={2184-4313},
}

TY - CONF

JO - Proceedings of the 9th International Conference on Pattern Recognition Applications and Methods - ICPRAM
TI - Using Automatic Features for Text-image Classification in Amharic Documents
SN - 978-989-758-397-1
IS - 2184-4313
AU - Belay, B.
AU - Habtegebrial, T.
AU - Belay, G.
AU - Stricker, D.
PY - 2020
SP - 440
EP - 445
DO - 10.5220/0008940704400445
PB - SciTePress