loading
Documents

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Yaakov Hacohen-Kerner 1 ; Nadav Schweitzer 2 and Yaakov Shoham 1

Affiliations: 1 Jerusalem College of Technology, Israel ; 2 Bar-Ilan University, Israel

ISBN: 978-989-8425-28-7

Keyword(s): Hebrew-Aramaic Texts, Information Retrieval, Quotation Identification.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Clustering and Classification Methods ; Information Extraction ; Knowledge Discovery and Information Retrieval ; Knowledge-Based Systems ; Symbolic Systems

Abstract: Quotations in a text document contain important information about the content, the context, the sources that the author uses, their importance and impact. Therefore, automatic identification of quotations from documents is an important task. Quotations included in rabbinic literature are difficult to identify and to extract for various reasons. The aim of this research is to automatically identify Biblical quotations included in rabbinic documents written in Hebrew-Aramaic. We deal with various kinds of quotations: partial, missing and incorrect. We formulate nineteen features to identify these quotations. These features were divided into seven different feature sets: matches, best matches, sums of weights, weighted averages, weighted medians, common words, and quotation indicators. Several features are novel. Experiments on various combinations of these features were performed using four common machine learning methods. A combination of 17 features using J48 (an improved version of C 4.5) achieves an accuracy of 91.2%, which is an improvement of about 8% compared to a baseline result. (More)

PDF ImageFull Text

Download
CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 34.239.158.107

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Hacohen-Kerner, Y.; Schweitzer, N. and Shoham, Y. (2010). AUTOMATIC IDENTIFICATION OF BIBLICAL QUOTATIONS IN HEBREW-ARAMAIC DOCUMENTS.In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010) ISBN 978-989-8425-28-7, pages 320-325. DOI: 10.5220/0003106703200325

@conference{kdir10,
author={Yaakov Hacohen{-}Kerner. and Nadav Schweitzer. and Yaakov Shoham.},
title={AUTOMATIC IDENTIFICATION OF BIBLICAL QUOTATIONS IN HEBREW-ARAMAIC DOCUMENTS},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)},
year={2010},
pages={320-325},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003106703200325},
isbn={978-989-8425-28-7},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2010)
TI - AUTOMATIC IDENTIFICATION OF BIBLICAL QUOTATIONS IN HEBREW-ARAMAIC DOCUMENTS
SN - 978-989-8425-28-7
AU - Hacohen-Kerner, Y.
AU - Schweitzer, N.
AU - Shoham, Y.
PY - 2010
SP - 320
EP - 325
DO - 10.5220/0003106703200325

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.