A Digital Palaeographic Approach towards Writer Identification in the Dead Sea Scrolls

Maruf A. Dhali, Sheng He, Mladen Popović, Eibert Tigchelaar, Lambert Schomaker

2017

Abstract

To understand the historical context of an ancient manuscript, scholars rely on the prior knowledge of writer and date of that document. In this paper, we study the Dead Sea Scrolls, a collection of ancient manuscripts with immense historical, religious, and linguistic significance, which was discovered in the mid-20th century near the Dead Sea. Most of the manuscripts of this collection have become digitally available only recently and techniques from the pattern recognition field can be applied to revise existing hypotheses on the writers and dates of these scrolls. This paper presents our ongoing work which aims to introduce digital palaeography to the field and generate fresh empirical data by means of pattern recognition and artificial intelligence. Challenges in analyzing the Dead Sea Scrolls are highlighted by a pilot experiment identifying the writers using several dedicated features. Finally, we discuss whether to use specifically-designed shape features for writer identification or to use the Deep Learning methods on a relatively limited ancient manuscript collection which is degraded over the course of time and is not labeled, as in the case of the Dead Sea Scrolls.

References

  1. Adak, C. and Chaudhuri, B. B. (2015). Writer identification from offline isolated bangla characters and numerals. In ICDAR, pages 486-490. IEEE.
  2. Baird, H. S. (1992). Document image defect models. In Structured Document Image Analysis, pages 546-556. Springer.
  3. Belongie, S., Malik, J., and Puzicha, J. (2002). Shape matching and object recognition using shape contexts. IEEE PAMI, 24(4):509-522.
  4. Benhamou, S. (2004). How to reliably estimate the tortuosity of an animal's path: straightness, sinuosity, or fractal dimension? Theoretical Biology, 229(2).
  5. Brink, A., Smit, J., Bulacu, M., and Schomaker, L. (2012). Writer identification using directional ink-trace width measurements. PR, 45(1):162-171.
  6. Bulacu, M. and Schomaker, L. (2007). Text-independent writer identification and verification using textural and allographic features. IEEE PAMI, 29(4):701-717.
  7. Bulacu, M., Schomaker, L., and Brink, A. (2007). Textindependent writer identification and verification on offline arabic handwriting. In ICDAR, volume 2, pages 769-773. IEEE.
  8. Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and FeiFei, L. (2009). ImageNet: A large-scale hierarchical image database. In CVPR.
  9. Fisher, R. A. (1925). Statistical methods for research workers. Genesis Publishing Pvt Ltd.
  10. Ha, T. M. and Bunke, H. (1997). Off-line, handwritten numeral recognition by perturbation method. IEEE PAMI, 19(5):535-539.
  11. He, S., Samara, P., Burgers, J., and Schomaker, L. (2016). A multiple-label guided clustering algorithm for historical document dating and localization. IEEE Transactions on Image Processing, 25(11):5252-5265.
  12. He, S. and Schomaker, L. (2014). Delta-n hinge: Rotationinvariant features for writer identification. In ICPR, pages 2023-2028.
  13. He, S. and Schomaker, L. (2016). Writer identification using curvature-free features. PR.
  14. He, S. and Schomaker, L. (2017). Beyond ocr: Multifaceted understanding of handwritten document characteristics. PR, 63:321-333.
  15. He, S., Wiering, M., and Schomaker, L. (2015). Junction detection in handwritten documents and its application to writer identification. PR, 48(12):4036-4048.
  16. Ito, S. and Kubota, S. (2010). Object classification using heterogeneous co-occurrence features. In European Conference on Computer Vision, pages 701-714. Springer.
  17. Karunakara, K. and Mallikarjunaswamy, B. (2011). Writer identification based on offline handwritten document images in kannada language using empirical mode decomposition method. Writer, 30(6).
  18. LeCun, Y., Bengio, Y., and Hinton, G. (2015). Deep learning. Nature, 521(7553):436-444.
  19. Leuzzi, F., Tessitore, G., Delfino, S., Fusco, C., Gneo, M., and Zambonini, G. (2016). A statistical approach to speaker identification in forensic phonetics field.
  20. Li, Y., Wang, S., Tian, Q., and Ding, X. (2015). Feature representation for statistical-learning-based object detection: A review. PR, 48(11):3542-3559.
  21. Lim, T. and Alexander, P. (1995). Volume 1. In The Dead Sea Scrolls Electronic Library. Brill.
  22. Long, M. and Wang, J. (2015). Learning transferable features with deep adaptation networks. CoRR, abs/1502.02791, 1:2.
  23. Mikolajczyk, K. and Schmid, C. (2005). A performance evaluation of local descriptors. IEEE PAMI, 27(10):1615-1630.
  24. Monk (2016). Medieval palaeographic scale data set (online collection).
  25. Otsu, N. (1975). A threshold selection method from graylevel histograms. Automatica, 11(285-296):23-27.
  26. Plamondon, R. and Lorette, G. (1989). Automatic signature verification and writer identification - the state of the art. Pattern recognition, 22(2):107-131.
  27. Popovic, M. (2012). Qumran as scroll storehouse in times of crisis? a comparative perspective on judaean desert manuscript collections 1. Journal for the Study of Judaism, 43(4-5):551-594.
  28. Popovic, M. (2015). The ancient 'library' of qumran between urban and rural culture. In The Dead Sea Scrolls at Qumran and the Concept of a Library, pages 155- 167. Brill.
  29. Prasad, D. K., Quek, C., Leung, M. K., and Cho, S.-Y. (2011). A parameter independent line fitting method. In ACPR, pages 441-445.
  30. Qi, X., Xiao, R., Li, C.-G., Qiao, Y., Guo, J., and Tang, X. (2014). Pairwise rotation invariant co-occurrence local binary pattern. IEEE PAMI, 36(11):2199-2213.
  31. Schomaker, L. (2016). Design considerations for a largescale image-based text search engine in historical manuscript collections. it-Information Technology, 58(2):80-88.
  32. Schomaker, L. and Bulacu, M. (2004). Automatic writer identification using connected-component contours and edge-based features of uppercase western script. IEEE PAMI, 26(6):787-798.
  33. Shor, P., Manfredi, M., Bearman, G. H., Marengo, E., Boydston, K., and Christens-Barry, W. A. (2014). The leon levy dead sea scrolls digital library: The digitization project of the dead sea scrolls. Journal of Eastern Mediterranean Archaeology and Heritage Studies, 2(2):71-89.
  34. Siddiqi, I. and Vincent, N. (2010). Text independent writer recognition using redundant writing patterns with contour-based orientation and curvature features. PR, 43(11):3853-3865.
  35. Sobel, I. (1990). An isotropic 3× 3 image gradient operator. Machine Vision for three-demensional Sciences.
  36. Stokes, P. A. (2015). Digital approaches to paleography and book history: some challenges, present and future. Frontiers in Digital Humanities, 2:5.
  37. Tigchelaar, E. (2002). In search of the scribe of 1qs. In Emanuel, pages 339-352. Brill.
  38. Tigchelaar, E. (2010). Dead sea scrolls. In The Eerdmans Dictionary of Early Judaism, pages 163-180. Eerdmans.
  39. Van der Zant, T., Schomaker, L., and Haak, K. (2008). Handwritten-word spotting using biologically inspired features. IEEE PAMI, 30(11):1945-1957.
  40. Varga, T. and Bunke, H. (2003). Effects of training set expansion in handwriting recognition using synthetic data. In Proc. 11th Conf. of the Int. Graphonomics Society, pages 200-203. Citeseer.
  41. Yardeni, A. (2002). The book of Hebrew script: history, palaeography, script styles, calligraphy & design. Oak Knoll Pr.
Download


Paper Citation


in Harvard Style

Dhali M., He S., Popović M., Tigchelaar E. and Schomaker L. (2017). A Digital Palaeographic Approach towards Writer Identification in the Dead Sea Scrolls . In Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-758-222-6, pages 693-702. DOI: 10.5220/0006249706930702


in Bibtex Style

@conference{icpram17,
author={Maruf A. Dhali and Sheng He and Mladen Popović and Eibert Tigchelaar and Lambert Schomaker},
title={A Digital Palaeographic Approach towards Writer Identification in the Dead Sea Scrolls},
booktitle={Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2017},
pages={693-702},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006249706930702},
isbn={978-989-758-222-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - A Digital Palaeographic Approach towards Writer Identification in the Dead Sea Scrolls
SN - 978-989-758-222-6
AU - Dhali M.
AU - He S.
AU - Popović M.
AU - Tigchelaar E.
AU - Schomaker L.
PY - 2017
SP - 693
EP - 702
DO - 10.5220/0006249706930702