Tree Distance in Answer Retrieval and Parser Evaluation

Martin Emms

Abstract

The use of syntactic tree-distance as a surrogate for semantic distance in an answer retrieval task is investigated. The feasibility of this is confirmed by showing that retrieval performance increases with parse quality, and an application of this to parser evaluation is discussed. Variant definitions of tree-distance involving parameters such as whole vs sub-tree, node weighting, wild-card trees and lexical emphasis are compared with each other and with sub-string distance.

References

  1. K.Zhang, D.Shasha: Simple fast algorithms for the editing distance between trees and related problems. SIAM Journal of Computing 18 (1989) 1245-1262
  2. Fontana, W., Hofacker, I.L., Stadler, P.F.: (Vienna rna package) www.tbi.univie.ac.at/i˜vo/RNA.
  3. V.I.Levenshtein: Binary codes capable of correcting insertions and reversals. Sov. Phys. Dokl 10 (1966) 707-710
  4. Molla, D., Hutchison, B.: Intrinsic vs extrinsic evaluations of parsing systems. In: Proceedings European Association for Computational Linguistics (EACL), workshop on Evaluation Initiatives in Natural Language Processing. (2003) 43-50
  5. Collins, M.: Head-driven statistical models for natural language parsing. PhD thesis (1999)
Download


Paper Citation


in Harvard Style

Emms M. (2005). Tree Distance in Answer Retrieval and Parser Evaluation . In Proceedings of the 2nd International Workshop on Natural Language Understanding and Cognitive Science - Volume 1: NLUCS, (ICEIS 2005) ISBN 972-8865-23-6X, pages 155-160. DOI: 10.5220/0002565101550160


in Bibtex Style

@conference{nlucs05,
author={Martin Emms},
title={Tree Distance in Answer Retrieval and Parser Evaluation},
booktitle={Proceedings of the 2nd International Workshop on Natural Language Understanding and Cognitive Science - Volume 1: NLUCS, (ICEIS 2005)},
year={2005},
pages={155-160},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002565101550160},
isbn={972-8865-23-6X},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Workshop on Natural Language Understanding and Cognitive Science - Volume 1: NLUCS, (ICEIS 2005)
TI - Tree Distance in Answer Retrieval and Parser Evaluation
SN - 972-8865-23-6X
AU - Emms M.
PY - 2005
SP - 155
EP - 160
DO - 10.5220/0002565101550160