AUTOMATIC TEXT ANNOTATION FOR QUESTIONS

Gang Liu, Zhi Lu, Tianyong Hao, Liu Wenyin

2010

Abstract

An automatic annotation method for annotating text with semantic labels is proposed for question answering systems. The approach first extracts the keywords from a given question. Semantic label selection module is then employed to select the semantic labels to tag keywords. In order to distinguish multi-senses and assigns best semantic labels, a Bayesian based method is used by referring to historically annotated questions. If there is no appropriate label, WordNet is then employed to obtain candidate labels by calculating the similarity between each keyword in the question and the concept list in our predefined Tagger Ontology. Experiments on 6 categories show that this annotation method achieves the precision of 76% in average.

References

  1. Cheng, P.J., Chiao, H.C., Pan, Y.C. and Chien, L.F. 2005. Annotating text segments in documents for search. Proceedings of the 2005 IEEE/WIC/ACM International Conference on Web Intelligence, pp. 317- 320.
  2. Hao, T.Y., Hu, D.W., Liu, W.Y. and Zeng, Q.T. 2007. Semantic patterns for user-interactive question answering, Journal of Concurrency and Computation: Practice and Experience 20(1), 2007.
  3. Lin, D. 2003. Dependency-based evaluation of MINIPAR. Treebanks: Building and Using Parsed Corpora, 2003.
  4. Prager, J., Brown, E. and Coden, A. 2000. Questionanswering by predictive annotation, Proceedings of the 23rd Annual International ACM SIGIR conference, Athens, 2000.
  5. Sfihari, R. and Li, W. 1999. Question answering supported by information extraction, Proceedings of the Eighth Text REtrieval Conference (TREC8), Gaithersburg, Md., 1999.
  6. Carr, L., Bechhofer, S., Goble, C. and Hall, W. 2001. Conceptual linking: ontology-based open hypermedia, Proceedings of the 10th International World Wide Web Conference, pp. 334-342, Hong Kong, 2001.
  7. Handschuh, S., Staab, S. and Ciravegna, F. 2002. SCREAM - semiautomatic creation of metadata, Proceedings of the 13th International Conference on Knowledge Engineering and Management (EKAW 2002), Springer Verlag, 2002.
  8. Vargas-Vera, M., Motta, E., Domingue, J., Lanzoni, M., Stutt, A. and Ciravegna, F. 2002. MnM: ontology driven semi-automatic and automatic support for semantic markup, Proceedings of the 13th International Conference on Knowledge Engineering and Management (EKAW 2002), Springer Verlag, 2002.
  9. Kiryakov, A., Popov,B., Ognyanoff,D., Manov, D. and Goranov, K.M. 2004. Semantic annotation, indexing, and retrieval, Journal of Web Semantics, pp. 49-79, 2004.
  10. Reeve, L. and Han H. 2005. Survey of semantic annotation platforms, Proceedings of the 2005 ACM Symposium on Applied Computing, Santa Fe, New Mexico, March 13 - 17, 2005.
  11. Veale, T. 2002. Meta-knowledge annotation for efficient natural-language question-answering, Proceedings of the 13th Irish International Conference (AICS 2002), Limerick, Ireland, pp. 115-128, September 12-13, 2002.
  12. Prager, J., Radev D. and Czuba K. 2001. Answering whatis questions by virtual annotation, Proceedings of the first International Conference on Human Language Technology Research 2001, San Diego, March 18 - 21, 2001.
  13. Hays, D. 1964. Dependency theory: a formalism and some observations, Language, Linguistic Society of America, Vol. 40, No. 4, pp. 511-525, 1964.
  14. Miller, G. A. 1995. WordNet: a lexical database for English, Communications of the ACM, Vol. 38, Issue 11, 1995.
  15. Li, Y.H., Bandar, Z.A. and McLean, D. 2003. An approach for measuring semantic similarity between words using multiple information sources, IEEE Transactions on Knowledge and Data Engineering Vol. 15, No. 4, July/August, 2003.
  16. Cowie, J., Ludovik, E., Molina-Salgado, H., Nirenburg, S. and Sheremetyeva, S. 2000. Automatic question answering, Proceedings of the Rubin Institute for Advanced Orthopedics Conference, Paris, 2000.
  17. Álvez, J., Atserias, J., Carrera, J., Climent, S., Laparra, E., Oliver, A. and Rigau, G. 2008. Complete and consistent annotation of wordNet using the top concept ontology. Proceedings of Sixth International Language Resources and Evaluation (LREC'08), European Language Resources Association (ELRA), 2008.
  18. Hao, T.Y., Ni, X.L., Quan, X.J., W.Y. Liu 2009. Automatic Construction of Semantic Dictionary for Question Categorization, Proceedings of The 13th World Multi-Conference on Systemics, Cybernetics and Informatics: WMSCI 2009, Orlando, pp. 220-225, July 10-13, 2009.
Download


Paper Citation


in Harvard Style

Liu G., Lu Z., Hao T. and Wenyin L. (2010). AUTOMATIC TEXT ANNOTATION FOR QUESTIONS . In Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST, ISBN 978-989-674-025-2, pages 227-236. DOI: 10.5220/0002796702270236


in Bibtex Style

@conference{webist10,
author={Gang Liu and Zhi Lu and Tianyong Hao and Liu Wenyin},
title={AUTOMATIC TEXT ANNOTATION FOR QUESTIONS},
booktitle={Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST,},
year={2010},
pages={227-236},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002796702270236},
isbn={978-989-674-025-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST,
TI - AUTOMATIC TEXT ANNOTATION FOR QUESTIONS
SN - 978-989-674-025-2
AU - Liu G.
AU - Lu Z.
AU - Hao T.
AU - Wenyin L.
PY - 2010
SP - 227
EP - 236
DO - 10.5220/0002796702270236