Predictive Text System for Bahasa with Frequency, n-gram, Probability Table and Syntactic using Grammar

Derwin Suhartono, Garry Wong, Polim Kusuma, Silviana Saputra

2014

Abstract

Predictive text system is an alternative way to improve human communication, especially in matter of typing. Originally, predictive text system was intended for people who have flaws in verbal and motor. This system is aimed to all people who demands speed and accuracy in typing a document. There were many similar researches which develop this system that had their own strengths and weaknesses. This research attempts to develop the algorithm for predictive text system by combining four methods from previous researches and focus only in Bahasa (Indonesian language). The four methods consist of frequency, n-gram, probability table, and syntactic using grammar. Frequency method is used to rank words based on how many times the words were typed. Probability table is a table designed for storing data such as predefined phrases and trained data. N-gram is used to train data so that it is able to predict the next word based on previous word. And syntactic using grammar will predict the next word based on syntactic relationship between previous word and next word. By using this combination, user can reduce the keystroke up to 59% in which the average keystrokes saving is about 50%.

References

  1. Ganslandt, S., Jorwall, J., Nugues. P., 2009. Predictive Text Entry using Syntax and Semantics. Proceedings of the 11th International Conference on Parsing Technologies (IWPT). Association for Computational Linguistics.
  2. Gong, J., Tarasewich, P., MacKenzie, I. S., 2008. Improved Word List Ordering for Text Entry on Ambiguous Keypads. Proceedings of the Fifth Nordic Conference on Human-Computer Interaction - NordiCHI 2008. ACM.
  3. Jurafsky, D., Martin, J. H., 2008. Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Pearson, Prentice Hall. New Jersey, 2nd edition.
  4. MacKenzie, I. S., 2002. KSPC (keystrokes per character) as a characteristic of text entry techniques. Proceedings of the Fourth International Symposium on Human-Computer Interaction with Mobile Devices. Springer-Verlag.
  5. MacKenzie, I. S., Ishii, K. T., 2007. Text entry systems: mobility, accessibility, universality. Morgan Kaufmann Publishers. San Francisco.
  6. Nalavade, D., Mahule, T., Ketkar, H., 2008. PreText: A Predictive Text Entry System for Mobile Phones. Proceedings of the World Congress on Engineering 2008 Vol III. International Association of Engineers.
  7. Soukoreff, R. W., MacKenzie, I. S., 2003. Metrics for text entry research: An evaluation of MSD and KSPC, and a new unified error metric. Proceedings of the ACM Conference on Human Factors in Computing Systems - CHI 2003. ACM.
  8. Trnka, K., McCoy, K. F., 2008. Evaluating Word Prediction: Framing Keystroke Savings. HLT-Short 7808 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers Pages 261-264. Association for Computational Linguistics.
  9. Verberne, S., Bosch, A. V. D., Strik, H., Boves, L., 2012. The effect of domain and text type on text prediction quality. EACL 7812 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics.
  10. Vitoria, N. G., Abascal, J., 2006. Text prediction system: a survey. Universal Access in the Information Society. Springer-Verlag.
Download


Paper Citation


in Harvard Style

Suhartono D., Wong G., Kusuma P. and Saputra S. (2014). Predictive Text System for Bahasa with Frequency, n-gram, Probability Table and Syntactic using Grammar . In Proceedings of the 6th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-758-015-4, pages 305-311. DOI: 10.5220/0004756603050311


in Bibtex Style

@conference{icaart14,
author={Derwin Suhartono and Garry Wong and Polim Kusuma and Silviana Saputra},
title={Predictive Text System for Bahasa with Frequency, n-gram, Probability Table and Syntactic using Grammar},
booktitle={Proceedings of the 6th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2014},
pages={305-311},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004756603050311},
isbn={978-989-758-015-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - Predictive Text System for Bahasa with Frequency, n-gram, Probability Table and Syntactic using Grammar
SN - 978-989-758-015-4
AU - Suhartono D.
AU - Wong G.
AU - Kusuma P.
AU - Saputra S.
PY - 2014
SP - 305
EP - 311
DO - 10.5220/0004756603050311