loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Kazuko Takahashi 1 ; Hirofumi Taki 2 ; Shunsuke Tanabe 3 and Wei Li 4

Affiliations: 1 Keiai University, Japan ; 2 Hosei University, Japan ; 3 Waseda University, Japan ; 4 Tokyo Institute of Technology, China

Keyword(s): Automatic Coding System, Answers to Open-Ended Question, Occupation and Industry Coding, Natural Language Processing, Machine Learning, Confidence Level.

Related Ontology Subjects/Areas/Topics: Applications ; Applications and Case-studies ; Artificial Intelligence ; Knowledge Engineering and Ontology Development ; Knowledge-Based Systems ; Natural Language Processing ; Pattern Recognition ; Symbolic Systems

Abstract: We develop a new automatic coding system with a three-grade confidence level corresponding to each of the national/international standard code sets for answers to open-ended questions regarding to respondent’s occupation and industry in social surveys including a national census. The “occupation and industry coding” is a necessary task for statistical processing. However, this task requires a great deal of labor and time-consuming. In addition, inconsistent results occur if the coders are not experts of coding. In formal research, various automatic coding systems have been developed, which are incomplete and generally unfriendly to a non-developer user. Our new system assigns three candidate codes to an answer for coders by SVMs (Support Vector Machines), and attaches a three-grade confidence level to the first-ranked predicted code by using classification scores to support a manual check of the results. The system is now open to the public through the Website of the Social Science J apan Data Archive (SSJDA). After the submitted data file which followed the specified format is approved, the users can obtain files of codes for up to four kinds with a three-grade confidence level. In this paper, we describe our system and evaluate it. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.119.122.125

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Takahashi, K.; Taki, H.; Tanabe, S. and Li, W. (2014). An Automatic Coding System with a Three-Grade Confidence Level Corresponding to the National/International Occupation and Industry Standard - Open to the Public on the Web. In Proceedings of the International Conference on Knowledge Engineering and Ontology Development (IC3K 2014) - KEOD; ISBN 978-989-758-049-9; ISSN 2184-3228, SciTePress, pages 369-375. DOI: 10.5220/0005131703690375

@conference{keod14,
author={Kazuko Takahashi. and Hirofumi Taki. and Shunsuke Tanabe. and Wei Li.},
title={An Automatic Coding System with a Three-Grade Confidence Level Corresponding to the National/International Occupation and Industry Standard - Open to the Public on the Web},
booktitle={Proceedings of the International Conference on Knowledge Engineering and Ontology Development (IC3K 2014) - KEOD},
year={2014},
pages={369-375},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005131703690375},
isbn={978-989-758-049-9},
issn={2184-3228},
}

TY - CONF

JO - Proceedings of the International Conference on Knowledge Engineering and Ontology Development (IC3K 2014) - KEOD
TI - An Automatic Coding System with a Three-Grade Confidence Level Corresponding to the National/International Occupation and Industry Standard - Open to the Public on the Web
SN - 978-989-758-049-9
IS - 2184-3228
AU - Takahashi, K.
AU - Taki, H.
AU - Tanabe, S.
AU - Li, W.
PY - 2014
SP - 369
EP - 375
DO - 10.5220/0005131703690375
PB - SciTePress