Towards Unsupervised Word Error Correction in Textual Big Data

Joao Paulo Carvalho; Sérgio Curto

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Towards Unsupervised Word Error Correction in Textual Big Data

Topics: Fuzzy Information Processing, Fusion, Text Mining

In Proceedings of the International Conference on Fuzzy Computation Theory and Applications - Volume 0IJCCI, 181-186, 2014 , Rome, Italy

Authors: Joao Paulo Carvalho and Sérgio Curto

Affiliation: Universidade de Lisboa, Portugal

Keyword(s): Fuzzy Text Preprocessing, Medical Text Reports, Natural Language Processing, Word Similarity, MIMIC II.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Computational Intelligence ; Fuzzy Information Processing, Fusion, Text Mining ; Fuzzy Systems ; Soft Computing

Abstract: Large unedited technical textual databases might contain information that cannot be properly extracted using Natural Language Processing (NLP) tools due to the many existent word errors. A good example is the MIMIC II database, where medical text reports are a direct representation of experts’ views on real time observable data. Such reports contain valuable information that can improve predictive medic decision making models based on physiological data, but have never been used with that goal so far. In this paper we propose a fuzzy based semi-automatic method to specifically address the large number of word errors contained in such databases that will allow the direct application of NLP techniques, such as Bag of Words, to the textual data.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.143.4.181

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Carvalho, J. and Curto, S. (2014). Towards Unsupervised Word Error Correction in Textual Big Data. In Proceedings of the International Conference on Fuzzy Computation Theory and Applications (IJCCI 2014) - FCTA; ISBN 978-989-758-053-6, SciTePress, pages 181-186. DOI: 10.5220/0005140401810186

@conference{fcta14,
author={Joao Paulo Carvalho. and Sérgio Curto.},
title={Towards Unsupervised Word Error Correction in Textual Big Data},
booktitle={Proceedings of the International Conference on Fuzzy Computation Theory and Applications (IJCCI 2014) - FCTA},
year={2014},
pages={181-186},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005140401810186},
isbn={978-989-758-053-6},
}

TY - CONF

JO - Proceedings of the International Conference on Fuzzy Computation Theory and Applications (IJCCI 2014) - FCTA
TI - Towards Unsupervised Word Error Correction in Textual Big Data
SN - 978-989-758-053-6
AU - Carvalho, J.
AU - Curto, S.
PY - 2014
SP - 181
EP - 186
DO - 10.5220/0005140401810186
PB - SciTePress