EmBoost: Embedding Boosting to Learn Multilevel Abstract Text Representation for Document Retrieval

Tolgahan Cakaloglu; Tolgahan Cakaloglu; Xiaowei Xu; Roshith Raghavan

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

EmBoost: Embedding Boosting to Learn Multilevel Abstract Text Representation for Document Retrieval

Topics: Data Mining; Deep Learning; Knowledge Representation and Reasoning; Knowledge-Based Systems; Machine Learning; Natural Language Processing; Neural Networks

In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART, 352-360, 2022

Authors: Tolgahan Cakaloglu ^{1

;

2} ; Xiaowei Xu ¹ and Roshith Raghavan ³

Affiliations: ¹ University of Arkansas, Little Rock, Arkansas, U.S.A. ; ² Walmart Labs, Dallas, Texas, U.S.A. ; ³ Walmart Labs, Bentonville, Arkansas, U.S.A.

Keyword(s): Natural Language Processing, Information Retrieval, Deep Learning, Learning Representations, Text Matching.

Abstract: Learning hierarchical representation has been vital in natural language processing and information retrieval. With recent advances, the importance of learning the context of words has been underscored. In this paper we propose EmBoost i.e. Embedding Boosting of word or document vector representations that have been learned from multiple embedding models. The advantage of this approach is that this higher order word embedding represents documents at multiple levels of abstraction. The performance gain from this approach has been demonstrated by comparing with various existing text embedding strategies on retrieval and semantic similarity tasks using Stanford Question Answering Dataset (SQuAD), and Question Answering by Search And Reading (QUASAR). The multilevel abstract word embedding is consistently superior to existing solo strategies including Glove, FastText, ELMo and BERT-based models. Our study shows that further gains can be made when a deep residual neural model is specifical ly trained for document retrieval. (More)

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.145.93.210

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Cakaloglu, T.; Xu, X. and Raghavan, R. (2022). EmBoost: Embedding Boosting to Learn Multilevel Abstract Text Representation for Document Retrieval. In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-547-0; ISSN 2184-433X, SciTePress, pages 352-360. DOI: 10.5220/0010822900003116

@conference{icaart22,
author={Tolgahan Cakaloglu. and Xiaowei Xu. and Roshith Raghavan.},
title={EmBoost: Embedding Boosting to Learn Multilevel Abstract Text Representation for Document Retrieval},
booktitle={Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2022},
pages={352-360},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010822900003116},
isbn={978-989-758-547-0},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - EmBoost: Embedding Boosting to Learn Multilevel Abstract Text Representation for Document Retrieval
SN - 978-989-758-547-0
IS - 2184-433X
AU - Cakaloglu, T.
AU - Xu, X.
AU - Raghavan, R.
PY - 2022
SP - 352
EP - 360
DO - 10.5220/0010822900003116
PB - SciTePress