loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Hugo Alonso 1 ; 2 and Teresa Candeias 1

Affiliations: 1 Universidade Lusófona – Centro Universitário do Porto, Rua Augusto Rosa, n.º 24, 4000-098 Porto, Portugal ; 2 Universidade de Aveiro, Campus Universitário de Santiago, 3810-193 Aveiro, Portugal

Keyword(s): Wine, Classification, Data Imbalance, Re-Sampling, Learning Methods, Predictive Models.

Abstract: The wine industry has becoming increasingly important worldwide and is one of the most significant industries in Portugal. In a previous paper, the problem of predicting how much a Portuguese consumer is willing to pay for a bottle of wine was considered for the first time ever. The problem was treated as a multi-class ordinal classification task. Although we achieved good prediction results, globally speaking, it was difficult to identify rare cases of consumers who are interested in paying for more expensive wines. We found that this was a direct consequence of data imbalance. Therefore, here, we present a first attempt to deal with this issue, based on the use of re-sampling strategies to balance the training data, namely random under-sampling, random over-sampling with replacement and the synthetic minority over-sampling technique. We consider several learning methods and develop various predictive models. A comparative study is carried out and its results highlight the importanc e of a careful choice of the re-sampling strategy and the learning method in order to get the best possible prediction results. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.117.85.200

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Alonso, H. and Candeias, T. (2023). Predicting How Much a Consumer Is Willing to Pay for a Bottle of Wine: Dealing With Data Imbalance. In Proceedings of the 12th International Conference on Data Science, Technology and Applications - DATA; ISBN 978-989-758-664-4; ISSN 2184-285X, SciTePress, pages 263-270. DOI: 10.5220/0012068800003541

@conference{data23,
author={Hugo Alonso. and Teresa Candeias.},
title={Predicting How Much a Consumer Is Willing to Pay for a Bottle of Wine: Dealing With Data Imbalance},
booktitle={Proceedings of the 12th International Conference on Data Science, Technology and Applications - DATA},
year={2023},
pages={263-270},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012068800003541},
isbn={978-989-758-664-4},
issn={2184-285X},
}

TY - CONF

JO - Proceedings of the 12th International Conference on Data Science, Technology and Applications - DATA
TI - Predicting How Much a Consumer Is Willing to Pay for a Bottle of Wine: Dealing With Data Imbalance
SN - 978-989-758-664-4
IS - 2184-285X
AU - Alonso, H.
AU - Candeias, T.
PY - 2023
SP - 263
EP - 270
DO - 10.5220/0012068800003541
PB - SciTePress