Food Recognition: Can Deep Learning or Bag-of-Words Match Humans?

Pedro Furtado

2020

Abstract

Automated smartphone-based food recognition is a useful basis for applications targeted at dietary assessment. Dish recognition is a necessary step in that process. One of the possible approaches to use is deep learning-based recognition, another one is bag-of-words based classification. Deep learning has increasingly become the preferred approach to use in either this or other image classification tasks. Additionally, if humans are better recognizing the dish, the automated approach is useless (it will be less error-prone for the user to identify the dish instead of capturing the photo). We compare the alternatives of Deep Learning (DL), Bag-of-words (BoW) and Humans (H). The best deep learner beats humans when on few food categories, but looses if it has to learn many more food categories, which is expected in real contexts. We describe the approaches, analyze the results, draw conclusions and design further work to evaluate further and improve the approaches.

Download


Paper Citation


in Harvard Style

Furtado P. (2020). Food Recognition: Can Deep Learning or Bag-of-Words Match Humans?. In Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 2: BIOIMAGING; ISBN 978-989-758-398-8, SciTePress, pages 102-108. DOI: 10.5220/0008893301020108


in Bibtex Style

@conference{bioimaging20,
author={Pedro Furtado},
title={Food Recognition: Can Deep Learning or Bag-of-Words Match Humans?},
booktitle={Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 2: BIOIMAGING},
year={2020},
pages={102-108},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008893301020108},
isbn={978-989-758-398-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020) - Volume 2: BIOIMAGING
TI - Food Recognition: Can Deep Learning or Bag-of-Words Match Humans?
SN - 978-989-758-398-8
AU - Furtado P.
PY - 2020
SP - 102
EP - 108
DO - 10.5220/0008893301020108
PB - SciTePress