Deep Learning-Powered Assembly Step Classification for Intricate Machines

Luca Rodiga, Luca Rodiga, Eva Eggeling, Ulrich Krispel, Torsten Ullrich, Torsten Ullrich

2024

Abstract

Augmented Reality-based assistance systems can help qualified technicians by providing them with technical details. However, the applicability is limited by the low availability of real data. In this paper, we focus on synthetic renderings of CAD data. Our objective is to investigate different model architectures within the machine-learning component and compare their performance. The training data consists of CAD renderings from different viewpoints distributed over a sphere around the model. Utilizing the advantages of transfer learning and pre-trained backbones we trained different versions of EfficientNet and EfficientNetV2 on these images for every assembly step in two resolutions. The classification performance was evaluated on a smaller test set of synthetic renderings and a dataset of real-world images of the model. The best Top1-accuracy on the real-world dataset is achieved by the medium-sized EfficientNetV2 with 57.74%, while the best Top5-accuracy is provided by EfficientNetV2 Small. Consequently, our approach has a good classification performance indicating the real-world applicability of such a deep learning classifier in the near future.

Download


Paper Citation


in Harvard Style

Rodiga L., Eggeling E., Krispel U. and Ullrich T. (2024). Deep Learning-Powered Assembly Step Classification for Intricate Machines. In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP; ISBN 978-989-758-679-8, SciTePress, pages 500-507. DOI: 10.5220/0012376300003660


in Bibtex Style

@conference{visapp24,
author={Luca Rodiga and Eva Eggeling and Ulrich Krispel and Torsten Ullrich},
title={Deep Learning-Powered Assembly Step Classification for Intricate Machines},
booktitle={Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP},
year={2024},
pages={500-507},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012376300003660},
isbn={978-989-758-679-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP
TI - Deep Learning-Powered Assembly Step Classification for Intricate Machines
SN - 978-989-758-679-8
AU - Rodiga L.
AU - Eggeling E.
AU - Krispel U.
AU - Ullrich T.
PY - 2024
SP - 500
EP - 507
DO - 10.5220/0012376300003660
PB - SciTePress