HD-VoxelFlex: Flexible High-Definition Voxel Grid Representation

Igor Vozniak, Pavel Astreika, Philipp Müller, Nils Lipp, Christian Müller, Philipp Slusallek

2024

Abstract

Voxel grids are an effective means to represent 3D data, as they accurately preserve spatial relations. However, the inherent sparseness of voxel grid representations leads to significant memory consumption in deep learning architectures, in particular for high-resolution (HD) inputs. As a result, current state-of-the-art approaches to the reconstruction of 3D data tend to avoid voxel grid inputs. In this work, we propose HD-VoxelFlex, a novel 3D CNN architecture that can be flexibly applied to HD voxel grids with only moderate increase in training parameters and memory consumption. HD-VoxelFlex introduces three architectural novelties. First, to improve the models’ generalizability, we introduce a random shuffling layer. Second, to reduce information loss, we introduce a novel reducing skip connection layer. Third, to improve modelling of local structure that is crucial for HD inputs, we incorporate a kNN distance mask as input. We combine these novelties with a “bag of tricks” identified in a comprehensive literature review. Based on these novelties we propose six novel building blocks for our encoder-decoder HD-VoxelFlex architecture. In evaluations on the ModelNet10/40 and PCN datasets, HD-VoxelFlex outperforms the state-of-the-art in all point cloud reconstruction metrics. We show that HD-VoxelFlex is able to process high-definition (128 3 , 192 3 ) voxel grid inputs at much lower memory consumption than previous approaches. Furthermore, we show that HD-VoxelFlex, without additional fine-tuning, demonstrates competitive performance in the classification task, proving its generalization ability. As such, our results underline the neglected potential of voxel grid input for deep learning architectures.

Download


Paper Citation


in Harvard Style

Vozniak I., Astreika P., Müller P., Lipp N., Müller C. and Slusallek P. (2024). HD-VoxelFlex: Flexible High-Definition Voxel Grid Representation. In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP; ISBN 978-989-758-679-8, SciTePress, pages 204-219. DOI: 10.5220/0012374800003660


in Bibtex Style

@conference{visapp24,
author={Igor Vozniak and Pavel Astreika and Philipp Müller and Nils Lipp and Christian Müller and Philipp Slusallek},
title={HD-VoxelFlex: Flexible High-Definition Voxel Grid Representation},
booktitle={Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP},
year={2024},
pages={204-219},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012374800003660},
isbn={978-989-758-679-8},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP
TI - HD-VoxelFlex: Flexible High-Definition Voxel Grid Representation
SN - 978-989-758-679-8
AU - Vozniak I.
AU - Astreika P.
AU - Müller P.
AU - Lipp N.
AU - Müller C.
AU - Slusallek P.
PY - 2024
SP - 204
EP - 219
DO - 10.5220/0012374800003660
PB - SciTePress