
Li, Y., Zhang, Y., Razmjoo, A., and Calinon, S. (2024b).
Representing robot geometry as distance fields: Ap-
plications to whole-body manipulation. In 2024 IEEE
International Conference on Robotics and Automation
(ICRA), page 15351–15357. IEEE.
Liu, P., Zhang, K., Tateo, D., Jauhri, S., Peters, J., and
Chalvatzaki, G. (2022). Regularized deep signed
distance fields for reactive motion generation. In
2022 IEEE/RSJ International Conference on Intelli-
gent Robots and Systems (IROS). IEEE.
Mescheder, L., Oechsle, M., Niemeyer, M., Nowozin, S.,
and Geiger, A. (2019). Occupancy networks: Learn-
ing 3d reconstruction in function space. In 2019
IEEE/CVF Conference on Computer Vision and Pat-
tern Recognition (CVPR). IEEE.
Mildenhall, B., Srinivasan, P. P., Tancik, M., Barron, J. T.,
Ramamoorthi, R., and Ng, R. (2020). NeRF: Rep-
resenting Scenes as Neural Radiance Fields for View
Synthesis, page 405–421. Springer International Pub-
lishing.
Ortiz, J., Clegg, A., Dong, J., Sucar, E., Novotny, D., Zoll-
hoefer, M., and Mukadam, M. (2022). isdf: Real-
time neural signed distance fields for robot perception.
In Robotics: Science and Systems XVIII, RSS2022.
Robotics: Science and Systems Foundation.
Osher, S. and Fedkiw, R. (2003). Level Set Methods and
Dynamic Implicit Surfaces. Springer New York.
Park, J. J., Florence, P., Straub, J., Newcombe, R., and
Lovegrove, S. (2019). Deepsdf: Learning continuous
signed distance functions for shape representation. In
2019 IEEE/CVF Conference on Computer Vision and
Pattern Recognition (CVPR), page 165–174. IEEE.
Ratliff, N., Toussaint, M., and Schaal, S. (2015). Under-
standing the geometry of workspace obstacles in mo-
tion optimization. In 2015 IEEE International Con-
ference on Robotics and Automation (ICRA), page
4202–4209. IEEE.
Sitzmann, V., Martel, J., Bergman, A., Lindell, D., and Wet-
zstein, G. (2020). Implicit neural representations with
periodic activation functions. Advances in neural in-
formation processing systems, 33:7462–7473.
Vlachos, C. and Moustakas, K. (2024). High–fidelity hap-
tic rendering through implicit neural force representa-
tion. In International Conference on Human Haptic
Sensing and Touch Enabled Computer Applications,
pages 493–506. Springer.
Yariv, L., Gu, J., Kasten, Y., and Lipman, Y. (2021). Vol-
ume rendering of neural implicit surfaces. Advances
in Neural Information Processing Systems, 34:4805–
4815.
APPENDIX
During the initial stages of the presented research,
a debate in the literature between signed and un-
signed distance functions and their advantages, made
us wonder whether choosing a signed representation
was the better option. While a signed representation
was more intuitive to us and we felt that the continu-
ous derivatives was a very attractive feature, unsigned
CDFs had been successfully used already. After care-
ful testing, the signed representation appeared as a
more robust and stable option for our use cases. It
is worth mentioning that under certain circumstances
using an unsigned representation might be desirable,
if one is willing to sacrifice stability for marginally
higher peaks. For the sake of completeness, results for
an unsigned C-space Distance Function represented
using the same SIRENs are included in Table 4 for the
batch size of 20000 × 100 and Table 5 for the batch
size of 10 × 100. These may be directly compared to
the results of Table 1 and Table 2, respectively. Ad-
ditionally, Table 6 serves as an expansion to Table 3
that includes training time for the unsigned cases.
Table 4: Additional results of the contact experiment. Calculation of the C-space distances and gradients is performed by an
unsigned variant of our SIREN-based approach, and by a lighter version of the same network. Both networks were trained on
a batch size of 20000 × 100 for 50000 epochs. Results after 1, 2, 3, 4, 5, or 10 iterations are displayed. For Mean Average
Error (MAE) and Root Mean Squared Error (RMSE), lower is better. For Success Rate (SR), higher is better.
Projection
Iterations
SIREN (unsigned) SIREN light (unsigned)
MAE (cm) ↓ RMSE (cm) ↓ SR (%) ↑ MAE (cm) ↓ RMSE (cm) ↓ SR (%) ↑
1 4.58 ± 1.80 8.97 ± 3.49 70.5 ± 9.7 4.94 ± 1.93 9.25 ± 3.64 67.5 ± 10.0
2 1.37 ± 0.46 2.78 ± 1.19 93.9 ± 6.3 1.46 ± 0.45 2.99 ± 1.26 93.0 ± 5.8
3 1.15 ± 0.34 1.68 ± 0.63 95.6 ± 4.6 1.20 ± 0.35 1.75 ± 0.65 95.3 ± 5.5
4 1.23 ± 0.41 1.65 ± 0.66 94.6 ± 6.9 1.22 ± 0.37 1.63 ± 0.60 94.5 ± 5.8
5 1.27 ± 0.45 1.71 ± 0.75 93.9 ± 7.8 1.24 ± 0.40 1.65 ± 0.59 93.8 ± 6.6
10 1.37 ± 0.54 1.86 ± 0.91 92.8 ± 8.1 1.34 ± 0.52 1.75 ± 0.78 92.8 ± 9.2
CSDF-by-SIREN: Learning Signed Distances in the Configuration Space Through Sinusoidal Representation Networks
89