Vectorization of Bias in Machine Learning Algorithms

Sophie Bekerman; Eric Chen; Lily Lin; George Monta Nez

doi:10.5220/0010845000003116

Vectorization of Bias in Machine Learning Algorithms

Sophie Bekerman, Eric Chen, Lily Lin, George Monta Nez

2022

Abstract

We develop a method to measure and compare the inductive bias of classifications algorithms by vectorizing aspects of their behavior. We compute a vectorized representation of the algorithm’s bias, known as the inductive orientation vector, for a set of algorithms. This vector captures the algorithm’s probability distribution over all possible hypotheses for a classification task. We cluster and plot the algorithms’ inductive orientation vectors to visually characterize their relationships. As algorithm behavior is influenced by the training dataset, we construct a Benchmark Data Suite (BDS) matrix that considers algorithms’ pairwise distances across many datasets, allowing for more robust comparisons. We identify many relationships supported by existing literature, such as those between k-Nearest Neighbor and Random Forests and among tree-based algorithms, and evaluate the strength of those known connections, showing the potential of this geometric approach to investigate black-box machine learning algorithms.

Download

Paper Citation

in Harvard Style

Bekerman S., Chen E., Lin L. and Monta Nez G. (2022). Vectorization of Bias in Machine Learning Algorithms. In Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART, ISBN 978-989-758-547-0, pages 354-365. DOI: 10.5220/0010845000003116

in Bibtex Style

@conference{icaart22,
author={Sophie Bekerman and Eric Chen and Lily Lin and George Monta Nez},
title={Vectorization of Bias in Machine Learning Algorithms},
booktitle={Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,},
year={2022},
pages={354-365},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0010845000003116},
isbn={978-989-758-547-0},
}

in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Conference on Agents and Artificial Intelligence - Volume 2: ICAART,
TI - Vectorization of Bias in Machine Learning Algorithms
SN - 978-989-758-547-0
AU - Bekerman S.
AU - Chen E.
AU - Lin L.
AU - Monta Nez G.
PY - 2022
SP - 354
EP - 365
DO - 10.5220/0010845000003116