Model Card Metadata Collection from Hugging Face to Foster Multidisciplinary AI Research: A Dataset

Muhammad Asif Suryani, Saurav Karmakar, Brigitte Mathiak, Philipp Mayr

2025

Abstract

Metadata features generally exhibit valuable meta information which may facilitate researchers in their tasks. Several studies incorporated scholarly metadata by highlighting its usefulness in certain granularity to assist numerous research tasks. The emergence of Large Language Models (LLMs) has brought an exciting change in the field of Artificial Intelligence (AI) and Machine Learning (ML), which is equally supported by Open Science initiative and FAIR principles. One of the prominent platforms, which ensures the availability of these models to research communities is the Hugging Face. It provides democratized access to models while experiencing rapid growth as a repository. As of March 2025, Hugging Face hosts more than 1.4 million models, which were 0.5 million approximately in February 2024. In this dataset paper, we provide information on a large fraction of Hugging Face model cards. Our dataset comprises of a wide range of metadata features which showcase the meta information about each model card. In this work, we aim to provide democratized access to a collection of diverse metadata features from Hugging Face model cards and present an insightful overview of these cards by leveraging the metadata to support the research communities by facilitating model adoption.

Download


Paper Citation


in Harvard Style

Suryani M., Karmakar S., Mathiak B. and Mayr P. (2025). Model Card Metadata Collection from Hugging Face to Foster Multidisciplinary AI Research: A Dataset. In Proceedings of the 14th International Conference on Data Science, Technology and Applications - Volume 1: DATA; ISBN 978-989-758-758-0, SciTePress, pages 583-590. DOI: 10.5220/0013571800003967


in Bibtex Style

@conference{data25,
author={Muhammad Suryani and Saurav Karmakar and Brigitte Mathiak and Philipp Mayr},
title={Model Card Metadata Collection from Hugging Face to Foster Multidisciplinary AI Research: A Dataset},
booktitle={Proceedings of the 14th International Conference on Data Science, Technology and Applications - Volume 1: DATA},
year={2025},
pages={583-590},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013571800003967},
isbn={978-989-758-758-0},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 14th International Conference on Data Science, Technology and Applications - Volume 1: DATA
TI - Model Card Metadata Collection from Hugging Face to Foster Multidisciplinary AI Research: A Dataset
SN - 978-989-758-758-0
AU - Suryani M.
AU - Karmakar S.
AU - Mathiak B.
AU - Mayr P.
PY - 2025
SP - 583
EP - 590
DO - 10.5220/0013571800003967
PB - SciTePress