loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Muhammad Asif Suryani ; Saurav Karmakar ; Brigitte Mathiak and Philipp Mayr

Affiliation: Knowledge Technologies for the Social Sciences GESIS – Leibniz-Institut für Sozialwissenschaften, Köln, Germany

Keyword(s): Hugging Face, Metadata Exploration, Metadata Collection, Large Language Models, Research Data Management, Multidisciplinary Research, Dataset.

Abstract: Metadata features generally exhibit valuable meta information which may facilitate researchers in their tasks. Several studies incorporated scholarly metadata by highlighting its usefulness in certain granularity to assist numerous research tasks. The emergence of Large Language Models (LLMs) has brought an exciting change in the field of Artificial Intelligence (AI) and Machine Learning (ML), which is equally supported by Open Science initiative and FAIR principles. One of the prominent platforms, which ensures the availability of these models to research communities is the Hugging Face. It provides democratized access to models while experiencing rapid growth as a repository. As of March 2025, Hugging Face hosts more than 1.4 million models, which were 0.5 million approximately in February 2024. In this dataset paper, we provide information on a large fraction of Hugging Face model cards. Our dataset comprises of a wide range of metadata features which showcase the meta information about each model card. In this work, we aim to provide democratized access to a collection of diverse metadata features from Hugging Face model cards and present an insightful overview of these cards by leveraging the metadata to support the research communities by facilitating model adoption. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.108

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Suryani, M. A., Karmakar, S., Mathiak, B., Mayr and P. (2025). Model Card Metadata Collection from Hugging Face to Foster Multidisciplinary AI Research: A Dataset. In Proceedings of the 14th International Conference on Data Science, Technology and Applications - Volume 1: DATA; ISBN 978-989-758-758-0; ISSN 2184-285X, SciTePress, pages 583-590. DOI: 10.5220/0013571800003967

@conference{data25,
author={Muhammad Asif Suryani and Saurav Karmakar and Brigitte Mathiak and Philipp Mayr},
title={Model Card Metadata Collection from Hugging Face to Foster Multidisciplinary AI Research: A Dataset},
booktitle={Proceedings of the 14th International Conference on Data Science, Technology and Applications - Volume 1: DATA},
year={2025},
pages={583-590},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0013571800003967},
isbn={978-989-758-758-0},
issn={2184-285X},
}

TY - CONF

JO - Proceedings of the 14th International Conference on Data Science, Technology and Applications - Volume 1: DATA
TI - Model Card Metadata Collection from Hugging Face to Foster Multidisciplinary AI Research: A Dataset
SN - 978-989-758-758-0
IS - 2184-285X
AU - Suryani, M.
AU - Karmakar, S.
AU - Mathiak, B.
AU - Mayr, P.
PY - 2025
SP - 583
EP - 590
DO - 10.5220/0013571800003967
PB - SciTePress