Randout-KD: Finetuning Foundation Models for Text Classification via Random Noise and Knowledge Distillation

Pervaiz Khan; Pervaiz Khan; Andreas Dengel; Andreas Dengel; Sheraz Ahmed

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Randout-KD: Finetuning Foundation Models for Text Classification via Random Noise and Knowledge Distillation

Topics: Deep Learning; Natural Language Processing

In Proceedings of the 15th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART, 457-465, 2023 , Lisbon, Portugal

Authors: Pervaiz Khan ^{1

;

2} ; Andreas Dengel ^{1

;

2} and Sheraz Ahmed ¹

Affiliations: ¹ German Research Center for Artificial Intelligence (DFKI), 67663 Kaiserslautern, Germany ; ² Department of Computer Science, TU Kaiserslautern, 67663 Kaiserslautern, Germany

Keyword(s): Random Noise, Knowledge Distillation, Text Classification.

Abstract: Finetuning foundation models effectively on downstream tasks is ongoing research. In this paper, we present a finetuning method “Randout-KD” that enhances the performance of a student model for text classification. We specifically propose a noise-injecting method in the representations of the transformer model during its finetuning that works as regularization. Moreover, we integrate the knowledge distillation and noise injection methods and show that combining these approaches boosts the baseline model performance. We evaluate the proposed method on two datasets namely “CODA-19” and “RHMD” using PubMedBERT and RoBERTa Large as teacher models, and data2vec as a student model. Results show that the proposed approach improves the accuracy up to 1.2% compared to the baseline methods.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 216.73.216.157

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Khan, P., Dengel, A., Ahmed and S. (2023). Randout-KD: Finetuning Foundation Models for Text Classification via Random Noise and Knowledge Distillation. In Proceedings of the 15th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-623-1; ISSN 2184-433X, SciTePress, pages 457-465. DOI: 10.5220/0011687800003393

@conference{icaart23,
author={Pervaiz Khan and Andreas Dengel and Sheraz Ahmed},
title={Randout-KD: Finetuning Foundation Models for Text Classification via Random Noise and Knowledge Distillation},
booktitle={Proceedings of the 15th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2023},
pages={457-465},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0011687800003393},
isbn={978-989-758-623-1},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 15th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - Randout-KD: Finetuning Foundation Models for Text Classification via Random Noise and Knowledge Distillation
SN - 978-989-758-623-1
IS - 2184-433X
AU - Khan, P.
AU - Dengel, A.
AU - Ahmed, S.
PY - 2023
SP - 457
EP - 465
DO - 10.5220/0011687800003393
PB - SciTePress