Publication Details

Fighting Randomness With Randomness: Mitigating Optimisation Instability of Fine-Tuning Using Ensemble and Noise Regularisation

PECHER Branislav, ČEGIŇ Ján, BELANEC Róbert, SRBA Ivan, ŠIMKO Jakub and BIELIKOVÁ Mária. Fighting Randomness With Randomness: Mitigating Optimisation Instability of Fine-Tuning Using Ensemble and Noise Regularisation. In: Findings of the Association for Computational Linguistics: EMNLP 2024. Miami: Association for Computational Linguistics, 2024, pp. 11005-11044. ISBN 979-8-8917-6168-1.

Czech title

Boj proti náhodnosti náhodnosťou: Zmierňovanie nestability optimalizácie pri doladení pomocou ansámblov a regularizácie šumom

Type

conference paper

Language

english

Authors

Pecher Branislav, Ing. (DCGM FIT BUT)
Čegiň Ján, Ing. (DCGM FIT BUT)
Belanec Róbert, Ing. (DCGM FIT BUT)
Srba Ivan
Šimko Jakub, doc. Ing., Ph.D. (DCGM FIT BUT)
Bieliková Mária, prof. Ing., PhD. (DCGM FIT BUT)

Keywords

NLP in resource-constrained settings, parameter-efficient-training, data-efficient training, data augmentation, fine-tuning, mitigating randomness, ensembling

Abstract

While fine-tuning of pre-trained language models generally helps to overcome the lack of labelled training samples, it also displays model performance instability. This instability mainly originates from randomness in initialisation or data shuffling. To address this, researchers either modify the training process or augment the available samples, which typically results in increased computational costs. We propose a new mitigation strategy, called Delayed Ensemble with Noisy Interpolation (DENI), that leverages the strengths of ensembling, noise regularisation and model interpolation, while retaining computational efficiency. We compare DENI with 9 representative mitigation strategies across 3 models, 4 tuning strategies and 7 text classification datasets. We show that: 1) DENI outperforms the best performing mitigation strategy (Ensemble), while using only a fraction of its cost; 2) the mitigation strategies are beneficial for parameter-efficient fine-tuning (PEFT) methods, outperforming full fine-tuning in specific cases; and 3) combining DENI with data augmentation often leads to even more effective instability mitigation.

Published

2024

Pages

11005-11044

Proceedings

Findings of the Association for Computational Linguistics: EMNLP 2024

Conference

Conference on Empirical Methods in Natural Language Processing, Miami, Florida, US

ISBN

979-8-8917-6168-1

Publisher

Association for Computational Linguistics

Place

Miami, US

DOI

10.18653/v1/2024.findings-emnlp.644

BibTeX

@INPROCEEDINGS{FITPUB13220,
   author = "Branislav Pecher and J\'{a}n \v{C}egi\v{n} and R\'{o}bert Belanec and Ivan Srba and Jakub \v{S}imko and M\'{a}ria Bielikov\'{a}",
   title = "Fighting Randomness With Randomness: Mitigating Optimisation Instability of Fine-Tuning Using Ensemble and Noise Regularisation",
   pages = "11005--11044",
   booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2024",
   year = 2024,
   location = "Miami, US",
   publisher = "Association for Computational Linguistics",
   ISBN = "979-8-8917-6168-1",
   doi = "10.18653/v1/2024.findings-emnlp.644",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/13220"
}