L3Masking: Multi-task Fine-tuning for Language Models by Leveraging Lessons Learned from Vanilla Models

Yusuke Kimura Takahiro Komamizu Kenji Hatano
雑誌・プロシーディングス名: Proceedings of the Workshop on Customizable NLP in conjunction with the 2024 Conference on Empirical Methods in Natural Language Processing
開催地(都道府県): florida
国名(英語): US
言語: English
ページ: 53--62
出版年: 2024
出版月: 11
出版日: 2024-11-14
DOI: 10.18653/v1/2024.customnlp4u-1.6
📄 PDFを開く 🌐 詳細ページへ
       

概要

When distributional differences exist between pre-training and fine-tuning data, language models (LMs) may perform poorly on downstream tasks. Recent studies have reported that multi-task learning of downstream task and masked language modeling (MLM) task during the fine-tuning phase improves the performance of the downstream task. Typical MLM tasks (e.g., random token masking (RTM)) tend not to care tokens corresponding to the knowledge already acquired during the pre-training phase, therefore LMs may not notice the important clue or not effective to acquire linguistic knowledge of the task or domain. To overcome this limitation, we propose a new masking strategy for MLM task, called L3Masking 1, that leverages lessons (specifically, token-wise likelihood in a context) learned from the vanilla language model to be fine-tuned. L3Masking actively masks tokens with low likelihood on the vanilla model. Experimental evaluations on text classification tasks in different domains confirms a multi-task text classification method with L3Masking performed task adaptation more effectively than that with RTM. These results suggest the usefulness of assigning a preference to the tokens to be learned as the task or domain adaptation.

引用情報

Yusuke Kimura, Takahiro Komamizu, , Kenji Hatano, L3Masking: Multi-task Fine-tuning for Language Models by Leveraging Lessons Learned from Vanilla Models, Proceedings of the Workshop on Customizable NLP in conjunction with the 2024 Conference on Empirical Methods in Natural Language Processing, pp.53--62, 2024-11-14, DOI: 10.18653/v1/2024.customnlp4u-1.6.

Iconic One Theme | Powered by Wordpress