Replication Data for: On Finetuning Large Language Models
收藏DataONE2023-10-12 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:a56c7be92673f08870bd5018f62f61a4ccaf45c3e29aba31dd5b0c9a9e38f177
下载链接
链接失效反馈官方服务:
资源简介:
A recent paper by Häffner et al. 2023 introduces an interpretable deep learning approach for domain specific dictionary creation, where it is claimed that the dictionary-based approach outperforms finetuned language models in predictive accuracy while retaining interpretability. We show that the dictionary-based approach's reported superiority over large language models, BERT specifically, is due to the fact that most of the parameters in the language models are excluded from finetuning. In this letter, we first discuss the architecture of BERT models, then explain the limitations of finetuning only the top classification layer, and lastly we report results where finetuned language models outperform the newly proposed dictionary-based approach by 27% in terms of R2 and 46% in terms of mean squared error once we allow these parameters to learn during finetuning. Researchers interested in large language models, text classification, and text regression should find our results useful. Our code and data are publicly available. Code to replicate \"On Finetuning Large Language Models.\"
创建时间:
2023-12-16



