MTCue: Model Checkpoints
收藏DataCite Commons2024-02-12 更新2025-04-16 收录
下载链接:
https://orda.shef.ac.uk/articles/dataset/MTCue_Model_Checkpoints/22956002/1
下载链接
链接失效反馈官方服务:
资源简介:
The model checkpoints contained here are associated with an ACL 2023 paper entitled "MTCue: Learning Zero-Shot Control of Extra-Textual Attributes by Leveraging Unstructured Context in Neural Machine Translation" (citation is to be added when the Proceedings are published).
<br>
Each .zip file here contains a checkpoint to a baseline translation model and MTCue for the language pair in the name (e.g. en.de is the English-to-German language pair). How to use them is described in detail in the associated GitHub repository.
<br>
The models (checkpoints.zip) were trained in PyTorch and via the Fairseq toolkit:
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Köpf, A., Yang, E., DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S., Steiner, B., Fang, L., … Chintala, S. (2019). PyTorch: An imperative style, high-performance deep learning library. Advances in Neural Information Processing Systems, 32(NeurIPS).
<br>
Myle Ott, Sergey Edunov, Alexei Baevski, Angela Fan, Sam Gross, Nathan Ng, David Grangier, and Michael Auli. 2019. fairseq: A Fast, Extensible Toolkit for Sequence Modeling. In <em>Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations)</em>, pages 48–53, Minneapolis, Minnesota. Association for Computational Linguistics.
<br>
Full documentation to how to use the resources is included in the [GitHub repository](https://github.com/st-vincent1/MTCue) which contains a link to this ORDA page.
本仓库提供的模型检查点与2023年国际计算语言学协会年会(ACL 2023)论文《MTCue:利用神经机器翻译(Neural Machine Translation)中的非结构化上下文实现零样本(Zero-shot)额外属性控制学习》相关,论文集正式出版后将补充完整引用信息。
此处的每个.zip压缩文件均包含对应语言对的基线翻译模型与MTCue模型的检查点,例如文件名中的en.de代表英语到德语的语言对。具体使用方法详见配套的GitHub仓库。
本仓库的模型检查点(checkpoints.zip)基于PyTorch框架与Fairseq工具包训练完成,相关引用如下:
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Köpf, A., Yang, E., DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S., Steiner, B., Fang, L., … Chintala, S. (2019). PyTorch:一款命令式风格的高性能深度学习库。《神经信息处理系统进展》,32(NeurIPS)。
Myle Ott、Sergey Edunov、Alexei Baevski、Angela Fan、Sam Gross、Nathan Ng、David Grangier及Michael Auli. 2019. fairseq:一款快速可扩展的序列建模工具包。收录于《2019年北美计算语言学协会年会(演示论文集)》,第48–53页,美国明尼苏达州明尼阿波利斯市。国际计算语言学协会。
完整的资源使用说明详见配套GitHub仓库[https://github.com/st-vincent1/MTCue],该仓库亦包含本ORDA页面的链接。
提供机构:
The University of Sheffield
创建时间:
2023-06-20



