five

MTCue: Model Checkpoints

收藏
DataCite Commons2024-02-12 更新2025-04-16 收录
下载链接:
https://orda.shef.ac.uk/articles/dataset/MTCue_Model_Checkpoints/22956002/1
下载链接
链接失效反馈
官方服务:
资源简介:
The model checkpoints contained here are associated with an ACL 2023 paper entitled "MTCue: Learning Zero-Shot Control of Extra-Textual Attributes by Leveraging Unstructured Context in Neural Machine Translation" (citation is to be added when the Proceedings are published). <br> Each .zip file here contains a checkpoint to a baseline translation model and MTCue for the language pair in the name (e.g. en.de is the English-to-German language pair). How to use them is described in detail in the associated GitHub repository. <br> The models (checkpoints.zip) were trained in PyTorch and via the Fairseq toolkit: Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G.,  Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Köpf,  A., Yang, E., DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S.,  Steiner, B., Fang, L., … Chintala, S. (2019). PyTorch: An imperative  style, high-performance deep learning library. Advances in Neural  Information Processing Systems, 32(NeurIPS). <br> Myle Ott, Sergey Edunov, Alexei Baevski, Angela Fan, Sam Gross, Nathan Ng, David Grangier, and Michael Auli. 2019. fairseq: A Fast, Extensible Toolkit for Sequence Modeling. In <em>Proceedings  of the 2019 Conference of the North American Chapter of the Association  for Computational Linguistics (Demonstrations)</em>, pages 48–53, Minneapolis, Minnesota. Association for Computational Linguistics. <br> Full documentation to how to use the resources is included in the [GitHub repository](https://github.com/st-vincent1/MTCue) which contains a link to this ORDA page.

本仓库提供的模型检查点与2023年国际计算语言学协会年会(ACL 2023)论文《MTCue:利用神经机器翻译(Neural Machine Translation)中的非结构化上下文实现零样本(Zero-shot)额外属性控制学习》相关,论文集正式出版后将补充完整引用信息。 此处的每个.zip压缩文件均包含对应语言对的基线翻译模型与MTCue模型的检查点,例如文件名中的en.de代表英语到德语的语言对。具体使用方法详见配套的GitHub仓库。 本仓库的模型检查点(checkpoints.zip)基于PyTorch框架与Fairseq工具包训练完成,相关引用如下: Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Köpf, A., Yang, E., DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S., Steiner, B., Fang, L., … Chintala, S. (2019). PyTorch:一款命令式风格的高性能深度学习库。《神经信息处理系统进展》,32(NeurIPS)。 Myle Ott、Sergey Edunov、Alexei Baevski、Angela Fan、Sam Gross、Nathan Ng、David Grangier及Michael Auli. 2019. fairseq:一款快速可扩展的序列建模工具包。收录于《2019年北美计算语言学协会年会(演示论文集)》,第48–53页,美国明尼苏达州明尼阿波利斯市。国际计算语言学协会。 完整的资源使用说明详见配套GitHub仓库[https://github.com/st-vincent1/MTCue],该仓库亦包含本ORDA页面的链接。
提供机构:
The University of Sheffield
创建时间:
2023-06-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作