Sohy/de_en

Name: Sohy/de_en
Creator: Sohy
Published: 2024-05-08 13:43:20
License: 暂无描述

Hugging Face2024-05-08 更新2024-06-12 收录

下载链接：

https://hf-mirror.com/datasets/Sohy/de_en

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: apache-2.0 task_categories: - text2text-generation language: - de - en configs: - config_name: default data_files: - split: train path: de_en_train.jsonl - split: test path: de_en_tst.jsonl - split: validation path: de_en_dev.jsonl --- # Dataset Card for Dataset Name  This dataset card aims to be a base template for new datasets. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/datasetcard_template.md?plain=1). ## Dataset Details ### Dataset Description  - **Curated by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] ### Dataset Sources [optional]  - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses  ### Direct Use  [More Information Needed] ### Out-of-Scope Use  [More Information Needed] ## Dataset Structure  [More Information Needed] ## Dataset Creation ### Curation Rationale  [More Information Needed] ### Source Data  #### Data Collection and Processing  [More Information Needed] #### Who are the source data producers?  [More Information Needed] ### Annotations [optional]  #### Annotation process  [More Information Needed] #### Who are the annotators?  [More Information Needed] #### Personal and Sensitive Information  [More Information Needed] ## Bias, Risks, and Limitations  [More Information Needed] ### Recommendations  Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations. ## Citation [optional]  **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional]  [More Information Needed] ## More Information [optional] [More Information Needed] ## Dataset Card Authors [optional] [More Information Needed] ## Dataset Card Contact [More Information Needed]

--- 许可证：Apache-2.0 任务类别： - 文本到文本生成（text2text-generation）语言： - 德语（de） - 英语（en）配置项： - 配置名称：默认（default）数据文件： - 拆分方式：训练集（train），文件路径：de_en_train.jsonl - 拆分方式：测试集（test），文件路径：de_en_tst.jsonl - 拆分方式：验证集（validation），文件路径：de_en_dev.jsonl --- # 数据集卡片模板  本数据集卡片旨在作为新建数据集的基础模板。本卡片基于[此原始模板](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/datasetcard_template.md?plain=1)生成。 ## 数据集详情 ### 数据集描述  - **整理者：** [需补充更多信息] - **资助方（可选）：** [需补充更多信息] - **共享方（可选）：** [需补充更多信息] - **自然语言处理所用语言：** [需补充更多信息] - **许可证：** [需补充更多信息] ### 数据集来源（可选）  - **代码仓库：** [需补充更多信息] - **相关论文（可选）：** [需补充更多信息] - **演示项目（可选）：** [需补充更多信息] ## 数据集用途  ### 直接使用场景  [需补充更多信息] ### 不适宜使用场景  [需补充更多信息] ## 数据集结构  [需补充更多信息] ## 数据集构建 ### 整理依据  [需补充更多信息] ### 源数据  #### 数据收集与处理流程  [需补充更多信息] #### 源数据生产者是谁？  [需补充更多信息] ### 标注信息（可选）  #### 标注流程  [需补充更多信息] #### 标注者是谁？  [需补充更多信息] #### 个人与敏感信息  [需补充更多信息] ## 偏差、风险与局限性  ### 相关建议  用户应知晓该数据集存在的风险、偏差与局限性。如需进一步建议，需补充更多信息。 ## 引用信息（可选）  **BibTeX：** [需补充更多信息] **APA：** [需补充更多信息] ## 术语表（可选）  [需补充更多信息] ## 更多信息（可选） [需补充更多信息] ## 数据集卡片作者（可选） [需补充更多信息] ## 数据集卡片联系方式 [需补充更多信息]

提供机构：

Sohy

原始信息汇总

数据集概述

基本信息

许可证: Apache-2.0
任务类别: 文本到文本生成
语言: 德语, 英语

数据配置

配置名称: default
数据文件:
- 训练集: de_en_train.jsonl
- 测试集: de_en_tst.jsonl
- 验证集: de_en_dev.jsonl

5,000+

优质数据集

54 个

任务类型

进入经典数据集