kardosdrur/folketinget-discussions
收藏Hugging Face2023-12-04 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/kardosdrur/folketinget-discussions
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
dataset_info:
features:
- name: comment
dtype: string
- name: response
dtype: string
splits:
- name: train
num_bytes: 7032676.035654362
num_examples: 3814
- name: test
num_bytes: 1759090.9643456375
num_examples: 954
download_size: 4898174
dataset_size: 8791767.0
---
# Discussions in Folketinget
The dataset is based on data from Folketinget in the Danish Gigaword corpus.
Comment-response pairs are purely extracted on the basis of heuristics, and have not been manually evaluated.
The dataset was created for aiding the training of sentence transformer models in the Danish Foundation Models project.
The dataset is currently not recommended for production use.
提供机构:
kardosdrur
原始信息汇总
数据集概述
许可证
- MIT许可证
配置
- 默认配置
- 数据文件
- 训练集:
data/train-* - 测试集:
data/test-*
- 训练集:
- 数据文件
数据集信息
- 特征
- 评论:字符串类型
- 回复:字符串类型
- 分割
- 训练集
- 字节数:7032676.035654362
- 样本数:3814
- 测试集
- 字节数:1759090.9643456375
- 样本数:954
- 训练集
- 下载大小:4898174字节
- 数据集大小:8791767.0字节



