modernlegal/sodnapraksa
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/modernlegal/sodnapraksa
下载链接
链接失效反馈官方服务:
资源简介:
Sodna Praksa是一个斯洛文尼亚语的数据集,包含的任务类别有文本摘要和文本生成。数据集的大小在10K到100K之间,标签为法律。它包含的列有id、source_db、source_url、title、date、metadata、paragraphs(包括content、kind和order)、html和markdown。数据集分为训练集,共有27664个样本,大小为881,684,096字节。提供了默认配置,训练数据文件路径为data/train-*。
Sodna Praksa is a Slovenian dataset that includes task categories of summarization and text generation. The dataset size is between 10K and 100K, tagged with legal. It contains columns such as id, source_db, source_url, title, date, metadata, paragraphs (including content, kind, and order), html, and markdown. The dataset is split into a training set with a total of 27,664 samples, totaling 881,684,096 bytes. A default configuration is provided, with the training data file path as data/train-*.
提供机构:
modernlegal



