djamina/relatives_psr

Name: djamina/relatives_psr
Creator: djamina
Published: 2024-06-12 23:57:06
License: 暂无描述

Hugging Face2024-06-12 更新2024-06-12 收录

下载链接：

https://hf-mirror.com/datasets/djamina/relatives_psr

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - fr size_categories: - n<1K task_categories: - token-classification dataset_info: features: - name: id dtype: int32 - name: texts dtype: string - name: tokens sequence: string - name: labels dtype: class_label: names: '0': O '1': DET '2': APPO '3': AMBIGUE - name: psr_tags sequence: class_label: names: '0': O '1': DET '2': APPO '3': AMBIGUE - name: psr_seq_tags sequence: class_label: names: '0': O '1': B-DET '2': I-DET '3': B-APPO '4': I-APPO '5': B-AMBIGUE '6': I-AMBIGUE splits: - name: train num_bytes: 367878 num_examples: 392 - name: validation num_bytes: 100723 num_examples: 99 - name: test num_bytes: 54069 num_examples: 55 download_size: 138886 dataset_size: 522670 configs: - config_name: default data_files: - split: train path: data/train-* - split: validation path: data/validation-* - split: test path: data/test-* --- # Dataset Card for Dataset Name  This dataset card aims to be a base template for new datasets. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/datasetcard_template.md?plain=1). ## Dataset Details ### Dataset Description  - **Curated by:** [More Information Needed] - **Funded by [optional]:** [More Information Needed] - **Shared by [optional]:** [More Information Needed] - **Language(s) (NLP):** [More Information Needed] - **License:** [More Information Needed] ### Dataset Sources [optional]  - **Repository:** [More Information Needed] - **Paper [optional]:** [More Information Needed] - **Demo [optional]:** [More Information Needed] ## Uses  ### Direct Use  [More Information Needed] ### Out-of-Scope Use  [More Information Needed] ## Dataset Structure  [More Information Needed] ## Dataset Creation ### Curation Rationale  [More Information Needed] ### Source Data  #### Data Collection and Processing  [More Information Needed] #### Who are the source data producers?  [More Information Needed] ### Annotations [optional]  #### Annotation process  [More Information Needed] #### Who are the annotators?  [More Information Needed] #### Personal and Sensitive Information  [More Information Needed] ## Bias, Risks, and Limitations  [More Information Needed] ### Recommendations  Users should be made aware of the risks, biases and limitations of the dataset. More information needed for further recommendations. ## Citation [optional]  **BibTeX:** [More Information Needed] **APA:** [More Information Needed] ## Glossary [optional]  [More Information Needed] ## More Information [optional] [More Information Needed] ## Dataset Card Authors [optional] [More Information Needed] ## Dataset Card Contact [More Information Needed]

提供机构：

djamina

原始信息汇总

数据集概述

数据集基本信息

语言: 法语 (fr)
数据集大小: 小于1K (n<1K)
任务类别: 分词分类 (token-classification)

数据集特征

id: 整数型 (int32)
texts: 字符串型 (string)
tokens: 字符串序列
labels: 分类标签，包括 O, DET, APPO, AMBIGUE
psr_tags: 分类标签序列，包括 O, DET, APPO, AMBIGUE
psr_seq_tags: 分类标签序列，包括 O, B-DET, I-DET, B-APPO, I-APPO, B-AMBIGUE, I-AMBIGUE

数据集划分

训练集: 392个样本，367878字节
验证集: 99个样本，100723字节
测试集: 55个样本，54069字节

数据集大小

下载大小: 138886字节
数据集总大小: 522670字节

配置信息

默认配置: 包含训练、验证和测试数据文件的路径配置

5,000+

优质数据集

54 个

任务类型

进入经典数据集