five

rntc/biomed-fr-v3-enriched-softmin-tres_agressif

收藏
Hugging Face2025-10-06 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/rntc/biomed-fr-v3-enriched-softmin-tres_agressif
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个经过质量增强的 rntc/biomed-fr-v3-enriched 数据集,通过软最小瓶颈采样方法进行了质量上采样。该数据集属于医学和生物医学领域,数据语言为法语。预处理方法包括软最小值计算、权重计算和重采样。使用了教育分数、内容丰富度、术语精度和写作质量等四个质量评分,排除了缺失评分的样本。重采样后数据集大小保持不变,具有一个名为 tres_agressif 的预设配置。

This is a quality-enhanced version of the rntc/biomed-fr-v3-enriched dataset, upsampling the quality through soft-min bottleneck sampling. The dataset belongs to the medical and biomedical fields and is in French. The preprocessing methods include soft-min calculation, weight computation, and resampling. Four quality scores such as educational score, content richness, terminology precision, and writing quality are used, and samples with missing scores are excluded. The dataset size remains the same after resampling and has a specific preset configuration named tres_agressif.
提供机构:
rntc
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作