five

davanstrien/aud-lfm2.5-350m-20260428

收藏
Hugging Face2026-04-28 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/davanstrien/aud-lfm2.5-350m-20260428
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个由大型语言模型(LLM)标注的数据集,由classify-and-augment工具生成。数据集使用LiquidAI/LFM2.5-350M模型进行标注,包含positive(正面)和negative(负面)两个情感标签。原始输入数据有180行,经过处理后输出195行数据。标签分布显示negative标签有124个(全部为真实数据),positive标签有71个(其中56个为真实数据,15个为合成数据)。合成数据审计显示,positive类需要44个样本,实际生成了58个候选样本,经过验证保留了44个,最终保留了15个合成样本,接受率为75.9%。

This is an LLM-annotated dataset produced by the classify-and-augment tool. The dataset uses the LiquidAI/LFM2.5-350M model for annotation and contains two sentiment labels: positive and negative. The original input consists of 180 rows, and the processed output contains 195 rows. The label distribution shows 124 negative labels (all real data) and 71 positive labels (56 real and 15 synthetic). The synthesis audit indicates that for the positive class, 44 samples were needed, 58 candidates were generated, 44 were validated, and 15 synthetic samples were ultimately kept, with an acceptance rate of 75.9%.
提供机构:
davanstrien
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作