five

wsdsda/Chinese-English-Product-Review-Dataset

收藏
Hugging Face2026-03-27 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/wsdsda/Chinese-English-Product-Review-Dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit --- # 📦 Chinese-English Product Review Dataset (Sentiment Tagged) A high-quality bilingual dataset containing **1000+ real-world style product reviews** in both Chinese 🇨🇳 and English 🇬🇧, each annotated with a sentiment label: **Positive**, **Neutral**, or **Negative**. --- ## ✨ Features - **Languages**: Chinese & English - **Domains**: Smartphone, Headphones, Tablets - **Fields**: `id`, `category`, `zh_review`, `en_review`, `sentiment` - **Sentiments**: Positive / Neutral / Negative - **Dataset Size**: 1000 samples - **License**: MIT (commercial use allowed) --- ## 📊 Data Sample | id | category | zh_review | en_review | sentiment | |--------|------------|---------------------------------------------|------------------------------------------------------------------|-----------| | id_001 | smartphone | 信号偶尔会跳格子,在地下停车场特别明显。 | The signal drops randomly, especially in underground parking lots. | negative | | id_002 | headphone | 耳机戴着很舒服,长时间也不会觉得夹耳朵。 | These headphones are super comfy, even after wearing them for hours. | positive | | id_003 | tablet | 拍照功能出乎意料地好,夜景模式惊艳。 | The camera is surprisingly good, especially in night mode. | positive | --- ## 📚 Use Cases - Fine-tuning multilingual sentiment classification models - Training Chinese-English cross-lingual LLMs - Benchmarking: translation, intent understanding, classification - Pretraining or adapting sentiment-aware QA systems --- ## 📥 Contents This dataset includes: - ✅ `comment_dataset.csv` – Clean, ready-to-use format - ✅ `raw_01.txt` – Original generation source - ✅ `sample_preview.png` – Visual preview - ✅ `LICENSE.txt`, `README.md` --- ## 🛡 License This dataset is released under the [MIT License](LICENSE.txt). ✔️ **Free for academic and commercial use** --- ## 🔗 Full Pack Download Need the ZIP version with preview images, raw files and everything ready to use? 👉 [Access the full packaged dataset on Gumroad](https://kaiwen45.gumroad.com/l/hntgcp) --- ## 📢 Access Policy 🆓 The **full dataset is currently open access** on Hugging Face. 🚧 **Future versions** with larger samples, categories or tasks may switch to preview-only here, with complete distributions on Gumroad. 🧠 Your support enables more high-quality, multilingual data products. Feel free to **explore, cite, and share**! --- Enjoy and happy training! 🚀

--- 许可证:MIT --- # 📦 中英双语带情感标注产品评论数据集(Chinese-English Product Review Dataset (Sentiment Tagged)) 本数据集为高质量双语数据集,包含**1000余篇真实场景产品评论**,涵盖中文与英文两种语言,每条评论均标注有情感标签:**正面(Positive)、中性(Neutral)或负面(Negative)**。 --- ## ✨ 数据集特性 - **语言覆盖**:中文与英文 - **应用领域**:智能手机、头戴式耳机、平板电脑 - **数据字段**:`id`、`category`、`zh_review`、`en_review`、`sentiment` - **情感类别**:正面(Positive)、中性(Neutral)、负面(Negative) - **数据集规模**:1000条样本 - **许可证**:MIT(允许商业使用) --- ## 📊 数据样例 | id | category | zh_review | en_review | sentiment | |--------|------------|---------------------------------------------|------------------------------------------------------------------|-----------| | id_001 | smartphone | 信号偶尔会出现跳格现象,在地下停车场中尤为明显。 | The signal drops randomly, especially in underground parking lots. | negative | | id_002 | headphone | 这款耳机佩戴体验极佳,长时间佩戴也不会产生夹耳不适感。 | These headphones are super comfy, even after wearing them for hours. | positive | | id_003 | tablet | 拍照功能表现超出预期,夜景模式效果尤为惊艳。 | The camera is surprisingly good, especially in night mode. | positive | --- ## 📚 应用场景 - 微调多语言情感分类模型 - 训练中英跨语言大语言模型(Large Language Model, LLM) - 基准测试:可用于翻译、意图理解及分类任务 - 预训练或适配情感感知问答系统 --- ## 📥 数据集内容 本数据集包含以下文件: - ✅ `comment_dataset.csv`:经清洗整理、可直接投入使用的格式文件 - ✅ `raw_01.txt`:原始生成源文件 - ✅ `sample_preview.png`:数据样例可视化预览图 - ✅ `LICENSE.txt`、`README.md` --- ## 🛡 许可证 本数据集采用[MIT许可证(MIT License)](LICENSE.txt)发布。 ✔️ **可免费用于学术研究与商业用途** --- ## 🔗 完整打包版下载 需要包含预览图片、原始文件及所有可用资源的ZIP压缩包?👉 [前往Gumroad平台获取完整打包数据集](https://kaiwen45.gumroad.com/l/hntgcp) --- ## 📢 访问政策 🆓 **完整数据集目前可在Hugging Face平台免费获取**。 🚧 **未来版本**(包含更多样本、品类或任务类型)可能仅在此处提供预览,完整版本将仅在Gumroad平台发布。 🧠 您的支持将助力我们产出更多高质量多语言数据产品。 欢迎**探索、引用与分享**! --- 祝您使用愉快,训练顺利!🚀
提供机构:
wsdsda
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作