wsdsda/Chinese-English-Product-Review-Dataset
收藏Hugging Face2026-03-27 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/wsdsda/Chinese-English-Product-Review-Dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
---
# 📦 Chinese-English Product Review Dataset (Sentiment Tagged)
A high-quality bilingual dataset containing **1000+ real-world style product reviews** in both Chinese 🇨🇳 and English 🇬🇧, each annotated with a sentiment label: **Positive**, **Neutral**, or **Negative**.
---
## ✨ Features
- **Languages**: Chinese & English
- **Domains**: Smartphone, Headphones, Tablets
- **Fields**: `id`, `category`, `zh_review`, `en_review`, `sentiment`
- **Sentiments**: Positive / Neutral / Negative
- **Dataset Size**: 1000 samples
- **License**: MIT (commercial use allowed)
---
## 📊 Data Sample
| id | category | zh_review | en_review | sentiment |
|--------|------------|---------------------------------------------|------------------------------------------------------------------|-----------|
| id_001 | smartphone | 信号偶尔会跳格子,在地下停车场特别明显。 | The signal drops randomly, especially in underground parking lots. | negative |
| id_002 | headphone | 耳机戴着很舒服,长时间也不会觉得夹耳朵。 | These headphones are super comfy, even after wearing them for hours. | positive |
| id_003 | tablet | 拍照功能出乎意料地好,夜景模式惊艳。 | The camera is surprisingly good, especially in night mode. | positive |
---
## 📚 Use Cases
- Fine-tuning multilingual sentiment classification models
- Training Chinese-English cross-lingual LLMs
- Benchmarking: translation, intent understanding, classification
- Pretraining or adapting sentiment-aware QA systems
---
## 📥 Contents
This dataset includes:
- ✅ `comment_dataset.csv` – Clean, ready-to-use format
- ✅ `raw_01.txt` – Original generation source
- ✅ `sample_preview.png` – Visual preview
- ✅ `LICENSE.txt`, `README.md`
---
## 🛡 License
This dataset is released under the [MIT License](LICENSE.txt).
✔️ **Free for academic and commercial use**
---
## 🔗 Full Pack Download
Need the ZIP version with preview images, raw files and everything ready to use?
👉 [Access the full packaged dataset on Gumroad](https://kaiwen45.gumroad.com/l/hntgcp)
---
## 📢 Access Policy
🆓 The **full dataset is currently open access** on Hugging Face.
🚧 **Future versions** with larger samples, categories or tasks may switch to preview-only here, with complete distributions on Gumroad.
🧠 Your support enables more high-quality, multilingual data products.
Feel free to **explore, cite, and share**!
---
Enjoy and happy training! 🚀
---
许可证:MIT
---
# 📦 中英双语带情感标注产品评论数据集(Chinese-English Product Review Dataset (Sentiment Tagged))
本数据集为高质量双语数据集,包含**1000余篇真实场景产品评论**,涵盖中文与英文两种语言,每条评论均标注有情感标签:**正面(Positive)、中性(Neutral)或负面(Negative)**。
---
## ✨ 数据集特性
- **语言覆盖**:中文与英文
- **应用领域**:智能手机、头戴式耳机、平板电脑
- **数据字段**:`id`、`category`、`zh_review`、`en_review`、`sentiment`
- **情感类别**:正面(Positive)、中性(Neutral)、负面(Negative)
- **数据集规模**:1000条样本
- **许可证**:MIT(允许商业使用)
---
## 📊 数据样例
| id | category | zh_review | en_review | sentiment |
|--------|------------|---------------------------------------------|------------------------------------------------------------------|-----------|
| id_001 | smartphone | 信号偶尔会出现跳格现象,在地下停车场中尤为明显。 | The signal drops randomly, especially in underground parking lots. | negative |
| id_002 | headphone | 这款耳机佩戴体验极佳,长时间佩戴也不会产生夹耳不适感。 | These headphones are super comfy, even after wearing them for hours. | positive |
| id_003 | tablet | 拍照功能表现超出预期,夜景模式效果尤为惊艳。 | The camera is surprisingly good, especially in night mode. | positive |
---
## 📚 应用场景
- 微调多语言情感分类模型
- 训练中英跨语言大语言模型(Large Language Model, LLM)
- 基准测试:可用于翻译、意图理解及分类任务
- 预训练或适配情感感知问答系统
---
## 📥 数据集内容
本数据集包含以下文件:
- ✅ `comment_dataset.csv`:经清洗整理、可直接投入使用的格式文件
- ✅ `raw_01.txt`:原始生成源文件
- ✅ `sample_preview.png`:数据样例可视化预览图
- ✅ `LICENSE.txt`、`README.md`
---
## 🛡 许可证
本数据集采用[MIT许可证(MIT License)](LICENSE.txt)发布。
✔️ **可免费用于学术研究与商业用途**
---
## 🔗 完整打包版下载
需要包含预览图片、原始文件及所有可用资源的ZIP压缩包?👉 [前往Gumroad平台获取完整打包数据集](https://kaiwen45.gumroad.com/l/hntgcp)
---
## 📢 访问政策
🆓 **完整数据集目前可在Hugging Face平台免费获取**。
🚧 **未来版本**(包含更多样本、品类或任务类型)可能仅在此处提供预览,完整版本将仅在Gumroad平台发布。
🧠 您的支持将助力我们产出更多高质量多语言数据产品。
欢迎**探索、引用与分享**!
---
祝您使用愉快,训练顺利!🚀
提供机构:
wsdsda



