Sentiment Analysis Dataset

Name: Sentiment Analysis Dataset
Creator: Datadome LLC
Published: 2026-01-08 16:50:32
License: 暂无描述

Snowflake2026-01-08 更新2026-01-11 收录

下载链接：

https://app.snowflake.com/marketplace/listing/GZU6Z2CS62I

下载链接

链接失效反馈

官方服务：

资源简介：

The **Sentiment Analysis Dataset** is a high-quality corpus of realistic, labeled text samples designed to support training and evaluation of sentiment classification models. The dataset consists of natural-language sentences that reflect real-world customer feedback and user opinions, each annotated with a discrete sentiment label (positive, neutral, negative) and a continuous sentiment score to capture emotional intensity. All records are synthetically generated to resemble authentic user-generated text while remaining privacy-safe and suitable for commercial use. The dataset is delivered as native Snowflake tables, enabling immediate integration into analytics, machine learning, and NLP workflows.<br/><br/>**Scope** Spans multiple common feedback sources—including product reviews, social interactions, customer support conversations, and product feedback—to reflect how sentiment is expressed across different communication channels and contexts. **Scale** Tens of thousands of labeled text examples, with balanced sentiment distribution, provide sufficient coverage for model training, experimentation, benchmarking, and proof-of-concept development. Additional volumes are available for advanced use cases via a private offer. **Value** - **High-quality sentiment labels** with both categorical polarity and numeric sentiment scores for classification and regression tasks. - **Realistic language patterns** that closely resemble real customer feedback while remaining fully synthetic and privacy-safe. - **Built-in metadata** (source domain) to enable segmentation, filtering, and domain-specific analysis. - **Zero-friction integration** via Snowflake tables—query, join, and train models directly without additional preprocessing. Use this dataset to accelerate the development of sentiment-aware NLP systems—whether you’re fine-tuning transformer models, building customer feedback analytics, developing conversational AI, or validating sentiment pipelines in a secure, production-friendly environment.

提供机构：

Datadome LLC

创建时间：

2026-01-08

原始信息汇总

Sentiment Analysis Dataset 数据集概述

数据集基本信息

数据集名称: Sentiment Analysis Dataset
提供商: Datadome LLC
数据集描述: 这是一个高质量、带标签的文本样本语料库，旨在支持情感分类模型的训练和评估。数据集包含反映真实世界客户反馈和用户意见的自然语言句子，每个句子都标注了离散的情感标签（积极、中性、消极）和连续的情感分数以捕捉情感强度。所有记录均为合成生成，模拟真实用户生成文本，同时保持隐私安全并适合商业用途。数据集以原生 Snowflake 表的形式交付，可立即集成到分析、机器学习和 NLP 工作流中。

数据范围与规模

范围: 涵盖多种常见的反馈来源，包括产品评论、社交互动、客户支持对话和产品反馈，以反映不同沟通渠道和背景下情感的表达方式。
规模: 包含数万个带标签的文本示例，情感分布平衡，为模型训练、实验、基准测试和概念验证开发提供了足够的覆盖范围。可通过私人报价获取更多数量以用于高级用例。

数据价值

高质量情感标签: 包含分类极性（积极/中性/消极）和数值情感分数，适用于分类和回归任务。
逼真的语言模式: 与真实客户反馈高度相似，同时完全合成且隐私安全。
内置元数据: 包含来源领域信息，支持细分、过滤和特定领域分析。
零摩擦集成: 通过 Snowflake 表实现，可直接查询、连接和训练模型，无需额外预处理。

业务需求

情感分析: 帮助组织从大量非结构化文本中提取清晰的情感信号（积极、中性、消极），识别满意度趋势和新兴问题。涵盖产品评论、社交互动、客户支持对话和产品反馈，支持跨渠道的情感分析。预标记的示例和数值情感分数可跳过手动标注，加速模型训练和评估。

数据字典

表名: SENTIMENT_ANALYSIS_DATASET

字段名	数据类型	描述
SENTIMENT	Varchar	情感标签
SENTIMENT_SCORE	Float	情感分数
SOURCE	Varchar	来源
TEXT	Varchar	文本内容

使用示例

预览查询: SELECT * from SENTIMENT_ANALYSIS_DATASET limit 10;

定价方案

Basic Plan: $1/月
Starter: $1/月

试用信息

试用类型: 限时功能试用
试用时长: 30天

数据集技术详情

更新频率: 每月
时间覆盖范围: 最近1个月（按月）
地理覆盖范围: 美国（所有州），以及51个县
云区域可用性 (AWS): US East (N. Virginia)
法律条款: Standard

分类标签

AI & ML
Sentiment Analysis

提供商联系方式

销售: info@datadome.io
支持: support@datadome.io

提供商简介

Datadome.io 是一个 API 优先的自然语言处理平台，通过简单的 REST 端点提供情感分析、文本相似性评分、命名实体提取和主题分类功能。它使企业和开发人员能够快速“将文本转化为洞察”，适用于大规模评论分析、相关内容发现、自动化元数据标记或定制 NLP 应用等用例。

5,000+

优质数据集

54 个

任务类型

进入经典数据集