neuralspace/NSME-COM
收藏Hugging Face2022-09-13 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/neuralspace/NSME-COM
下载链接
链接失效反馈官方服务:
资源简介:
---
annotations_creators:
- other
language_creators:
- other
language:
- en
expert-generated license:
- cc-by-nc-sa-4.0
multilinguality:
- monolingual
size_categories:
- n<1K
source_datasets:
- original
task_categories:
- question-answering
- text-retrieval
- text2text-generation
- other
- translation
- conversational
task_ids:
- extractive-qa
- closed-domain-qa
- utterance-retrieval
- document-retrieval
- closed-domain-qa
- open-book-qa
- closed-book-qa
paperswithcode_id: acronym-identification
pretty_name: Massive E-commerce Dataset for Retail and Insurance domain.
train-eval-index:
- config: nsds
task: token-classification
task_id: entity_extraction
splits:
train_split: train
eval_split: test
col_mapping:
sentence: text
label: target
metrics:
- type: nsme-com
name: NSME-COM
config:
nsds
tags:
- chatbots
- e-commerce
- retail
- insurance
- consumer
- consumer goods
configs:
- nsds
---
# Dataset Card for NSME-COM
## Table of Contents
- [Dataset Description](#dataset-description)
- [Dataset Summary](#dataset-summary)
- [Supported Tasks](#supported-tasks)
- [Languages](#languages)
- [Dataset Structure](#dataset-structure)
- [Data Instances](#data-instances)
- [Data Fields](#data-fields)
- [Data Splits](#data-splits)
- [Dataset Creation](#dataset-creation)
- [Curation Rationale](#curation-rationale)
- [Source Data](#source-data)
- [Annotations](#annotations)
- [Personal and Sensitive Information](#personal-and-sensitive-information)
- [Considerations for Using the Data](#considerations-for-using-the-data)
- [Social Impact of Dataset](#social-impact-of-dataset)
- [Discussion of Biases](#discussion-of-biases)
- [Other Known Limitations](#other-known-limitations)
- [Additional Information](#additional-information)
- [Dataset Curators](#dataset-curators)
- [Licensing Information](#licensing-information)
- [Citation Information](#citation-information)
- [Contributions](#contributions)
### Dataset Description
- **Homepage**: [NeuralSpace Homepage](https://huggingface.co/neuralspace)
- **Repository:** [NSME-COM Dataset](https://huggingface.co/datasets/neuralspace/NSME-COM)
- **Point of Contact:** [Ankur Saxena](mailto:ankursaxena@neuralspace.ai)
- **Point of Contact:** [Ayushman Dash](mailto:ayushman@neuralspace.ai)
- **Size of downloaded dataset files:** 10.86 KB
### Dataset Summary
In this digital age, the E-Commerce industry has increasingly become a vital component of business strategy and development. To streamline, enhance and take the customer experience to the highest level, NLP can help create surprisingly massive value in the E-Commerce industry.
One of the most popular NLP use-cases is a chatbot. With a chatbot you can automate your customer engagement saving yourself time and other resources. Offering an enhanced and simplified customer experience you can increase your sales and also offer your website visitors personalized recommendations.
The NSME-COM dataset (NeuralSpace Massive E-Comm) is a manually curated dataset by data engineers at [NeuralSpace](https://www.neuralspace.ai/) for the insurance and retail domain. The dataset contains intents (the action users want to execute) and examples (anything that a user sends to the chatbot) that can be used to build a chatbot. The files in this dataset are available in JSON format.
### Supported Tasks
#### nsme-com
### Languages
The language data in NSME-COM is in English (BCP-47 `en`)
## Dataset Structure
### Data Instances
- **Size of downloaded dataset files:** 10.86 KB
An example of 'test' looks as follows.
``` {
"text": "is it good to add roadside assistance?",
"intent": "Add",
"type": "Test"
}
```
An example of 'train' looks as follows.
```{
"text": "how can I add my spouse as a nominee?",
"intent": "Add",
"type": "Train"
},
```
### Data Fields
The data fields are the same among all splits.
#### nsme-com
- `text`: a `string` feature.
- `intent`: a `string` feature.
- `type`: a classification label, with possible values including `train` or `test`.
### Data Splits
#### nsme-com
| |train|test|
|----|----:|---:|
|nsme-com| 1725| 406|
### Contributions
Ankur Saxena (ankursaxena@neuralspace.ai)
提供机构:
neuralspace
原始信息汇总
数据集概述
数据集名称
- 名称: NSME-COM
- 别名: NeuralSpace Massive E-Comm
数据集描述
- 领域: 零售和保险
- 用途: 用于构建聊天机器人,自动化客户服务
- 内容: 包含意图(用户想要执行的操作)和示例(用户发送给聊天机器人的信息)
- 格式: JSON
数据集特性
- 语言: 英语 (
en) - 多语言性: 单语
- 大小: 小于1K
- 来源: 原始数据
- 任务类别:
- 问答
- 文本检索
- 文本到文本生成
- 翻译
- 对话
- 任务ID:
- 抽取式问答
- 封闭领域问答
- 话语检索
- 文档检索
- 开放书本问答
- 封闭书本问答
数据集结构
- 数据实例: 包含文本和意图,以及类型(训练或测试)
- 数据字段:
text: 字符串类型intent: 字符串类型type: 分类标签,值为train或test
- 数据分割:
- 训练集: 1725条
- 测试集: 406条
许可证
- 许可证: CC-BY-NC-SA-4.0
标签
- 标签:
- 聊天机器人
- 电子商务
- 零售
- 保险
- 消费者
- 消费品
贡献者
- 贡献者: Ankur Saxena (ankursaxena@neuralspace.ai)



