five

neuralspace/NSME-COM

收藏
Hugging Face2022-09-13 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/neuralspace/NSME-COM
下载链接
链接失效反馈
官方服务:
资源简介:
--- annotations_creators: - other language_creators: - other language: - en expert-generated license: - cc-by-nc-sa-4.0 multilinguality: - monolingual size_categories: - n<1K source_datasets: - original task_categories: - question-answering - text-retrieval - text2text-generation - other - translation - conversational task_ids: - extractive-qa - closed-domain-qa - utterance-retrieval - document-retrieval - closed-domain-qa - open-book-qa - closed-book-qa paperswithcode_id: acronym-identification pretty_name: Massive E-commerce Dataset for Retail and Insurance domain. train-eval-index: - config: nsds task: token-classification task_id: entity_extraction splits: train_split: train eval_split: test col_mapping: sentence: text label: target metrics: - type: nsme-com name: NSME-COM config: nsds tags: - chatbots - e-commerce - retail - insurance - consumer - consumer goods configs: - nsds --- # Dataset Card for NSME-COM ## Table of Contents - [Dataset Description](#dataset-description) - [Dataset Summary](#dataset-summary) - [Supported Tasks](#supported-tasks) - [Languages](#languages) - [Dataset Structure](#dataset-structure) - [Data Instances](#data-instances) - [Data Fields](#data-fields) - [Data Splits](#data-splits) - [Dataset Creation](#dataset-creation) - [Curation Rationale](#curation-rationale) - [Source Data](#source-data) - [Annotations](#annotations) - [Personal and Sensitive Information](#personal-and-sensitive-information) - [Considerations for Using the Data](#considerations-for-using-the-data) - [Social Impact of Dataset](#social-impact-of-dataset) - [Discussion of Biases](#discussion-of-biases) - [Other Known Limitations](#other-known-limitations) - [Additional Information](#additional-information) - [Dataset Curators](#dataset-curators) - [Licensing Information](#licensing-information) - [Citation Information](#citation-information) - [Contributions](#contributions) ### Dataset Description - **Homepage**: [NeuralSpace Homepage](https://huggingface.co/neuralspace) - **Repository:** [NSME-COM Dataset](https://huggingface.co/datasets/neuralspace/NSME-COM) - **Point of Contact:** [Ankur Saxena](mailto:ankursaxena@neuralspace.ai) - **Point of Contact:** [Ayushman Dash](mailto:ayushman@neuralspace.ai) - **Size of downloaded dataset files:** 10.86 KB ### Dataset Summary In this digital age, the E-Commerce industry has increasingly become a vital component of business strategy and development. To streamline, enhance and take the customer experience to the highest level, NLP can help create surprisingly massive value in the E-Commerce industry. One of the most popular NLP use-cases is a chatbot. With a chatbot you can automate your customer engagement saving yourself time and other resources. Offering an enhanced and simplified customer experience you can increase your sales and also offer your website visitors personalized recommendations. The NSME-COM dataset (NeuralSpace Massive E-Comm) is a manually curated dataset by data engineers at [NeuralSpace](https://www.neuralspace.ai/) for the insurance and retail domain. The dataset contains intents (the action users want to execute) and examples (anything that a user sends to the chatbot) that can be used to build a chatbot. The files in this dataset are available in JSON format. ### Supported Tasks #### nsme-com ### Languages The language data in NSME-COM is in English (BCP-47 `en`) ## Dataset Structure ### Data Instances - **Size of downloaded dataset files:** 10.86 KB An example of 'test' looks as follows. ``` { "text": "is it good to add roadside assistance?", "intent": "Add", "type": "Test" } ``` An example of 'train' looks as follows. ```{ "text": "how can I add my spouse as a nominee?", "intent": "Add", "type": "Train" }, ``` ### Data Fields The data fields are the same among all splits. #### nsme-com - `text`: a `string` feature. - `intent`: a `string` feature. - `type`: a classification label, with possible values including `train` or `test`. ### Data Splits #### nsme-com | |train|test| |----|----:|---:| |nsme-com| 1725| 406| ### Contributions Ankur Saxena (ankursaxena@neuralspace.ai)
提供机构:
neuralspace
原始信息汇总

数据集概述

数据集名称

  • 名称: NSME-COM
  • 别名: NeuralSpace Massive E-Comm

数据集描述

  • 领域: 零售和保险
  • 用途: 用于构建聊天机器人,自动化客户服务
  • 内容: 包含意图(用户想要执行的操作)和示例(用户发送给聊天机器人的信息)
  • 格式: JSON

数据集特性

  • 语言: 英语 (en)
  • 多语言性: 单语
  • 大小: 小于1K
  • 来源: 原始数据
  • 任务类别:
    • 问答
    • 文本检索
    • 文本到文本生成
    • 翻译
    • 对话
  • 任务ID:
    • 抽取式问答
    • 封闭领域问答
    • 话语检索
    • 文档检索
    • 开放书本问答
    • 封闭书本问答

数据集结构

  • 数据实例: 包含文本和意图,以及类型(训练或测试)
  • 数据字段:
    • text: 字符串类型
    • intent: 字符串类型
    • type: 分类标签,值为traintest
  • 数据分割:
    • 训练集: 1725条
    • 测试集: 406条

许可证

  • 许可证: CC-BY-NC-SA-4.0

标签

  • 标签:
    • 聊天机器人
    • 电子商务
    • 零售
    • 保险
    • 消费者
    • 消费品

贡献者

  • 贡献者: Ankur Saxena (ankursaxena@neuralspace.ai)
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作