DataOceanAI/Off_the_self_dataset
收藏Hugging Face2023-10-11 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/DataOceanAI/Off_the_self_dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
license: unknown
task_categories:
- conversational
- text-generation
tags:
- datasets
- dataoceanai
- speechocean
- ASR
- TTS
pretty_name: DataOcean AI - Off the self datasets
---
# Introduction
<!-- Provide a quick summary of the dataset. -->
DataOcean AI (SHA stock code: 688787), founded in 2005, is one of the earliest AI training data solution providers in China.
As the first listed enterprise in AI training data domestically, DataOcean AI is committed to providing AI datasets and services for AI enterprises and R&D institutions.
DataOcean AI specializes in delivering comprehensive, multilingual, cross-domain, and multimodal AI datasets, along with a range of data-related services. Our offerings include data annotation, data collection, data design, and modal evaluation, catering to the diverse needs of enterprises across various industries. Our services encompass essential domains such as smart voice (including voice recognition and voice synthesis), computer vision, and natural language processing, spanning a wide array of approximately 200 primary languages and dialects from around the globe.
DataOcean AI has been actively involved in the industry for nearly two decades and has developed close to 700 deep partnerships with leading IT companies, academic institutions, and emerging AI enterprises. It has delivered thousands of customized projects successfully and gained the deep trust of customers by focusing on competent, dependable, and safe data services. The company’s superior resources which cover 190+ languages and dialects in more than 70 countries, as well as its technologically leading algorithm R&D team and well-experienced project teams, are valuable assets of the company that contribute to the overall successful implementation of frontier AI projects around the world.
### Dataset Description
<!-- Provide a longer summary of what this dataset is. -->
- **Curated by:** [DATAOCEAN AI](https://en.dataoceanai.com/)
- **License:** Commercial
Check out the [files](https://huggingface.co/datasets/DataOceanAI/Off_the_self_dataset/tree/main) or visit our website for details
## Contact
You can alwasy contact us via email "contact@dataoceanai.com" or fill up the [contact form](https://en.dataoceanai.com/?m=index&c=dsvoice&a=consult&aboutus_id=9619) in our website ' https://en.dataoceanai.com/ '
<!-- Address questions around how the dataset is intended to be used. -->
license: 未知
task_categories:
- 对话式
- 文本生成
tags:
- 数据集
- DataOcean AI
- speechocean
- 自动语音识别(ASR)
- 文本转语音(TTS)
pretty_name: DataOcean AI - 通用现成数据集
---
# 数据集介绍
<!-- 简要概述该数据集。 -->
DataOcean AI(上海证券交易所股票代码:688787)成立于2005年,是中国最早的人工智能训练数据解决方案提供商之一。
作为国内人工智能训练数据领域首家上市企业,DataOcean AI 致力于为人工智能企业与研发机构提供AI数据集及相关服务。
DataOcean AI 专注于提供全面、多语言、跨领域、多模态的人工智能数据集,以及一系列数据相关服务。我们的服务涵盖数据标注、数据采集、数据设计与模态评估,可满足各行业企业的多样化需求。业务覆盖智能语音(包含语音识别与语音合成)、计算机视觉、自然语言处理等核心领域,支持全球约200种主要语言及方言。
DataOcean AI 深耕行业近二十年,已与头部IT企业、学术机构及新兴AI企业建立近700家深度合作关系,成功交付数千个定制化项目,并凭借专业可靠、安全合规的数据服务赢得客户的深度信赖。公司拥有覆盖全球70余个国家、190余种语言及方言的优质资源,同时配备技术领先的算法研发团队与经验丰富的项目团队,这些都是助力全球前沿AI项目顺利落地的宝贵资产。
### 数据集详情
<!-- 详细说明该数据集的具体情况。 -->
- **出品方:** [DataOcean AI](https://en.dataoceanai.com/)
- **许可证:** 商业许可
可查看[数据集文件](https://huggingface.co/datasets/DataOceanAI/Off_the_self_dataset/tree/main)或访问官方网站获取详细信息
## 联系方式
您可随时通过邮箱"contact@dataoceanai.com"联系我们,或填写我方官网https://en.dataoceanai.com/ 上的[联系表单](https://en.dataoceanai.com/?m=index&c=dsvoice&a=consult&aboutus_id=9619)
提供机构:
DataOceanAI
原始信息汇总
数据集描述
- 数据集名称: DataOcean AI - Off the self datasets
- 任务类别:
- 对话
- 文本生成
- 标签:
- 数据集
- dataoceanai
- speechocean
- ASR
- TTS
- 许可证: 商业
- 数据集提供方: DATAOCEAN AI
联系信息
- 电子邮件: contact@dataoceanai.com
- 联系表单: 联系表单



