five

[SAMPLE] Nexdata | Unsupervised Text Data | 1 PB | Foundation Model | Pre-training Data | Large ...

收藏
Databricks2025-01-04 收录
下载链接:
https://marketplace.databricks.com/details/f6e13c88-730a-4093-a633-8964f9320c2a/Nexdata_SAMPLE-Nexdata-Unsupervised-Text-Data-1-PB-Foundation-Model-Pre-training-Data-Large-
下载链接
链接失效反馈
官方服务:
资源简介:
1. Test Questions Data Volume: 50 Millions Data Filed: contains title, answer, parse, subject, grade, question type; Format: jsonl; Language: English, Korean, French, German, Spanish 2. e-books Data Volume: 10 million books with ISBN Formats: Epub, PDF Language: English, Korean, French, German, Spanish 3. About Nexdata Nexdata owns off-the-shelf PB-level Large Language Model(LLM) Data, 1 million hours of Audio Data and 800TB of Annotated Imagery Data. These ready-to-go data supports instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/llm?source=Datarade
提供机构:
Nexdata
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作