five

Salesforce/cota-mantis

收藏
Hugging Face2025-01-06 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Salesforce/cota-mantis
下载链接
链接失效反馈
官方服务:
资源简介:
TACO数据集是一个包含293K个合成思维和动作链(CoTA)的数据集,这些链由GPT-4o生成,涉及15种不同的动作,如OCR、深度估计、计算等。该数据集的主要用途是微调多模态语言模型,以产生思维和动作链来回答复杂的视觉问题。数据集来源于Cauldron和Mantis-Instruct,这些数据是从COCO、AOKVQA、ScienceQA、Visual Genome等现有数据集中收集的。数据集的局限性包括GPT-4o的偏见和动作的局限性,主要涵盖视觉中心工具和一些通用工具。

The TACO dataset is a collection of 293K synthetic chains of thoughts and actions (CoTA) generated by GPT-4o, involving 15 different actions such as OCR, depth estimation, calculation, etc. The primary use of this dataset is to fine-tune multi-modal language models to produce chains of thoughts and actions to answer complex visual questions. The dataset is sourced from Cauldron and Mantis-Instruct, which are collected from various existing datasets including COCO, AOKVQA, ScienceQA, Visual Genome, etc. The limitations of the dataset include biases inherited from GPT-4o and the limited scope of actions, mostly covering vision-centric tools and some generic tools.
提供机构:
Salesforce
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作