five

ebowwa/relative-postioning

收藏
Hugging Face2024-05-20 更新2024-06-15 收录
下载链接:
https://hf-mirror.com/datasets/ebowwa/relative-postioning
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: prompt dtype: string - name: response dtype: string splits: - name: train num_bytes: 916486 num_examples: 10000 download_size: 164700 dataset_size: 916486 configs: - config_name: default data_files: - split: train path: data/train-* license: apache-2.0 task_categories: - feature-extraction language: - en tags: - code pretty_name: relative-positioning size_categories: - 10K<n<100K --- ### NOTE: this is a huggingface duplicate for further processing. Here's the originial readme # Dataset Card for Dataset Name This dataset aims to teach LLMs relative positioning (e.g. above, left from, below, etc.), which in my findings most LLMs, even SOTA where not able to produce under all circumstances. Will be pushing a fine-tuned mixtral-7x8B with this dataset. ## Dataset Details ### Dataset Description Contains Data for relative positioning on a grid(256, 256). Assumes Origin [0, 0] is in the bottom left. Two Objects (Object 1, Object 2) are randomly created. Answer is there relative position to one another. - **Curated by:** [Antoine Angert] - **Language(s) (NLP):** [English] - **License:** [apache-2.0] ## Uses ### Direct Use Can be used to fine-tune Language Models. (Althought so far not been tested, will update) ## Dataset Structure Features: Prompt(String), Response(String) ## Dataset Creation ### Curation Rationale I did some testing to see how well LLMs are able to handle positional data(2D, 3D). I found that most small models (tested: llama-7B, llama-13B, mistral-7B) have very poor positional understanding. Most bigger Models (tested: gpt-3.5-turbo, gpt-4, llama-70B, mixtral-7x8B) have a fairly good positional understanding, as long as no other context is provided. When I tried using positional reasoning with some other unrelated context, the performance of these bigger models dropped imensly. This is my first attempt of trying to embed this understanding directly into the models and not throught context. #### Data Collection and Processing The dataset was generated using a python script. ## Dataset Card Authors [optional] Antoine Angert ## Dataset Card Contact Contact under: antoine.angert@hsbi.de
提供机构:
ebowwa
原始信息汇总

数据集卡片 for Dataset Name

数据集详情

数据集描述

该数据集旨在教授大型语言模型(LLMs)相对定位(例如,上方、左侧、下方等),在我的研究中发现,即使在所有情况下,大多数LLMs(包括SOTA)也无法生成正确的相对定位。将推送一个使用此数据集微调的mixtral-7x8B模型。

数据集结构

  • 特征:
    • prompt: 字符串类型
    • response: 字符串类型

数据集分割

  • 训练集:
    • 字节数: 916486
    • 样本数: 10000

数据集创建

数据收集和处理

该数据集是使用Python脚本生成的。

数据集使用

直接使用

可用于微调语言模型。(尽管尚未测试,将更新)

数据集许可证

  • 许可证: apache-2.0

数据集语言

  • 语言: 英语

数据集标签

  • 标签: code

数据集名称

  • 名称: relative-positioning

数据集大小分类

  • 大小分类: 10K<n<100K

数据集作者

  • 作者: Antoine Angert

数据集联系信息

  • 联系信息: antoine.angert@hsbi.de
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作