DLIlab/webarena_paraphrased_instruction_instruction_web
收藏Hugging Face2024-03-26 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/DLIlab/webarena_paraphrased_instruction_instruction_web
下载链接
链接失效反馈官方服务:
资源简介:
In this case, given_tasks are made through the following steps.
1. make vectorDB with only instruction in mind2Web.
2. for each website in WebArena, retrieve relevant instructions in mind2web, top_30, with a threshold 0.47(L2 distance)
3. every retrieved instructions(from mind2web) are used as given_task.
---
dataset_info:
features:
- name: given_task
dtype: string
- name: generated_task
sequence: string
- name: target_web_name
dtype: string
- name: annotation_id
dtype: string
splits:
- name: train
num_bytes: 30103
num_examples: 39
download_size: 20295
dataset_size: 30103
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
DLIlab
原始信息汇总
数据集概述
数据集结构
- 特征信息:
- given_task:字符串类型
- generated_task:字符串序列类型
- target_web_name:字符串类型
- annotation_id:字符串类型
数据集分割
- 训练集:
- 样本数量:39个
- 数据大小:30103字节
数据集大小
- 下载大小:20295字节
- 数据集总大小:30103字节
配置信息
- 默认配置:
- 数据文件路径:
data/train-*
- 数据文件路径:



