mzio/aprm-finqa_reasoning-gpt5m_med-gs8-s0-r1-train
收藏Hugging Face2026-03-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/mzio/aprm-finqa_reasoning-gpt5m_med-gs8-s0-r1-train
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: state
list:
- name: content
dtype: string
- name: role
dtype: string
- name: action
struct:
- name: content
dtype: string
- name: role
dtype: string
- name: next_obs
list:
- name: content
dtype: string
- name: role
dtype: string
- name: state_len
dtype: int64
- name: temperature
dtype: float64
- name: reward
dtype: float64
- name: done
dtype: bool
- name: truncated
dtype: bool
- name: timestep
dtype: int64
- name: try_step
dtype: int64
- name: batch_id
dtype: int64
- name: unique_data_sample_id
dtype: int64
- name: generation_id
dtype: int64
- name: split
dtype: string
- name: return_
dtype: float64
- name: advantage
dtype: int64
- name: return_is_computed
dtype: bool
- name: advantage_is_computed
dtype: bool
- name: constant_reward_group
dtype: bool
- name: tools
list:
- name: description
dtype: string
- name: name
dtype: string
- name: parameters
struct:
- name: properties
struct:
- name: company_name
struct:
- name: description
dtype: string
- name: type
dtype: string
- name: expression
struct:
- name: description
dtype: string
- name: type
dtype: string
- name: query
struct:
- name: description
dtype: string
- name: type
dtype: string
- name: table_name
struct:
- name: description
dtype: string
- name: type
dtype: string
- name: text
struct:
- name: description
dtype: string
- name: type
dtype: string
- name: required
list: string
- name: type
dtype: string
- name: type
dtype: string
- name: action_prob
dtype: float64
- name: system_prompt
dtype: string
splits:
- name: train
num_bytes: 44630842
num_examples: 3000
download_size: 42467620
dataset_size: 44630842
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
mzio



