TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_AT_OURS-SFT-longmult_4dig__v1
收藏Hugging Face2025-11-10 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/D-ExpTracker__FinEval_16k_fulleval_AT_OURS-SFT-longmult_4dig__v1
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于评估任务为longmult_4dig的FinEval_16k_fulleval_AT_OURS-SFT实验的数据集。数据集包含测试和训练数据,测试数据有1000个样本。数据集的特征包括问题、答案、任务配置、任务来源、提示信息、模型响应、评估信息等,并且包含实验的元数据信息。数据集用于跟踪实验的各个阶段,包括模型训练、超参数配置、日志记录和评估结果。
This is a dataset for evaluating the longmult_4dig task in the FinEval_16k_fulleval_AT_OURS-SFT experiment. The dataset includes test and training data, with 1000 samples in the test set. The features of the dataset include questions, answers, task configurations, task sources, prompt information, model responses, evaluation information, and metadata about the experiment. The dataset is used to track various stages of the experiment, including model training, hyperparameter configuration, log recording, and evaluation results.
提供机构:
TAUR-dev



