tomg-group-umd/CLRS-Text-test

Name: tomg-group-umd/CLRS-Text-test
Creator: tomg-group-umd
Published: 2024-07-10 15:21:43
License: 暂无描述

Hugging Face2024-07-10 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/tomg-group-umd/CLRS-Text-test

下载链接

链接失效反馈

官方服务：

资源简介：

CLRS文本测试数据集包含5个不同的测试分割，每个分割使用不同的随机种子生成，涵盖了30种算法。每个样本包含问题、答案、算法名称和长度四个特征。数据集的下载大小为219757341字节，总大小为923909071字节。数据集的语言为英语，许可证为Apache-2.0。

The CLRS Text Testing Datasets contain 5 different test splits, each generated with a different random seed, covering 30 algorithms. Each sample includes four features: question, answer, algorithm name, and length. The dataset has a download size of 219757341 bytes and a total size of 923909071 bytes. The dataset is in English and is licensed under Apache-2.0.

提供机构：

tomg-group-umd

原始信息汇总

CLRS Text Testing Datasets

数据集概述

名称: CLRS Text Testing Datasets
语言: 英语
大小类别: 100K < n < 1M
许可证: Apache-2.0

数据集特征

特征:
- question: 字符串类型
- answer: 字符串类型
- algo_name: 字符串类型
- length: 整数类型

数据集分割

分割:
- test_1: 100,400个样本，183,920,334字节
- test_2: 100,600个样本，185,222,175字节
- test_3: 100,600个样本，184,881,343字节
- test_4: 100,800个样本，186,159,042字节
- test_5: 100,400个样本，183,726,177字节

数据集配置

配置名称: default
数据文件路径:
- test_1: data/test_1-*
- test_2: data/test_2-*
- test_3: data/test_3-*
- test_4: data/test_4-*
- test_5: data/test_5-*

数据集大小

下载大小: 219,757,341字节
数据集大小: 923,909,071字节

包含的算法

activity_selector
articulation_points
bellman_ford
bfs
binary_search
bridges
bubble_sort
dag_shortest_paths
dfs
dijkstra
find_maximum_subarray_kadane
floyd_warshall
graham_scan
heapsort
insertion_sort
jarvis_march
kmp_matcher
lcs_length
matrix_chain_order
minimum
mst_kruskal
mst_prim
naive_string_matcher
optimal_bst
quickselect
quicksort
segments_intersect
strongly_connected_components
task_scheduling
topological_sort

5,000+

优质数据集

54 个

任务类型

进入经典数据集