five

TAPAS

收藏
魔搭社区2025-11-27 更新2025-05-24 收录
下载链接:
https://modelscope.cn/datasets/facebook/TAPAS
下载链接
链接失效反馈
官方服务:
资源简介:
# TAPAS: Datasets for Learning the Learning with Errors Problem ## About this Data AI-powered attacks on Learning with Errors (LWE)—an important hard math problem in post-quantum cryptography—rival or outperform "classical" attacks on LWE under certain parameter settings. Despite the promise of this approach, a dearth of accessible data limits AI practitioners' ability to study and improve these attacks. Creating LWE data for AI model training is time- and compute-intensive and requires significant domain expertise. To fill this gap and accelerate AI research on LWE attacks, we propose the TAPAS datasets, a **t**oolkit for **a**nalysis of **p**ost-quantum cryptography using **A**I **s**ystems. These datasets cover several LWE settings and can be used off-the-shelf by AI practitioners to prototype new approaches to cracking LWE. The table below gives an overview of the datasets provided in this work: | n | log q | omega | rho | # samples | |--------|-----------|----------|--------|------------| | 256 | 20 | 10 | 0.4284 | 400M | | 512 | 12 | 10 | 0.9036 | 40M | | 512 | 28 | 10 | 0.6740 | 40M | | 512 | 41 | 10 | 0.3992 | 40M | | 1024 | 26 | 10 | 0.8600 | 40M | ## Usage These datasets are intended to be used in conjunction with the code at: https://github.com/facebookresearch/LWE-benchmarking Download and unzip the .tar.gz files into a directory with enough storage. For the datasets split into different chunks, concatenate all the files into one data.prefix file after unzipping. Then, follow the instructions in this [README](https://github.com/facebookresearch/LWE-benchmarking/blob/main/README.md) to generate the full sets of LWE pairs and train AI models on this data. Due to storage constraints, we only provide 40M of the n=256 data here on huggingface. The rest can be found at this directory (append filenames ranging from chunk_ab.tar.gz to chunk_aj.tar.gz to download): http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/ Here are the exact links to each remaining section of the n=256 data (each link has 40M examples): [1](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ab.tar.gz) [2](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ac.tar.gz) [3](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ad.tar.gz) [4](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ae.tar.gz) [5](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_af.tar.gz) [6](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ag.tar.gz) [7](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ah.tar.gz) [8](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ai.tar.gz) [9](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_aj.tar.gz)

# TAPAS: 用于学习容错学习问题的数据集 ## 关于本数据集 基于人工智能的容错学习(Learning with Errors, LWE)攻击——后量子密码学中的一类核心困难数学问题——在部分参数配置场景下,其性能已可媲美甚至超越传统的LWE经典攻击方法。尽管该技术路径前景可观,但可用数据集的匮乏极大限制了AI从业者研究与优化此类攻击的能力。为LWE模型训练生成专属数据集不仅耗时耗力,还需要深厚的领域专业知识储备。为填补这一研究空白,加速LWE攻击相关的AI研究进程,我们推出了TAPAS数据集,即借助人工智能系统开展后量子密码学分析(**t**oolkit for **a**nalysis of **p**ost-quantum cryptography using **A**I **s**ystems)的工具集。本数据集覆盖多种LWE参数配置场景,AI从业者可直接开箱即用,以此为基础原型化开发破解LWE的全新方案。 下表为本工作提供的数据集概览: | 参数n | log q | omega | rho | 样本总量 | |--------|-----------|----------|--------|------------| | 256 | 20 | 10 | 0.4284 | 4亿 | | 512 | 12 | 10 | 0.9036 | 4000万 | | 512 | 28 | 10 | 0.6740 | 4000万 | | 512 | 41 | 10 | 0.3992 | 4000万 | | 1024 | 26 | 10 | 0.8600 | 4000万 | ## 使用方式 本数据集需配合下述代码仓库使用:https://github.com/facebookresearch/LWE-benchmarking 请将下载的.tar.gz文件解压至具备足够存储空间的目录中。若数据集被拆分为多个分块,解压完成后需将所有文件合并为一个名为data.prefix的文件。 随后,请遵循此[README文档](https://github.com/facebookresearch/LWE-benchmarking/blob/main/README.md)中的说明,生成完整的LWE样本对集,并基于此数据训练AI模型。 受限于存储空间,我们仅在Hugging Face平台上提供了n=256配置下的4000万条数据。其余数据可通过以下目录下载(需将文件名范围从chunk_ab.tar.gz至chunk_aj.tar.gz追加至目录路径后):http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/ 以下为n=256数据集各剩余分块的精准下载链接(每个链接对应4000万条样本): [1](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ab.tar.gz) [2](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ac.tar.gz) [3](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ad.tar.gz) [4](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ae.tar.gz) [5](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_af.tar.gz) [6](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ag.tar.gz) [7](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ah.tar.gz) [8](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_ai.tar.gz) [9](http://dl.fbaipublicfiles.com/large_objects/lwe-benchmarking/n256_logq20/chunk_aj.tar.gz)
提供机构:
maas
创建时间:
2025-05-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作