euclaise/tex-stackexchange
收藏Hugging Face2023-10-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/euclaise/tex-stackexchange
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: parent_url
dtype: string
- name: parent_score
dtype: string
- name: parent_body
dtype: string
- name: parent_user
dtype: string
- name: parent_title
dtype: string
- name: accepted
dtype: bool
- name: body
dtype: string
- name: score
dtype: string
- name: user
dtype: string
- name: answer_id
dtype: string
- name: __index_level_0__
dtype: int64
splits:
- name: train
num_bytes: 505025688
num_examples: 190807
download_size: 221660047
dataset_size: 505025688
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
license: cc-by-sa-4.0
---
# Dataset Card for "tex-stackexchange"
This is a dump of [the TeX StackExchange community](https://tex.stackexchange.com/), converted to markdown.
Data from [The StackExchange data dump](https://archive.org/details/stackexchange), 2023-09-12 release.
Posts where the *questions* included images are excluded. Images in the answers are stripped out.
提供机构:
euclaise
原始信息汇总
数据集概述
数据集信息
-
特征列表:
parent_url: 字符串类型parent_score: 字符串类型parent_body: 字符串类型parent_user: 字符串类型parent_title: 字符串类型accepted: 布尔类型body: 字符串类型score: 字符串类型user: 字符串类型answer_id: 字符串类型__index_level_0__: 整数类型
-
数据分割:
train: 包含 190807 个样本,占用 505025688 字节
-
数据集大小:
- 下载大小: 221660047 字节
- 数据集大小: 505025688 字节
配置信息
- 默认配置:
- 数据文件路径:
data/train-*
- 数据文件路径:
许可证
- 许可证: CC BY-SA 4.0



