Kongongong/Thai-Physics-Data-40K
收藏Hugging Face2024-05-02 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Kongongong/Thai-Physics-Data-40K
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- question-answering
language:
- th
tags:
- physics
size_categories:
- 10K<n<100K
---
***Thai-Physics-Data*** is a Thai-Based physics data with more than 40k lines of data.
**Data Sources:**
ArtifactAI/arxiv-physics-instruct-tune-30k (CC BY-NC 2.0)
camel-ai/physics
**How to load Data (Hugging Face)**
```python
from datasets import load_dataset
Thai_Physics_Data = load_dataset("Kongongong/Thai-Physics-Data-40K")
Thai_Physics_Data = Thai_Physics_Data['train']
def format_data():
....
data =[]
format_data()
data = Dataset.from_dict({"text": data})
```
**How to load Data (CSV)**
```python
from datasets import Dataset
raw_datasets = pd.read_csv("./physic_thai.csv")
raw_datasets = raw_datasets[['question','answer']]
def format_data():
....
data =[]
format_data()
data = Dataset.from_dict({"text": data})
```
提供机构:
Kongongong
原始信息汇总
数据集概述
基本信息
- 名称: Thai-Physics-Data
- 任务类别: 问答 (question-answering)
- 语言: 泰语 (th)
- 标签: 物理 (physics)
- 数据量: 10K<n<100K
数据来源
- ArtifactAI/arxiv-physics-instruct-tune-30k (CC BY-NC 2.0)
- camel-ai/physics
数据加载方式
使用 Hugging Face
python from datasets import load_dataset
Thai_Physics_Data = load_dataset("Kongongong/Thai-Physics-Data-40K") Thai_Physics_Data = Thai_Physics_Data[train]
def format_data(): # 格式化数据的代码 pass
data = [] format_data()
data = Dataset.from_dict({"text": data})
使用 CSV
python from datasets import Dataset import pandas as pd
raw_datasets = pd.read_csv("./physic_thai.csv") raw_datasets = raw_datasets[[question,answer]]
def format_data(): # 格式化数据的代码 pass
data = [] format_data()
data = Dataset.from_dict({"text": data})



