Mathematics-Class10-Tnsb
收藏魔搭社区2025-11-27 更新2025-05-10 收录
下载链接:
https://modelscope.cn/datasets/prithivMLmods/Mathematics-Class10-Tnsb
下载链接
链接失效反馈官方服务:
资源简介:
# Mathematics-Class10-Tnsb
This dataset contains scanned images from a Class 10 Mathematics textbook under the TNSB (Tamil Nadu State Board) curriculum. It is intended for educational machine learning tasks such as image-to-text (OCR), textbook digitization, or educational content understanding.
## Dataset Details
* **Source**: Tamil Nadu State Board Class 10 Mathematics textbook
* **Task**: Image-to-Text
* **Language**: English
* **Split**: `train` only
* **Rows**: 352
* **Format**: Images only (scanned textbook pages)
* **License**: [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
## Features
| Feature | Type |
| ------- | ----- |
| image | Image |
This dataset contains **352 images** in the `train` split, with a total size of **223 MB**.
## Use Cases
* Optical Character Recognition (OCR) model training
* Educational content parsing and summarization
* Visual question answering for school-level curriculum
* PDF-to-digital-textbook conversion systems
## How to Use
You can use the dataset directly with the 🤗 Datasets library:
```python
from datasets import load_dataset
dataset = load_dataset("prithivMLmods/Mathematics-Class10-Tnsb")
```
Each sample in the dataset contains:
```python
{
"image": <PIL.Image>
}
```
## Citation
If you use this dataset in your research or application, please consider citing the source and providing attribution to the original textbook from the Tamil Nadu State Education Board.
# 十年级数学-TNSB
本数据集包含遵循TNSB(泰米尔纳德邦国家教育委员会)课程标准的十年级数学教科书扫描图像,旨在用于教育相关机器学习任务,例如图像转文本(光学字符识别,Optical Character Recognition,OCR)、教科书数字化或教育内容理解。
## 数据集详情
* **来源**:泰米尔纳德邦国家教育委员会十年级数学教科书
* **任务**:图像转文本
* **语言**:英语
* **划分**:仅`train`划分
* **样本数**:352
* **格式**:仅图像(扫描的教科书页面)
* **许可证**:[Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0)
## 特征
| 特征 | 类型 |
| ------- | ----- |
| 图像 | 图像 |
本数据集的`train`划分包含共计**352张图像**,总大小为**223 MB**。
## 应用场景
* 光学字符识别(Optical Character Recognition,OCR)模型训练
* 教育内容解析与摘要生成
* 针对学段课程的视觉问答任务
* PDF转数字化教科书系统
## 使用方法
您可直接通过🤗 Datasets库加载本数据集:
python
from datasets import load_dataset
dataset = load_dataset("prithivMLmods/Mathematics-Class10-Tnsb")
数据集中的每个样本包含:
python
{
"image": <PIL.Image>
}
## 引用说明
若您在研究或应用中使用本数据集,请注明其来源,并标注其源自泰米尔纳德邦国家教育委员会的原版教科书。
提供机构:
maas
创建时间:
2025-05-06



