DataOrigin/educational-concept-videos-india
收藏Hugging Face2026-04-06 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/DataOrigin/educational-concept-videos-india
下载链接
链接失效反馈官方服务:
资源简介:
---
license: other
task_categories:
- video-classification
- text-to-video
language:
- hi
- en
- ta
- te
- bn
- ml
- mr
- or
- as
- gu
- kn
- pa
tags:
- education,
- k12,
- india,
- multilingual,
- youtube,
- expert-led,
- concept-learning,
- indic-languages
pretty_name: Educational Concept Videos India
size_categories:
- 1K<n<10K
---
# Educational Concept Videos India
## Dataset Description
A curated collection of expert-led educational concept and learning videos
published on YouTube, covering K-12 curriculum topics across Indian boards.
Produced by Prepp, India's largest K-12 learning platform, operated by
Collegedunia Web Private Limited.
## Dataset Summary
- **Total videos:** 4,400 videos
- **Content type:** Expert-led concept explanation and learning videos
- **Subjects:** Mathematics, Science, Social Studies, English, Regional Languages
- **Grades:** Class 1 through Class 12
- **Boards:** 33 Indian educational boards including CBSE, ICSE, and all
major state boards
- **Languages:** Hindi, English, Tamil, Telugu, Kannada, Malayalam, Marathi,
Bengali, Gujarati, Odia, Punjabi, Assamese
- **Format:** Video (MP4) with structured expert narration and visual aids
- **Distribution:** Originally published on YouTube; full dataset available
for commercial licensing
## Sample Data
Three sample videos are available in this repository demonstrating:
- Sample 1: Mathematics concept explanation (Algebra, Class 9, CBSE)
- Sample 2: Science concept video (Light and Optics, Class 10, CBSE)
- Sample 3: Regional language concept video (Hindi medium, Class 7)
## Key Features
- **Expert-led:** All videos delivered by qualified subject matter experts
with structured pedagogical approach
- **Concept-first design:** Videos structured for conceptual understanding
rather than rote memorisation — high signal for reasoning model training
- **Curriculum-mapped:** Each video tagged to specific board, grade, subject,
chapter, and learning objective
- **Multilingual delivery:** Same concepts taught across multiple Indian
languages enabling cross-lingual training signal
- **Verified accuracy:** All content reviewed for factual correctness before
publication
## Intended Uses
- Training video-language models for educational content understanding
- Concept explanation generation model development
- Multilingual educational AI and tutoring system training
- Indic language instructional video understanding
- Curriculum-aligned content recommendation systems
- Teacher AI and pedagogical model fine-tuning
## Data Collection and Rights
All content is proprietary, produced by Prepp's in-house expert faculty and
content team. Content is curriculum-mapped, factually verified, and ethically
sourced. Full dataset licensing is available for commercial AI training purposes.
## Licensing and Commercial Access
This repository contains sample data only. The full dataset of 4,400
expert-led concept videos is available for commercial AI training licensing.
**For licensing inquiries contact:**
Ankit Dubey — Head of AI Data Partnerships, Collegedunia
ankit.dubey@collegedunia.com
## Dataset Curator
[Collegedunia Web Private Limited](https://collegedunia.com) |
[Prepp](https://prepp.in)
Gurugram, Haryana, India
提供机构:
DataOrigin



