five

DataOrigin/educational-concept-videos-india

收藏
Hugging Face2026-04-06 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/DataOrigin/educational-concept-videos-india
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: other task_categories: - video-classification - text-to-video language: - hi - en - ta - te - bn - ml - mr - or - as - gu - kn - pa tags: - education, - k12, - india, - multilingual, - youtube, - expert-led, - concept-learning, - indic-languages pretty_name: Educational Concept Videos India size_categories: - 1K<n<10K --- # Educational Concept Videos India ## Dataset Description A curated collection of expert-led educational concept and learning videos published on YouTube, covering K-12 curriculum topics across Indian boards. Produced by Prepp, India's largest K-12 learning platform, operated by Collegedunia Web Private Limited. ## Dataset Summary - **Total videos:** 4,400 videos - **Content type:** Expert-led concept explanation and learning videos - **Subjects:** Mathematics, Science, Social Studies, English, Regional Languages - **Grades:** Class 1 through Class 12 - **Boards:** 33 Indian educational boards including CBSE, ICSE, and all major state boards - **Languages:** Hindi, English, Tamil, Telugu, Kannada, Malayalam, Marathi, Bengali, Gujarati, Odia, Punjabi, Assamese - **Format:** Video (MP4) with structured expert narration and visual aids - **Distribution:** Originally published on YouTube; full dataset available for commercial licensing ## Sample Data Three sample videos are available in this repository demonstrating: - Sample 1: Mathematics concept explanation (Algebra, Class 9, CBSE) - Sample 2: Science concept video (Light and Optics, Class 10, CBSE) - Sample 3: Regional language concept video (Hindi medium, Class 7) ## Key Features - **Expert-led:** All videos delivered by qualified subject matter experts with structured pedagogical approach - **Concept-first design:** Videos structured for conceptual understanding rather than rote memorisation — high signal for reasoning model training - **Curriculum-mapped:** Each video tagged to specific board, grade, subject, chapter, and learning objective - **Multilingual delivery:** Same concepts taught across multiple Indian languages enabling cross-lingual training signal - **Verified accuracy:** All content reviewed for factual correctness before publication ## Intended Uses - Training video-language models for educational content understanding - Concept explanation generation model development - Multilingual educational AI and tutoring system training - Indic language instructional video understanding - Curriculum-aligned content recommendation systems - Teacher AI and pedagogical model fine-tuning ## Data Collection and Rights All content is proprietary, produced by Prepp's in-house expert faculty and content team. Content is curriculum-mapped, factually verified, and ethically sourced. Full dataset licensing is available for commercial AI training purposes. ## Licensing and Commercial Access This repository contains sample data only. The full dataset of 4,400 expert-led concept videos is available for commercial AI training licensing. **For licensing inquiries contact:** Ankit Dubey — Head of AI Data Partnerships, Collegedunia ankit.dubey@collegedunia.com ## Dataset Curator [Collegedunia Web Private Limited](https://collegedunia.com) | [Prepp](https://prepp.in) Gurugram, Haryana, India
提供机构:
DataOrigin
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作