five

BrajText-Saar

收藏
Mendeley Data2026-04-09 收录
下载链接:
https://data.mendeley.com/datasets/pg624k2rky/1
下载链接
链接失效反馈
官方服务:
资源简介:
Cultural texts provide a profound understanding of the emotions, social values, and symbolic representations of communities. India is one of the most culturally and linguistically diverse nations. Computational research often overlooks cultural texts due to a lack of structured digital resources. Braj, one of the Indian regional languages, remains unexplored for text analytics. The raw unprocessed data is collected from the Manmandir Santhans website which promotes Braj Language. The Braj region’s Holi festival and the related stories of Radha and Krishna are the main subjects of the dataset. A hybrid pre-processing technique were implemented to identify Stopwords, special characters, and Numbers. The final developed BrajText-Saar is cleaned and pre-processed suitable for further cultural text analytics, pattern mining and further natural language processing tasks.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作