five

NafisAhmed/ShilpoBangla

收藏
Hugging Face2026-03-18 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/NafisAhmed/ShilpoBangla
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - image-to-text language: - bn size_categories: - 1K<n<10K --- --- license: cc-by-4.0 task_categories: - image-to-text language: - bn size_categories: - 1K<n<10K pretty_name: ShilpoBangla --- # ShilpoBangla: A Bangla Image-Text Dataset for Cultural Heritage ## Dataset Description **ShilpoBangla** is a multimodal dataset consisting of culturally significant Bangladeshi product images paired with human-written Bangla alternative text descriptions. The dataset is designed to facilitate research in image captioning, multimodal learning, and low-resource language processing, particularly for Bangla. The dataset captures diverse aspects of Bangladeshi cultural heritage, enabling models to learn meaningful visual-text relationships grounded in cultural context. --- ## Dataset Summary - **Total samples:** 1,200 - **Number of categories:** 5 - **Image format:** JPG - **Annotation format:** CSV - **Language:** Bangla ### Categories - Traditional Clothing - Foods - Musical Instruments - Folk Arts and Crafts - Jewelry Each category contains **240 images**, ensuring a balanced distribution across classes. --- ## Data Fields Each entry in the dataset is represented in the CSV annotation file with the following fields: - **class_name** (`string`): Category label of the item - **image_path** (`string`): Relative path formatted as `class_name/image_name.jpg` - **alt_text_bn** (`string`): Human-written Bangla description The descriptions include details about: - visual appearance - material composition - cultural significance - usage context --- ## Data Collection Process The dataset was created using a combination of: - **Crowdsourced data collection** via Google Forms - **Direct field photography** conducted by the authors All textual descriptions were manually written in Bangla to ensure high-quality, culturally accurate annotations. --- ## Intended Uses This dataset can be used for: - Bangla Image Captioning - Image-to-Text Generation - Multimodal Representation Learning - Cultural AI and Digital Heritage Research --- ## Limitations - Limited to five predefined categories - Does not include regional dialect variations of Bangla - Potential bias due to data collection sources --- Citation If you use this dataset, please cite: @misc{s._m._nafis_ahmed_2026, author = { S. M. Nafis Ahmed and Syed Nafees Kaiser and Muhammad Nazrul Islam }, title = { ShilpoBangla (Revision 59f6ad3) }, year = 2026, url = { https://huggingface.co/datasets/NafisAhmed/ShilpoBangla }, doi = { 10.57967/hf/8064 }, publisher = { Hugging Face } } ## Acknowledgements We thank all contributors who provided images and descriptions, and acknowledge the effort involved in preserving and digitizing Bangladeshi cultural heritage.
提供机构:
NafisAhmed
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作