five

LipBengal Dataset

收藏
Figshare2024-06-28 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/LipBengal_Dataset/26008285/1
下载链接
链接失效反馈
官方服务:
资源简介:
We introduce the "LipBengal" dataset, marking a significant advancement in the field of Bengali lip-reading and visual speech recognition research. This dataset addresses a critical gap in the research landscape. Despite Bengali being the seventh most spoken language globally, with over 265 million speakers, it has been largely underrepresented in this domain.LipBengal offers a comprehensive resource for researchers, comprising visual data from 150 speakers across 73 classes, covering Bengali phonemes, alphabets, and symbols. Captured under diverse and uncontrolled conditions, LipBengal is the most extensive Bengali lip-reading dataset to date. Detailed annotations, ranging from phoneme-level classifications to full sentence constructions, further enhance its value. The dataset's thorough coverage of Bengali phonemes captures the nuances of lip movements associated with distinct sounds.This rich resource holds promise for training accurate lip-reading models, with applications in accessibility improvements, enhanced speech recognition, silent speech interfaces, and linguistic research. The diversity in speaker backgrounds ensures broader representability of Bengali pronunciation patterns, while meticulous annotation and curation processes guarantee quality and reliability. LipBengal is a valuable asset for researchers and developers working in Bengali lip-reading and visual speech recognition.<br>The LipBengal dataset can be accessed through:<br>https://drive.google.com/drive/folders/1CgOg35Cfs3H6-vHmG11LDt0qlS6q--jt?usp=sharing
提供机构:
Shahed, Md. Tanvir Rahman; Wahed, Md. Abdul; Kundu, Manab Kumar; Nyeem, Hussain; Aronno, Md.Tanjil Islam; Sadeef, Jane Alam; Islam, R Rafiul; Ahsan, Tashrif
创建时间:
2024-06-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作