five

nisheeth/English-GujaratiBrailleCorpus

收藏
Hugging Face2024-04-22 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/nisheeth/English-GujaratiBrailleCorpus
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - translation size_categories: - 100M<n<1B --- ### 446280 parallel aligned sentences for English to Gujarati Braille Language Pair #### This work is supported by the funding received from the Anusandhan National Research Foundation (previously SERB), Government of India for the project “Development of English to Bharti Braille Machine Assisted Translation System” through grant no. CRG/2020/004246 ## Project Sumary Life is tough for people who are blind or deafblind. Braille is a medium through which they can read the text available in the world. Unfortunately, due to the lack of resources that can transcribe textual material into braille, they are deprived of a lot of knowledge available in books. This project would help address this issue. We wish to develop a system that can transcribe the text available in English into Bharati Braille. We have specifically chosen Bharati Braille as it is a unified braille script for writing the languages of India. We would be developing a hybrid Machine Assisted Translation System, which can translate English Text into Bharti Braille. For this, we would be developing a domain-specific corpus for the English-Hindi language pair (for domains like Art and Culture), which will be followed by the development of the machine translation system and its evaluation. This research would be helpful to people in our country who have a desire to learn and gain knowledge but are unable to do so due to the unavailability of literature in Bharti Braille. This system would be critically important to the lives of visually impaired people because the ability to read and write in braille opens the door to literacy, intellectual freedom, equal opportunity, and personal security for them. more details can be found at: https://prism.serbonline.in/SRPIProfile/MTAxMDQ5MTQ= International Standard Langauge Resources Number (ISLRN) : 628-503-908-669-4 (https://islrn.org/resources/628-503-908-669-4/)
提供机构:
nisheeth
原始信息汇总

数据集概述

基本信息

  • 许可证: MIT
  • 任务类别: 翻译
  • 大小类别: 100M<n<1B

数据集内容

  • 描述: 包含446280对平行对齐的句子,用于英语到古吉拉特盲文语言对的翻译。

项目背景与目标

  • 支持机构: 由印度政府Anusandhan国家研究基金会(原SERB)资助。
  • 项目名称: “Development of English to Bharti Braille Machine Assisted Translation System”
  • 项目编号: CRG/2020/004246
  • 目标: 开发一个系统,能够将英文文本转录成Bharti Braille,这是一种印度语言的统一盲文脚本。
  • 应用领域: 特别关注艺术和文化领域。
  • 预期影响: 帮助视觉障碍人士通过盲文获取知识和学习机会,提高他们的识字率、智力自由、平等机会和个人安全。

国际标准语言资源编号

  • ISLRN: 628-503-908-669-4
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作