five

deshanksuman/Swabhasha_RomanizedSinhala_Dataset

收藏
Hugging Face2024-03-12 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/deshanksuman/Swabhasha_RomanizedSinhala_Dataset
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - si metrics: - accuracy - bleu tags: - Romanized Sinhala - Sinhala - Transliteration --- --- # Model Card for Model ID This Repo is about Romanized Sinhala to Sinhala Transliteration using the Ngram and Rule Base Model. ### Model Description This dataset is capable of handling short-hand typing(Adhoc Transliteration). eg Input: khmda Output : කොහොමද If you are using this work: Kindly cite : T. G. D. K. Sumanathilaka, R. Weerasinghe and Y. H. P. P. Priyadarshana, "Swa-Bhasha: Romanized Sinhala to Sinhala Reverse Transliteration using a Hybrid Approach," 2023 3rd International Conference on Advanced Research in Computing (ICARC), Belihuloya, Sri Lanka, 2023, pp. 136-141, doi: 10.1109/ICARC57651.2023.10145648. keywords: {Terminology;Social networking (online);Computational modeling;Knowledge based systems;Message services;Data structures;Data models;Romanized Sinhala;Transliteration;Tri-gram;Rule-based;Prediction;Suggestion}, The datasets used can be accessed Through : https://github.com/Sumanathilaka/Swa-Bhasha-Sinhala-Singlish-Dataset - **Developed by:** Deshan sumanathilaka - **Model type:** Ngram Model - **Language(s) (NLP):** Python - **License:** IEEE - **Paper [optional]:** T. G. D. K. Sumanathilaka, R. Weerasinghe and Y. H. P. P. Priyadarshana, "Swa-Bhasha: Romanized Sinhala to Sinhala Reverse Transliteration using a Hybrid Approach," 2023 3rd International Conference on Advanced Research in Computing (ICARC), Belihuloya, Sri Lanka, 2023, pp. 136-141, doi: 10.1109/ICARC57651.2023.10145648. keywords: {Terminology;Social networking (online);Computational modeling;Knowledge based systems;Message services;Data structures;Data models;Romanized Sinhala;Transliteration;Tri-gram;Rule-based;Prediction;Suggestion}, - **Demo [optional]:** https://youtu.be/w6kdIDzoov4 ## Uses Romanized Sinhala to Sinhala Transliteration ## How to Get Started with the Model Download all the files from the repo. You can open Transliterator.py file Call the function triGramTranslate(inputStr) with an input String in Romanized Sinhala. ## Model Card Authors [optional] Deshan Sumanathilaka ## Model Card Contact deshankoshala@gmail.com
提供机构:
deshanksuman
原始信息汇总

数据集概述

基本信息

  • 语言:
    • si
  • 评估指标:
    • 准确率
    • BLEU
  • 标签:
    • Romanized Sinhala
    • Sinhala
    • Transliteration

模型描述

  • 模型类型: Ngram模型
  • 开发语言: Python
  • 许可证: IEEE

数据集用途

  • 主要用途: Romanized Sinhala到Sinhala的转写

如何开始使用模型

  • 下载所有文件。
  • 打开Transliterator.py文件。
  • 调用函数triGramTranslate(inputStr),传入罗马化的Sinhala字符串作为输入。

开发者信息

  • 开发者: Deshan Sumanathilaka
  • 联系方式: deshankoshala@gmail.com
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作