deshanksuman/Swabhasha_RomanizedSinhala_Dataset
收藏Hugging Face2024-03-12 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/deshanksuman/Swabhasha_RomanizedSinhala_Dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- si
metrics:
- accuracy
- bleu
tags:
- Romanized Sinhala
- Sinhala
- Transliteration
---
---
# Model Card for Model ID
This Repo is about Romanized Sinhala to Sinhala Transliteration using the Ngram and Rule Base Model.
### Model Description
This dataset is capable of handling short-hand typing(Adhoc Transliteration).
eg
Input: khmda
Output : කොහොමද
If you are using this work:
Kindly cite :
T. G. D. K. Sumanathilaka, R. Weerasinghe and Y. H. P. P. Priyadarshana, "Swa-Bhasha: Romanized Sinhala to Sinhala Reverse Transliteration using a Hybrid Approach," 2023 3rd International Conference on Advanced Research in Computing (ICARC), Belihuloya, Sri Lanka, 2023, pp. 136-141, doi: 10.1109/ICARC57651.2023.10145648. keywords: {Terminology;Social networking (online);Computational modeling;Knowledge based systems;Message services;Data structures;Data models;Romanized Sinhala;Transliteration;Tri-gram;Rule-based;Prediction;Suggestion},
The datasets used can be accessed Through :
https://github.com/Sumanathilaka/Swa-Bhasha-Sinhala-Singlish-Dataset
- **Developed by:** Deshan sumanathilaka
- **Model type:** Ngram Model
- **Language(s) (NLP):** Python
- **License:** IEEE
- **Paper [optional]:**
T. G. D. K. Sumanathilaka, R. Weerasinghe and Y. H. P. P. Priyadarshana, "Swa-Bhasha: Romanized Sinhala to Sinhala Reverse Transliteration using a Hybrid Approach," 2023 3rd International Conference on Advanced Research in Computing (ICARC), Belihuloya, Sri Lanka, 2023, pp. 136-141, doi: 10.1109/ICARC57651.2023.10145648. keywords: {Terminology;Social networking (online);Computational modeling;Knowledge based systems;Message services;Data structures;Data models;Romanized Sinhala;Transliteration;Tri-gram;Rule-based;Prediction;Suggestion},
- **Demo [optional]:** https://youtu.be/w6kdIDzoov4
## Uses
Romanized Sinhala to Sinhala Transliteration
## How to Get Started with the Model
Download all the files from the repo.
You can open Transliterator.py file
Call the function triGramTranslate(inputStr) with an input String in Romanized Sinhala.
## Model Card Authors [optional]
Deshan Sumanathilaka
## Model Card Contact
deshankoshala@gmail.com
提供机构:
deshanksuman
原始信息汇总
数据集概述
基本信息
- 语言:
- si
- 评估指标:
- 准确率
- BLEU
- 标签:
- Romanized Sinhala
- Sinhala
- Transliteration
模型描述
- 模型类型: Ngram模型
- 开发语言: Python
- 许可证: IEEE
数据集用途
- 主要用途: Romanized Sinhala到Sinhala的转写
如何开始使用模型
- 下载所有文件。
- 打开Transliterator.py文件。
- 调用函数triGramTranslate(inputStr),传入罗马化的Sinhala字符串作为输入。
开发者信息
- 开发者: Deshan Sumanathilaka
- 联系方式: deshankoshala@gmail.com



