five

Gold dataset for checking transliteration output from ChatGPT 4.0 and local tool

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13318808
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset includes 385 names and short phrases in Ancient and Modern Greek. This dataset has been used as gold dataset to evaluate the transliteration output from ChatGPT 4.0 and a local transliteration tool developed by the 'International Hellenic University, Department of Information and Electronic Engineering' and 'Open Knowledge Greece'. On the 13th-14th August 2024, both ChatGPT and local tool were tested against two well-known standards, i.e, the ALA-LC Romanization for Greek and the ISO 843 standard.  The Gold dataset includes the names and short phrases in Ancient and Modern Greek, the expected output, the ChatGPT output, and the local tool output. For evaluating the results the EXACT excel formula was used. The results of the evaluation are also included in the dataset.  The Gold dataset complements a manuscript submitted to the Metadata and Semantics Research (MTSR'24) conference.
创建时间:
2024-08-14
二维码
社区交流群
二维码
科研交流群
商业服务