Classifier Performance Ranking.

Figshare2026-01-09 更新2026-04-28 收录

下载链接：

https://figshare.com/articles/dataset/_p_Classifier_Performance_Ranking_p_/31039129

下载链接

链接失效反馈

官方服务：

资源简介：

This study addresses the challenge of distinguishing human translations from those generated by Large Language Models (LLMs) by utilizing dependency triplet features and evaluating 16 machine learning classifiers. Using 10-fold cross-validation, the SVM model achieves the highest mean F1-score of 93%, while all other classifiers consistently differentiate between human and machine translations. SHAP analysis helps identify key dependency features that distinguish human and machine translations, improving our understanding of how LLMs produce translationese. The findings provide practical insights for enhancing translation quality assessment and refining translation models across various languages and text genres, contributing to the advancement of natural language processing techniques. The dataset and implementation code of our study are available at: https://github.com/KiemaG5/LLM-translationese.

创建时间：

2026-01-09

5,000+

优质数据集

54 个

任务类型

进入经典数据集