csariyildiz/turkish-wordlist
收藏Hugging Face2024-12-26 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/csariyildiz/turkish-wordlist
下载链接
链接失效反馈官方服务:
资源简介:
这是一个由维基百科文本处理得到的土耳其语单词列表,包含了2,510,327个单词。这个列表是一个UTF-8编码的csv文件,包含标题行。列表中的单词通过处理大约50万篇维基百科文章的文本获得,包括土耳其字母和引号字符,不包含英文字母,且已将旧式土耳其字符进行替换。所有单词都由小写字母组成,最多包含30个字符。
This is a Turkish word list derived from Wikipedia text processing, containing 2,510,327 words. The list is a UTF-8 encoded csv file with headers. The words were obtained by processing the text of approximately 500 thousand articles on Wikipedia, including Turkish letters and quotation marks, without English letters, and with the replacement of older Turkish characters. All words consist of lowercase letters and a maximum of 30 characters.
提供机构:
csariyildiz



