Stoplists for African languages generated from the ASP corpus
收藏SSH Open MarketPlace2022-11-03 更新2024-08-03 收录
下载链接:
https://marketplace.sshopencloud.eu/dataset/nF5h0a
下载链接
链接失效反馈官方服务:
资源简介:
This project uses the source texts provided by the African Storybook Project as a corpus and provides a number of tools to extract frequency lists and lists of stopwords from this corpus for the 60+ languages covered by ASP. The immediate goal is to create freely-licensed stoplists for languages that are not currently included in the stopwords project, and to then submit those lists as pull requests to the upstream project.
创建时间:
2022-11-03



