five

StopWords dataset: Integration of a set of stopwords in English and Portuguese - rev. 1

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14176111
下载链接
链接失效反馈
官方服务:
资源简介:
StopWords dataset: Integration of a set of stopwords in English and Portuguese - rev. 1 ================================ StopWords dataset - rev. 1 (two MS-Excel files) -------------StopWords IntegratedBasic integration of a set of stopwords (English and Portuguese) for use in Text Mining tasks. File name 1: StopWords_Integrated_Favaretto.xlsx Tab 1 of MS-Excel: pt_accent (215 words)Column Label: stopwords_pt Tab 2 of MS-Excel: pt_noaccent (208 words)Column Label: stopwords_pt_na Tab 3 of MS-Excel: en (213 words)Column Label: stopwords_en -------------StopWords ExtendedExtension of a set of stopwords (English and Portuguese) for use in Text Mining tasks. File name 2: StopWords_Extended_Favaretto.xlsx Tab 1 of MS-Excel: pt_extend (614 words)Column Label: stopwords_pt_extend Tab 2 of MS-Excel: en_extended (483 words)Column Label: stopwords_en_extend ================================ Warning: Some words in this set of stopwords may even be misspelled intentionally, as they may occur in practice in texts that are not written correctly. Aviso: Algumas palavras deste conjunto de stopwords podem até mesmo ter grafia errada de forma intencional, pois podem ocorrer na prática em textos não escritos corretamente. ================================ Source: elaborated by Prof. Dr. José Eduardo Ricciardi Favaretto based on a mix of several different sources https://orcid.org/0000-0002-0143-0809https://lattes.cnpq.br/3790103269421610https://linkedin.com/in/favaretto ================================
创建时间:
2024-11-17
二维码
社区交流群
二维码
科研交流群
商业服务