StopWords dataset: Integration of a set of stopwords in English and Portuguese - rev. 1
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14176111
下载链接
链接失效反馈官方服务:
资源简介:
StopWords dataset: Integration of a set of stopwords in English and Portuguese - rev. 1
================================
StopWords dataset - rev. 1 (two MS-Excel files)
-------------StopWords IntegratedBasic integration of a set of stopwords (English and Portuguese) for use in Text Mining tasks.
File name 1: StopWords_Integrated_Favaretto.xlsx
Tab 1 of MS-Excel: pt_accent (215 words)Column Label: stopwords_pt
Tab 2 of MS-Excel: pt_noaccent (208 words)Column Label: stopwords_pt_na
Tab 3 of MS-Excel: en (213 words)Column Label: stopwords_en
-------------StopWords ExtendedExtension of a set of stopwords (English and Portuguese) for use in Text Mining tasks.
File name 2: StopWords_Extended_Favaretto.xlsx
Tab 1 of MS-Excel: pt_extend (614 words)Column Label: stopwords_pt_extend
Tab 2 of MS-Excel: en_extended (483 words)Column Label: stopwords_en_extend
================================
Warning: Some words in this set of stopwords may even be misspelled intentionally, as they may occur in practice in texts that are not written correctly.
Aviso: Algumas palavras deste conjunto de stopwords podem até mesmo ter grafia errada de forma intencional, pois podem ocorrer na prática em textos não escritos corretamente.
================================
Source: elaborated by Prof. Dr. José Eduardo Ricciardi Favaretto based on a mix of several different sources
https://orcid.org/0000-0002-0143-0809https://lattes.cnpq.br/3790103269421610https://linkedin.com/in/favaretto
================================
创建时间:
2024-11-17



