SMS Spam Collection Data Set
收藏academictorrents.com2025-03-21 收录
下载链接:
https://academictorrents.com/details/25932ba42d983dd7b4474d8f59ab56cdc25d9107
下载链接
链接失效反馈官方服务:
资源简介:
==Data Set Information: This corpus has been collected from free or free for research sources at the Internet: -> A collection of 425 SMS spam messages was manually extracted from the Grumbletext Web site. This is a UK forum in which cell phone users make public claims about SMS spam messages, most of them without reporting the very spam message received. The identification of the text of spam messages in the claims is a very hard and time-consuming task, and it involved carefully scanning hundreds of web pages. The Grumbletext Web site is: [Web Link]. -> A subset of 3,375 SMS randomly chosen ham messages of the NUS SMS Corpus (NSC), which is a dataset of about 10,000 legitimate messages collected for research at the Department of Computer Science at the National University of Singapore. The messages largely originate from Singaporeans and mostly from students attending the University. These messages were collected from volunteers who were made aware that their contributions were
数据集信息:本语料库系从互联网上免费或免费用于研究资源的来源收集而来:-> 从Grumbletext网站手动提取了425条短信垃圾信息集合。Grumbletext网站系一英国论坛,手机用户在此公开声明关于短信垃圾信息,其中大多数用户并未报告收到的垃圾短信。在声明中识别垃圾短信文本是一项极其困难且耗时的工作,涉及对数百个网页的仔细扫描。Grumbletext网站链接:[Web Link]。-> 从新加坡国立大学短信语料库(NSC)中随机选取的3,375条短信正常信息子集,该语料库包含约10,000条合法短信,由新加坡国立大学计算机科学系收集用于研究。这些短信主要源自新加坡人,尤其是就读于大学的学生。这些信息由志愿者提供,他们已知悉自己的贡献对研究具有重要意义。
提供机构:
academictorrents.com



