Plasmoxy/Discord-Unvelied-Extracted-Backup
收藏Hugging Face2025-09-12 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/Plasmoxy/Discord-Unvelied-Extracted-Backup
下载链接
链接失效反馈官方服务:
资源简介:
Discord Unveiled - Filtered Dataset是一个包含来自Discord Unveiled数据集的表面过滤和处理后的消息数据。该数据集经过转换为CSV格式、移除机器人消息、过滤掉只含URL、提及、频道或Discord表情的消息,并且通过FastText语言模型过滤非英语消息。数据集包含时间戳、用户ID、用户名和消息内容等字段。
The Discord Unveiled - Filtered Dataset is a collection of superficially filtered and processed message data from the Discord Unveiled dataset. The dataset has been converted to CSV format, with bot messages removed, messages containing only URLs, mentions, channels, or Discord emojis filtered out, and non-English messages filtered using a FastText language identification model. It includes fields such as timestamp, user ID, username, and message content.
提供机构:
Plasmoxy



