five

PAN23 Profiling Cryptocurrency Influencers with Few-shot Learning

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7616218
下载链接
链接失效反馈
官方服务:
资源简介:
This is the dataset for the shared task on Profiling Cryptocurrency Influencers with Few-shot Learning. Please consult the task's page for further details on the format, the dataset's creation, and links to baselines and utility code.   Task: In this shared task we aim to profile cryptocurrency influencers in social media, from a low-resource perspective. Moreover, we propose to categorize other related aspects of the influencers, also using a low-resource setting. Specifically, we focus on English Twitter posts for three different sub-tasks: Low-resource influencer profiling (subtask1): Input: 32 users per label with a maximum of 10 English tweets each. Classes: (1) null, (2) nano, (3) micro, (4) macro, (5) mega Official evaluation metric: Macro F1 Submission: TIRA. Baselines: User-character Logistic Regression; t5-large (bi-encoders) - zero shot [7], t5-large (label tuning) - few shot [7] Low-resource influencer interest identification (subtask2): Input: 64 users per label with 1 English tweet each. Classes: (1) technical information, (2) price update, (3) trading matters, (4) gaming, (5) other Official evaluation metric: Macro F1 Submission: TIRA. Baselines: User-character Logistic Regression; t5-large (bi-encoders) - zero shot [7], t5-large (label tuning) - few shot [7] Low-resource influencer intent identification (subtask3): Input: 64 users per label with 1 English tweets each. Classes: (1) subjective opinion, (2) financial information, (3) advertising, (4) announcement Official evaluation metric: Macro F1 Submission: TIRA. Baselines: User-character Logistic Regression; t5-large (bi-encoders) - zero shot [7], t5-large (label tuning) - few shot [7] Versioning:  1.0: initial upload 1.1 fixed a minor bug where some users contained some non-English text. Since English is the target language in the competition, all non-English texts have been replaced or removed.
创建时间:
2023-03-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作