five

SYNCode (Case Study Datasets)

收藏
DataCite Commons2025-05-01 更新2025-09-08 收录
下载链接:
https://figshare.com/articles/dataset/SYNCode_Case_Study_Datasets_/28779662/2
下载链接
链接失效反馈
官方服务:
资源简介:
SYNCode is a collaborative annotation framework that combines human expertise with large language models (LLMs) to improve the quality of annotations for complex, code-centric datasets like Stack Overflow. The system integrates TF-IDF filtering, advanced transformer models (NLP Transformer and UniXcoder), and iterative human refinement to enhance annotation accuracy and reduce bias. A prototype interface enables real-time human-LLM collaboration, demonstrating strong potential for improving annotation reliability in software engineering and NLP applications.
提供机构:
figshare
创建时间:
2025-04-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作