five

NIST GenAI Code Pilot Evaluation: Scoring Software

收藏
NIST Chemistry WebBook2025-12-11 更新2026-03-14 收录
下载链接:
https://data.nist.gov/od/id/mds2-3858
下载链接
链接失效反馈
官方服务:
资源简介:
NIST Generative AI (GenAI) is a new evaluation program administered by the NIST Information Technology Laboratory to assess generative AI technologies developed by the research community from around the world. NIST GenAI is an umbrella program that supports various evaluations for research and measurement science in Generative AI by providing a platform for Test and Evaluation. One component of GenAI is NIST GenAI Code, which currently evaluates: Can AI generate code for testing software reliably? This item is the open-source scoring code software used to score NIST GenAI Code submissions. The GenAI Code Pilot Software is open-sourced and available on Github under the USNISTGOV Organization, repository name "test_code_eval".
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作