NIST GenAI Code Pilot Evaluation: Scoring Software
收藏NIST Chemistry WebBook2025-12-11 更新2026-03-14 收录
下载链接:
https://data.nist.gov/od/id/mds2-3858
下载链接
链接失效反馈官方服务:
资源简介:
NIST Generative AI (GenAI) is a new evaluation program administered by the NIST Information Technology Laboratory to assess generative AI technologies developed by the research community from around the world. NIST GenAI is an umbrella program that supports various evaluations for research and measurement science in Generative AI by providing a platform for Test and Evaluation. One component of GenAI is NIST GenAI Code, which currently evaluates: Can AI generate code for testing software reliably? This item is the open-source scoring code software used to score NIST GenAI Code submissions. The GenAI Code Pilot Software is open-sourced and available on Github under the USNISTGOV Organization, repository name "test_code_eval".



