tanaymehta/labelled_regex
收藏Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/tanaymehta/labelled_regex
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含正则表达式及其描述性标签。据我所知,这是该平台上最大且标签清晰的正则表达式数据集。数据集是通过获取[innovatorved/regex_dataset]并使用[gemma-3-27b-it] LLM为每个正则表达式生成简洁合适的标题而构建的。对于每个超过100个字符的正则表达式,使用了稍微不同的提示来生成更详细的描述。
This dataset consists of Regexes and their descriptive labels. As far as I am aware, this is the largest, cleanly labelled regex dataset on this platform. I constructed this dataset by taking [innovatorved/regex_dataset] and using [gemma-3-27b-it] LLM to generate a concise and suitable title for each regex. For each regex that was larger than 100 characters, I used a slightly different prompt to generate an even more detailed description.
提供机构:
tanaymehta



