bogdanrivera/legal_civiles_oaxaca_llama_unsloth_template
收藏Hugging Face2025-10-12 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/bogdanrivera/legal_civiles_oaxaca_llama_unsloth_template
下载链接
链接失效反馈官方服务:
资源简介:
这是一个完整的、为大型语言模型(LLM)微调(SFT)而优化的墨西哥瓦哈卡州民法典数据集。数据集不仅包含法律条文的文本,还经过了深度清洗和增强,包括通过LLM生成合成问题(概念性、摘要等)以及使用正则表达式进行最终标记,以提高模型训练的准确性。数据集使用LLM模型进行微调,并遵循对话式的数据结构。
This is a complete and enriched dataset of the Civil Code of the State of Oaxaca, Mexico, formatted specifically for fine-tuning (SFT) of large language models (LLMs). The dataset not only contains the text of the legal articles but has also been deeply cleaned and augmented, including the generation of synthetic questions (conceptual, summary, etc.) by LLMs and final tagging with regular expressions to improve the accuracy of the model during training. The dataset uses LLM models for fine-tuning and follows a conversational data structure.
提供机构:
bogdanrivera



