raymondzmc/stackoverflow_Llama-3.2-1B-Instruct_vocab_2000_last
收藏Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/raymondzmc/stackoverflow_Llama-3.2-1B-Instruct_vocab_2000_last
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含用于自然语言处理任务的结构化数据,特征包括ID、上下文文本、下一个单词、下一个单词的logits、输入嵌入、词袋表示和标签。数据集包含20,000个训练样本,可能用于下一个单词预测或文本生成等任务。
This dataset contains structured data for natural language processing tasks, with features including ID, context text, next word, next word logits, input embeddings, bag-of-words representation, and labels. The dataset includes 20,000 training examples and is likely used for tasks such as next-word prediction or text generation.
提供机构:
raymondzmc



