jackboyla/gone_and_growned_my_own_dataset
收藏数据集概述
许可证
- Apache 2.0
数据规模
- 1K < n < 10K
数据集信息
- 特征:
id:int64text:字符串序列tokenized_text:字符串序列的序列model_name:字符串instruction:字符串ents:列表的列表head:字符串序列tail:字符串序列
generation:字符串序列ner:字符串序列的序列__index_level_0__:int64
数据分割
- 训练集:
- 字节数:1148647044
- 样本数:26688
数据大小
- 下载大小:141700312
- 数据集大小:1148647044
配置
- 默认配置:
- 数据文件:
- 分割:训练
- 路径:data/train-*
- 数据文件:
标签
- 合成数据
数据结构示例(默认配置)
json { "ents": { "head": ["41", "42", "PRODUCT", "IPhone"], "tail": ["0", "5", "DATE", "This time of the year"] }, "generation": "NO_RELATION", "instruction": "You are a fantastic relation extraction model who only outputs valid JSON. Extract the relation between the given entities using the context in the below text. If no relation exists, use the label "NO_RELATION". ONLY RETURN THE RELATION LABEL. Pay VERY close attention to which entity is the head and tail; this dictates the direction of the relationship.
Entities: {head: [41, 42, PRODUCT, IPhone], tail: [0, 5, DATE, This time of the year]}
Text: This time of the year... ...my heart sings, my energy level pumps up and I feel inspired. Does that happen to you too? Just thought I would pop in to share some quick IPhone photos that I have taken in the last few days. I love playing with my lilacs. If you have followed along these last four years, you know that our lilacs are family blooms... some were my grandmothers, Mr. Fleas grandmothers and the deep double French were my Moms. The dogwoods this past week were stunning. We have just one pink...the rest are white. And...of course, I love using my vintage, rustic treasures to hold the all the beautiful blossoms we are lucky enough to have on the property. My fascination with Instagram continues, but my son Dan will be happy to know that I have been using my Canon big girl camera again and now that we have replaced my 8 year old laptop (may she rest in peace) with a new one that is fast and has lots of storage... I hope to blog a bit more often as well. Wishing you a lovely spring! I cant wait to get back up to the lake!", "model_name": "mistralai/Mistral-7B-Instruct-v0.2", "text": "This time of the year... ...my heart sings, my energy level pumps up and I feel inspired. Does that happen to you too? Just thought I would pop in to share some quick IPhone photos that I have taken in the last few days. I love playing with my lilacs. If you have followed along these last four years, you know that our lilacs are family blooms... some were my grandmothers, Mr. Fleas grandmothers and the deep double French were my Moms. The dogwoods this past week were stunning. We have just one pink...the rest are white. And...of course, I love using my vintage, rustic treasures to hold the all the beautiful blossoms we are lucky enough to have on the property. My fascination with Instagram continues, but my son Dan will be happy to know that I have been using my Canon big girl camera again and now that we have replaced my 8 year old laptop (may she rest in peace) with a new one that is fast and has lots of storage... I hope to blog a bit more often as well. Wishing you a lovely spring! I cant wait to get back up to the lake!", "tokenized_text": ["This", "time", "of", "the", "year", "...", " ", "...", "my", "heart", "sings", ",", "my", "energy", "level", "pumps", "up", "and", "I", "feel", "inspired", ".", " ", "Does", "that", "happen", "to", "you", "too", "?", " ", "Just", "thought", "I", "would", "pop", "in", "to", "share", "some", "quick", "IPhone", "photos", " ", "that", "I", "have", "taken", "in", "the", "last", "few", "days", ".", " ", "I", "love", "playing", "with", "my", "lilacs", ".", " ", "If", "you", "have", "followed", "along", "these", "last", "four", "years", ",", " ", "you", "know", "that", "our", "lilacs", "are", "family", "blooms", "...", " ", "some", "were", "my", "grandmothers", ",", "Mr.", "Flea", "grandmothers", " ", "and", "the", "deep", "double", "French", "were", "my", "Moms", ".", " ", "The", "dogwoods", "this", "past", "week", "were", "stunning", ".", " ", "We", "have", "just", "one", "pink", "...", "the", "rest", "are", "white", ".", " ", "And", "...", "of", "course", ",", "I", "love", "using", "my", "vintage", ",", "rustic", "treasures", " ", "to", "hold", "the", "all", "the", "beautiful", "blossoms", " ", "we", "are", "lucky", "enough", "to", "have", "on", "the", "property", ".", " ", "My", "fascination", "with", "Instagram", "continues", ",", " ", "but", "my", "son", "Dan", "will", "be", "happy", "to", "know", " ", "that", "I", "have", "been", "using", "my", "Canon", "big", "girl", "camera", "again", " ", "and", "now", "that", "we", "have", "replaced", "my", "8", "year", "old", "laptop", " ", "(", "may", "she", "rest", "in", "peace", ")", " ", "with", "a", "new", "one", "that", "is", "fast", "and", "has", "lots", "of", "storage", "...", " ", "I", "hope", "to", "blog", "a", "bit", "more", "often", "as", "well", ".", " ", "Wishing", "you", "a", "lovely", "spring", "!", " ", "I", "ca", "nt", "wait", "to", "get", "back", "up", "to", "the", "lake", "!"] }
加载数据集
python from datasets import load_dataset
ds = load_dataset("jackboyla/gone_and_growned_my_own_dataset", "default")
或简化为: python from datasets import load_dataset
ds = load_dataset("jackboyla/gone_and_growned_my_own_dataset")



