five

carecodeconnect/jhana-sentences

收藏
Hugging Face2024-03-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/carecodeconnect/jhana-sentences
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - text-generation language: - en --- # Dataset Card for "Jhana Sentences" ## Table of Contents - [Dataset Description](#dataset-description) - [Dataset Summary](#dataset-summary) - [Supported Tasks](#supported-tasks) - [Languages](#languages) - [Dataset Structure](#dataset-structure) - [Data Instances](#data-instances) - [Data Fields](#data-fields) - [Data Splits](#data-splits) - [Usage](#usage) - [Citation](#citation) - [Contact](#contact) ## Dataset Description ### Dataset Summary This dataset, named "Jhana Sentences," contains sentences and passages focused on Jhana meditation practices, teachings, and insights. It is intended for use in training language models for applications related to meditation guidance, spiritual advice, and related conversational agents. ![Semantic Similarity Heatmap](images/semantic_similarity_heatmap.png) ### Supported Tasks - `text-generation`: The dataset can be used to train models for generating meditation-related content. - `language-modeling`: Suitable for training models to understand context and semantics in meditation and mindfulness contexts. ### Languages The text in the dataset is primarily in English and Pali. ## Dataset Structure ### Data Instances A data instance in "Jhana Sentences" dataset might look as follows: ``` It's how quickly can I generate access concentration? Once I've got access concentration, then, yeah, the first jhana is mine as soon as I want it. ``` ### Data Splits This dataset is provided as a single file without explicit training/validation/test splits. Users are encouraged to create splits as needed for their specific tasks. ## Usage This dataset is suitable for training language models that require an understanding of meditation-related discourse. Example applications include conversational agents providing meditation guidance and systems generating content on meditation topics. ## Citation Please cite this dataset as: ```bibtex @misc{jhana_sentences_dataset, author = {carecodeconnect}, title = {Jhana Sentences: A Dataset for Meditation-Focused Language Models}, year = {2024}, publisher = {Hugging Face}, journal = {Hugging Face Dataset Hub}, } ```
提供机构:
carecodeconnect
原始信息汇总

数据集卡片 "Jhana Sentences"

数据集描述

数据集概述

"Jhana Sentences" 数据集包含专注于 Jhana 冥想实践、教义和洞见的句子和段落。该数据集旨在用于训练与冥想指导、精神建议及相关对话代理相关的语言模型。

支持的任务

  • text-generation: 该数据集可用于训练生成冥想相关内容的模型。
  • language-modeling: 适用于训练模型以理解冥想和正念语境中的上下文和语义。

语言

数据集中的文本主要为英语和巴利语。

数据集结构

数据实例

"Jhana Sentences" 数据集中的一个数据实例可能如下所示:

Its how quickly can I generate access concentration? Once Ive got access concentration, then, yeah, the first jhana is mine as soon as I want it.

数据分割

该数据集以单个文件形式提供,未明确划分训练/验证/测试集。用户可根据具体任务需要自行创建分割。

使用

该数据集适用于训练需要理解冥想相关论述的语言模型。示例应用包括提供冥想指导的对话代理和生成冥想主题内容的系统。

引用

请按如下方式引用该数据集:

bibtex @misc{jhana_sentences_dataset, author = {carecodeconnect}, title = {Jhana Sentences: A Dataset for Meditation-Focused Language Models}, year = {2024}, publisher = {Hugging Face}, journal = {Hugging Face Dataset Hub}, }

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作