chenle015/OpenMP_Question_Answering
收藏Hugging Face2023-08-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/chenle015/OpenMP_Question_Answering
下载链接
链接失效反馈官方服务:
资源简介:
---
license: bsd
---
# OpenMP Question Answering Dataset
OpenMP Question Answering Dataset is a new OpenMP question answering introduced in paper "LM4HPC: Towards Effective Language Model Application in High-Performance Computing".
It is designed to probe the capabilities of language models in single-turn interactions with users. Similar to other QA datasets, we include
some request-response pairs which are not strictly question-answering pairs. The categories and examples of questions in the OMPQA dataset can be found in below table.
| **Category** | **Count** | **Example Questions** |
| ------------ | --------- | --------------------- |
| Basics | 40 | What is a worksharing construct in OpenMP? |
| Examples | 20 | Give an example OpenMP C code for computing PI using numerical integration. |
| Compilers | 24 | In what language is LLVM written? <br> How is a parallel region represented in Clang? |
| Benchmarks | 23 | What are the NAS Parallel benchmarks? <br> Which benchmark assesses data race detection tools? |
# Data Usage
The dataset is provided in a CSV file, with each entry in the CSV table representing a pair of question and answer.
# Contribute
Welcome to join us and become a contributor to this project! If you want to share some datasets, put them in csv file and email to lechen AT iastate.edu. Thank you!
# Citation
If you use the data collection, code, or experimental findings in this repository, please cite our IWOMP paper:
@article{chen2023lm4hpc,
title={LM4HPC: Towards Effective Language Model Application in High-Performance Computing},
author={Chen, Le and Lin, Pei-Hung and Vanderbruggen, Tristan and Liao, Chunhua and Emani, Murali and de Supinski, Bronis},
journal={arXiv preprint arXiv:2306.14979},
year={2023}
}
提供机构:
chenle015
原始信息汇总
OpenMP Question Answering Dataset
概述
OpenMP Question Answering Dataset 是一个新引入的 OpenMP 问答数据集,该数据集在论文 "LM4HPC: Towards Effective Language Model Application in High-Performance Computing" 中被介绍。该数据集旨在测试语言模型在单轮用户交互中的能力。与其他问答数据集类似,该数据集包含一些请求-响应对,这些对并不严格是问答对。
数据集分类及示例
数据集中的问题分为以下几类,并提供了相应的示例问题:
| Category | Count | Example Questions |
|---|---|---|
| Basics | 40 | What is a worksharing construct in OpenMP? |
| Examples | 20 | Give an example OpenMP C code for computing PI using numerical integration. |
| Compilers | 24 | In what language is LLVM written? <br> How is a parallel region represented in Clang? |
| Benchmarks | 23 | What are the NAS Parallel benchmarks? <br> Which benchmark assesses data race detection tools? |
数据格式
数据集以 CSV 文件形式提供,每个条目在 CSV 表中表示一对问答。
引用
如果您使用该数据集、代码或实验结果,请引用以下论文:
plaintext @article{chen2023lm4hpc, title={LM4HPC: Towards Effective Language Model Application in High-Performance Computing}, author={Chen, Le and Lin, Pei-Hung and Vanderbruggen, Tristan and Liao, Chunhua and Emani, Murali and de Supinski, Bronis}, journal={arXiv preprint arXiv:2306.14979}, year={2023} }



