five

BIOSNAP_DDI_and_genes_data

收藏
DataCite Commons2025-06-01 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/dataset/BIOSNAP_DDI_and_genes_data/23600565/1
下载链接
链接失效反馈
资源简介:
<strong># NamE Datasets</strong><br> Jeffrey Seathrún Sardina<br> <br> <strong>## Information and Use</strong><br> Datasets for the paper "NamE: Named Graph Embeddings for Context Modelling and Expert Knowledge Integration in Knowledge Graph Embeddings". Datasets are given with all variant forms, and with the original train-test-valid spits used in the paper. Train, test, and valid files are marked as such in the file prefix (e.g. "*-train.csv.gz").<br> <br> Datasets beginning in "triples-" contain triples in a comma-separated format; i.e.<br> ```<br> subject,predicate,object<br> subject,predicate,object<br> subject,predicate,object<br> ```<br> <br> Datasets beginning in "quads-" contain quads in a comma-separated format; i.e.<br> ```<br> subject,predicate,object,namedGraph<br> subject,predicate,object,namedGraph<br> subject,predicate,object,namedGraph<br> ```<br> <br> All datasets are gzipped; on most systems (Linux / MAC) you can uncompress these with "gunzip (file)" if needed.<br> <br> <strong>## Citation</strong><br> If you use these datasets or NamE in your work, please cite "NamE: Named Graph Embeddings for Context Modelling and Expert Knowledge Integration in Knowledge Graph Embeddings"<br> ```<br> BIBTEX citation pending<br> ```<br> <br> Included below are citations to the data sources from which these datasets were created. Note that the triples form of FB15K-237, as it appears in this repository, is unmodified from its original version.<br> <br> <strong>**BIOSNAP dataset citation:**</strong><br> ```<br> @misc{biosnap,<br> author = {Marinka Zitnik, Rok Sosi\v{c}, Sagar Maheshwari, and Jure Leskovec},<br> title = {{BioSNAP Datasets}: {Stanford} Biomedical Network Dataset Collection},<br> howpublished = {\url{http://snap.stanford.edu/biodata}},<br> month = aug,<br> year = 2018<br> }<br> ```

<strong># NamE数据集</strong><br> Jeffrey Seathrún Sardina<br> <br> <strong>## 数据集信息与使用规范</strong><br> 本数据集对应论文《NamE: Named Graph Embeddings for Context Modelling and Expert Knowledge Integration in Knowledge Graph Embeddings》。数据集包含所有变体形式,并附带论文中使用的原始训练集(train set)、测试集(test set)与验证集(valid set)划分方案。训练、测试与验证文件会在文件名前缀中进行标注(例如"*-train.csv.gz")。<br> <br> 文件名以"triples-"开头的数据集采用逗号分隔格式存储三元组(triple),格式如下:<br> <br> subject,predicate,object<br> subject,predicate,object<br> subject,predicate,object<br> <br> 其中`subject`为头实体,`predicate`为关系,`object`为尾实体。<br> <br> 文件名以"quads-"开头的数据集采用逗号分隔格式存储四元组(quad),格式如下:<br> <br> subject,predicate,object,namedGraph<br> subject,predicate,object,namedGraph<br> subject,predicate,object,namedGraph<br> <br> 其中`namedGraph`为命名图。<br> <br> 所有数据集均采用gzip压缩;在大多数Linux/macOS系统中,若有需要可通过`gunzip <文件名>`命令进行解压。<br> <br> <strong>## 引用说明</strong><br> 若您在研究工作中使用本数据集或NamE方法,请引用论文《NamE: Named Graph Embeddings for Context Modelling and Expert Knowledge Integration in Knowledge Graph Embeddings》。当前BibTeX引用格式待补充。<br> <br> 以下列出了本数据集所依托的原始数据源的引用信息。需注意,本仓库中FB15K-237的三元组形式未做任何修改,与原始版本完全一致。<br> <br> <strong>**BIOSNAP数据集引用:**</strong><br> <br> @misc{biosnap,<br> author = {Marinka Zitnik, Rok Sosiv{c}, Sagar Maheshwari, and Jure Leskovec},<br> title = {{BioSNAP Datasets}: {Stanford} Biomedical Network Dataset Collection},<br> howpublished = {url{http://snap.stanford.edu/biodata}},<br> month = aug,<br> year = 2018<br> }<br>
提供机构:
figshare
创建时间:
2023-06-29
AI搜集汇总
数据集介绍
main_image_url
以上内容由AI搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作