Taxonomic punchlines: metadata in biology|生物分类学数据集|社会科学元数据数据集

DataCite Commons2021-05-05 更新2024-07-27 收录

生物分类学

社会科学元数据

下载链接：

https://tandf.figshare.com/articles/dataset/Taxonomic_punchlines_metadata_in_biology/8188262/1

下载链接

链接失效反馈

资源简介：

Biological nomenclature contains metadata that can inform researchers about a taxon’s place in nature and the namer’s place in contemporary science and culture. The socio-scientific content of that metadata, and the story it conveys about the origin of a scientific name, hold value for taxonomy and interest for the public in general. However, such metadata are perishable if not hard-coded into literature. Accordingly, the present paper attempts to document the use and value of socio-scientific metadata through examples of whimsical taxonomic names. In the process, I capture hitherto unpublished views on this topic expressed by George Gaylord Simpson, the twentieth century's most distinguished vertebrate palaeontologist and a co-founder of the modern synthetic theory of evolution, along with personal perspectives of many of the eminent palaeozoologists and biologists of his time. The principal conclusion is that whimsical names will surely increase in their ubiquity in scientific literature, and this commends acknowledgement in the international zoological code to encourage the preservation of their origin stories. Credit: Cartoon from New Scientist, its masked arthropod grumbling about whimsical scientific nomenclature, originally appeared in McClellan (1982). The artist, David Austin (1935-2005), began his cartooning career in the 1970s, later becoming well-known for his political commentary in pocket cartoons featured in British dailies including Today, The Daily Telegraph, and The Guardian, and also Labour Weekly, The Spectator, Field and Mail. (Used with permission of Mr. Austin’s estate, courtesy of Janet Slee, 2018.)

提供机构：

Taylor & Francis

创建时间：

2019-05-27

用户留言

有没有相关的论文或文献参考？

这个数据集是基于什么背景创建的？

数据集的作者是谁？

能帮我联系到这个数据集的作者吗？

这个数据集如何下载？

点击留言

数据主题

具身智能

数据集 4098个

机构 8个

大模型

数据集 439个

机构 10个

无人机

数据集 37个

机构 6个

指令微调

数据集 36个

机构 6个

蛋白质结构

数据集 50个

机构 8个

空间智能

数据集 21个

机构 5个

5,000+

优质数据集

54 个

任务类型

进入经典数据集

热门数据集

中国气象数据

本数据集包含了中国2023年1月至11月的气象数据，包括日照时间、降雨量、温度、风速等关键数据。通过这些数据，可以深入了解气象现象对不同地区的影响，并通过可视化工具揭示中国的气温分布、降水情况、风速趋势等。

github 收录

poi

本项目收集国内POI兴趣点，当前版本数据来自于openstreetmap。

github 收录

VoxBox

VoxBox是一个大规模语音语料库，由多样化的开源数据集构建而成，用于训练文本到语音（TTS）系统。

github 收录

UniProt

UniProt（Universal Protein Resource）是全球公认的蛋白质序列与功能信息权威数据库，由欧洲生物信息学研究所（EBI）、瑞士生物信息学研究所（SIB）和美国蛋白质信息资源中心（PIR）联合运营。该数据库以其广度和深度兼备的蛋白质信息资源闻名，整合了实验验证的高质量数据与大规模预测的自动注释内容，涵盖从分子序列、结构到功能的全面信息。UniProt核心包括注释详尽的UniProtKB知识库（分为人工校验的Swiss-Prot和自动生成的TrEMBL），以及支持高效序列聚类分析的UniRef和全局蛋白质序列归档的UniParc。其卓越的数据质量和多样化的检索工具，为基础研究和药物研发提供了无可替代的支持，成为生物学研究中不可或缺的资源。

www.uniprot.org 收录

CosyVoice 2

CosyVoice 2是由阿里巴巴集团开发的多语言语音合成数据集，旨在通过大规模多语言数据集训练，实现高质量的流式语音合成。数据集通过有限标量量化技术改进语音令牌的利用率，并结合预训练的大型语言模型作为骨干，支持流式和非流式合成。数据集的创建过程包括文本令牌化、监督语义语音令牌化、统一文本-语音语言模型和块感知流匹配模型等步骤。该数据集主要应用于语音合成领域，旨在解决高延迟和低自然度的问题，提供接近人类水平的语音合成质量。

arXiv 收录