five

Medical Repository Archive (1797–1824)

收藏
Snowflake2026-04-16 更新2026-04-17 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZSXZGPW3W89
下载链接
链接失效反馈
官方服务:
资源简介:
Complete archive of the Medical Repository, the first medical journal published in the United States. **10,189 rows** of clean, structured text documenting the birth of American medical literature, beginning with **Volume 1, Number 1, 1797**. **What this data does for your model:** - Your model learns authentic late 18th‑century American medicine from the very first medical journal published in the United States. - Your model retrieves original observations on yellow fever, smallpox inoculation, and other epidemic diseases that shaped early American public health. - Your model trains on the language of pre‑germ theory medicine, when physicians relied on clinical observation without knowledge of bacteria or viruses. - Your model understands the foundations of American medical publishing, capturing the transition from Revolutionary War medicine to the early Republic's professional aspirations. **What's inside:** - Volume 1, Number 1 (1797): the founding document of American medical literature - Early American clinical case studies and disease observations - Original reports on yellow fever, smallpox inoculation, and other epidemic diseases - Pre‑germ theory medical concepts and therapeutic approaches - The birth of medical publishing in the post‑Revolutionary United States **Perfect for:** - LLM fine‑tuning on early American medical text - History of medicine and digital humanities - Rare book and special collections research - Medical terminology evolution studies **Format:** Snowflake-native JSONL with columns: ISSUE, TITLE, AUTHOR, TYPE, TEXT. Fully cleaned, bias‑audited, and ready for AI training. <p><br/></p>
提供机构:
Devin Media Corp.
创建时间:
2026-04-16
原始信息汇总

Medical Repository Archive (1797–1824) 数据集概述

数据集基本信息

  • 数据集名称:Medical Repository Archive (1797–1824)
  • 提供商:Devin Media Corp.
  • 数据描述:Complete archive of the Medical Repository, the first medical journal published in the United States.
  • 数据规模:10,189 rows
  • 数据格式:Snowflake-native JSONL
  • 数据列:ISSUE, TITLE, AUTHOR, TYPE, TEXT
  • 数据质量:Fully cleaned, bias-audited, and ready for AI training
  • 时间范围:1797年至1824年
  • 起始点:Volume 1, Number 1, 1797
  • 地理覆盖范围:United States
  • 数据更新频率:Annually

数据内容详情

  • 内容性质:Clean, structured text documenting the birth of American medical literature.
  • 历史意义:Founding document of American medical publishing.
  • 主题涵盖
    • Post-Revolutionary War medicine and public health
    • Early American clinical case studies
    • Pre-germ theory medical practice
    • Rare 18th-century medical observations and discoveries

适用场景

  • LLM fine-tuning on early American medical text
  • History of medicine and digital humanities
  • Rare book and special collections research
  • Medical terminology evolution studies

业务需求对应

  • Machine Learning:Train, fine-tune, and deploy machine learning models on 10,000+ rows of curated early American medical text spanning 1797–1824. Ideal for historical medical terminology extraction and digital humanities research.
  • Real World Data (RWD):Leverage historically documented clinical cases, public health observations, and medical practices from the early Republic as real-world data for research and analysis. This archive captures medicine before the germ theory revolution.
  • Life Sciences Commercialization:Support pharmaceutical and medical history research with curated early American medical literature documenting pre-modern therapeutic approaches and disease concepts.

使用示例

  1. 查看元数据文档 sql SELECT TITLE, TEXT FROM MR_CORPUS WHERE TYPE = metadata LIMIT 5;

  2. 搜索早期美国医学内容 sql SELECT TITLE, ISSUE FROM MR_CORPUS WHERE TYPE = article AND TEXT ILIKE %yellow fever% OR TEXT ILIKE %smallpox% OR TEXT ILIKE %inoculation% LIMIT 10;

  3. 按类型统计行数 sql SELECT TYPE, COUNT(*) FROM MR_CORPUS GROUP BY TYPE;

云区域可用性 (Azure)

  • Central US (Iowa)
  • East US (Virginia)
  • East US 2 (Virginia)
  • South Central US (Texas)
  • 3 More

提供商信息

  • 提供商名称:Devin Media Corp.
  • 提供商专长:Specializes in premium historical data for AI training. Provides comprehensive, provenance tracked, bias-audited, pre-1930 publications and archives, professionally cleaned and structured for machine learning applications.
  • 数据集特点:Pre-1930 and verified public domain/Copyright free; Professionally OCRd and aggressively cleaned; Provenance-tracked and bias-audited; Formatted as JSONL for AI-readiness; Delivered via secure API (no file downloads).

联系方式

  • 销售:hello@devinmediacorp.com
  • 支持:hello@devinmediacorp.com

法律条款

  • Standard
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作