Medical Repository Archive (1797–1824)
收藏Snowflake2026-04-16 更新2026-04-17 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZSXZGPW3W89
下载链接
链接失效反馈官方服务:
资源简介:
Complete archive of the Medical Repository, the first medical journal published in the United States. **10,189 rows** of clean, structured text documenting the birth of American medical literature, beginning with **Volume 1, Number 1, 1797**.
**What this data does for your model:**
- Your model learns authentic late 18th‑century American medicine from the very first medical journal published in the United States.
- Your model retrieves original observations on yellow fever, smallpox inoculation, and other epidemic diseases that shaped early American public health.
- Your model trains on the language of pre‑germ theory medicine, when physicians relied on clinical observation without knowledge of bacteria or viruses.
- Your model understands the foundations of American medical publishing, capturing the transition from Revolutionary War medicine to the early Republic's professional aspirations.
**What's inside:**
- Volume 1, Number 1 (1797): the founding document of American medical literature
- Early American clinical case studies and disease observations
- Original reports on yellow fever, smallpox inoculation, and other epidemic diseases
- Pre‑germ theory medical concepts and therapeutic approaches
- The birth of medical publishing in the post‑Revolutionary United States
**Perfect for:**
- LLM fine‑tuning on early American medical text
- History of medicine and digital humanities
- Rare book and special collections research
- Medical terminology evolution studies
**Format:** Snowflake-native JSONL with columns: ISSUE, TITLE, AUTHOR, TYPE, TEXT. Fully cleaned, bias‑audited, and ready for AI training.
<p><br/></p>
提供机构:
Devin Media Corp.
创建时间:
2026-04-16
原始信息汇总
Medical Repository Archive (1797–1824) 数据集概述
数据集基本信息
- 数据集名称:Medical Repository Archive (1797–1824)
- 提供商:Devin Media Corp.
- 数据描述:Complete archive of the Medical Repository, the first medical journal published in the United States.
- 数据规模:10,189 rows
- 数据格式:Snowflake-native JSONL
- 数据列:ISSUE, TITLE, AUTHOR, TYPE, TEXT
- 数据质量:Fully cleaned, bias-audited, and ready for AI training
- 时间范围:1797年至1824年
- 起始点:Volume 1, Number 1, 1797
- 地理覆盖范围:United States
- 数据更新频率:Annually
数据内容详情
- 内容性质:Clean, structured text documenting the birth of American medical literature.
- 历史意义:Founding document of American medical publishing.
- 主题涵盖:
- Post-Revolutionary War medicine and public health
- Early American clinical case studies
- Pre-germ theory medical practice
- Rare 18th-century medical observations and discoveries
适用场景
- LLM fine-tuning on early American medical text
- History of medicine and digital humanities
- Rare book and special collections research
- Medical terminology evolution studies
业务需求对应
- Machine Learning:Train, fine-tune, and deploy machine learning models on 10,000+ rows of curated early American medical text spanning 1797–1824. Ideal for historical medical terminology extraction and digital humanities research.
- Real World Data (RWD):Leverage historically documented clinical cases, public health observations, and medical practices from the early Republic as real-world data for research and analysis. This archive captures medicine before the germ theory revolution.
- Life Sciences Commercialization:Support pharmaceutical and medical history research with curated early American medical literature documenting pre-modern therapeutic approaches and disease concepts.
使用示例
-
查看元数据文档 sql SELECT TITLE, TEXT FROM MR_CORPUS WHERE TYPE = metadata LIMIT 5;
-
搜索早期美国医学内容 sql SELECT TITLE, ISSUE FROM MR_CORPUS WHERE TYPE = article AND TEXT ILIKE %yellow fever% OR TEXT ILIKE %smallpox% OR TEXT ILIKE %inoculation% LIMIT 10;
-
按类型统计行数 sql SELECT TYPE, COUNT(*) FROM MR_CORPUS GROUP BY TYPE;
云区域可用性 (Azure)
- Central US (Iowa)
- East US (Virginia)
- East US 2 (Virginia)
- South Central US (Texas)
- 3 More
提供商信息
- 提供商名称:Devin Media Corp.
- 提供商专长:Specializes in premium historical data for AI training. Provides comprehensive, provenance tracked, bias-audited, pre-1930 publications and archives, professionally cleaned and structured for machine learning applications.
- 数据集特点:Pre-1930 and verified public domain/Copyright free; Professionally OCRd and aggressively cleaned; Provenance-tracked and bias-audited; Formatted as JSONL for AI-readiness; Delivered via secure API (no file downloads).
联系方式
- 销售:hello@devinmediacorp.com
- 支持:hello@devinmediacorp.com
法律条款
- Standard



