American Journal of Public Health 1899-1930
收藏Snowflake2026-04-13 更新2026-04-15 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZSXZGPW3VD7
下载链接
链接失效反馈官方服务:
资源简介:
Complete pre-1930 archive of the American Journal of Public Health (AJPH), the official journal of the American Public Health Association. **45,696 rows** of clean, structured text documenting the formative decades of public health in the United States.
**What this data does for your model:**
- Your model learns authentic early 20th‑century public health practice from the official journal of the American Public Health Association.
- Your model retrieves original research on sanitation reform, infectious disease control (tuberculosis, diphtheria, typhoid, Spanish flu), and vital statistics.
- Your model trains on the language of Progressive Era health advocacy, housing policy, child welfare, and occupational hygiene.
- Your model understands the evolution of epidemiology, from outbreak investigation to population‑level prevention strategies.
**What's inside:**
- Early epidemiology and vital statistics
- Sanitation reform and housing policy
- Infectious disease control (tuberculosis, diphtheria, typhoid, Spanish flu)
- Child welfare and school health programs
- Occupational health and industrial hygiene
- The Progressive Era public health movement
**Perfect for:**
- LLM fine-tuning on public health and epidemiology
- Public health policy research and history
- Infectious disease modeling and historical epidemiology
- Health equity and social determinants research
**Format:** Snowflake-native JSONL with columns: ISSUE, TITLE, AUTHOR, TYPE, TEXT. Fully cleaned, bias-audited, and ready for AI training.
*From sanitation to social medicine, the journal that defined public health in America, now ready for AI.*
<p><br/></p>
提供机构:
Devin Media Corp.
创建时间:
2026-04-13
原始信息汇总
American Journal of Public Health 1899-1930 数据集概述
数据集基本信息
- 数据集名称:American Journal of Public Health 1899-1930
- 提供方:Devin Media Corp.
- 数据描述:Complete pre-1930 archive of the American Journal of Public Health (AJPH), the official journal of the American Public Health Association.
- 数据量:45,696 rows
- 数据内容:Clean, structured text documenting the formative decades of public health in the United States.
数据内容详情
涵盖主题
- Early epidemiology and vital statistics
- Sanitation reform and housing policy
- Infectious disease control (tuberculosis, diphtheria, typhoid, Spanish flu)
- Child welfare and school health programs
- Occupational health and industrial hygiene
- The Progressive Era public health movement
适用场景
- LLM fine-tuning on public health and epidemiology
- Public health policy research and history
- Infectious disease modeling and historical epidemiology
- Health equity and social determinants research
数据结构与格式
数据格式
- Snowflake-native JSONL
- 列字段:ISSUE, TITLE, AUTHOR, TYPE, TEXT
- 数据状态:Fully cleaned, bias-audited, and ready for AI training
数据字典(AJPH_CORPUS表)
| 列名 | 数据类型 | 描述 |
|---|---|---|
| ISSUE | Varchar | |
| TITLE | Varchar | |
| AUTHOR | Varchar | |
| TYPE | Varchar | |
| TEXT | Varchar | |
| INGESTION_DATE | Timestamp_NTZ |
业务应用场景
机器学习
- Train, fine-tune, and deploy machine learning models on 45,000+ rows of curated public health text spanning the Progressive Era through 1930.
- Ideal for domain-specific LLM fine-tuning, public health terminology extraction, and epidemiology NLP.
真实世界数据
- Leverage historically documented sanitation practices, infectious disease control, and vital statistics as real-world data for research and analysis.
- This archive captures the birth of modern public health in America.
生命科学商业化
- Support public health research with curated historical literature documenting the development of epidemiology, sanitation, and health promotion.
- Track the evolution of population health from 1911–1930.
使用示例
查看元数据文档
sql SELECT TITLE, TEXT FROM AJPH_CORPUS WHERE TYPE = metadata LIMIT 5;
搜索卫生与健康
sql SELECT ISSUE, TITLE FROM AJPH_CORPUS WHERE TYPE = article AND TEXT ILIKE %sanitation% OR TEXT ILIKE %hygiene% LIMIT 10;
搜索传染病
sql SELECT TITLE, ISSUE FROM AJPH_CORPUS WHERE TYPE = article AND TEXT ILIKE %tuberculosis% OR TEXT ILIKE %influenza% OR TEXT ILIKE %typhoid% LIMIT 10;
数据集技术详情
更新频率
- Annually
地理覆盖范围
- United States
云区域可用性(AWS)
- Canada (Central)
- US East (N. Virginia)
- US East (Ohio)
- US West (Oregon)
法律条款
- Standard
提供商信息
关于Devin Media Corp.
- 专注于为AI训练提供优质历史数据。
- 提供全面、来源可追溯、偏见审核、1930年以前的出版物和档案,经过专业清洗和结构化处理,适用于机器学习应用。
- 数据集涵盖医学、金融、时尚、法律和文化领域,包括一些社会最负盛名和标志性的出版物。
- 每个数据集都具备以下特点:
- Pre-1930 and verified public domain/Copyright free
- Professionally OCRd and aggressively cleaned
- Provenance-tracked and bias-audited
- Formatted as JSONL for AI-readiness
- Delivered via secure API (no file downloads)
联系方式
- 销售:hello@devinmediacorp.com
- 支持:hello@devinmediacorp.com



