five

American Journal of Public Health 1899-1930

收藏
Snowflake2026-04-13 更新2026-04-15 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZSXZGPW3VD7
下载链接
链接失效反馈
官方服务:
资源简介:
Complete pre-1930 archive of the American Journal of Public Health (AJPH), the official journal of the American Public Health Association. **45,696 rows** of clean, structured text documenting the formative decades of public health in the United States. **What this data does for your model:** - Your model learns authentic early 20th‑century public health practice from the official journal of the American Public Health Association. - Your model retrieves original research on sanitation reform, infectious disease control (tuberculosis, diphtheria, typhoid, Spanish flu), and vital statistics. - Your model trains on the language of Progressive Era health advocacy, housing policy, child welfare, and occupational hygiene. - Your model understands the evolution of epidemiology, from outbreak investigation to population‑level prevention strategies. **What's inside:** - Early epidemiology and vital statistics - Sanitation reform and housing policy - Infectious disease control (tuberculosis, diphtheria, typhoid, Spanish flu) - Child welfare and school health programs - Occupational health and industrial hygiene - The Progressive Era public health movement **Perfect for:** - LLM fine-tuning on public health and epidemiology - Public health policy research and history - Infectious disease modeling and historical epidemiology - Health equity and social determinants research **Format:** Snowflake-native JSONL with columns: ISSUE, TITLE, AUTHOR, TYPE, TEXT. Fully cleaned, bias-audited, and ready for AI training. *From sanitation to social medicine, the journal that defined public health in America, now ready for AI.* <p><br/></p>
提供机构:
Devin Media Corp.
创建时间:
2026-04-13
原始信息汇总

American Journal of Public Health 1899-1930 数据集概述

数据集基本信息

  • 数据集名称:American Journal of Public Health 1899-1930
  • 提供方:Devin Media Corp.
  • 数据描述:Complete pre-1930 archive of the American Journal of Public Health (AJPH), the official journal of the American Public Health Association.
  • 数据量:45,696 rows
  • 数据内容:Clean, structured text documenting the formative decades of public health in the United States.

数据内容详情

涵盖主题

  • Early epidemiology and vital statistics
  • Sanitation reform and housing policy
  • Infectious disease control (tuberculosis, diphtheria, typhoid, Spanish flu)
  • Child welfare and school health programs
  • Occupational health and industrial hygiene
  • The Progressive Era public health movement

适用场景

  • LLM fine-tuning on public health and epidemiology
  • Public health policy research and history
  • Infectious disease modeling and historical epidemiology
  • Health equity and social determinants research

数据结构与格式

数据格式

  • Snowflake-native JSONL
  • 列字段:ISSUE, TITLE, AUTHOR, TYPE, TEXT
  • 数据状态:Fully cleaned, bias-audited, and ready for AI training

数据字典(AJPH_CORPUS表)

列名 数据类型 描述
ISSUE Varchar
TITLE Varchar
AUTHOR Varchar
TYPE Varchar
TEXT Varchar
INGESTION_DATE Timestamp_NTZ

业务应用场景

机器学习

  • Train, fine-tune, and deploy machine learning models on 45,000+ rows of curated public health text spanning the Progressive Era through 1930.
  • Ideal for domain-specific LLM fine-tuning, public health terminology extraction, and epidemiology NLP.

真实世界数据

  • Leverage historically documented sanitation practices, infectious disease control, and vital statistics as real-world data for research and analysis.
  • This archive captures the birth of modern public health in America.

生命科学商业化

  • Support public health research with curated historical literature documenting the development of epidemiology, sanitation, and health promotion.
  • Track the evolution of population health from 1911–1930.

使用示例

查看元数据文档

sql SELECT TITLE, TEXT FROM AJPH_CORPUS WHERE TYPE = metadata LIMIT 5;

搜索卫生与健康

sql SELECT ISSUE, TITLE FROM AJPH_CORPUS WHERE TYPE = article AND TEXT ILIKE %sanitation% OR TEXT ILIKE %hygiene% LIMIT 10;

搜索传染病

sql SELECT TITLE, ISSUE FROM AJPH_CORPUS WHERE TYPE = article AND TEXT ILIKE %tuberculosis% OR TEXT ILIKE %influenza% OR TEXT ILIKE %typhoid% LIMIT 10;

数据集技术详情

更新频率

  • Annually

地理覆盖范围

  • United States

云区域可用性(AWS)

  • Canada (Central)
  • US East (N. Virginia)
  • US East (Ohio)
  • US West (Oregon)

法律条款

  • Standard

提供商信息

关于Devin Media Corp.

  • 专注于为AI训练提供优质历史数据。
  • 提供全面、来源可追溯、偏见审核、1930年以前的出版物和档案,经过专业清洗和结构化处理,适用于机器学习应用。
  • 数据集涵盖医学、金融、时尚、法律和文化领域,包括一些社会最负盛名和标志性的出版物。
  • 每个数据集都具备以下特点:
    • Pre-1930 and verified public domain/Copyright free
    • Professionally OCRd and aggressively cleaned
    • Provenance-tracked and bias-audited
    • Formatted as JSONL for AI-readiness
    • Delivered via secure API (no file downloads)

联系方式

  • 销售:hello@devinmediacorp.com
  • 支持:hello@devinmediacorp.com
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作