Medical Journal of Australia Archive (1914-1930)
收藏Snowflake2026-04-10 更新2026-04-11 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZSXZGPW3TFV
下载链接
链接失效反馈官方服务:
资源简介:
Complete pre-1930 archive of the Medical Journal of Australia (MJA), the official journal of the Australian Medical Association. **72,758 rows** of clean, structured medical text beginning with **Volume 1, Number 1, July 4, 1914,** the very first issue, published just weeks before the outbreak of World War I.
**What this data does for your model:**
- Your model learns authentic early 20th‑century Australian medicine from the very first issue (July 4, 1914), published weeks before WWI.
- Your model retrieves original research on cardiac arrhythmia, auricular flutter, and early electro‑cardiograph technology.
- Your model trains on WWI‑era Australian clinical practice, public health reporting, and the rise of cardiology as a specialty.
- Your model understands the language of Australian medical journalism, adding geographic diversity to your training corpus.
**What's inside:**
- Early cardiology research on cardiac arrhythmia and auricular flutter
- Electro-cardiograph technology and heart muscle lesions
- WWI-era Australian medicine and public health
- Clinical case studies from early 20th-century Australia
- Foundational research from Australian physicians
**Perfect for:**
- LLM fine-tuning on Australian medical text
- Cardiology and heart rhythm research
- WWI medical history
- Geographic diversification of medical training data
**Format:** Snowflake-native JSONL with columns: ISSUE, TITLE, AUTHOR, TYPE, TEXT. Fully cleaned, bias-audited, and ready for AI training.
*From the first issue in 1914 through 1930, Australian medicine at the dawn of modern cardiology, now ready for the age of AI.*
<p><br/></p>
提供机构:
Devin Media Corp.
创建时间:
2026-04-09
原始信息汇总
Medical Journal of Australia Archive (1914-1930) 数据集概述
数据集基本信息
- 数据集名称: Medical Journal of Australia Archive (1914-1930)
- 提供商: Devin Media Corp.
- 描述: 澳大利亚医学协会官方期刊《澳大利亚医学杂志》(MJA) 完整的1930年前档案。包含从第一卷第一期(1914年7月4日)开始的72,758行清洁、结构化的医学文本,该期出版于第一次世界大战爆发前几周。
- 数据量: 72,758 行
- 时间范围: 1914年至1930年
- 地理覆盖范围: 澳大利亚
数据内容
- 主要内容:
- 早期心脏病学研究,包括心律失常和心房扑动
- 心电图技术和心肌病变
- 第一次世界大战时期的澳大利亚医学和公共卫生
- 20世纪早期澳大利亚的临床病例研究
- 澳大利亚医生的基础研究
- 数据格式: Snowflake原生JSONL
- 数据列:
- ISSUE (Varchar)
- TITLE (Varchar)
- AUTHOR (Varchar)
- TYPE (Varchar)
- TEXT (Varchar)
- INGESTION_DATE (Timestamp_NTZ)
- 数据状态: 完全清洁、经过偏见审核,可用于AI训练
适用场景
- 机器学习: 针对澳大利亚医学文本进行领域特定的LLM微调、心脏病学术语提取和临床NLP模型开发。
- 真实世界数据(RWD): 利用历史记录的心脏病例、临床观察和治疗结果作为研究和分析的真实世界数据。
- 生命科学商业化: 支持生命科学研究,提供记录20世纪早期澳大利亚心脏病学的历史医学文献。
数据字典示例
- 表名: MJA_CORPUS
- 数据预览示例:
- 1924年11月: "The direct local action of light on a deep-seated"
- 1926年2月: "The method of removing the synovial sheath in toto"
- 1924年5月: "The growth was then pushed down by means of the finger"
- 1927年10月: "the visits and advice of the social service worker"
- 1923年6月: "the variations in the blood formula in response to"
使用示例
-
查看元数据文档: sql SELECT TITLE, TEXT FROM MJA_CORPUS WHERE TYPE = metadata LIMIT 5;
-
搜索心脏病学主题: sql SELECT ISSUE, TITLE, AUTHOR FROM MJA_CORPUS WHERE TYPE = article AND TEXT ILIKE %cardiac% OR TEXT ILIKE %heart% OR TEXT ILIKE %arrhythmia% LIMIT 10;
-
按类型统计行数: sql SELECT TYPE, COUNT(*) FROM MJA_CORPUS GROUP BY TYPE;
-
搜索一战时期医学: sql SELECT TITLE, AUTHOR, ISSUE FROM MJA_CORPUS WHERE TYPE = article AND ISSUE LIKE 1914% OR ISSUE LIKE 1915% OR ISSUE LIKE 1916% OR ISSUE LIKE 1917% OR ISSUE LIKE 1918% LIMIT 10;
技术详情
- 更新频率: 每年
- 云区域可用性: AWS(包括亚太地区雅加达、马来西亚、孟买、大阪等53个区域)
- 法律条款: 标准条款
提供商信息
- 提供商名称: Devin Media Corp.
- 销售联系: hello@devinmediacorp.com
- 支持联系: hello@devinmediacorp.com
- 提供商专长: 专注于为AI训练提供优质历史数据,提供全面、来源可追溯、经过偏见审核的1930年前出版物和档案,经过专业清洁和结构化处理,适用于机器学习应用。



