five

MarineLives/Line-Insertions

收藏
Hugging Face2024-09-17 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/MarineLives/Line-Insertions
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: cc-by-sa-4.0 --- Small dataset of line sections extracted from English High Court of Admiralty deposition volume HCA 13/58 (covers 1642-44) Series of four lines of consecutive text extracted from a page Grouped in paris of four lines, with first group the raw machine transcribed text and the second group the corrected text Raw text group of four lines * Line two or three of each group of four lines contains an insertion mark '⁁' * Line above or below the insertion mark consists of the text which should be inserted Corrected text group of four lines * Reduced to three lines with text inserted at insertion mark * In some cases additional text changes, correcting HRT errors and expanding English and Latin abbreviations and contractions, to ensure accurate GroudTruth Structured as follows: **EXAMPLE FOURTEEN: RAW TEXT** **EXAMPLE FOURTEEN: CORRECTION** WORKED EXAMPLE: **EXAMPLE SIX: RAW TEXT** the said parties and Benedicte Stafforde the Englishe master of for that voyage the Sta Cara ⁁ and the said shippe was laden with bayes Cloth and wynes and iron and pitch when shee came to Sta **EXAMPLE SIX: CORRECTION** the said parties and Benedicte Stafforde the Englishe master of the Santa Clara for that voyage and the said shippe was laden with bayes Cloth and wynes and iron and pitch when shee came to Santa
提供机构:
MarineLives
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作