five

1922 Film Industry Trade Press Corpus

收藏
DataONE2024-01-22 更新2025-08-02 收录
下载链接:
https://search.dataone.org/view/sha256:faaaed19d2147d1a8336d33f0a9e8ed140dfd3f143a07cdd607ebf406c055f02
下载链接
链接失效反馈
官方服务:
资源简介:
For the first half of the twentieth century, no American industry boasted a more motley and prolific trade press than the movie business—a cutthroat landscape that set the stage for battle by ink. In 1930, Martin Quigley, publisher of Exhibitors Herald, conspired with Hollywood studios to eliminate all competing trade papers, yet this attempt and each one thereafter collapsed. Exploring the communities of exhibitors and creative workers that constituted key subscribers, Ink-Stained Hollywood tells the story of how a heterogeneous trade press triumphed by appealing to the foundational aspects of industry culture—taste, vanity, partisanship, and exclusivity. In captivating detail, Eric Hoyt chronicles the histories of well-known trade papers (Variety, Motion Picture Herald) alongside important yet forgotten publications (Film Spectator, Film Mercury, and Camera!), and challenges the canon of film periodicals, offering new interpretative frameworks for understanding print journalism’s rel..., The files in the corpus were scanned and OCRed by The Internet Archive using Teseract and hOCR. They are the versions which we used in our analysis. The similarity testing itself was conducted via two python script one which downloaded the scans from the Internet Archive, this one can be skipped if using the corpus in this deposit), and the other which compared the text files using Euclidean Distance, Cosine Distance, and Levenshtein distance metrics. The levenshtein distance metrics were calculate via RapidFuzz and were the standard Levenshtein Distance Ratio (LDR), the Sorted LDR which orders the words into alphabetical order, and the Set LDR which orders the words into alphabetical order and then removes any duplicates., , # 1922 Film Industry Trade Press Corpus ## Description of the data and file structure The data is in a .ZIP archive and consists of 23 DJVU text files, one for each publication in the corpus. The titles, dates, and links to the full scans of the publications on the Internet Archive are listed below.  | **Publication** | **Location** | **Dates** | **URL** | | :-------------------------------------------------------------- | :-------------- | :---------------------- | :----------------------------------------------------------------------- | | *The American Cinematographer* | Los Angeles, US | July 1922 | | | *Camera* | Los Angeles, US | April 1922–April 1923 | | | *Canadian Moving Picture Digest* ...
创建时间:
2025-07-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作