1922 Film Industry Trade Press Corpus
收藏DataONE2024-01-22 更新2025-08-02 收录
下载链接:
https://search.dataone.org/view/sha256:faaaed19d2147d1a8336d33f0a9e8ed140dfd3f143a07cdd607ebf406c055f02
下载链接
链接失效反馈官方服务:
资源简介:
For the first half of the twentieth century, no American industry boasted a more motley and prolific trade press than the movie businessâa cutthroat landscape that set the stage for battle by ink. In 1930, Martin Quigley, publisher of Exhibitors Herald, conspired with Hollywood studios to eliminate all competing trade papers, yet this attempt and each one thereafter collapsed. Exploring the communities of exhibitors and creative workers that constituted key subscribers, Ink-Stained Hollywood tells the story of how a heterogeneous trade press triumphed by appealing to the foundational aspects of industry cultureâtaste, vanity, partisanship, and exclusivity. In captivating detail, Eric Hoyt chronicles the histories of well-known trade papers (Variety, Motion Picture Herald) alongside important yet forgotten publications (Film Spectator, Film Mercury, and Camera!), and challenges the canon of film periodicals, offering new interpretative frameworks for understanding print journalismâs rel..., The files in the corpus were scanned and OCRed by The Internet Archive using Teseract and hOCR. They are the versions which we used in our analysis.
The similarity testing itself was conducted via two python script one which downloaded the scans from the Internet Archive, this one can be skipped if using the corpus in this deposit), and the other which compared the text files using Euclidean Distance, Cosine Distance, and Levenshtein distance metrics. The levenshtein distance metrics were calculate via RapidFuzz and were the standard Levenshtein Distance Ratio (LDR), the Sorted LDR which orders the words into alphabetical order, and the Set LDR which orders the words into alphabetical order and then removes any duplicates., , # 1922 Film Industry Trade Press Corpus
## Description of the data and file structure
The data is in a .ZIP archive and consists of 23 DJVU text files, one for each publication in the corpus.
The titles, dates, and links to the full scans of the publications on the Internet Archive are listed below.Â
| **Publication** | **Location** | **Dates** | **URL** |
| :-------------------------------------------------------------- | :-------------- | :---------------------- | :----------------------------------------------------------------------- |
| *The American Cinematographer* | Los Angeles, US | July 1922 | |
| *Camera* | Los Angeles, US | April 1922âApril 1923 | |
| *Canadian Moving Picture Digest* ...
创建时间:
2025-07-26



