mattismegevand/IMSDb
收藏Hugging Face2023-08-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/mattismegevand/IMSDb
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
task_categories:
- text-classification
- text-generation
- question-answering
- summarization
language:
- en
size_categories:
- 1K<n<10K
viewer: false
---
## IMSDb Scraper
A Python script that scrapes movie script details from the Internet Movie Script Database (IMSDb) website.
### Features:
- Fetches all script links available on IMSDb.
- Retrieves details for each movie script including:
- Title
- Poster Image URL
- IMSDb Opinion
- IMSDb Rating
- Average User Rating
- Writers
- Genres
- Script Date
- Movie Release Date
- Submitted By
- Full Script Text
### Installation
1. Clone repository.
2. Install the required Python packages.
```bash
pip install -r requirements.txt
```
### Usage
1. Simply run the script.
```bash
python scraper.py
```
2. After the script runs, the extracted movie script details will be saved to `data.jsonl`.
### Error Handling
If any errors occur during scraping, they will be written to `error.txt` with the respective URL and error message.
### Contributing
Feel free to open issues or PRs if you find any problems or have improvements in mind.
### License
This project is licensed under the MIT License. See `LICENSE` for more details.
提供机构:
mattismegevand
原始信息汇总
数据集概述
许可证
- MIT许可证
任务类别
- 文本分类
- 文本生成
- 问答
- 摘要
语言
- 英语
数据集大小
- 1K<n<10K
数据集内容
- 电影剧本的详细信息,包括:
- 标题
- 海报图片URL
- IMSDb评价
- IMSDb评分
- 平均用户评分
- 编剧
- 类型
- 剧本日期
- 电影上映日期
- 提交者
- 完整剧本文本



