five

mattismegevand/IMSDb

收藏
Hugging Face2023-08-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/mattismegevand/IMSDb
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit task_categories: - text-classification - text-generation - question-answering - summarization language: - en size_categories: - 1K<n<10K viewer: false --- ## IMSDb Scraper A Python script that scrapes movie script details from the Internet Movie Script Database (IMSDb) website. ### Features: - Fetches all script links available on IMSDb. - Retrieves details for each movie script including: - Title - Poster Image URL - IMSDb Opinion - IMSDb Rating - Average User Rating - Writers - Genres - Script Date - Movie Release Date - Submitted By - Full Script Text ### Installation 1. Clone repository. 2. Install the required Python packages. ```bash pip install -r requirements.txt ``` ### Usage 1. Simply run the script. ```bash python scraper.py ``` 2. After the script runs, the extracted movie script details will be saved to `data.jsonl`. ### Error Handling If any errors occur during scraping, they will be written to `error.txt` with the respective URL and error message. ### Contributing Feel free to open issues or PRs if you find any problems or have improvements in mind. ### License This project is licensed under the MIT License. See `LICENSE` for more details.
提供机构:
mattismegevand
原始信息汇总

数据集概述

许可证

  • MIT许可证

任务类别

  • 文本分类
  • 文本生成
  • 问答
  • 摘要

语言

  • 英语

数据集大小

  • 1K<n<10K

数据集内容

  • 电影剧本的详细信息,包括:
    • 标题
    • 海报图片URL
    • IMSDb评价
    • IMSDb评分
    • 平均用户评分
    • 编剧
    • 类型
    • 剧本日期
    • 电影上映日期
    • 提交者
    • 完整剧本文本
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作