five

sayurio/ryans-bd-product-data

收藏
Hugging Face2026-04-05 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/sayurio/ryans-bd-product-data
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: other task_categories: - text-generation - table-question-answering language: - en - bn tags: - e-commerce - hardware - electronics - bangladesh - web-scraped pretty_name: Ryans Computers Product Archive size_categories: - 10K<n<100K --- # Ryans Computers Product Archive ## Overview This repository contains a comprehensive dataset scraped from [ryans.com](https://www.ryans.com/), one of the largest retail chains for computer hardware, laptops, and consumer electronics in Bangladesh. The dataset serves as a structured archive of product catalogs, technical specifications, pricing, and descriptions. ## Purpose and Usage This dataset is published publicly and strictly for **educational, research, and analytical purposes**. When combined with other regional e-commerce datasets, it becomes an incredibly powerful tool for data scientists, developers, and hardware enthusiasts looking to: * Perform comparative market analysis and track historical pricing trends for consumer electronics in Bangladesh. * Train or evaluate models for e-commerce parsing, tabular data extraction, and structured JSON generation. * Build Retrieval-Augmented Generation (RAG) systems for tech and hardware recommendations. * Analyze product specifications and component availability across different retail platforms. ## Dataset Details * **Source:** ryans.com * **Collection Method:** Web scraping * **Content Type:** E-commerce product data (including titles, prices, descriptions, specifications, brand metadata, and categorical hierarchy). * **Repository:** `sayurio/ryans-bd-product-data` ## Copyright and Fair Use Disclaimer This archive is created under the principles of **Fair Use** (under Section 107 of the Copyright Act) for purposes such as criticism, comment, research, and scholarship. * **No Ownership Claimed:** The creator of this repository does not claim any ownership, authorship, or copyright over the original product descriptions, images, or branding. All rights, title, and interest in the original content, logos, and trademarks remain entirely with their respective manufacturers, brands, and Ryans IT Limited. * **Non-Commercial:** This dataset is provided completely free of charge and is strictly not intended for commercial gain, competitive market manipulation, or profit. * **Transformative Use:** The data has been aggregated, extracted from its original web formatting, and compiled specifically for computational analysis and educational study. This represents a transformative use of the original publicly available material. **Takedown Requests:** If you are a copyright holder or representative of the source website and wish for specific data to be removed from this archive, please open an issue or contact the repository owner directly. Please submit a removal request specifying the exact URLs, SKUs, or product identifiers you wish to have taken down so they can be accurately located within the dataset and removed.

--- 许可证:其他 任务类别: - 文本生成 - 表格问答 语言: - 英语 - 孟加拉语 标签: - 电子商务 - 硬件 - 电子产品 - 孟加拉国 - 网络爬取 数据集名称:瑞安电脑产品档案(Ryans Computers Product Archive) 规模类别:10000 < 数据量 < 100000 --- # 瑞安电脑产品档案(Ryans Computers Product Archive) ## 概览 本仓库包含从[ryans.com](https://www.ryans.com/)爬取的大规模数据集,该网站是孟加拉国规模最大的电脑硬件、笔记本电脑及消费电子产品零售连锁品牌之一。本数据集为产品目录、技术规格、定价信息及商品描述提供了结构化存档。 ## 用途与使用场景 本数据集公开发布,**仅用于教育、研究及分析用途**。若与其他区域电子商务数据集结合使用,将成为面向数据科学家、开发者及硬件爱好者的极具实用价值的工具,可用于以下场景: * 针对孟加拉国消费电子产品开展对比性市场分析,并追踪其历史定价趋势。 * 训练或评估用于电子商务内容解析、表格数据提取及结构化JSON生成的模型。 * 构建面向科技与硬件产品推荐的检索增强生成(Retrieval-Augmented Generation,RAG)系统。 * 分析不同零售平台间的产品规格及组件供货情况。 ## 数据集详情 * **数据来源**:ryans.com * **采集方式**:网络爬取 * **内容类型**:电子商务产品数据(涵盖商品标题、售价、商品描述、技术规格、品牌元数据及分类层级结构)。 * **仓库地址**:`sayurio/ryans-bd-product-data` ## 版权与合理使用声明 本档案依据版权法第107条规定的**合理使用(Fair Use)**原则创建,用于评论、点评、研究及学术探讨等场景。 * **无所有权声明**:本仓库创建者不对原始产品描述、图片或品牌标识主张任何所有权、著作权或版权。原始内容、标识及商标的全部权利、所有权及相关权益均归属于其各自的制造商、品牌方及瑞安信息技术有限公司(Ryans IT Limited)。 * **非商业用途声明**:本数据集完全免费提供,严格禁止用于商业牟利、竞争性市场操纵或任何盈利性活动。 * **改造性使用声明**:本数据已从原始网页格式中提取并聚合,专门为计算分析与学术研究而整理,属于对原始公开素材的改造性使用。 **下架申请**:若您为版权方或源网站代表,希望从本档案中移除特定数据,请提交Issue或直接联系仓库所有者。提交下架申请时,请注明需移除数据对应的具体URL、库存单位(Stock Keeping Unit,SKU)或产品标识符,以便我们在数据集中准确定位并完成移除。
提供机构:
sayurio
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作