five

AhmadBabajoytk/nigerian_retail_and_ecommerce_point_of_sale_records

收藏
Hugging Face2026-03-04 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/AhmadBabajoytk/nigerian_retail_and_ecommerce_point_of_sale_records
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: gpl task_categories: - tabular-regression - time-series-forecasting language: - en tags: - retail - ecommerce - nigeria - synthetic-data - sales-analytics - transactions size_categories: - 100K<n<1M pretty_name: Point Of Sale Records --- # Point Of Sale Records ## Dataset Description Comprehensive point of sale records for Nigerian retail and e-commerce analysis ## Dataset Information - **Category**: Sales and Transactions - **Industry**: Retail & E-Commerce - **Country**: Nigeria - **Format**: CSV, Parquet - **Rows**: 800,000 - **Columns**: 11 - **Date Generated**: 2025-10-06 - **Location**: `data/point_of_sale_records/` - **License**: GPL ## Schema | Column | Type | Sample Values | |--------|------|---------------| | `transaction_id` | String | POS00000000 | | `store_name` | String | Balogun Market | | `city` | String | Kano | | `transaction_date` | String | 2024-08-03 16:00:00 | | `cashier_id` | String | CASH990 | | `items_count` | Integer | 13 | | `total_amount_ngn` | Float | 33837.28 | | `payment_method` | String | cash | | `discount_applied` | Boolean | False | | `loyalty_points_earned` | Integer | 662 | | `receipt_number` | String | RCP0000000000 | ## Sample Data ``` transaction_id store_name city transaction_date cashier_id items_count total_amount_ngn payment_method discount_applied loyalty_points_earned receipt_number POS00000000 Balogun Market Kano 2024-08-03 16:00:00 CASH990 13 33837.28 cash False 662 RCP0000000000 POS00000001 Game Kano 2024-09-07 20:00:00 CASH840 2 23883.91 cash False 531 RCP0000000001 POS00000002 Game Port Harcourt 2024-07-04 09:00:00 CASH148 19 35400.49 mobile_money False 840 RCP0000000002 ``` ## Use Cases - Data analysis and insights - Machine learning model training - Business intelligence - Research and education - Predictive analytics ## Nigerian Context This dataset incorporates authentic Nigerian retail and e-commerce characteristics: ### E-Commerce Platforms - **Jumia** (35% market share) - Leading marketplace - **Konga** (25% market share) - Major competitor - **Jiji** (20% market share) - Classifieds platform - PayPorte, Slot, and other platforms ### Physical Retail - **Shoprite**, **Spar**, **Game** - Major supermarket chains - **Slot**, **Pointek** - Electronics retailers - **Mr Price** - Fashion retail - Traditional markets: Balogun Market, Computer Village ### Payment Methods - Cash on Delivery (45%) - Most popular - Bank Transfer (25%) - Debit Card (15%) - USSD (8%) - Mobile Money (5%) - Credit Card (2%) ### Logistics & Delivery - **GIG Logistics** - Nationwide coverage - **Kwik Delivery** - Fast urban delivery - **DHL**, **FedEx** - International and express - **Red Star Express** - Nationwide courier - Local dispatch riders ### Geographic Coverage Major Nigerian cities including: - **Lagos** - Commercial capital, highest retail density - **Abuja** - Federal capital, high e-commerce penetration - **Kano** - Northern commercial hub - **Port Harcourt** - Oil city, strong purchasing power - **Ibadan** - Large urban market - Plus 10+ other major cities ### Products & Categories - **Electronics**: Tecno, Infinix, Samsung phones; laptops, TVs - **Fashion**: Ankara fabric, Agbada, Kaftan, sneakers - **Groceries**: Rice (50kg bags), Garri, Palm Oil, Indomie - **Beauty**: Shea butter, Black soap, hair extensions - **Home**: Generators, inverters, solar panels ### Currency & Pricing - **Currency**: Nigerian Naira (NGN, ₦) - **Exchange Rate**: ~₦1,500/USD - **Price Ranges**: Realistic Nigerian market prices - **Time Zone**: West Africa Time (WAT, UTC+1) ## File Formats ### CSV ``` data/point_of_sale_records/nigerian_retail_and_ecommerce_point_of_sale_records.csv ``` ### Parquet (Recommended) ``` data/point_of_sale_records/nigerian_retail_and_ecommerce_point_of_sale_records.parquet ``` ## Nigerian Retail and E-Commerce - Loading the Dataset ### Hugging Face Datasets ```python from datasets import load_dataset # Load dataset dataset = load_dataset("electricsheepafrica/nigerian_retail_and_ecommerce_point_of_sale_records") # Convert to pandas df = dataset['train'].to_pandas() print(f"Loaded {len(df):,} rows") ``` ### Pandas (Direct) ```python import pandas as pd # Load CSV df = pd.read_csv('data/point_of_sale_records/nigerian_retail_and_ecommerce_point_of_sale_records.csv') # Load Parquet (recommended for large datasets) df = pd.read_parquet('data/point_of_sale_records/nigerian_retail_and_ecommerce_point_of_sale_records.parquet') ``` ### PyArrow ```python import pyarrow.parquet as pq # Load Parquet table = pq.read_table('data/point_of_sale_records/nigerian_retail_and_ecommerce_point_of_sale_records.parquet') df = table.to_pandas() ``` ## Data Quality - ✅ **Realistic Distributions**: Based on Nigerian retail patterns - ✅ **No Missing Critical Fields**: Complete core data - ✅ **Proper Data Types**: Appropriate types for each column - ✅ **Consistent Naming**: Clear, descriptive column names - ✅ **Nigerian Context**: Authentic local characteristics - ✅ **Production Scale**: Suitable for real-world applications ## Ethical Considerations - This is **synthetic data** generated for research and development - No real customer data or personally identifiable information - Designed to reflect realistic patterns without privacy concerns - Safe for public use, testing, and education ## License **GPL License** - General Public License This dataset is free to use for: - Research and academic purposes - Commercial applications - Educational projects - Open source development ## Citation ```bibtex @dataset{nigerian_retail_point_of_sale_records_2025, title={Point Of Sale Records}, author={Electric Sheep Africa}, year={2025}, publisher={Hugging Face}, howpublished={\url{https://huggingface.co/datasets/electricsheepafrica/nigerian-retail-point-of-sale-records}} } ``` ## Related Datasets This dataset is part of the **Nigerian Retail & E-Commerce Datasets** collection, which includes 42 datasets covering: - Customer & Shopper Data - Sales & Transactions - Product & Inventory - Marketing & Engagement - Operations & Workforce - Pricing & Revenue - Customer Support - Emerging & Advanced Technologies **Browse all datasets**: https://huggingface.co/electricsheepafrica ## Updates & Maintenance - **Version**: 1.0 - **Last Updated**: 2025-10-06 - **Maintenance**: Active - **Issues**: Report via Hugging Face discussions ## Contact For questions, feedback, or collaboration: - **Hugging Face**: electricsheepafrica - **Issues**: Open a discussion on the dataset page - **General Inquiries**: Via Hugging Face profile --- **Part of the Nigerian Industry Datasets Initiative** Building comprehensive, authentic datasets for African markets.

--- 许可证:GPL 任务类别: - 表格回归(tabular-regression) - 时间序列预测(time-series-forecasting) 语言: - 英语 标签: - 零售 - 电子商务 - 尼日利亚 - 合成数据(synthetic-data) - 销售分析(sales-analytics) - 交易记录(transactions) 规模类别: - 100K<数据量<1M 展示名称:销售点(Point Of Sale)记录 --- # 销售点记录 ## 数据集描述 面向尼日利亚零售与电子商务分析的全量销售点交易记录 ## 数据集详情 - **数据集类别**:销售与交易记录 - **所属行业**:零售与电子商务 - **覆盖国家**:尼日利亚 - **存储格式**:CSV、Parquet - **数据行数**:800,000 - **字段数量**:11 - **生成日期**:2025-10-06 - **存储路径**:`data/point_of_sale_records/` - **许可证**:GPL ## 字段结构 | 字段名 | 数据类型 | 示例值 | |--------|----------|--------| | `transaction_id` | 字符串 | POS00000000 | | `store_name` | 字符串 | Balogun Market | | `city` | 字符串 | Kano | | `transaction_date` | 字符串 | 2024-08-03 16:00:00 | | `cashier_id` | 字符串 | CASH990 | | `items_count` | 整数 | 13 | | `total_amount_ngn` | 浮点数 | 33837.28 | | `payment_method` | 字符串 | cash | | `discount_applied` | 布尔值 | False | | `loyalty_points_earned` | 整数 | 662 | | `receipt_number` | 字符串 | RCP0000000000 | ## 示例数据 transaction_id store_name city transaction_date cashier_id items_count total_amount_ngn payment_method discount_applied loyalty_points_earned receipt_number POS00000000 Balogun Market Kano 2024-08-03 16:00:00 CASH990 13 33837.28 cash False 662 RCP0000000000 POS00000001 Game Kano 2024-09-07 20:00:00 CASH840 2 23883.91 cash False 531 RCP0000000001 POS00000002 Game Port Harcourt 2024-07-04 09:00:00 CASH148 19 35400.49 mobile_money False 840 RCP0000000002 ## 应用场景 - 数据分析与洞察挖掘 - 机器学习模型训练 - 商业智能分析 - 研究与教学用途 - 预测性分析 ## 尼日利亚本土适配说明 本数据集融入了尼日利亚零售与电子商务的真实特征: ### 电子商务平台 - **Jumia**(市场份额35%):头部电商平台 - **Konga**(市场份额25%):主要竞品平台 - **Jiji**(市场份额20%):分类信息平台 - PayPorte、Slot等其他平台 ### 实体零售 - **Shoprite**、**Spar**、**Game**:大型连锁超市 - **Slot**、**Pointek**:电子产品零售商 - **Mr Price**:时尚零售品牌 - 传统市场:巴洛贡市场、电脑村 ### 支付方式 - 货到付款(占比45%):最主流支付方式 - 银行转账(25%) - 借记卡支付(15%) - USSD支付(8%) - 移动支付(5%) - 信用卡支付(2%) ### 物流配送 - **GIG Logistics**:全国覆盖配送 - **Kwik Delivery**:快速城市配送 - **DHL**、**FedEx**:国际与加急配送服务 - **Red Star Express**:全国 Courier 配送服务 - 本地配送骑手 ### 地理覆盖范围 尼日利亚主要城市包括: - **拉各斯**:商业首都,零售密度最高 - **阿布贾**:联邦首都,电子商务渗透率最高 - **卡诺**:北部商业枢纽 - **哈科特港**:石油城市,购买力强劲 - **伊巴丹**:大型城市市场 - 以及其他10余个主要城市 ### 商品品类 - **电子产品**:Tecno、Infinix、三星手机、笔记本电脑、电视等 - **时尚服饰**:安卡拉面料、阿巴达长袍、卡夫坦长衫、运动鞋等 - **食品杂货**:50公斤装大米、加里木薯粉、棕榈油、Indomie方便面等 - **美妆产品**:乳木果油、黑皂、接发片等 - **家居用品**:发电机、逆变器、太阳能板等 ### 货币与定价规则 - **货币**:尼日利亚奈拉(NGN,₦) - **汇率**:约1美元兑换1500奈拉 - **价格区间**:贴合尼日利亚本土市场的真实定价 - **时区**:西非时间(WAT,UTC+1) ## 文件格式 ### CSV格式 data/point_of_sale_records/nigerian_retail_and_ecommerce_point_of_sale_records.csv ### Parquet格式(推荐) data/point_of_sale_records/nigerian_retail_and_ecommerce_point_of_sale_records.parquet ## 尼日利亚零售与电子商务数据集加载方法 ### Hugging Face Datasets 加载方式 python from datasets import load_dataset # 加载数据集 dataset = load_dataset("electricsheepafrica/nigerian_retail_and_ecommerce_point_of_sale_records") # 转换为pandas数据框 df = dataset['train'].to_pandas() print(f"已加载 {len(df):,} 行数据") ### 直接使用Pandas加载 python import pandas as pd # 加载CSV文件 df = pd.read_csv('data/point_of_sale_records/nigerian_retail_and_ecommerce_point_of_sale_records.csv') # 加载Parquet文件(推荐用于大型数据集) df = pd.read_parquet('data/point_of_sale_records/nigerian_retail_and_ecommerce_point_of_sale_records.parquet') ### PyArrow加载方式 python import pyarrow.parquet as pq # 加载Parquet文件 table = pq.read_table('data/point_of_sale_records/nigerian_retail_and_ecommerce_point_of_sale_records.parquet') df = table.to_pandas() ## 数据质量保障 - ✅ **真实分布贴合市场**:基于尼日利亚零售市场的真实交易模式 - ✅ **无关键字段缺失**:核心交易字段完整无缺失 - ✅ **数据类型规范统一**:各字段匹配适配的数据类型 - ✅ **命名清晰一致**:字段命名清晰且具备强描述性 - ✅ **本土特征完整**:融入尼日利亚本地零售与电商的真实特征 - ✅ **生产级规模**:适配真实世界的业务应用场景 ## 伦理考量 本数据集为**合成数据**,仅用于研究与开发用途;未包含任何真实客户数据或个人可识别信息;旨在还原真实交易模式的同时规避隐私风险;可安全用于公开使用、测试与教学场景。 ## 许可证 **GPL许可证**(通用公共许可证) 本数据集可免费用于以下场景: - 研究与学术用途 - 商业应用 - 教育项目 - 开源开发 ## 引用格式 bibtex @dataset{nigerian_retail_point_of_sale_records_2025, title={Point Of Sale Records}, author={Electric Sheep Africa}, year={2025}, publisher={Hugging Face}, howpublished={url{https://huggingface.co/datasets/electricsheepafrica/nigerian-retail-point-of-sale-records}} } ## 相关数据集 本数据集隶属于**尼日利亚零售与电子商务数据集合集**,该合集包含42个数据集,覆盖以下领域: - 客户与购物者数据 - 销售与交易记录 - 商品与库存数据 - 营销与用户互动数据 - 运营与劳动力数据 - 定价与营收数据 - 客户支持数据 - 新兴与前沿技术相关数据 **浏览全部数据集**:https://huggingface.co/electricsheepafrica ## 更新与维护 - **版本号**:1.0 - **最后更新日期**:2025-10-06 - **维护状态**:活跃 - **问题反馈**:通过Hugging Face讨论区提交 ## 联系方式 如有疑问、反馈或合作意向: - **Hugging Face账号**:electricsheepafrica - **问题反馈**:在数据集页面开启讨论 - **通用咨询**:通过Hugging Face个人主页联系 --- **隶属于尼日利亚行业数据集倡议项目** 本项目旨在为非洲市场打造全面且真实的行业数据集。
提供机构:
AhmadBabajoytk
二维码
社区交流群
二维码
科研交流群
商业服务