financial_credit_dataset
收藏魔搭社区2025-11-27 更新2025-11-15 收录
下载链接:
https://modelscope.cn/datasets/syncora/financial_credit_dataset
下载链接
链接失效反馈官方服务:
资源简介:
# Financial Credit Card Dataset — Free Financial Dataset 💳
High-Fidelity **Financial Dataset** for ML & AI Research, Credit Risk Modeling, and LLM Training
---
## 🌟 About This Dataset
This **financial dataset** provides synthetic credit and debit card records, including card brand, type, credit limits, issuance dates, CVV, and more.
All records are **privacy-safe**, making it ideal for **ML experimentation, AI research, and dataset for LLM training**.
**Visit our website** to learn more about the tool powering this dataset:
[🌐 Syncora.ai](https://syncora.ai)
---
## 📊 Dataset Features
| Feature | Description |
|---------|-------------|
| `id` | Unique record ID |
| `client_id` | Client identifier |
| `card_brand` | Visa, Mastercard, etc. |
| `card_type` | Credit, Debit, Prepaid |
| `card_number` | Synthetic card number |
| `expires` | Card expiration date |
| `cvv` | Security code |
| `has_chip` | Chip-enabled status |
| `num_cards_issued` | Total cards issued |
| `credit_limit` | Credit limit in USD |
| `acct_open_date` | Account opening date |
| `year_pin_last_changed` | Last PIN update year |
| `card_on_dark_web` | Flag for compromised cards |
---
## 📦 What This Repo Contains
- **Financial Dataset CSV** – Ready for ML and analytics
[⬇️ Download Dataset](https://huggingface.co/datasets/syncora/financia_credit_dataset/blob/main/finance_cards_data.csv)
- **Jupyter Notebook** – Explore and analyze the financial dataset
[📓 Open Notebook](https://huggingface.co/datasets/syncora/financia_credit_dataset/blob/main/Finance_banking_notebook.ipynb)
---
## 🤖 Machine Learning & AI Use Cases
- **💳 Fraud Detection & Risk Analysis**: Identify anomalous card patterns and assess credit risk
- **🛠 Feature Engineering**: Create features from card activity, limits, and account history
- **🧠 LLM Training**: Convert structured financial records into text for model fine-tuning — excellent **dataset for LLM training**
- **📊 Benchmarking**: Evaluate ML model performance on realistic synthetic financial scenarios
- **🔍 Explainability & Insights**: Apply SHAP, LIME, or ELI5 to interpret predictions
## 🔗 Resources
- **⚡ Synthetic Data Generator** – Build your own financial datasets
[Open Generator](https://huggingface.co/spaces/syncora/synthetic-generation)
- **🌐 Syncora.ai** – Learn more about the platform powering this dataset
[Visit Website](https://syncora.ai)
## 📜 License
Released under **Apache License 2.0**.
This is a **free financial dataset**, ideal for ML research, credit analytics, and **dataset for LLM training**.
# 金融信用卡数据集 — 免费金融数据集 💳
适用于机器学习(ML)与人工智能(AI)研究、信用风险建模以及大语言模型(LLM)训练的高保真金融数据集
---
## 🌟 数据集概述
本金融数据集提供合成生成的信用卡与借记卡记录,涵盖卡组织、卡类型、信用额度、发卡日期、CVV码等信息。
所有记录均符合隐私安全标准,非常适合用于机器学习实验、人工智能研究以及大语言模型训练数据集。
**访问我们的官网**了解更多支撑该数据集的工具详情:
[🌐 Syncora.ai](https://syncora.ai)
---
## 📊 数据集字段说明
| 字段名 | 字段说明 |
|---------|-------------|
| `id` | 唯一记录标识符 |
| `client_id` | 客户标识 |
| `card_brand` | 卡组织(如Visa、万事达Mastercard等) |
| `card_type` | 卡类型(信用卡、借记卡、预付卡) |
| `card_number` | 合成生成的卡号 |
| `expires` | 卡片有效期 |
| `cvv` | 安全校验码 |
| `has_chip` | 是否支持芯片功能 |
| `num_cards_issued` | 累计发卡总量 |
| `credit_limit` | 以美元计价的信用额度 |
| `acct_open_date` | 账户开户日期 |
| `year_pin_last_changed` | 最近一次修改PIN码的年份 |
| `card_on_dark_web` | 卡片是否在暗网被泄露的标记字段 |
---
## 📦 本仓库包含内容
- **金融数据集CSV文件** — 可直接用于机器学习与数据分析
[⬇️ 下载数据集](https://huggingface.co/datasets/syncora/financia_credit_dataset/blob/main/finance_cards_data.csv)
- **Jupyter Notebook** — 用于探索与分析该金融数据集
[📓 打开Notebook](https://huggingface.co/datasets/syncora/financia_credit_dataset/blob/main/Finance_banking_notebook.ipynb)
---
## 🤖 机器学习与人工智能应用场景
- **💳 欺诈检测与风险分析**:识别异常卡片交易模式并评估信用风险
- **🛠 特征工程**:基于卡片活动、额度与账户历史构建特征
- **🧠 大语言模型(LLM)训练**:将结构化金融记录转换为文本格式,用于模型微调,是优质的大语言模型训练数据集
- **📊 基准测试**:在真实感较强的合成金融场景中评估机器学习模型的性能
- **🔍 可解释性分析与洞察提取**:通过SHAP、LIME或ELI5等工具解释模型预测结果
## 🔗 相关资源
- **⚡ 合成数据生成器**:可自行构建金融数据集
[打开生成器](https://huggingface.co/spaces/syncora/synthetic-generation)
- **🌐 Syncora.ai**:了解更多支撑该数据集的平台详情
[访问官网](https://syncora.ai)
---
## 📜 授权协议
本数据集采用**Apache License 2.0**协议发布。
这是一款免费的金融数据集,非常适合用于机器学习研究、信用数据分析以及大语言模型训练数据集。
提供机构:
maas
创建时间:
2025-10-09



