违约贷款数据集
收藏阿里云天池2026-06-03 更新2024-03-07 收录
下载链接:
https://tianchi.aliyun.com/dataset/140861
下载链接
链接失效反馈官方服务:
资源简介:
包含了违约贷款测试集训练集共100万条数据,id: 贷款清单分配的唯一信用证标识
loanAmnt :贷款金额
term :贷款期限(year)
interestRate :贷款利率
installment :分期付款金额
grade: 贷款等级
subGrade: 贷款等级之子级
employmentTitle :就业职称
employmentLength :就业年限(年)
homeOwnership :借款人在登记时提供的房屋所有权状况
annualIncome: 年收入
verificationStatus: 验证状态
issueDate :贷款发放的月份
purpose :借款人在贷款申请时的贷款用途类别
postCode :借款人在贷款申请中提供的邮政编码的前3位数字
regionCode :地区编码
dti :债务收入比
delinquency_2years :借款人过去2年信用档案中逾期30天以上的违约事件数
ficoRangeLow :借款人在贷款发放时的fico所属的下限范围
ficoRangeHigh :借款人在贷款发放时的fico所属的上限范围
openAcc :借款人信用档案中未结信用额度的数量
pubRec :贬损公共记录的数量
pubRecBankruptcies :公开记录清除的数量
revolBal :信贷周转余额合计
revolUtil :循环额度利用率,或借款人使用的相对于所有可用循环信贷的信贷金额
totalAcc :借款人信用档案中当前的信用额度总数
initialListStatus :贷款的初始列表状态
applicationType :表明贷款是个人申请还是与两个共同借款人的联合申请
earliesCreditLine :借款人最早报告的信用额度开立的月份
title :借款人提供的贷款名称
policyCode :公开可用的策略代码=1新产品不公开可用的策略代码=2
n:系列匿名特征 匿名特征n0-n14,为一些贷款人行为计数特征的处理
This dataset includes a total of 1,000,000 records for default loan training and test sets. The detailed field descriptions are as follows:
1. id: Unique credit identifier assigned to the loan list
2. loanAmnt: Loan amount
3. term: Loan term (year)
4. interestRate: Loan interest rate
5. installment: Installment payment amount
6. grade: Loan grade
7. subGrade: Sub-level of the loan grade
8. employmentTitle: Job title
9. employmentLength: Length of employment (years)
10. homeOwnership: Home ownership status provided by the borrower at the time of registration
11. annualIncome: Annual income
12. verificationStatus: Verification status
13. issueDate: Month when the loan was issued
14. purpose: Category of loan purpose indicated by the borrower when applying for the loan
15. postCode: First 3 digits of the postal code provided by the borrower in the loan application
16. regionCode: Regional code
17. dti: Debt-to-income ratio
18. delinquency_2years: Number of default events with overdue over 30 days in the borrower's credit history over the past 2 years
19. ficoRangeLow: Lower bound of the FICO score range when the loan was issued
20. ficoRangeHigh: Upper bound of the FICO score range when the loan was issued
21. openAcc: Number of open credit lines in the borrower's credit file
22. pubRec: Number of derogatory public records
23. pubRecBankruptcies: Number of cleared public record bankruptcies
24. revolBal: Total revolving credit balance
25. revolUtil: Revolving line utilization rate, i.e., the amount of credit utilized by the borrower relative to all available revolving credit
26. totalAcc: Total number of current credit lines in the borrower's credit file
27. initialListStatus: Initial listing status of the loan
28. applicationType: Indicates whether the loan is an individual application or a joint application with two co-borrowers
29. earliesCreditLine: Month when the borrower's earliest reported credit line was opened
30. title: Loan title provided by the borrower
31. policyCode: Publicly available policy code = 1; non-publicly available policy code = 2
32. n: Series of anonymous features n0-n14, which are processed count features of lender behaviors
提供机构:
阿里云天池
创建时间:
2022-11-11
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集是一个包含100万条违约贷款记录的公共数据集,用于信用风险评估和机器学习建模。数据涵盖了贷款金额、期限、利率、借款人收入、就业状况、信用历史等30多个特征,并提供了训练集和测试集文件,适用于预测贷款违约的算法开发。
以上内容由遇见数据集搜集并总结生成



