spectrallabs/credit-scoring-training-dataset
收藏Hugging Face2024-05-02 更新2024-05-25 收录
下载链接:
https://hf-mirror.com/datasets/spectrallabs/credit-scoring-training-dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
---
The training dataset includes all addresses that had undertaken at least one borrow transaction on Aave v2 Ethereum or Compound v2 Ethereum any time between 7 May 2019 and 31 August 2023, inclusive (called the observation window).
Data Structure & Shape
There are almost 0.5 million observations with each representing a single borrow event. Therefore, all feature values are calculated as at the timestamp of a borrow event and represent the cumulative positions just before the borrow event's timestamp.
Given that the data is at the borrow transaction level, an address with 10 borrow events during the observation window will appear 10 times in the dataset with different feature values across the 10 borrow timestamps.
As an example, if address 0xa initiated the following transactions on Aave v2 Ethereum:
At block_0: deposit 100 ETH
At block_1: borrow 25 ETH
At block_3: repay 10 ETH
At block_4: borrow 10 ETH
then 0xa will appear twice in the dataset as follows:
At block_1: all feature values will be calculated using all Ethereum and wallet transactions up to block_0, i.e., excluding the borrow event and other transactions at block_1
At block_4: all feature values will be calculated using all Ethereum and wallet transactions up to block_3, i.e., excluding the borrow event and other transactions at block_4
The dataset has 79 features in total, including the target label, transaction block number, transaction timestamp, and ETH address of the borrower.
Features
Other than counts related features, all feature values are denominated in ETH. All price conversions of the underlying tokens to ETH were done with the applicable rates as obtained from Transpose, Coinpaprika, CoinGecko, and Compound and Aave's price feeds at the respective event timestamps.
提供机构:
spectrallabs
原始信息汇总
数据集概述
数据集内容
- 数据集类型:训练数据集
- 数据来源:Aave v2 Ethereum 和 Compound v2 Ethereum
- 时间范围:2019年5月7日至2023年8月31日
- 数据粒度:每个观察代表一次借款事件
- 数据量:约50万条记录
数据结构
- 特征数量:79个特征
- 特征内容:包括目标标签、交易区块号、交易时间戳、借款人ETH地址等
- 特征值单位:除计数相关特征外,其他特征值以ETH计价
- 价格转换:使用Transpose、Coinpaprika、CoinGecko、Compound和Aave的价格数据进行转换
数据集更新
- 更新频率:根据借款事件发生时间更新
- 重复记录:同一地址在观察窗口内多次借款将产生多条记录,每条记录反映不同时间点的特征值
许可协议
- 许可证:Apache-2.0



