thomaserhel/ethusdt-binance-spot-kline-1m-daily-2023-2024
收藏Hugging Face2024-05-04 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/thomaserhel/ethusdt-binance-spot-kline-1m-daily-2023-2024
下载链接
链接失效反馈官方服务:
资源简介:
---
license: gpl-3.0
language:
- en
tags:
- finance
size_categories:
- 10M<n<100M
---
# Dataset Card for ETHUSDT 1-minute Dataset
## Dataset Details
### Dataset Description
<!-- Provide a longer summary of what this dataset is. -->
This dataset contains 1-minute granularity historical data for the ETHUSDT pair on Binance spot market, covering the period from May 1st, 2023 to May 1st, 2024. The data is formatted into a single CSV file, with each row representing a 1-minute candlestick.
- **Curated by:** Thomas Erhel
- **Language(s) (NLP):** English
- **License:** GNU General Public License v3.0 or later
## Uses
<!-- Address questions around how the dataset is intended to be used. -->
### Direct Use
<!-- This section describes suitable use cases for the dataset. -->
This dataset is suitable for various applications, including:
- Backtesting trading strategies
- Training machine learning models for cryptocurrency price prediction
- Analyzing market trends and patterns
### Out-of-Scope Use
<!-- This section addresses misuse, malicious use, and uses that the dataset will not work well for. -->
This dataset is not intended for use in high-frequency trading or real-time market analysis, as the data is aggregated at a 1-minute granularity.
## Dataset Structure
<!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. -->
The dataset consists of a single CSV file, ethusdt_1min.csv, with the following columns:
- **date**: datetime object representing the timestamp of each 1-minute candlestick
- **open_price**: opening price of each 1-minute candlestick
- **high_price**: highest price of each 1-minute candlestick
- **low_price**: lowest price of each 1-minute candlestick
- **close_price**: closing price of each 1-minute candlestick
- **volume**: trading volume of each 1-minute candlestick
## Dataset Creation
### Curation Rationale
<!-- Motivation for the creation of this dataset. -->
The dataset was created to provide a comprehensive and easily accessible resource for cryptocurrency market analysis and machine learning applications.
### Source Data
<!-- This section describes the source data (e.g. news text and headlines, social media posts, translated sentences, ...). -->
The source data was obtained from Binance's API, using the k-line daily endpoint with a granularity of 1 minute.
#### Data Collection and Processing
<!-- This section describes the data collection and processing process such as data selection criteria, filtering and normalization methods, tools and libraries used, etc. -->
The data was collected using a Python script, which aggregated the data from multiple CSV files into a single file. The script used the **pandas** library to read and process the data.
#### Who are the source data producers?
<!-- This section describes the people or systems who originally created the data. It should also include self-reported demographic or identity information for the source data creators if this information is available. -->
The source data was produced by Binance, a leading cryptocurrency exchange.
#### Annotation process
<!-- This section describes the annotation process such as annotation tools used in the process, the amount of data annotated, annotation guidelines provided to the annotators, interannotator statistics, annotation validation, etc. -->
No annotation process was involved in the creation of this dataset.
#### Who are the annotators?
<!-- This section describes the people or systems who created the annotations. -->
N/A
#### Personal and Sensitive Information
<!-- State whether the dataset contains data that might be considered personal, sensitive, or private (e.g., data that reveals addresses, uniquely identifiable names or aliases, racial or ethnic origins, sexual orientations, religious beliefs, political opinions, financial or health data, etc.). If efforts were made to anonymize the data, describe the anonymization process. -->
This dataset does not contain any personal or sensitive information.
## Bias, Risks, and Limitations
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
This dataset may contain biases and limitations inherent to the data collection process, such as:
- Data quality issues due to API errors or connectivity problems
- Limited coverage of the cryptocurrency market, as the dataset only includes data for the ETHUSDT pair
- Potential for data manipulation or tampering
### Recommendations
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
Users should be aware of these limitations and take necessary precautions when using the dataset. Additionally, users should verify the data quality and integrity before using it for any applications.
## Dataset Card Contact
thomas.erhel@gmail.com
提供机构:
thomaserhel
原始信息汇总
数据集卡片:ETHUSDT 1分钟数据集
数据集详情
数据集描述
该数据集包含从2023年5月1日至2024年5月1日期间,Binance现货市场上ETHUSDT对的1分钟粒度历史数据。数据格式为单个CSV文件,每行代表一个1分钟的蜡烛图。
- 策划者: Thomas Erhel
- 语言(NLP): 英语
- 许可证: GNU通用公共许可证v3.0或更高版本
适用用途
直接用途
该数据集适用于以下应用:
- 回测交易策略
- 训练机器学习模型进行加密货币价格预测
- 分析市场趋势和模式
非适用用途
该数据集不适用于高频交易或实时市场分析,因为数据是以1分钟粒度聚合的。
数据集结构
数据集由单个CSV文件 ethusdt_1min.csv 组成,包含以下列:
- date:表示每个1分钟蜡烛图时间戳的日期时间对象
- open_price:每个1分钟蜡烛图的开盘价
- high_price:每个1分钟蜡烛图的最高价
- low_price:每个1分钟蜡烛图的最低价
- close_price:每个1分钟蜡烛图的收盘价
- volume:每个1分钟蜡烛图的交易量
数据集创建
创建理由
该数据集的创建旨在为加密货币市场分析和机器学习应用提供一个全面且易于访问的资源。
源数据
源数据从Binance的API获取,使用1分钟粒度的k线每日端点。
数据收集和处理
数据通过Python脚本收集,该脚本使用pandas库将多个CSV文件聚合到一个文件中。
源数据生产者
源数据由领先的加密货币交易所Binance生产。
标注过程
该数据集的创建过程中未涉及标注过程。
标注者
N/A
个人和敏感信息
该数据集不包含任何个人或敏感信息。
偏差、风险和限制
该数据集可能包含数据收集过程中固有的偏差和限制,例如:
- 由于API错误或连接问题导致的数据质量问题
- 加密货币市场覆盖范围有限,因为数据集仅包括ETHUSDT对的数据
- 数据操纵或篡改的可能性
建议
用户应了解这些限制,并在使用数据集之前采取必要的预防措施。此外,用户在使用数据集之前应验证数据的质量和完整性。
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



