five

STEAD subsample 4 CDiffSD

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10972600
下载链接
链接失效反馈
官方服务:
资源简介:
STEAD Subsample Dataset for CDiffSD Training Overview This dataset is a subsampled version of the STEAD dataset, specifically tailored for training our CDiffSD model (Cold Diffusion for Seismic Denoising). It consists of four HDF5 files, each saved in a format that requires Python's `h5py` method for opening. Dataset Files The dataset includes the following files: train: Used for both training and validation phases (with validation train split). Contains earthquake ground truth traces. noise_train: Used for both training and validation phases. Contains noise used to contaminate the traces. test: Used for the testing phase, structured similarly to train. noise_test: Used for the testing phase, contains noise data for testing. Each file is structured to support the training and evaluation of seismic denoising models. Data The HDF5 files named noise contain two main datasets: traces: This dataset includes N number of events, with each event being 6000 in size, representing the length of the traces. Each trace is organized into three channels in the following order: E (East-West), N (North-South), Z (Vertical). metadata: This dataset contains the names of the traces for each event. Similarly, the train and test files, which contain earthquake data, include the same traces and metadata datasets, but also feature two additional datasets: p_arrival: Contains the arrival indices of P-waves, expressed in counts. s_arrival: Contains the arrival indices of S-waves, also expressed in counts. Usage To load these files in a Python environment, use the following approach: ```python import h5pyimport numpy as np # Open the HDF5 file in read modewith h5py.File('train_noise.hdf5', 'r') as file:    # Print all the main keys in the file    print("Keys in the HDF5 file:", list(file.keys()))     if 'traces' in file:        # Access the dataset        data = file['traces'][:10]  # Load the first 10 traces     if 'metadata' in file:        # Access the dataset        trace_name = file['metadata'][:10]  # Load the first 10 metadata entries``` Ensure that the path to the file is correctly specified relative to your Python script. Requirements To use this dataset, ensure you have Python installed along with the Pandas library, which can be installed via pip if not already available: ```bashpip install numpypip install h5py```
创建时间:
2024-04-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作