five

Data Set of Existing Summary Statistics from Equipment Sensor Data

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4533817
下载链接
链接失效反馈
官方服务:
资源简介:
This data set was generated in accordance with the semiconductor industry and contains sensor recordings from high-precision and high-tech production equipment. Basically, the semiconductor production consists of hundreds of process steps performing physical and chemical operations on so-called wafers, i.e. slices based on semiconductor material. Typically, bunches of wafers are aggregated into so-called lots of size 25, which always pass through the same operations in the production chain. In the production chain, each process equipment is equipped with several sensors recording physical parameters like gas flow, temperature, voltage, etc., resulting in so-called sensor data recorded during each process step. Out of these time-dependent sensor data, so called key numbers (KNs) are extracted using a certain time frame in the individual sensor recordings judged by experts to be important for the process. To keep the entire production as stable as possible, the KNs are used in order to intervene in case of deviations. After the production, each device on the wafer is tested in the most careful way resulting in so-called wafer test data. In some cases, suspicious patterns occur in the wafer test data potentially leading to failure. In this case the root cause must be found in the production chain. For this purpose, the given KNs are provided. The aim is to find correlations between the wafer test data and the KNs in order to identify the root cause. The given data is divided into three data sets: "process1.csv", "process2.csv" and "response.csv". "process1.csv" and "process2.csv" represent the extracted KNs from two process equipment. The "response.csv" data set contains the corresponding wafer test data. For the unique identification, the first two columns in each data set are the lot number and the wafer number respectively. The exact column structure is given as follows: for "process1.csv" and "process2.csv": lot:            the lot number wafer:            the wafer number KN1:        the recordings of the first sensor KN2:        the recordings of the second sensor . . . KN50:        the recordings of the last sensor "KN1"-"KN36" belongs to "process1" and "KN37"-"KN50" belongs to "process2". for "response.csv": lot:            the lot number wafer:            the wafer number response:        the numerical test values class:            the "good"/"bad" classification depending on the response value (threshold: 0,75)
创建时间:
2021-02-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作