five

Simulated datasets for detector and particle flow reconstruction: CLD detector model for FCC-ee

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14930757
下载链接
链接失效反馈
官方服务:
资源简介:
Data description Datasets generated using Key4HEP and the CLD detector model for FCC-ee suitable for particle flow reconstruction studies. The datasets contain generator particles, reconstructed tracks and calorimeter hits, reconstructed Pandora PF particles and their respective links in the EDM4HEP format. The following processes have been simulated with Pythia 8: p8_ee_tt_ecm365: ee -> ttbar, center of mass energy at 365 GeV The detector simulation has been done with Geant4, the reconstruction with Marlin interfaced via Key4HEP which includes PF reconstruction with Pandora, all using publicly available models and code.   Contents This record includes the following files: p8_ee_tt_ecm365_rootfiles.tgz: small subset of files suitable for testing dataset_full.txt: the full list of files, hosted on the EOS storage system at CERN, ~2TB total p8_ee_tt_ecm365.cmd: the Pythia8 card pythia.py: the pythia steering code for Key4HEP run_sim.sh: the steering script for generating, simulating and reconstructing a single file of 100 events from the p8_ee_tt_ecm365 PandoraSettings.zip: the settings used for Pandora PF reconstruction CLDReconstruction.py: the steering configuration of the reconstruction modules in Key4HEP cld_steer.py: the steering configuration of the Geant4 simulation modules in Key4HEP   Dataset semantics Each file consists of event records. Each event contains structured branches of the relevant physics data. The branches relevant to particle flow reconstruction include: MCParticles: the ground truth generator particles ECALBarrel, ECALEndcap, ECALOther, HCALBarrel, HCALEndcap, HCALOther, MUON: reconstructed hits in the various calorimeter subsystems SiTracks_Refitted: the reconstructed tracks PandoraClusters: the calorimeter hits, clustered by Pandora to calorimeter clusters MergedRecoParticles: the reconstructed particles from the Pandora particle flow algorithm CalohitMCTruthLink: the links between MC particles and reconstructed calorimeter hits SiTracksMCTruthLink: the links between MC particles and reconstructed tracks The full details of the EDM4HEP format are available here.   Dataset characteristics The p8_ee_tt_ecm365_rootfiles.tgz consists of about 50k events stored in a total of 500 files, 22GB in the ROOT EDM4HEP format. The full dataset is hosted on the EOS storage system at CERN, ~2TB total.   Detector geometry The key4hep CLD geometry version used to generate the events is CLD_o2_v05 and can be found on github here.   How can you use these data? The ROOT files can be directly loaded with the uproot Python library.   Disclaimer These are simulated samples suitable for conceptual machine learning R&D and software performance studies. They have not been calibrated with respect to real data, and should not be used to derive physics projections about the detectors. Neither FCC nor CERN endorse any works, scientific or otherwise, produced using these data. All releases will have a unique DOI that you are requested to cite in any applications or publications.
创建时间:
2025-02-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作