five

Sub-pixel accuracy in electron detection using a convolutional neural network

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3635922
下载链接
链接失效反馈
官方服务:
资源简介:
Abstract Modern direct electron detectors (DEDs) provided a giant leap in the use of cryogenic electron microscopy (cryo-EM) to study the structures of macromolecules and complexes thereof. However, the currently available commercial DEDs, all based on the monolithic active pixel sensor, still require relative long exposure times and their best results have been obtained at 300 keV. There is a need for pixelated electron counting detectors that can be operated at a broader range of energies, at higher throughput and higher dynamic range. Hybrid Pixel Detectors (HPDs) of the Medipix family were reported to be unsuitable for cryo-EM at energies above 80 keV as those electrons would affect too many pixels. Here we show that the Timepix3, part of the Medipix family, can be used for cryo-EM applications at higher energies. We tested Timepix3 detectors on a 200 keV FEI Tecnai Arctica microscope and a 300 keV FEI Tecnai G2 Polara microscope. A correction method was developed to correct for per-pixel differences in output. Timepix3 data were simulated for individual electron events using the package Geant4Medipix. Global statistical characteristics of the simulated detector response were in good agreement with experimental results. A convolutional neural network (CNN) was trained using the simulated data to predict the incident position of the electron within a pixel cluster. After training, the CNN predicted, on average, .39 pixel and 0.42 pixel from the incident electron position for 200 keV and 300 keV electrons respectively. The CNN improved the MTF of experimental data at half Nyquist from 0.39 to 0.70 at 200 keV, and from 0.06 to 0.65 at 300 keV respectively. We illustrate that the useful dose-lifetime of a protein can be measured within a 1 second exposure using Timepix3. Data description Data has been split up in experimental data, simulations, neural net models and ToT correction results. In general: each directory contains individual READMEs with steps how to reproduce the data. Experimental data For each type of data at 200 kV or 300 kV only the input raw data has been added and the resulting image file. Intermediate files have been left out. Models The models are the CNN models generated at 200 and 300 kV. Simulated data The simulated data consists of the dataset used for training the neural network and the indepedently simulated validation set.  ToT correction The ToT correction file only contain the resulting correction matrix. The experimental flat field data has been left out, due to its volume. It's about 300 GiB of data for both 200 and 300 kV. Software  The software used can be found as related identifiers to this deposit.
创建时间:
2020-03-03
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作