five

Code and Data for "Multiple re-reads of single proteins at single-amino-acid resolution using nanopores"

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5506739
下载链接
链接失效反馈
官方服务:
资源简介:
The primary structures containing data and analysis products are peptidereads_fig2.mat (for figure 2) and peptiderereads_fig3.mat (for figure 3). The main analysis scripts for these data structures are callvariants_fig2.m and reread_analysis_fig3.m respectively. Data for figures S6 (S6_reread_data.dat) and S8 (S8_hetero_data.dat), and the analysis script used to produce figure S6 (S6_reread_analysis.m) are also included. Other files are dependencies of these main scripts.   The fields in peptidereads_fig2 are as follows:     folder, eventnum, reducedStart, reducedEnd, suspicious, hasreread: notes for internal use variant: the true identity of the single-amino-acid substitution variant data: the ion current data for each read downsampled to 5 kHz. omit: whether the read was omitted from analysis due to length relativeDNAend: the index in the data where the DNA portion of the read ends. relativeLinkerEnd: the index in the data where the linker portion of the read ends. DNAlevels, Peplevels, Alllevels: extracted ion current levels for the DNA region, the peptide region, and everything. cal: the multiplicative and additive constants applied to calibrate the read caldata: the data with calibration constants applied cons0D, cons0W, cons0G, cons0DNA: initial guesses for consensuses based on hand curation of data. pepDcons0, pepWcons0, pepGcons0: the portion of the handmade consensus with the variant levels. pepDcons, pepWcons, pepGcons: the portion of the iterated consensus with the variant levels. inhandconsensus: whether the read was used in generation of the inital guess consensuses. inconsensus: whether the read was used in generation of either the initial guess or iterated consensuses. confidence: the relative likelihood of each variant assigned to the read incalls: whether the data was used in variant calling (i.e., not used in consensus generation) params: the analysis parameters used
创建时间:
2021-10-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作