Flagging False Positives Following Untargeted LC–MS Characterization of Histone Post-Translational Modification Combinations

Name: Flagging False Positives Following Untargeted LC–MS Characterization of Histone Post-Translational Modification Combinations
Creator: ACS Publications
Published: 2023-05-31 00:00:00
License: 暂无描述

acs.figshare.com2023-05-31 更新2025-01-21 收录

下载链接：

https://acs.figshare.com/articles/dataset/Flagging_False_Positives_Following_Untargeted_LC_MS_Characterization_of_Histone_Post-Translational_Modification_Combinations/4291295/1

下载链接

链接失效反馈

官方服务：

资源简介：

Epigenetic changes can be studied with an untargeted characterization of histone post-translational modifications (PTMs) by liquid chromatography–mass spectrometry (LC–MS). While prior information about more than 20 types of histone PTMs exists, little is known about histone PTM combinations (PTMCs). Because of the combinatorial explosion it is intrinsically impossible to consider all potential PTMCs in a database search. Consequentially, high-scoring false positives with unconsidered but correct alternative isobaric PTMCs can occur. Current quality controls can neither estimate the amount of unconsidered alternatives nor flag potential false positives. Here, we propose a conceptual workflow that provides such options. In this workflow, an in silico modeling of all candidate isoforms with known-to-exist PTMs is made. The most frequently occurring PTM sets of these candidate isoforms are determined and used in several database searches. This suppresses the combinatorial explosion while considering as many candidate isoforms as possible. Finally, annotations can be classified as unique or ambiguous, the latter implying false positives. This workflow was evaluated on an LC–MS data set containing 44 histone extracts. We were able to consider 60% of all candidate isoforms. Importantly, 40% of all annotations were classified as ambiguous. This highlights the need for a more thorough evaluation of modified peptide annotations.

表观遗传学变化可通过液相色谱-质谱联用（LC-MS）技术对组蛋白翻译后修饰（PTM）进行非定向表征进行研究。尽管已知超过20种类型的组蛋白PTM，但对于组蛋白PTM组合（PTMCs）的了解却甚少。由于组合爆炸效应，在数据库搜索中考虑所有潜在的PTMCs在本质上是不可能的。因此，可能会出现高评分的假阳性结果，其中包含未考虑但正确的同位素质量比PTMCs。目前的质量控制方法既无法估算未考虑的替代方案的数量，也无法标记潜在的假阳性结果。在此，我们提出了一种概念性工作流程，提供了此类选项。在此工作流程中，对已知存在的PTMs的所有候选异构体进行计算机模拟建模。确定这些候选异构体中最频繁出现的PTM集，并用于多次数据库搜索。这种方法在考虑尽可能多的候选异构体的同时，抑制了组合爆炸。最终，注释可以被分类为独特或模糊，后者暗示了假阳性。该工作流程在包含44个组蛋白提取物的大型LC-MS数据集上进行了评估。我们能够考虑所有候选异构体的60%。值得注意的是，所有注释中有40%被分类为模糊。这突显了对修饰肽注释进行更彻底评估的必要性。

提供机构：

ACS Publications

5,000+

优质数据集

54 个

任务类型

进入经典数据集