Evaluating validity of synthetic data generation scheme with respect to real patient data (removal of low frequency variants based on mean effective coverage).
收藏NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://figshare.com/articles/dataset/Evaluating_validity_of_synthetic_data_generation_scheme_with_respect_to_real_patient_data_removal_of_low_frequency_variants_based_on_mean_effective_coverage_/19674911
下载链接
链接失效反馈官方服务:
资源简介:
To examine the specification of the synthetic data generation process, we iterated the VAF distribution feature vectors from each patient through the entire simulated training data set of 40 million synthetic tumours, searching for the nearest neighbours based on euclidean distance. We provide overlaid histograms of each patient VAF distribution with the closest nearest neighbour. All samples can be examined by using the dropdown menu. The feature vectors for this analysis were generated by using the mean effective coverage (mean sequencing depth * purity) to remove low frequency variants as described in Methods.
(ZIP)
创建时间:
2022-04-28



