five

An example of a reverse-complementary distance-sensitive n-gram profile (RCDSNGP) representation with n = 4, 5, and 6 for a given sequence (AAGCTTGAGACACAGCT) with the reference subsequence marked in bold*.

收藏
Figshare2015-12-02 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/_An_example_of_a_reverse_complementary_distance_sensitive_n_gram_profile_RCDSNGP_representation_with_n_4_5_and_6_for_a_given_sequence_AAGCTT_GAGACA_CAGCT_with_the_reference_subsequence_marked_in_bold_/1002690
下载链接
链接失效反馈
官方服务:
资源简介:
*Given an m-length sequence s = s1, s2… si…si+j…sm, the RCDSNGP of s with respect to an j-length reference subsequence x = si…si+j−1 is a set of K 2-tuples, denoted as RCDSNGP(s) RCDSNGP(s) = {({f1, r1, d1}, c1),({f2, r2, d2}, c2)…({fK, rK, dK}, cK)}, fk being a distinct n-gram, rk being the reverse complement of fk, dk being the relative distance parameter, and ck being the sum of frequency counts of fk and rk with the same dk relative to x in s. Each set in a 2-tuple ({fk, rk, dk}) is a reverse-complementary distance-sensitive n-gram (RCDSNG), or a feature in our study. This RCDSNGP representation was adopted for all training sequences. For testing processes, each sequence was converted to RCDSNGP first, and then represented according to the selected RCDSNGs generated from the training datasets, including those with zero count.
创建时间:
2015-12-02
二维码
社区交流群
二维码
科研交流群
商业服务