five

Reprocessed CAGE peaks generated by FANTOM5 (mm10 v3)_CAGE_peaks_expression

收藏
DataCite Commons2020-09-02 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/Re-processing_of_the_data_generated_by_the_FANTOM5_project_mm10_v3_CAGE_peaks_expression/4880312/2
下载链接
链接失效反馈
官方服务:
资源简介:
<pre>CAGE peaks === This directory contains the CAGE peaks defined in the FANTOM5 project as a robust set, where genomic coordinates originally defined on mm9 are lifted to mm10 by liftOver tool. By following the update, their annotations are also updated - see “CAGE_peaks_annotation” directory in the parent directory. - Inquiries: fantom-help@gsc.riken.jp - Update: 2015-09-18 initial release 2016-07-22 ver.2 release 2017-04-14 ver.3 release Data files --- - mm10_liftover_CAGE_peaks_phase1and2.bed.gz : all lifted-over CAGE peaks using UCSC Lift-over - mm10_fair_CAGE_peaks_phase1and2.bed : fairly remapped CAGE peaks. - mm10_problematic_CAGE_peaks_phase1and2.bed : problematic CAGE peaks that were filtered out by QC - mm10_new_CAGE_peaks_phase1and2.bed : newly identified CAGE peaks in mm10 DPI clustering - mm9_droppped_CAGE_peaks_phase1and2.bed : unmapped CAGE peaks (in mm9 coordination) Description of the columns in CAGE peaks coordinates files --- This is baed on BED9 format, where the thickStart and thickEnd position represent representative TSS positions. (https://genome.ucsc.edu/FAQ/FAQformat.html#format1) - chromosome - start of CAGE peak region - end of CAGE peak region - name (ID) of the CAGE peak - score - strand of the CAGE peak - start of the representative TSS position - end of the representative TSS position (Note: end is always start+1) - rgb string for color coding (plus or minus strand only) </pre>

CAGE峰(CAGE peaks) === 本目录包含FANTOM5项目定义的稳健CAGE峰集合,其原本基于mm9基因组版本定义的基因组坐标已通过liftOver工具转换为mm10版本。伴随坐标更新,相关注释信息也已同步更新——详情请参阅上级目录中的"CAGE_peaks_annotation"目录。 - 咨询邮箱:fantom-help@gsc.riken.jp - 更新日志:2015年9月18日 首次发布;2016年7月22日 发布v2版本;2017年4月14日 发布v3版本 ### 数据文件 - mm10_liftover_CAGE_peaks_phase1and2.bed.gz:通过UCSC Lift-over工具转换得到的全部CAGE峰文件 - mm10_fair_CAGE_peaks_phase1and2.bed:经标准化重映射的CAGE峰文件 - mm10_problematic_CAGE_peaks_phase1and2.bed:经质量控制(QC)过滤剔除的异常CAGE峰文件 - mm10_new_CAGE_peaks_phase1and2.bed:在mm10版本中通过DPI聚类(DPI clustering)新鉴定得到的CAGE峰文件 - mm9_droppped_CAGE_peaks_phase1and2.bed:未成功映射的mm9版本坐标CAGE峰文件 ### CAGE峰坐标文件列说明 本文件基于BED9格式(BED9 format)构建,其中thickStart与thickEnd字段代表典型转录起始位点(TSS, Transcription Start Site)的位置(参考链接:https://genome.ucsc.edu/FAQ/FAQformat.html#format1)。各列含义如下: 1. 染色体 2. CAGE峰区域的起始坐标 3. CAGE峰区域的终止坐标 4. CAGE峰的名称(ID) 5. 分值 6. CAGE峰的链方向 7. 典型转录起始位点的起始坐标 8. 典型转录起始位点的终止坐标(注:终止坐标恒等于起始坐标+1) 9. 用于颜色编码的RGB字符串(仅适用于正链与负链)
提供机构:
figshare
创建时间:
2017-04-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作