statistics-all-columns-V2.pl

Name: statistics-all-columns-V2.pl
Creator: Repositorio de datos de investigación de la Universidad de Chile
Published: 2024-06-22 14:57:32
License: 暂无描述

DataCite Commons2024-06-22 更新2024-07-13 收录

下载链接：

http://datos.uchile.cl/file.xhtml?persistentId=doi:10.34691/UCHILE/AYDRZL/XVWNOP

下载链接

链接失效反馈

官方服务：

资源简介：

Script calculates a series of statistical values for each column of the indicated file. In this work it was used to estimate variability of codon usage between proteobacterial genomes based on standard deviations and the distance between the values of percentiles 90 and 10. The program additionally calculate other simple statistical values of columns such as mean, median, skewness, kurtosis and the value of selected percentiles. The script requires that the Perl module “Statistics::Descriptive” is installed. The file with data to be analyzed should be organized in tab separated columns. First column will be skipped (considering it contains the genome names or accession codes). First line in the file has to be commented and should contain the column titles. Other commented lines will be ignored. Usage: perl statistics-all-columns-V2.pl

本脚本可对指定文件的每一列计算一系列统计量。本工具曾被用于基于标准差以及90分位数与10分位数的差值，估算变形杆菌（Proteobacteria）基因组间的密码子使用偏好性差异。该程序还可计算列数据的其他基础统计指标，包括均值、中位数、偏度、峰度以及指定分位数的数值。运行本脚本需提前安装Perl模块"Statistics::Descriptive"。待分析数据文件需采用制表符分隔的列格式。第一列将被跳过（通常用于存储基因组名称或登录号）。文件首行需为注释行，且应包含列标题；其余注释行将被忽略。使用方法：perl statistics-all-columns-V2.pl

提供机构：

Repositorio de datos de investigación de la Universidad de Chile

创建时间：

2024-03-06

5,000+

优质数据集

54 个

任务类型

进入经典数据集