five

Data and code files for the Adaptive Skip-Train Structured Ensemble for Temporal Networks

收藏
DataCite Commons2024-03-24 更新2024-07-27 收录
下载链接:
https://springernature.figshare.com/articles/dataset/Data_and_code_files_for_the_Adaptive_Skip-Train_Structured_Ensemble_for_Temporal_Networks/5444500
下载链接
链接失效反馈
官方服务:
资源简介:
This fileset contains the data and source code related to the paper: <br>Pavlovski, M., Zhou, F., Stojkovic, I., Kocarev, L., &amp; Obradovic, Z. <b>"Adaptive Skip-Train Structured Regression for Temporal Networks"</b>, Proc. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017<br><br>The code files contain the experimental setups and code for running the various models:<br><b>AST-SE: </b>Adaptive Skip-Train Structured Ensemble, a sampling-based structured regression ensemble for prediction on top of temporal networks<b>LR:</b> An L1-regularized linear regression. LR was employed as an unstructured predictor for each of the following models in order to achieve efficiency.<b>GCRF: </b>Standard GCRF model that enables the chosen unstructured predictor to learn the network structure. <b>SE: </b>Structured ensemble composed of multiple GCRF models. <b>WSE: </b>Weighted structured ensemble that combines the predictions of multiple GCRFs in a weighted mixture in order to predict the nodes' outputs in the next timestep.<br>The data file <b>H3N2_data.mat </b>contains temporally collected gene expression measurements (12,032 genes) of a human subject infected with the H3N2 virus.<br>For further details see the related Conference paper.<br>All code is written in MATLAB and is available in <b>.m</b> format files. Raw code can be accessed from these files using openly-accessible text edit software. Data are provided in <b>.mat </b>format, accessible using the MATLAB computing environment.<br><b>Background </b>A broad range of high impact applications involve learning a predictive model in a temporal network environment. In weather forecasting, predicting effectiveness of treatments, outcomes in healthcare and in many other domains, networks are often large, while intervals between consecutive time moments are brief. Therefore, models are required to forecast in a more scalable and efficient way, without compromising accuracy. The Gaussian Conditional Random Field (GCRF) is a widely used graphical model for performing structured regression on networks. However, GCRF is not applicable to large networks and it cannot capture different network substructures (communities) since it considers the entire network while learning. In this study, we present a novel model, Adaptive Skip-Train Structured Ensemble (AST-SE), which is a sampling-based structured regression ensemble for prediction on top of temporal networks. AST-SE takes advantage of the scheme of ensemble methods to allow multiple GCRFs to learn from several subnetworks. The proposed model is able to automatically skip the entire training or some phases of the training process. The prediction accuracy and efficiency of AST-SE were assessed and compared against alternatives on synthetic temporal networks and the H3N2 Virus Influenza network. The obtained results provide evidence that (1) AST-SE is ~140 times faster than GCRF as it skips retraining quite frequently; (2) It still captures the original network structure more accurately than GCRF while operating solely on partial views of the network; (3) It outperforms both unweighted and weighted GCRF ensembles which also operate on sub- networks but require retraining at each timestep.<br>
提供机构:
figshare
创建时间:
2017-11-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作