WebRequests
收藏DataCite Commons2024-04-23 更新2024-07-13 收录
下载链接:
https://redata.anii.org.uy/citation?persistentId=doi:10.60895/redata/RWUUSV
下载链接
链接失效反馈官方服务:
资源简介:
A dataset of labeled requests assembled from several public datasets, namely, Malicious-URLs, PKDD, and CSIC 2010 (also included). To merge the datasets, only the URI of each web request was used. To construct a feature vector to train the networks, each URI was tokenized in unigrams following a bag-of-words approach. For each URI, the values of the unigrams were computed using term frequency–inverse document frequency (TF–IDF). Each URI was represented by an l1-normalized vector composed of the 500 most frequent tokens across the entire dataset.
提供机构:
Repositorio de datos abiertos de investigación de Uruguay
创建时间:
2024-04-03



