Criteo
收藏魔搭社区2025-11-20 更新2024-08-31 收录
下载链接:
https://modelscope.cn/datasets/OmniData/Criteo
下载链接
链接失效反馈官方服务:
资源简介:
displayName: Criteo (Display Advertising Challenge)
license:
- MIT
paperUrl: ""
publishDate: "2014"
publishUrl: https://ailab.criteo.com/display-advertising-challenge-criteo/
publisher:
- Criteo Research
tags:
- Advdertisment CTR
---
# 数据集介绍
## 简介
Criteo包含7天的点击数据,广泛用于CTR预测基准测试。Criteo数据集中有26个匿名分类字段和13个连续字段。显示广告是十亿美元的努力,也是机器学习在互联网上的主要用途之一。但是,其数据和方法通常保持在锁和钥匙下。在这项研究竞赛中,CriteoLabs将分享一周的数据,供您开发预测广告点击率 (CTR) 的模型。给定一个用户和他正在访问的页面,他点击给定广告的概率是多少?这项挑战的目标是为CTR估计提供最准确的ML算法基准。
## Download dataset
:modelscope-code[]{type="git"}
displayName: Criteo (Display Advertising Challenge)
license:
- MIT
paperUrl: ""
publishDate: "2014"
publishUrl: https://ailab.criteo.com/display-advertising-challenge-criteo/
publisher:
- Criteo Research
tags:
- Advertisement CTR
---
# Dataset Introduction
## Introduction
Criteo contains 7 days of click-through data, which is widely utilized as a benchmark for CTR prediction tasks. The Criteo dataset comprises 26 anonymized categorical fields and 13 continuous fields. Display advertising is a multi-billion-dollar industry and one of the core applications of machine learning on the Internet. However, its relevant data and technical methods are usually kept confidential. For this research competition, CriteoLabs released one week of such data to facilitate the development of models for predicting advertisement click-through rates (CTR). Given a user and the webpage they are browsing, what is the probability that they will click on a given advertisement? The goal of this challenge is to benchmark the most accurate ML algorithms for CTR estimation.
## Download dataset
:modelscope-code[]{type="git"}
提供机构:
maas
创建时间:
2024-07-11
搜集汇总
数据集介绍

背景与挑战
背景概述
Criteo是一个用于广告点击率预测的基准数据集,包含7天的点击数据,具有26个分类字段和13个连续字段,总大小为4.58GB。该数据集旨在为CTR估计提供最准确的机器学习算法基准。
以上内容由遇见数据集搜集并总结生成



