five

DLRM Recommendation Model Training

收藏
Databricks2024-05-09 收录
下载链接:
https://marketplace.databricks.com/details/db8353e3-ea71-4437-a80b-6f584cffa42b/Databricks_DLRM-Recommendation-Model-Training
下载链接
链接失效反馈
官方服务:
资源简介:
**Overview** This is a sample implementation of the [DLRM Recommendation Model](https://arxiv.org/abs/1906.00091) on Databricks with the following features: 1. [TorchRec](https://pytorch.org/torchrec/): for handling large datasets with many categorical features (where the embedding tables can't fit inside one GPU) 2. [TorchDistributor](https://docs.databricks.com/en/machine-learning/train-model/distributed-training/spark-pytorch-distributor.html): for doing distributed training on Databricks 3. [Mosaic StreamingDataset](https://docs.mosaicml.com/projects/streaming/en/stable/): for efficient data loading in a distributed environment **Use case** The DLRM model is an effective architecture for predicting click-through rates. The example provided uses a randomized synthetic dataset that has a similar layout to other publicly available datasets, such as the [Criteo](https://ailab.criteo.com/download-criteo-1tb-click-logs-dataset/) click logs dataset. The provided example notebooks can be updated to support those public datasets as well as your own recommendation use cases. **Product details** For more specific details, refer to the embedded notebook which contains a guide for how to train deep recommendation models on Databricks. **Licenses** - The implementation of DLRM on Databricks is based on this [repository](https://github.com/facebookresearch/dlrm/tree/main) by Facebook with the MIT License.
提供机构:
Databricks
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作