Comprehensive Skincare Product Data Extracted from Dermstore – Ingredient List in JSON Format|护肤产品数据集|成分分析数据集

www.crawlfeeds.com2024-11-03 更新2025-03-22 收录

护肤产品

成分分析

下载链接：

https://www.crawlfeeds.com/datasets/comprehensive-skincare-product-data-from-dermstore-ingredient-list-in-json

下载链接

链接失效反馈

资源简介：

This dataset contains meticulously extracted information on 100+ skincare products from Dermstore, one of the leading online skincare retailers. Designed for beauty analysts, researchers, and data scientists, this dataset provides a deep dive into ingredient compositions, product categories, and targeted skin concerns. Each entry is enriched with essential details, including full ingredient lists, enabling in-depth ingredient analysis and skincare product comparisons. Data Highlights: <ul> <li>Ingredient Listings: Comprehensive ingredient details for each product, ideal for ingredient-based research and analysis.</li> <li>Product and Brand Details: Product name, brand, category (e.g., cleansers, serums, sunscreens), and skin concerns (such as acne, aging, or sensitivity) offer full product context.</li> <li>User Ratings and Reviews (where available): Consumer ratings add valuable insights into product performance, satisfaction, and ingredient effectiveness.</li> </ul> Dataset Scope: The data includes essential product details and ingredient information for a variety of skincare products available on Dermstore. It covers skincare categories such as moisturizers, cleansers, serums, and sunscreens, making it a versatile tool for anyone interested in the ingredients behind skincare. Dataset Use Cases: <ol> <li>Trend Analysis: Identify ingredient trends, such as the popularity of specific actives like hyaluronic acid, niacinamide, or retinol.</li> <li>Recommendation Models: Build recommendation algorithms for skincare products based on ingredient compatibility and skin concerns.</li> <li>Product Comparison: Conduct comparative studies on products with similar or unique ingredients.</li> <li>Health and Safety Analysis: Research ingredient safety, allergens, and natural vs. synthetic composition trends.</li> </ol> Ideal For: <ul> <li>Data Scientists: For projects in predictive modeling, trend analysis, and building recommendation systems.</li> <li>Beauty and Health Researchers: For ingredient efficacy studies, consumer trend analysis, and product formulation research.</li> <li>Skincare Enthusiasts: Provides insights into product compositions and ingredient effectiveness.</li> </ul>

本数据集收录了来自Dermstore，一家领先的在线护肤零售商，关于100多个护肤产品的精心提取信息。专为美容分析师、研究人员和数据科学家设计，该数据集深入探讨了成分组成、产品类别以及针对的皮肤问题。每个条目都包含了详尽的详细信息，包括完整的成分列表，从而实现了深入成分分析和护肤产品的比较。 数据亮点： <ul> <li>成分列表：每个产品的全面成分细节，非常适合基于成分的研究和分析。</li> <li>产品和品牌详情：产品名称、品牌、类别（例如，洁面乳、精华液、防晒霜）以及皮肤问题（如痤疮、衰老或敏感性）提供了全面的产品背景。</li> <li>用户评分和评论（如有）：消费者评分为产品性能、满意度和成分有效性提供了宝贵的见解。</li> </ul> 数据集范围： 数据包括Dermstore上各种护肤产品的必要产品详情和成分信息。涵盖了护肤类别，如保湿霜、洁面乳、精华液和防晒霜，使其成为任何对护肤成分感兴趣的人的通用工具。 数据集应用场景： <ol> <li>趋势分析：识别成分趋势，例如透明质酸、烟酰胺或视黄醇等特定活性成分的流行程度。</li> <li>推荐模型：基于成分兼容性和皮肤问题构建护肤产品的推荐算法。</li> <li>产品比较：对具有相似或独特成分的产品进行对比研究。</li> <li>健康和安全分析：研究成分安全性、过敏原以及天然与合成成分组成的趋势。</li> </ol> 适用对象： <ul> <li>数据科学家：适用于预测建模、趋势分析和构建推荐系统等项目的项目。</li> <li>美容与健康研究人员：适用于成分功效研究、消费者趋势分析和产品配方研究。</li> <li>护肤爱好者：提供了关于产品组成和成分有效性的见解。</li> </ul>

提供机构：

Crawl Feeds

用户留言

有没有相关的论文或文献参考？

这个数据集是基于什么背景创建的？

数据集的作者是谁？

能帮我联系到这个数据集的作者吗？

这个数据集如何下载？

点击留言

数据主题

具身智能

数据集 4099个

机构 8个

大模型

数据集 439个

机构 10个

无人机

数据集 37个

机构 6个

指令微调

数据集 36个

机构 6个

蛋白质结构

数据集 50个

机构 8个

空间智能

数据集 21个

机构 5个

5,000+

优质数据集

54 个

任务类型

进入经典数据集

热门数据集

中国区域地面气象要素驱动数据集 v2.0（1951-2024）

中国区域地面气象要素驱动数据集（China Meteorological Forcing Data，以下简称 CMFD）是为支撑中国区域陆面、水文、生态等领域研究而研发的一套高精度、高分辨率、长时间序列数据产品。本页面发布的 CMFD 2.0 包含了近地面气温、气压、比湿、全风速、向下短波辐射通量、向下长波辐射通量、降水率等气象要素，时间分辨率为 3 小时，水平空间分辨率为 0.1°，时间长度为 74 年（1951~2024 年），覆盖了 70°E~140°E，15°N~55°N 空间范围内的陆地区域。CMFD 2.0 融合了欧洲中期天气预报中心 ERA5 再分析数据与气象台站观测数据，并在辐射、降水数据产品中集成了采用人工智能技术制作的 ISCCP-ITP-CNN 和 TPHiPr 数据产品，其数据精度较 CMFD 的上一代产品有显著提升。 CMFD 历经十余年的发展，其间发布了多个重要版本。2019 年发布的 CMFD 1.6 是完全采用传统数据融合技术制作的最后一个 CMFD 版本，而本次发布的 CMFD 2.0 则是 CMFD 转向人工智能技术制作的首个版本。此版本与 1.6 版具有相同的时空分辨率和基础变量集，但在其它诸多方面存在大幅改进。除集成了采用人工智能技术制作的辐射和降水数据外，在制作 CMFD 2.0 的过程中，研发团队尽可能采用单一来源的再分析数据作为输入并引入气象台站迁址信息，显著缓解了 CMFD 1.6 中因多源数据拼接和气象台站迁址而产生的虚假气候突变。同时，CMFD 2.0 数据的时间长度从 CMFD 1.6 的 40 年大幅扩展到了 74 年，并将继续向后延伸。CMFD 2.0 的网格空间范围虽然与 CMFD 1.6 相同，但其有效数据扩展到了中国之外，能够更好地支持跨境区域研究。为方便用户使用，CMFD 2.0 还在基础变量集之外提供了若干衍生变量，包括近地面相对湿度、雨雪分离降水产品等。此外，CMFD 2.0 摒弃了 CMFD 1.6 中通过 scale_factor 和 add_offset 参数将实型数据化为整型数据的压缩技术，转而直接将实型数据压缩存储于 NetCDF4 格式文件中，从而消除了用户使用数据时进行解压换算的困扰。本数据集原定版本号为 1.7，但鉴于本数据集从输入数据到研制技术都较上一代数据产品有了大幅的改变，故将其版本号重新定义为 2.0。CMFD 2.0 的数据内容与此前宣传的 CMFD 1.7 基本一致，仅对 1983 年 7 月以后的向下短/长波辐射通量数据进行了更新，以修正其长期趋势存在的问题。

国家青藏高原科学数据中心收录

lmarena-ai/arena-hard-auto-v0.1

--- license: apache-2.0 dataset_info: features: - name: question_id dtype: string - name: category dtype: string - name: cluster dtype: string - name: turns list: - name: content dtype: string splits: - name: train num_bytes: 251691 num_examples: 500 download_size: 154022 dataset_size: 251691 configs: - config_name: default data_files: - split: train path: data/train-* --- ## Arena-Hard-Auto **Arena-Hard-Auto-v0.1** ([See Paper](https://arxiv.org/abs/2406.11939)) is an automatic evaluation tool for instruction-tuned LLMs. It contains 500 challenging user queries sourced from Chatbot Arena. We prompt GPT-4-Turbo as judge to compare the models' responses against a baseline model (default: GPT-4-0314). Notably, Arena-Hard-Auto has the highest *correlation* and *separability* to Chatbot Arena among popular open-ended LLM benchmarks ([See Paper](https://arxiv.org/abs/2406.11939)). If you are curious to see how well your model might perform on Chatbot Arena, we recommend trying Arena-Hard-Auto. Please checkout our GitHub repo on how to evaluate models using Arena-Hard-Auto and more information about the benchmark. If you find this dataset useful, feel free to cite us! ``` @article{li2024crowdsourced, title={From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline}, author={Li, Tianle and Chiang, Wei-Lin and Frick, Evan and Dunlap, Lisa and Wu, Tianhao and Zhu, Banghua and Gonzalez, Joseph E and Stoica, Ion}, journal={arXiv preprint arXiv:2406.11939}, year={2024} } ```

hugging_face 收录

MeSH

MeSH（医学主题词表）是一个用于索引和检索生物医学文献的标准化词汇表。它包含了大量的医学术语和概念，用于描述医学文献中的主题和内容。MeSH数据集包括主题词、副主题词、树状结构、历史记录等信息，广泛应用于医学文献的分类和检索。

www.nlm.nih.gov 收录

中国交通事故深度调查（CIDAS）数据集

交通事故深度调查数据通过采用科学系统方法现场调查中国道路上实际发生交通事故相关的道路环境、道路交通行为、车辆损坏、人员损伤信息，以探究碰撞事故中车损和人伤机理。目前已积累深度调查事故10000余例，单个案例信息包含人、车、路和环境多维信息组成的3000多个字段。该数据集可作为深入分析中国道路交通事故工况特征，探索事故预防和损伤防护措施的关键数据源，为制定汽车安全法规和标准、完善汽车测评试验规程、

北方大数据交易中心收录

中国逐日格点降水数据集V2（1960–2024，0.1°）

CHM_PRE V2数据集是一套高精度的中国大陆逐日格点降水数据集。该数据集基于1960年至今共3476个观测站的长期日降水观测数据，并纳入11个降水相关变量，用于表征降水的相关性。数据集采用改进的反距离加权方法，并结合基于机器学习的LGBM算法构建。CHM_PRE V2与现有的格点降水数据集（包括CHM_PRE V1、GSMaP、IMERG、PERSIANN-CDR和GLDAS）表现出良好的时空一致性。数据集基于63,397个高密度自动雨量站2015–2019年的观测数据进行验证，发现该数据集显著提高了降水测量精度，降低了降水事件的高估，为水文建模和气候评估提供了可靠的基础。CHM_PRE V2 数据集提供分辨率为0.1°的逐日降水数据，覆盖整个中国大陆（18°N–54°N，72°E–136°E）。该数据集涵盖1960–2024年，并将每年持续更新。日值数据以NetCDF格式提供，为了方便用户，我们还提供NetCDF和GeoTIFF格式的年度和月度总降水数据。

国家青藏高原科学数据中心收录