five

bert-base-chinese.zip

收藏
阿里云天池2026-05-16 更新2024-03-07 收录
下载链接:
https://tianchi.aliyun.com/dataset/150782
下载链接
链接失效反馈
官方服务:
资源简介:
bert-base-chinese Copied like 295 Fill-Mask PyTorch TensorFlow JAX Safetensors Transformers Chinese bert AutoTrain Compatible arxiv: 1810.04805 Model card Files and versions Community 9 Bert-base-chinese Table of Contents Model Details Uses Risks, Limitations and Biases Training Evaluation How to Get Started With the Model Model Details Model Description This model has been pre-trained for Chinese, training and random input masking has been applied independently to word pieces (as in the original BERT paper). Developed by: HuggingFace team Model Type: Fill-Mask Language(s): Chinese License: [More Information needed] Parent Model: See the BERT base uncased model for more information about the BERT base model. Model Sources Paper: BERT Uses Direct Use This model can be used for masked language modeling Risks, Limitations and Biases CONTENT WARNING: Readers should be aware this section contains content that is disturbing, offensive, and can propagate historical and current stereotypes. Significant research has explored bias and fairness issues with language models (see, e.g., Sheng et al. (2021) and Bender et al. (2021)). Training Training Procedure type_vocab_size: 2 vocab_size: 21128 num_hidden_layers: 12 Training Data [More Information Needed] Evaluation Results [More Information Needed] How to Get Started With the Model from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("bert-base-chinese") model = AutoModelForMaskedLM.from_pretrained("bert-base-chinese"
提供机构:
阿里云天池
创建时间:
2023-04-13
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
bert-base-chinese.zip 是一个由HuggingFace团队开发的中文预训练语言模型,属于填充掩码(Fill-Mask)类型,采用与原始BERT相同的词片段随机掩码训练方法。该模型包含12个隐藏层,词汇表大小为21128,专门用于中文掩码语言建模任务。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作