five

kawaii4mano/genz-slang-pairs-1k

收藏
Hugging Face2026-04-27 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/kawaii4mano/genz-slang-pairs-1k
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc task_categories: - text-generation - text2text-generation language: - en tags: - slang - style-transfer - informal-language - casual-dialogue - gen-z - colloquial-english - paraphrase - text-augmentation - synthetic-data - data-augmentation - social-media-language - chatbot-training - conversational-data - creative-writing - nlp - language-modeling - paired-data - slang-translation pretty_name: Gen Z Slang Pairs Corpus (1 K) size_categories: - 1K<n<10K --- # Gen Z Slang Pairs Corpus (1 K) The **Gen Z Slang Pairs Corpus (1 K)** contains **1,000** everyday English sentences alongside their Gen Z–style slang rewrites. This dataset is designed for **style-transfer**, **informal-language generation**, and **paraphrasing** research. Use it to train models that transform formal or neutral sentences into expressive, youth‑oriented slang. ## Dataset Details *This dataset was generated programmatically using OpenAI GPT-4.1 Nano.* * **Language:** English * **Format:** CSV file * **Filename:** `genz_dataset.csv` * **Columns:** * `normal`: Original everyday English sentence * `gen_z`: Corresponding slang rewrite in Gen Z style ## Repository Structure ``` genz-slang-pairs-1k/ ├── README.md # This dataset card ├── genz_dataset.csv # 1,000 rows, columns: normal, gen_z └── license.txt # Apache 2.0 license text ``` ## Usage Example ```python from datasets import load_dataset ds = load_dataset("Programmer-RD-AI/genz-slang-pairs-1k") print(ds["train"][0]) # { # "normal": "How’s it going?", # "gen_z": "Yo, what’s poppin’?" # } ``` ## Tasks & Evaluation * **Text Generation:** Evaluate with BLEU, ROUGE, or METEOR. * **Style Transfer:** Measure semantic similarity (e.g., BERTScore) and slang authenticity. * **Paraphrasing:** Use chrF or BLEU to assess paraphrase quality. ## Citation If you use this dataset, please cite: ```bibtex @misc{ranuga_disansa_gamage_2025, author = { Ranuga Disansa Gamage }, title = { genz-slang-pairs-1k (Revision 9728f69) }, year = 2025, url = { https://huggingface.co/datasets/Programmer-RD-AI/genz-slang-pairs-1k }, doi = { 10.57967/hf/5742 }, publisher = { Hugging Face } } ``` ## License This dataset is licensed under the **Apache 2.0 License**. See [license.txt](license.txt) for full text.
提供机构:
kawaii4mano
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作