five

prnshv/teleembed-bench-main

收藏
Hugging Face2026-04-08 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/prnshv/teleembed-bench-main
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en pretty_name: TeleEmbed Benchmark (Main) tags: - retrieval - telecommunications - benchmarking license: apache-2.0 size_categories: - 1K<n<10K --- # TeleEmbed Benchmark — **Main** track Dense **embedding retrieval** benchmark over telecommunications corpora (**O-RAN**, **3GPP**, **srsRAN**). This repo ships **passage corpora** (`chunks.json`) and **labeled queries** (`benchmark_*.json`). **There is no fixed score in the files:** you choose a **sentence embedding model**, encode passages and questions, run retrieval, then read off MRR / Recall@K for **that** model. **Companion dataset:** the **Clean** track (different benchmark JSON, **full copy** of the same passage corpora under its own `TeleEmbed-Clean/` tree) is published as a separate standalone dataset. Link it from this card when the URL is set, e.g. `https://huggingface.co/datasets/<your_org>/<your_clean_dataset>`. --- ## What you must specify: the embedding model 1. Install deps (`requirements.txt` includes `sentence-transformers`). 2. Run `scripts/evaluate_retrieval.py` with **`--model <id-or-path>`** — any [SentenceTransformer](https://www.sbert.net/)–compatible checkpoint (e.g. `intfloat/e5-base-v2`, `BAAI/bge-small-en-v1.5`, or a local folder). 3. The script encodes **every corpus chunk** and **every question** with that same model, L2-normalizes, ranks by dot product, and prints metrics. 4. **Publish or cite the exact `--model` string** (and corpus, `chunk_size`, `track`) next to your numbers. Omitting `--model` uses a tiny default only for smoke tests. --- ## Layout (this repo) ``` main/ oran/chunks/<512|1024|2048>/chunks.json oran/benchmark_512.json (and 1024, 2048) 3gpp/... srsran/... scripts/ evaluate_retrieval.py paths.py requirements.txt .gitattributes ``` --- ## Task For each sample: `question` = query embedding; corpus = all `chunk_text` in `chunks.json`; gold = `chunk_id`. Metrics: **MRR**, **Recall@K** (see script defaults for K). --- ## Quick start (scoring) ```bash python -m venv .venv && source .venv/bin/activate pip install -U pip && pip install -r requirements.txt cd scripts python evaluate_retrieval.py --corpus oran --track main --chunk-size 512 \ --model intfloat/e5-base-v2 ``` Use `--device cuda`, `--batch-size`, `--top-k`, `--save-report`, `--max-samples` as needed. --- ## Hugging Face download ```bash git clone https://huggingface.co/datasets/<YOUR_USER>/<THIS_REPO> cd <THIS_REPO> ``` Use Git LFS (`git lfs install` before clone if needed). --- ## Citation Cite this dataset URL/DOI once published, and the underlying specifications as appropriate.
提供机构:
prnshv
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作