SriRamanaAtmic/AtmicCrosslingual
收藏Hugging Face2026-04-16 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/SriRamanaAtmic/AtmicCrosslingual
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
- hi
- ta
- te
- bn
- gu
license: cc-by-4.0
task_categories:
- sentence-similarity
tags:
- advaita-vedanta
- ramana-maharshi
- multilingual
- embeddings
- spiritual-texts
pretty_name: AtmicCrosslingual
---
# AtmicCrosslingual
Cross-lingual alignment dataset for Atmic multilingual embedding model
Trained model: [SriRamanaAtmic/AtmicEmbeddingv1](https://huggingface.co/SriRamanaAtmic/AtmicEmbeddingv1)
## Splits
| Split | Records |
|-------|---------|
| train | 5,711 |
| validation | 173 |
| test | 1,800 |
## Schema
All columns are `string` type. Columns not applicable to a split are filled with `"Unknown"`.
Columns: `anchor`, `collection`, `en_query`, `lang1`, `lang2`, `positive`, `quality`, `source`, `type`
## Usage
```python
from datasets import load_dataset
ds = load_dataset("SriRamanaAtmic/AtmicCrosslingual")
print(ds["train"][0])
```
## Related Resources
| Resource | Link |
|----------|------|
| Embedding Model | [SriRamanaAtmic/AtmicEmbeddingv1](https://huggingface.co/SriRamanaAtmic/AtmicEmbeddingv1) |
| AtmicMLM | [datasets/SriRamanaAtmic/AtmicMLM](https://huggingface.co/datasets/SriRamanaAtmic/AtmicMLM) |
| AtmicContrastive | [datasets/SriRamanaAtmic/AtmicContrastive](https://huggingface.co/datasets/SriRamanaAtmic/AtmicContrastive) |
| AtmicCrosslingual | [datasets/SriRamanaAtmic/AtmicCrosslingual](https://huggingface.co/datasets/SriRamanaAtmic/AtmicCrosslingual) |
提供机构:
SriRamanaAtmic



