alwaysgood/earnings_call_mono
收藏Hugging Face2026-03-21 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/alwaysgood/earnings_call_mono
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: Motley Fool Earnings Call Mono (Private)
language:
- en
license: other
size_categories:
- 100K<n<1M
tags:
- finance
- earnings-call
- transcripts
- monodata
---
# Motley Fool Earnings Call Mono (Private)
This private dataset contains cleaned monolingual earnings-call text chunks prepared from:
- Kaggle dataset: `tpotterer/motley-fool-scraped-earnings-call-transcripts`
## Split
- `train`: `135306` rows
## Columns
- `id`: chunk identifier
- `text`: cleaned source text chunk
- `ticker`: ticker symbol
- `exchange`: exchange string from source metadata
- `date`: call date string from source metadata
- `section`: `prepared` or `qa`
## Processing Summary
The uploaded file is the final output after:
1. chunk construction for context-preserving mono data
2. normalized deduplication
3. boilerplate removal (operator/opening control text)
4. removing rows containing the word `operator`
5. minimum length filter (`word_count >= 20`)
Detailed processing metadata is in `processing_manifest.json`.
## Privacy / Access
This repository is intentionally **private**.
## Important Note
The upstream Kaggle dataset lists license as unknown. Keep this repository private unless licensing is clarified.
提供机构:
alwaysgood



