five

alwaysgood/earnings_call_mono

收藏
Hugging Face2026-03-21 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/alwaysgood/earnings_call_mono
下载链接
链接失效反馈
官方服务:
资源简介:
--- pretty_name: Motley Fool Earnings Call Mono (Private) language: - en license: other size_categories: - 100K<n<1M tags: - finance - earnings-call - transcripts - monodata --- # Motley Fool Earnings Call Mono (Private) This private dataset contains cleaned monolingual earnings-call text chunks prepared from: - Kaggle dataset: `tpotterer/motley-fool-scraped-earnings-call-transcripts` ## Split - `train`: `135306` rows ## Columns - `id`: chunk identifier - `text`: cleaned source text chunk - `ticker`: ticker symbol - `exchange`: exchange string from source metadata - `date`: call date string from source metadata - `section`: `prepared` or `qa` ## Processing Summary The uploaded file is the final output after: 1. chunk construction for context-preserving mono data 2. normalized deduplication 3. boilerplate removal (operator/opening control text) 4. removing rows containing the word `operator` 5. minimum length filter (`word_count >= 20`) Detailed processing metadata is in `processing_manifest.json`. ## Privacy / Access This repository is intentionally **private**. ## Important Note The upstream Kaggle dataset lists license as unknown. Keep this repository private unless licensing is clarified.
提供机构:
alwaysgood
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作