AION-Analytics/indian_financial_news_42k
收藏Hugging Face2026-04-05 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/AION-Analytics/indian_financial_news_42k
下载链接
链接失效反馈官方服务:
资源简介:
INDIAN FINANCIAL NEWS DATASET (42K)
42,214 Indian financial news headlines with sentiment labels and taxonomy-enriched annotations.
CONTENTS
Total rows: 42,214
Event-labeled rows: 2,979 (with event_id, macro_signal, sector impacts)
Sentiment-only rows: 39,235 (labeled by AION-Sentiment-IN-v3)
Synthetic data: 200 rows for macro_inr_appreciation (rupee appreciation with negative sentiment)
COLUMNS
headline: News headline text
event_id: Taxonomy event identifier (95 events, empty for sentiment-only rows)
macro_signal: Market impact score (-1 to +1, 0 for sentiment-only rows)
sentiment_label: 0=negative, 1=neutral, 2=positive
confidence: Model confidence
match_score: Taxonomy match score
impact_level: Event impact level
Financial Services, Banks, NBFC, IT, Healthcare, FMCG, etc.: 32 sector impact scores
sentiment_label: Sentiment label (repeated for clarity)
SECTOR COLUMNS (32 total)
Financial Services, Banks, NBFC, IT, Healthcare, FMCG, Automobile and Auto Components,
Metals & Mining, Energy, Oil Gas & Consumable Fuels, Power, Telecommunication,
Consumer Durables, Consumer Services, Construction, Construction Materials, Realty,
Chemicals, Media Entertainment & Publication, Services, Diversified, Textiles,
Forest Materials, Transportation, Materials, Utilities, Aviation, Broking Company,
Cement, Fertilizer, Manufacturing, Capital Goods
SOURCE
Economic Times, Nifty 100 constituents, baptle dataset, synthetic augmentation.
Date range: 2024-2026
LICENSE
Apache License 2.0
印度金融新闻数据集(42K)
包含42214条印度金融新闻标题,附带情感标签与富分类学注释。
## 内容说明
总条目数:42214
带事件标注条目:2979条(包含事件ID、宏观信号、行业影响字段)
仅情感标注条目:39235条(由AION-Sentiment-IN-v3模型标注)
合成数据:200条,针对宏观主题卢比升值(附带负面情感标签)
## 字段列表
headline:新闻标题文本
event_id:分类学事件标识符(共95类事件,仅情感标注条目此字段为空)
macro_signal:市场影响得分(取值范围为-1至+1,仅情感标注条目此字段为0)
sentiment_label:情感标签,0=负面,1=中性,2=正面
confidence:模型置信度
match_score:分类学匹配得分
impact_level:事件影响等级
金融服务、银行、非银行金融公司(NBFC)、信息技术(IT)、医疗保健、快速消费品(FMCG)等:共32个行业影响得分
sentiment_label:情感标签(为便于清晰展示重复列出)
## 行业字段(共32个)
金融服务、银行、非银行金融公司(NBFC)、信息技术(IT)、医疗保健、快速消费品(FMCG)、汽车及汽车零部件、金属与采矿、能源、石油天然气及消费燃料、电力、电信、耐用消费品、消费者服务、建筑业、建筑材料、房地产、化工、媒体娱乐与出版、服务业、多元化行业、纺织业、林业材料、交通运输、原材料、公用事业、航空业、经纪公司、水泥、化肥、制造业、资本货物
## 数据来源
经济时报、印度Nifty 100指数成分股、baptle数据集、合成增强数据
时间范围:2024年-2026年
## 授权协议
Apache许可证2.0
提供机构:
AION-Analytics



