five

Financial News dataset for text mining

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/5569112
下载链接
链接失效反馈
官方服务:
资源简介:
please cite this dataset by : Nicolas Turenne, Ziwei Chen, Guitao Fan, Jianlong Li, Yiwen Li, Siyuan Wang, Jiaqi Zhou  (2021) Mining an English-Chinese parallel Corpus of Financial News,  BNU HKBU UIC, technical report   The dataset comes from Financial Times news website (https://www.ft.com/) news are written in both languages Chinese and English. FTIE.zip contains all documents in a file individually FT-en-zh.rar contains all documents in one file Below is a sample document in the dataset defined by these fields and syntax :  id;time;english_title;chinese_title;integer;english_body;chinese_body   1021892;2008-09-10T00:00:00Z;FLAW IN TWIN TOWERS REVEALED;科学家发现纽约双子塔倒塌的根本原因;1;Scientists have discovered the fundamental reason the Twin Towers collapsed on September 11 2001. The steel used in the buildings softened fatally at 500?C – far below its melting point – as a result of a magnetic change in the metal. @ The finding, announced at the BA Festival of Science in Liverpool yesterday, should lead to a new generation of steels capable of retaining strength at much higher temperatures.;科学家发现了纽约世贸双子大厦(Twin Towers)在2001年9月11日倒塌的根本原因。由于磁性变化,大厦使用的钢在500摄氏度——远远低于其熔点——时变软,从而产生致命后果。 @ 这一发现在昨日利物浦举行的BA科学节(BA Festival of Science)上公布。这应会推动能够在更高温度下保持强度的新一代钢铁的问世。   The dataset contains 60,473 bilingual  documents. Time range is from 2007 and 2020.    This dataset has been used for parallel bilingual news mining in Finance domain.
创建时间:
2021-10-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作