five

Tense-Annotation

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/records/4068283
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains parallel English and French texts from the Europarl corpus (Koehn, 2005). The files provide alignments of EN and FR verbs along with information on their position, tense and voice and can therefore be used in translational studies for these languages and/or the training of translation systems that can make use of the labels in this resource. Although the resource was created semi-automatically, the verb alignment and inferred tenses are of high precision, especially in the second file contained in the package: Tense-Annotation-full.txt : complete alignment. Tense-Annotation-gold.txt : alignments only for cases where there is an EN /and/ an FR tense that was inferred from the verbs.   The format in the two files is the following: EN sentence FR sentence Position_in_EN    \tab    EN_verb1    \tab    EN_tense    \tab    EN_voice    \tab    FR_verb    \tab    FR_tense    \tab    FR_voice Position_in_EN    \tab    EN_verb2    \tab    EN_tense    \tab    EN_voice    \tab    FR_verb    \tab    FR_tense    \tab    FR_voice EN sentence FR sentence Position_in_EN    \tab    EN_verb1    \tab    EN_tense    \tab    EN_voice    \tab    FR_verb    \tab    FR_tense    \tab    FR_voice ...   The following is an explanation on the labels used: EN_tense: past_perf_cont = Past Perfect Continuous past_perf = Past Perfect past_cont = Simple Past Continuous sim_past = Simple Past pres_perf = Present Perfect pres_perf_cont = Present Perfect Continuous pres_perf = Present Perfect pres_cont = Present Continuous pres = Present fut_perf_cont = Future Perfect Continuous fut_perf = Future Perfect fut_cont = Future Continuous fut = Future cond_perf_cont = conditional verb group with in continuous past tense cond_perf = conditional verb group in past tense cond_cont = conditional verb group in continuous present tense cond = conditional verb group in present tense infinitif = base verb form no_tag = tense not found EN_voice: active, passive, unknown FR_tense: pres = présent passe_comp = passé composé imparfait = imparfait plus_que_parf = plus-que-parfait passe_sim = passé simple passe_rec = passé récent passe_ant = passé antérieur imperatif = impératif subjonctif = subjonctif conditionnel = conditionnel futur_proche = futur proche futur = futur futur_ant = futur antérieur no_tag = tense not found FR_voice: active, passive, unknown @ = unaligned words
创建时间:
2020-10-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作