five

RIMES Dataset

收藏
paperswithcode.com2025-03-22 收录
下载链接:
https://paperswithcode.com/dataset/rimes
下载链接
链接失效反馈
官方服务:
资源简介:
The RIMES database (Reconnaissance et Indexation de données Manuscrites et de fac similÉS / Recognition and Indexing of handwritten documents and faxes) was created to evaluate automatic systems of recognition and indexing of handwritten letters. Of particular interest are cases such as those sent by postal mail or fax by individuals to companies or administrations. The database was collected by asking volunteers to write handwritten letters in exchange of gift vouchers. Volunteer were given a fictional identity (same sex as the real one) and up to 5 scenarios. Each scenario has been chosen among 9 realistic following themes : change of personal information (address, bank account), information request, opening and closing (customer account), modification of contract or order, complaint (bad service quality…), payment difficulties (asking for a delay, tax exemption…), reminder letter, damage declaration with further circumstances and a destination (administrations or service providers (telephone, power, bank, insurances). The volunteers composed a letter with those pieces of information using their own words. The layout was free and it was only asked to use white paper and to write in a readable way with black ink. The collect was a success with more than 1,300 people who have participated to the RIMES database creation by writing up to 5 mails. The RIMES database thus obtained contains 12,723 pages corresponding to 5605 mails of two to three pages.

RIMES数据库( Reconnaissance et Indexation de données Manuscrites et de fac similÉS / Recognition and Indexing of handwritten documents and faxes)旨在评估自动识别与索引手写信件的系统。其中,特别关注诸如个人通过邮政邮件或传真向公司或政府机构发送的案例。数据库的收集是通过请求志愿者以礼品券作为交换条件书写手写信件而完成的。志愿者被赋予一个虚构的身份(性别与真实身份相同)以及至多5个情景。每个情景均从9个现实主题中选择:个人信息变更(地址、银行账户)、信息查询、开户与销户、合同或订单修改、投诉(服务质量差等)、支付困难(要求延期、免税等)、提醒信、损失声明及进一步说明和目的地(政府机构或服务提供商(电话、电力、银行、保险))。志愿者使用自己的语言组合这些信息撰写信件。布局自由,仅要求使用白色纸张并以黑色墨水清晰书写。收集工作取得了成功,超过1300人参与了RIMES数据库的创建,每人撰写了多达5封邮件。因此,RIMES数据库包含12,723页,对应5605封两至三页的信件。
提供机构:
Papers with Code
搜集汇总
数据集介绍
main_image_url
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作