five

Romanised Arabic Chat Data

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4448379
下载链接
链接失效反馈
官方服务:
资源简介:
Chat room conversations in Romanised Arabic (44.83%), including code switching in English (4.68%) and French (9.44%). The Romanised Arabic form primarily denoted the Levantine dialect (Lebanese, Egyptian) among 10 participants. These conversations were recorded under natural observation on 10th June 2015 in intervals of 90 minutes. All instances of data collection occurred between midday-2pm, Lebanese local time; they are listed in consecutive order. Conversations were collected on the instant messaging website, www.icq.com, and specifically, the chat room server titled ‘#icq-lebanon’. As per the terms outlined in the website company policy on 10th June 2015, no consent was required to use the chat room entries or other information from the online users, who constitute the participants of this study. Any data which marked personally identifiable information has been excluded, unless considered relevant for the purpose of the linguistic analysis.
创建时间:
2021-02-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作