five

[SAMPLE] Nexdata | Lip Multimodal Data | 2,000 ID |Lip Sync Data |Audio Image AI Training Data | ...

收藏
Databricks2024-05-31 收录
下载链接:
https://marketplace.databricks.com/details/0e7b69f9-f7c9-4852-a414-1261ce6e9097/Nexdata_SAMPLE-Nexdata-Lip-Multimodal-Data-2,000 ID-Lip-Sync-Data-Audio-Image-AI-Training-Data-
下载链接
链接失效反馈
官方服务:
资源简介:
1. Specifications Data size : 2,000 id, each person collects the audio and video data from 13 different angles +1 txt document People distribution : race distribution: Asian, Caucasian, Black, Brown, gender distribution: gender balance, age distribution: people aged 18-60 Collecting environment : indoor natural light scenes, indoor fluorescent lamp scenes Annotated Imagery Data diversity : including multiple scenes, different ages, different shooting angles Device : cellphone, the resolution is 1,920*1,080 Collecting angle : audio and video data of front face, 3 angles left side face, 3 angles right side face, looking down, looking up, left side face down, right side face down, left side face up and right side face up all 13 different angles were collected at the same time Recording content : general field, unlimited content Language : 10 languages, each video is more than 20 seconds Data format : the video data format is .mp4, the audio is greater than or equal to 16KHz, 16bit, the frame rate is 25-30 fps Accuracy rata : the accuracy rate of sentence is more than 95% 2. About Nexdata Nexdata owns off-the-shelf 200,000 hours of speech recognition data, 800TB of Annotated Imagery Data, about 2 billion pieces of Natural Language Processing (NLP) Data. These ready-to-go Annotated Imagery Data support instant delivery, quickly improve the accuracy of AI models. For more details, please visit us at https://www.nexdata.ai/datasets/computervision?source=Datarade
提供机构:
Nexdata
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作