owid_charts_en
收藏Our World In Data (OWID) Evaluation Dataset 概述
数据集基本信息
- 来源:Our World In Data (https://ourworldindata.org)
- 样本量:1000个独特图表(从完整数据集中随机抽取的子样本)
- 完整数据集:https://huggingface.co/datasets/jjinaai/owid_charts
- 下载大小:111210969字节
- 数据集大小:111924520字节
数据集结构
- 特征:
query:字符串类型,文章中对图表的引用文本片段image:图像类型,图表图像image_filename:字符串类型,图像文件名text_description:字符串类型,使用EasyOCR从图像中提取的OCR文本
- 拆分:
test:1000个样本
示例数据
json { query: "Unsafe water is one of the worlds largest health and environmental problems, particularly for the poorest people . The Global Burden of Disease is a major global study on the causes and risk factors for death and disease published in the medical journal The Lancet . These estimates of the annual number of deaths attributed to a wide range of risk factors are shown here. Lack of access to safe water sources is a leading risk factor for infectious diseases, including cholera, diarrhea , dysentery, hepatitis A, typhoid, and polio . 1 It also exacerbates malnutrition and, in particular, childhood stunting . The chart shows that it ranks globally as a significant risk factor for death.", image: <PIL.PngImagePlugin.PngImageFile image mode=RGBA size=850x600 at 0x7F97BBB9A620>, image_filename: images/clean_water_number-of-deaths-by-risk-factor_60c21d43.png }
引用信息
bibtex @misc{OurWorldInData, author = {Our World in Data}, title = {Our World in Data}, year = {n.d.}, note = {License: CC BY. Data from Our World in Data is made available under the Creative Commons Attribution License.}, url = {https://ourworldindata.org/}, howpublished = {https://ourworldindata.org/}, note = {Accessed: 2024-12-11} }
免责声明
- 数据集可能包含公开可用的图像或文本数据。
- 所有数据仅供研究和教育用途。
- 如有知识产权或版权问题,请联系 "support-data (at) jina.ai"。
版权信息
- 所有权利归文档原作者所有。




