five

harvardairobotics/FairVLMed

收藏
Hugging Face2026-04-08 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/harvardairobotics/FairVLMed
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-nc-nd-4.0 task_categories: - image-classification - image-to-text modality: - image - text language: - en tags: - medical - ophthalmology - fairness - vision-language - glaucoma - fundus - clinical-notes pretty_name: Harvard-FairVLMed size_categories: - 10K<n<100K --- # Dataset Card: Harvard-FairVLMed ## Dataset Summary Harvard-FairVLMed is the **first fair vision-language medical dataset** designed for studying fairness in medical vision-language (VL) foundation models. It contains 10,000 SLO fundus images paired with de-identified clinical notes and comprehensive demographic annotations, enabling in-depth fairness analysis across four protected attributes: race, gender, ethnicity, and preferred language. This dataset was introduced at CVPR 2024: [FairCLIP: Harnessing Fairness in Vision-Language Learning](https://openaccess.thecvf.com/content/CVPR2024/papers/Luo_FairCLIP_Harnessing_Fairness_in_Vision-Language_Learning_CVPR_2024_paper.pdf). ## Dataset Details ### Dataset Description | Field | Value | |-----------------|-------| | **Institution** | Department of Ophthalmology, Harvard Medical School | | **Task** | Glaucoma classification (vision-language) | | **Modality** | Scanning Laser Ophthalmoscopy (SLO) fundus images + clinical notes | | **Scale** | 10,000 patients, 10,000 samples | | **Splits** | 7,000 train / 1,000 validation / 2,000 test | | **License** | [CC BY-NC-ND 4.0](https://creativecommons.org/licenses/by-nc-nd/4.0/) | - **Curated by:** Yan Luo, Min Shi, Muhammad Osama Khan, Muhammad Muneeb Afzal, Hao Huang, Shuaihang Yuan, Yu Tian, Luo Song, Ava Kouhana, Tobias Elze, Yi Fang, Mengyu Wang - **License:** [CC BY-NC-ND 4.0](https://creativecommons.org/licenses/by-nc-nd/4.0/) — non-commercial research only - **Paper:** [CVPR 2024](https://openaccess.thecvf.com/content/CVPR2024/papers/Luo_FairCLIP_Harnessing_Fairness_in_Vision-Language_Learning_CVPR_2024_paper.pdf) - **Contact:** harvardophai@gmail.com, harvardairobotics@gmail.com ### Data Fields Each subject is stored as a `.npz` file (`data_xxxxx.npz`) containing: | Field | Description | |-----------------|-------------| | `slo_fundus` | Scanning Laser Ophthalmoscopy (SLO) fundus image | | `md` | Visual field mean deviation | | `tds` | 52 visual field total deviation values | | `age` | Patient age | | `gender` | `0` = Female, `1` = Male | | `race` | `0` = Asian, `1` = Black, `2` = White | | `ethnicity` | `0` = Non-Hispanic, `1` = Hispanic, `-1` = Unknown | | `language` | `0` = English, `1` = Spanish, `2` = Other, `-1` = Unknown | | `maritalstatus` | `0` = Married/Partnered, `1` = Single, `2` = Divorced, `3` = Widowed, `4` = Legally Separated, `-1` = Unknown | | `glaucoma` | `0` = Non-Glaucoma, `1` = Glaucoma | | `note` | Original de-identified clinical note | | `note_extra` | De-identified clinical note with demographic attributes prepended | A metadata overview is provided in `data_summary.csv`. GPT-4-summarized versions of all notes are available in `gpt-4_summarized_notes.csv`. ### Demographics 6 demographic attributes are annotated per patient: age, gender, race, ethnicity, preferred language, and marital status. ## Uses ### Direct Use - Fairness benchmarking for medical vision-language foundation models (e.g., CLIP, BLIP-2) - Analysis of demographic disparities in ophthalmic AI across race, gender, ethnicity, and language - Development and evaluation of bias-mitigation methods for VL models in healthcare - Medical note summarization and multimodal learning research ### Out-of-Scope Use Clinical decisions, patient care, or any commercial application. This dataset shall not be used for clinical decisions at any time. ## Access The "Harvard" designation indicates the dataset originates from the Department of Ophthalmology at Harvard Medical School. It does not imply endorsement, sponsorship, or legal responsibility by Harvard University or Harvard Medical School. ## Citation **BibTeX:** ```bibtex @inproceedings{luo2024fairclip, title={Fairclip: Harnessing fairness in vision-language learning}, author={Luo, Yan and Shi, Min and Khan, Muhammad Osama and Afzal, Muhammad Muneeb and Huang, Hao and Yuan, Shuaihang and Tian, Yu and Song, Luo and Kouhana, Ava and Elze, Tobias and others}, booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition}, pages={12289--12301}, year={2024} } ``` **APA:** Luo, Y., Shi, M., Khan, M. O., Afzal, M. M., Huang, H., Yuan, S., Tian, Y., Song, L., Kouhana, A., Elze, T., Fang, Y., & Wang, M. (2024). FairCLIP: Harnessing Fairness in Vision-Language Learning. *Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)*, 12289–12301.
提供机构:
harvardairobotics
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作