five

A benchmark for identifying chemical structures from images and generating correct IUPAC names with explicit evaluation rules.

收藏
Zenodo2026-04-06 更新2026-05-26 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.19393819
下载链接
链接失效反馈
官方服务:
资源简介:
The IUPAC Multimodal Naming Benchmark is a chemistry-focused benchmark for evaluating whether models can infer the correct chemical name directly from structure images.The task requires multimodal understanding of molecular diagrams and application of chemical naming rules to produce the correct IUPAC name. This benchmark is designed for structured evaluation and includes explicit handling rules for stereochemistry, synonyms, aromatic compounds, tautomeric forms, and ionic compounds. Task Given an image of a chemical structure, the goal is to predict the correct chemical name.This benchmark is intended for: multimodal LLM evaluation chemistry reasoning benchmarks image-to-text scientific understanding exact-match and structured evaluation pipelines Evaluation Policy Each example contains: a canonical expected name optional accepted synonyms a stereochemistry requirement flag metadata notes for special cases Naming rules Stereochemistry is required when explicitly shown in the image. Common names are not accepted unless explicitly listed in the accepted synonym list. Tautomeric alternatives are not accepted unless they match the depicted structure or are explicitly allowed. Salts and ionic compounds must be named as depicted, including relevant counterions. Aromatic compounds follow a predefined canonical naming policy, with optional curated aliases. Dataset Fields Typical fields include: id image_path expected_name accepted_names stereochemistry_required common_names_accepted notes Example Use Cases evaluate multimodal chemistry models compare exact-match naming performance test structured reasoning pipelines benchmark scientific image understanding systems No
提供机构:
Zenodo
创建时间:
2026-04-06
二维码
社区交流群
二维码
科研交流群
商业服务