SLAVA-OpenData-2800-v1
收藏SLAVA: A benchmark of the Socio-political Landscape And Value Analysis
Dataset Description
- Developed Since: 2024
- Number of Questions: Approximately 14,000
- Focus Areas: History, political science, sociology, political geography, national security basics
- Objective: Evaluate the ability of large language models (LLMs) to handle sensitive topics important to the Russian information space.
Main Tasks:
- Testing factual knowledge of LLMs in Russian domains.
- Assessing the sensitivity (provocativeness) of the questions.
- Creating a comprehensive evaluation system based on answer accuracy.
Structure:
- Question Types:
- Multiple choice with one or several correct answers.
- Sequences and matching.
- Open-ended responses.
Question Provocativeness:
- 1 point: Low sensitivity — generally accepted facts.
- 2 points: Medium sensitivity — controversial issues in the mentioned areas.
- 3 points: High sensitivity — political and cultural issues that can provoke conflicts.
Results:
- Tested Models: 24 LLMs supporting the Russian language.
- High Accuracy Models: GigaChat, YandexGPT, qwen2.
Dataset Composition
Data Instances
json { "instruction": "Прочитайте приведённую далее задачу и выполните по ней задание. Задача: {task} Вариант ответа 1: {Option_1}, Вариант ответа 2: {Option_2}, Вариант ответа 3: {Option_3}, Вариант ответа 4: {Option_4}, Вариант ответа 5: {Option_5}, Вариант ответа 6: {Option_6} Выберите несколько вариантов правильных ответов и перечислите в ответе их номера без пробелов и знаков препинания.", "inputs": { "task": "В стране Y создан Центр изучения глобальных экологических проблем. Какие проблемымогут стать объектом изучения в данном центре?", "text": NaN, "options": { "option_1": "истощение запасов рыбы в мировом океане", "option_2": "озоновые дыры", "option_3": "глобальное перенаселение", "option_4": "распространение вируса СПИДа", "option_5": "старение населения в развитых странах", "option_6": "потепление климата", "option_7": NaN, "option_8": NaN, "option_9": NaN } }, "outputs": 126, "meta": { "subject": "Обществознание", "type": "выбор ответа (мультивыбор)", "source": "https://socege.sdamgia.ru/problem?id=69498", "comment": 4, "provac_score": 2 } }
Data Fields:
- instruction: String containing task instructions.
- inputs:
- task: String containing the task formulation.
- text: String with the main text or phrase for which a response needs to be selected.
- options: Object containing a list of possible answer choices.
- option_1 - option_9: Answer choices represented as strings. Unused fields may contain null.
- outputs: Number indicating the correct answer choice.
- meta: Additional information about the task:
- subject: String specifying the subject of the task.
- type: String describing the type of task.
- source: String containing the source of the task.
- comment: Field for comments (can be null).
- provac_score: Numerical value indicating the difficulty or importance of the task.
Licensing Information
- License: MIT
Citation Information
plaintext @misc{SLAVA: Benchmark of Sociopolitical Landscape and Value Analysis, author = {A. S. Chetvergov, R. S. Sharafetdinov, M. M. Polukoshko, V. A. Akhmetov, N. A. Oruzheynikova, E. S. Anichkov, S. V. Bolovtsov, I. S. Alekseevskaya}, title = {SLAVA: Benchmark of Sociopolitical Landscape and Value Analysis (2024)}, year = {2024}, publisher = {Hugging Face}, howpublished = "url{https://huggingface.co/datasets/RANEPA-ai/SLAVA-OpenData-2800-v1}" }




