Call for Shared Task Participation

Reconstructing the Reasoning in United Nations Resolutions

This shared task focuses on understanding argumentative structure in highly formal, legal-political UN resolutions. Participants are expected to build LLM-based systems to (1) identify and classify argumentative paragraphs in preambles and operative sections, and (2) predict argumentative relations between paragraphs.

Data: United Nations Resolutions Language: French (with English translations) Granularity: paragraph-level # Subtasks: 2 Metric: F1 and LLM-as-a-Judge Models: open-weight LLMs (≤ 8B) Project: DICED

Leaderboard

Team F1 Score LLM-Judge Final
🥇 LLM-Instruct 1 5 1
🥈 Prompteam 5 1 2
🥉 Argchestrators 2 6 3
🥉 HybridArguer 4 3 3
POINTERS 3 9 5
ResolveNow 9 2 7
TypeCoT 6 8 8
Ockham 8 7 9

* Top 3 teams will receive certificates and awards at the ArgMining 2026 Workshop in San Diego, USA.

Overview

United Nations resolutions encode collective reasoning at scale: negotiated positions, implicit premises, and carefully structured conclusions. This shared task evaluates how well modern systems can recover these underlying argumentative structures from text.

Tasks

The shared task consists of two subtasks aligned with the workshop theme “Understanding and evaluating arguments in both human and machine reasoning.”

Subtask 1: Argumentative Paragraph Classification

For each paragraph, predict (a) whether it is preambular or operative, and (b) assign a subset of 141 predefined tags as a multi-label classification problem.

Subtask 2: Argumentative Relation Prediction

Given a paragraph, predict which other paragraphs it is related to (indices), and label each link with one or more relation types: contradictive, supporting, complemental, modifying.

Data

We provide a training set and a held-out test set. Both in JSON schema to enable easy processing and reproducible development. We encourage participants to explore the data and design their systems accordingly. To make the task more accessible to non-French speakers, we provide English translations for the dataset.

The shared task draws on DICED, a structured resource for exploring intergovernmental cooperation in education. The broader DICED curation work currently covers the following institutional sources.

Institutional source Years Status
UNESCO International Bureau of Education: Resolutions and Recommendations of the International Conference on Education 1934-2008 100% completed
Council of Europe Standing Conference of Education Ministers 1959-2023 100% completed
United Nations General Assembly Resolutions on Education 1945-2025 80% completed
United Nations Commission on Human Rights and United Nations Human Rights Council Resolutions addressing Education 1946-2025 60% completed
UNESCO Legal Instruments Pertaining to Education 1946-2025 75% completed
UNESCO General Conference resolutions pertaining to Education 1946-2025 20% completed
European Council Conclusions addressing Education 2009-2025 Planned
European Council Recommendations addressing Education 2009-2025 Planned
European Parliament Resolutions addressing Education 1958-2025 Planned
OECD Council Recommendations addressing Education 1961-2025 Planned
UZH
University of Zurich, Department of Computational Linguistics
Hugging Face
Train and Test Set
Download on Hugging Face

Licensing note: training data follow a restricted UN license; by participating, teams agree not to redistribute the training data publicly.

Evaluation

Systems are evaluated using a combination of automated metrics and empirical auditing.

Final ranking is based on the average of both metrics. We will update the leaderboard live during the evaluation phase.

Submission

Participants submit predictions for the test set in the required JSON format.

Submission package

Compress your filled-out JSON test set and system paper into a single ZIP file for upload.

Allowed techniques are flexible (e.g., in-context learning, retrieval-augmented generation, etc.), but only open-source LLMs ≤ 8B may be used. Please also include a team name in your system paper for the leaderboard announcement.

Important dates

All deadlines are 11:59 PM UTC-12:00 (“anywhere on Earth”).

1 Feb 2026
Train and test data release
18 March 2026
Evaluation and submission starts
1 April 2026
Submission ends
15 April 2026
Evaluation ends; results notification
24 April 2026
Paper submission due
1 May 2026
Reviews to authors
12 May 2026
Camera-ready version due
July 2026
ArgMining 2026 Workshop

Organizers

University of Zurich, Zurich, Switzerland.

FAQ