Topical PISA Quiz Task — Generating Test Items

A task of the 2026 ELOQUENT lab on evaluating quality of generative language models

Contact: eloquent-clef2026-organizers@googlegroups.com

Task overview

This task focuses on automatic test items generated from a given document, targeting students aged 10 to 15. The objective is to generate assessment items in the form of Question–Answer pairs based on a provided text (the “stimulus”).

We provide participants with a ready-to-use question–answer generation prompt as a baseline, along with five automatically generated QA test items and one gold test item, all derived from the same source stimulus. The scope of participation is intentionally open and flexible. Participants may choose to:

For this edition, English is the selected language.

Quick Start

How to participate, in more detail

Example item

Some example items can be found here

Submission instructions

Data

PISA, public items, link to repo here

Quality Criteria

The following quality criteria will in various ways be taken into account.

Scoring

The scoring of submissions will be made using expertise from human editors who have worked with putting together previous PISA editions.

Timeline

Organisers

Contact address for questions or suggestions: eloquent-clef2026-organizers@googlegroups.com

Bibliography

Some relevant previous work – feel free to suggest items for this list e.g. by a pull request!