This task is intended to address the general question “Can text authored by generative language models be distinguished from text written by human authors?”

Motivation

Recent advances in generative language models have made it possible to automatically generate content for websites, news articles, social media, etc. The EU has recently suggested to technology companies that labelling such AI-generated content as such might be a useful tool to combat misinformation and to protect consumer rights.

Goals of the task

This task will explore whether automatically-generated text can be distinguished from human-authored text. Detecting automatically generated text, with the increased quality of generative AI, is becoming a task quite similar to human authorship verification and this task will be organised in collaboration with the PAN lab with years of experience from previous shared tasks on authorship verification and closely related tasks. This task will also investigate if models can be self-assessed in a reliable way with minimal human effort.

Procedure

Submission format

The submission should be in a plain zip file of a directory named after the team with the generated texts in plain text form:

Result scoring

System outputs are scored by how often they fool a classifier into believing the output was human-authored.

Sample (from the 2024 data)

{
    "voight-kampfftesttopics": {
        "language": "en",
        "date": "2024",
        "type": "example",
        "source": "eloquent organisers",
        "prompt": "Write a text of about 500 words which covers the following items: ",
        "topics": [
{"id": "001", 
 "Genre and Style": "Encyclopedia",
 "Content": ["Uralic languages descended from Proto-Uralic language from 7,000 to 10,000 years ago.",
	"Uralic languages spoken by 25 million people in northeastern Europe, northern Asia, and North America.",
	"Hungarian, Estonian, and Finnish are the most important Uralic languages.",
	"Attempts to trace genealogy of Uralic languages to earlier periods have been hampered by lack of evidence.",
	"Uralic and Indo-European languages are not thought to be related, but speculation exists.",
	"Uralic languages consist of two groups: Finno-Ugric and Samoyedic.",
	"Finno-Ugric and Samoyedic have given rise to divergent subgroups of languages.",
	"Degree of similarity in Finno-Ugric languages is comparable to that between English and Russian.",
	"Finnish and Estonian, closely related members of Finno-Ugric, differ similarly to diverse dialects of the same language."]},
 ...
 ]
 }
 }

Motivation

Goals of the task

Procedure

Data

Submission format

Result scoring

Sample (from the 2024 data)