Cover Image for AI Reading Club - In-Person Workshop -The Curious Case of Neural Text Degeneration

Presented by

A biweekly AI Reading Club to read and discuss the papers that built modern LLMs. The goal is to understand the actual engineering breakthroughs (tokenization, self-attention, decoding, efficiency,

Hosted By

26 Went

AI

AI Reading Club - In-Person Workshop -The Curious Case of Neural Text Degeneration

Name: AI Reading Club - In-Person Workshop -The Curious Case of Neural Text Degeneration
Start: 2026-06-04T18:00:00.000+02:00
End: 2026-06-04T19:00:00.000+02:00
Location: Paris, France

AI Reading Club

Register to See Address

Paris, France

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Paper: https://arxiv.org/abs/1904.09751

Before ChatGPT felt smooth, before “top-p” became a default setting in every LLM API, researchers faced a strange problem:

the model was powerful — but the text was falling apart.

In 2019, GPT-2 had just shocked the AI community. For the first time, neural language models felt like they could write convincing long-form text. But underneath the excitement, there was a deep technical puzzle:

Why did strong language models still generate text that became repetitive, generic, incoherent, or stuck in loops?

This paper, The Curious Case of Neural Text Degeneration, is one of the papers that helped change the answer.

Its core insight is simple and important:

a good language model is not enough.

How you decode from the model can completely change the quality of the text.

Beam search, which had worked well for tasks like translation, failed badly for open-ended generation. Pure sampling had the opposite problem: it could become weird, unstable, and incoherent. The paper shows that human writing does not simply follow the most likely next word again and again. Good text needs surprise, variation, and control.

That insight led to Nucleus Sampling, also known as top-p sampling — a decoding method that became foundational for modern language generation.

Instead of always choosing from a fixed number of likely tokens, nucleus sampling dynamically selects the smallest set of tokens that contains most of the probability mass, then samples from that “nucleus.” This helps avoid both extremes: repetitive overconfidence and chaotic randomness.

In this session, we will unpack:

why maximizing likelihood can produce worse text
why beam search breaks down in open-ended generation
why pure sampling can become incoherent
what the “unreliable tail” of a language model distribution means
how nucleus sampling works intuitively
why this paper still matters for understanding modern LLM behavior
what changed between the GPT-2 era and today’s chat models

This is not a session about “AI text used to be boring.”

It is about a historic moment in generative AI: when the field realized that generation quality lives not only inside the model, but also in the decoding strategy between the model and the words we see.

If you use LLMs, build with LLMs, prompt LLMs, evaluate LLMs, or just want to understand what happens under the hood when a model writes text, this session is for you.

No need to be an expert.

Come curious. Leave with a much sharper mental model of how language models actually generate.

Location

Please register to see the exact location of this event.