Cover Image for AI Reading Club - In-Person Workshop -The Curious Case of Neural Text Degeneration
Cover Image for AI Reading Club - In-Person Workshop -The Curious Case of Neural Text Degeneration
Avatar for AI Reading Club
Presented by
AI Reading Club
Hosted By
19 Going

AI Reading Club - In-Person Workshop -The Curious Case of Neural Text Degeneration

Register to See Address
Paris, France
Registration
Approval Required
Your registration is subject to host approval.
Welcome! To join the event, please register below.
About Event

Paper: https://arxiv.org/abs/1904.09751

Before ChatGPT felt smooth, before “top-p” became a default setting in every LLM API, researchers faced a strange problem:

the model was powerful — but the text was falling apart.

In 2019, GPT-2 had just shocked the AI community. For the first time, neural language models felt like they could write convincing long-form text. But underneath the excitement, there was a deep technical puzzle:

Why did strong language models still generate text that became repetitive, generic, incoherent, or stuck in loops?

This paper, The Curious Case of Neural Text Degeneration, is one of the papers that helped change the answer.

Its core insight is simple and important:

a good language model is not enough.

How you decode from the model can completely change the quality of the text.

Beam search, which had worked well for tasks like translation, failed badly for open-ended generation. Pure sampling had the opposite problem: it could become weird, unstable, and incoherent. The paper shows that human writing does not simply follow the most likely next word again and again. Good text needs surprise, variation, and control.

That insight led to Nucleus Sampling, also known as top-p sampling — a decoding method that became foundational for modern language generation.

Instead of always choosing from a fixed number of likely tokens, nucleus sampling dynamically selects the smallest set of tokens that contains most of the probability mass, then samples from that “nucleus.” This helps avoid both extremes: repetitive overconfidence and chaotic randomness.

In this session, we will unpack:

  • why maximizing likelihood can produce worse text

  • why beam search breaks down in open-ended generation

  • why pure sampling can become incoherent

  • what the “unreliable tail” of a language model distribution means

  • how nucleus sampling works intuitively

  • why this paper still matters for understanding modern LLM behavior

  • what changed between the GPT-2 era and today’s chat models

This is not a session about “AI text used to be boring.”

It is about a historic moment in generative AI: when the field realized that generation quality lives not only inside the model, but also in the decoding strategy between the model and the words we see.

If you use LLMs, build with LLMs, prompt LLMs, evaluate LLMs, or just want to understand what happens under the hood when a model writes text, this session is for you.

No need to be an expert.

Come curious. Leave with a much sharper mental model of how language models actually generate.

Location
Please register to see the exact location of this event.
Paris, France
Avatar for AI Reading Club
Presented by
AI Reading Club
Hosted By
19 Going