Cover Image for Week 45: Attention Residuals: Rethinking Information Flow in LLMs

Presented by

90/30 Club

We meet weekly in-person to talk about new ML papers! Come and join the discussion!

Hosted By

35 Going

AI

Week 45: Attention Residuals: Rethinking Information Flow in LLMs

Name: Week 45: Attention Residuals: Rethinking Information Flow in LLMs
Start: 2026-03-23T19:00:00.000-07:00
End: 2026-03-23T21:00:00.000-07:00
Location: 1680 Mission St

90/30 Club

1680 Mission St

San Francisco, California

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Paper Link

This paper introduces Attention Residuals (AttnRes), a novel architectural mechanism by the Kimi Team designed to rethink how information flows in modern Large Language Models (LLMs). The central challenge addressed in the work is that standard residual connections with PreNorm accumulate all layer outputs using fixed unit weights. This uniform aggregation causes uncontrolled hidden-state growth with depth, progressively diluting each layer's unique contribution. To overcome this limitation, the authors replace this fixed accumulation with a softmax attention mechanism over preceding layer outputs, allowing each layer to selectively aggregate earlier representations using learned, input-dependent weights.

Join us at Mox!

🔎Analyzed Papers Discussion at 20:00, (optional) quiet reading from 19:00 to 20:00.

Location

1680 Mission St

San Francisco, CA 94103, USA

4th Floor @ Mox sf (moxsf.com)

Presented by

90/30 Club

We meet weekly in-person to talk about new ML papers! Come and join the discussion!

Hosted By

35 Going

AI