Redwood Research Red Teaming Hackathon

Name: Redwood Research Red Teaming Hackathon
Start: 2025-09-13T10:00:00.000-07:00
End: 2025-09-14T17:00:00.000-07:00
Location: Berkeley

Hosted by Michael Yu

Berkeley

Berkeley, California

Approval Required

Your registration is subject to approval by the host.

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

⚡ Who can build the best hacks and defenses for LLMs? ⚡🤖

What if LLMs had hidden motives? Models deployed in the U.S. government and the world's most important companies might secretly be looking for opportunities to sneak backdoors into code, or otherwise pursue nefarious objectives. Could we catch these model hacks? This weekend, we're playing a capture the flag game with two teams:

The red team: The red team will create realistic examples of models that hide secret objectives (e.g. the model secretly tries to share data with attackers)
The blue team: The blue team will try to detect these objectives.

We'll bring the compute and some examples to build on. You bring the research hustle.

More details here: http://bit.ly/3JlJ3Q2

Hosts

Redwood Research (redwoodresearch.org)
Constellation (constellation.org)

🗓️ Schedule (Subject to Change):

Sept 13th 2025

9:30 AM: 🚪 Doors Open
9:45 AM: 🎤 Opening Remarks
11:00 AM: 💻 Start Coding!
12:00 AM: Lunch Provided
6:00 PM: Dinner Provided

Sept 14th 2025

9:30 AM: 🚪 Doors Open
12:00 PM: Lunch Provided
2:00 PM: 📤 Project Submission Deadline
5:30 PM: 👩‍💻 Awards ceremony

➡️ Next steps ➡️

Join the discord: https://discord.gg/7qEDgCWAwm👀 #teamsearch is where you can look for teams
Register on Devpost: https://redwood-af.devpost.com/

Notion: http://bit.ly/3JlJ3Q2

Location

Berkeley

CA, USA

Hosted By

70 Going

AI

Redwood Research Red Teaming Hackathon

​Hosts

​​🗓️ Schedule (Subject to Change):

​​➡️ Next steps ➡️

Hosts

🗓️ Schedule (Subject to Change):

➡️ Next steps ➡️