AI Security Day
AI Security Day connect researchers and builders interested in AI Safety. AI Security is part of MuShanghai's AI Week, which brings together AI builders in the Chinese and global ecosystem. AI Safety researchers and builders welcome! The event will be both in person (Shanghai @Alibaba) and virtual.
**AI Security Day of Schedule**
9:30-10 AM: Welcome and AI Security Day Introduction
10 AM-11 AM: TEE Talk + Workshop: Clawdi: A Secure iCloud for Agent
Speaker: Shelven Zhou (Phala)
11AM-11:30AM: All About Trust and Safety + Introduction to Osprey and Coop
Speaker: Juliet Shen, (Roost, https://roost.tools/)
11:30AM - 12PM: Break with Research Breakout + Highlight
12 PM - 12:30 PM: Quick Dive into Interpretability.
Speaker: Noam Youngerman, Security Researcher. Previous CTO of Epos, a research lab focusing on the intersection of mechanistic interpretability and AI security. Prior to that, Noam was at various AI research and held leadership roles in several applied research companies.
Talk Detail: Do we know what happens under the hood of an LLM? What does a latent space actually consist of? Noam's talk will include subliminal learning, a recently discovered effect that is fundamental to the behavior of LLMs and has some security implications.
12:30PM- 1PM: Speaker: Yujin Potter (Berkeley RDI)
1PM - 2PM: Lunch break
2PM - 2:30PM: AI Agent Control x AI Safety Landscape in China
Speaker: Sarah Sun (Open Community for AI Safety (China))
2:30PM - 3PM: AI Safety Field Building x AI Safety Fellowship
Speaker: Valerie Pang (SASH, https://www.aisafety.sg/)
3PM - 3:30PM: COFFEE BREAK
3:30PM - 4PM: Principles of Least Authority
Speaker: Jiang (Social Layer)
4PM - 4:30PM: Multi-agent Security
Speaker: Marcello Politi (Ethereum Foundation)
4:30PM - 5PM: Speaker: Soumaya Batra (WisePort. Co-author of Llama 2+3)
5PM - 5:30PM: Closing and Research Pod Next Step
Organizations interested in submitting projects for open source contribution or researcher support and exchange should complete the following form:
https://forms.gle/8SbaVEi5icVRjNFe6