Give your AI Agents Eyes and Ears. Perception 101 with VideoDB

Name: Give your AI Agents Eyes and Ears. Perception 101 with VideoDB
Start: 2026-04-07T17:30:00.000-07:00
End: 2026-04-07T20:30:00.000-07:00
Location: San Francisco, California

VideoDB

Register to See Address

San Francisco, California

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

Join AI engineers, startups, and creative professionals for a hands-on workshop on building real-time perception for agents.

AI is out of the chatbot phase. It is moving into devices. Soon it will sit on your desk. Then it will sit in your room.

As agents leave text boxes and enter the physical and digital world, they need real-time perception and structured delivery.

VideoDB is building the infrastructure layer that enables that shift: the ability to see, understand and act on real world.

This workshop is with Ashu, founder of VideoDB. We’ll discuss how to convert continuous media streams (screen, mic, camera, RTSP, files) into a structured context your agent can use.

Who should attend:

Engineers building agents/ products that need continuous and temporal multimodal awareness (not one-shot screenshots).
If you are exploring skills for OpenClaw/ computer use agents and wants it to have eyes and ears for any task.
Research teams building in physical AI, AI companion robots and wearables.
Product teams building meeting bots, desktop copilots, monitoring/ops, QA/compliance.
Workflows with camera streams and searching of events of interest.

What You’ll Discover:

What “perception” actually means for agents: continuous, temporal, multi-source, searchable, actionable.
How to support three input modes with one mental model: files, live streams, desktop capture.
How to build searchable memory so your agent can retrieve results with playable evidence, not vibes.
How to move from batch video AI to real-time event streams your agent can react to immediately.
Claude code/ codex skills for vibe coding within your stack.

✨ Plus:

Refreshments and networking session with top builders working on agents + multimodal infra.

Expect production-grade demos, takeaways you can reuse and an hour of networking to share ideas in agentic perception, video, multimodal AI and frontier tech.
For more, check https://github.com/video-db

Location

Please register to see the exact location of this event.

San Francisco, California

Presented by

VideoDB

Build agents that watch, listen, understand, and recall in real time

Hosted By

98 Went

AI

Give your AI Agents Eyes and Ears. Perception 101 with VideoDB

​​Who should attend:

​​What You’ll Discover:

​✨ ​Plus:

Who should attend:

What You’ll Discover:

✨ Plus: