Cover Image for Voice AI Meetup
Cover Image for Voice AI Meetup
Avatar for Pipecat
Presented by
Pipecat
The most widely used voice agents framework — 100% open source

Voice AI Meetup

Register to See Address
San Francisco, California
Registration
Welcome! Please choose your desired ticket type:
About Event

Join for our next Voice AI Meetup, moderated by Kwindla from Pipecat with leaders from Speechmatics, Tavus, and Daily.

  • We'll discuss benchmarks for voice AI, as well as multimodal AI research and use cases.

  • Speakers include Ricardo Herreros Symons, CSO Speechmatics; Sam Sykes, Director Innovation Speechmatics; Quinn Favret, Cofounder Tavus; Kwindla Hultman Kramer, Pipecat.

  • Plus demos, pizza/drinks, networking and conversations with fellow AI engineers, founders, investors, and teams.

Ricardo and Sam from Speechmatics are in town from London, and glad to talk STT. We're onsite at Tavus HQ, where you can chat with their team.

  • Doors open 6:30p PT in SF / livestream. Demos and fireside chats start 7:15p. More networking at 8p. Office closes 9p.

On benchmarks

One of the top questions we get from the Pipecat community and ecosystem is how to evaluate models (and how we evaluate models).

Pipecat recently released benchmarks evaluating voice AI performance. Our first benchmarks tested LLMs and STT.

Benchmarks are hard to do well, and always are a simplification of reality, at best!

We’re sitting down with the STT lab Speechmatics (which maintains a Pipecat service) to talk about benchmarks. As our Pipecat team designed and compiled our benchmarks, we worked closely with the leading labs, including Speechmatics, to get their input, feedback, and perspective.

Kwindla will continue that conversation, with Ricardo Herreros Symons, Speechmatics CSO:

  • How "hard" should a benchmark be and what should the data mix be?

  • What should you really be testing? (Latency, turn detection, configurability, etc, what else?)

  • What data sets should you train on?

  • How do APIs and orchestration implementations matter?

  • What is the difference between reality and…marketing.

  • What could the next evolution of a benchmark be?

On multimodal AI

We’re also excited for a fireside chat with Quinn Favret, Tavus cofounder. Tavus has long been a leader in multimodal AI. Kwindla will talk with Quinn about the latest research and training realtime models from scratch. We'll hear from Quinn about growing agentic video use cases from startups to the enterprise.


Your meetup hosts

Pipecat is the most widely used voice agents and multimodal AI framework. 100% open source. Vendor neutral.

Speechmatics provides advanced Voice AI technology, delivering real-time and batch transcription across 55+ languages. Their neural models are engineered for high accuracy across accents, noisy environments, and specialized domains, powering scalable speech intelligence for enterprise applications.

Tavus is an SF–based AI research lab pioneering human computing, teaching machines the art of being human. Build, scale, and customize lifelike AI video agents for your products and workflows.

Daily provides realtime voice, video, and AI infrastructure for developers. Its engineers maintain the open source Pipecat framework.

Location
Please register to see the exact location of this event.
San Francisco, California
Avatar for Pipecat
Presented by
Pipecat
The most widely used voice agents framework — 100% open source