Cover Image for The Future of Multimodal AI: From Video Ads to Voice Cloning
Cover Image for The Future of Multimodal AI: From Video Ads to Voice Cloning
Avatar for FounderCoHo
Presented by
FounderCoHo
Share founder story. Fuel founder journey. www.foundercoho.com
19 Going

The Future of Multimodal AI: From Video Ads to Voice Cloning

Registration
Approval Required
Your registration is subject to approval by the host.
Welcome! To join the event, please register below.
About Event

Multimodal AI is everywhere now. Type text/URL → get a video ad. Record 10 seconds → clone any voice. Upload a product photo → generate a complete marketing campaign.

But here's the brutal truth: Most multimodal AI startups have incredible tech and no idea who actually needs it. They're solutions desperately searching for problems.

Between the demo and shipping to millions of users? That's where things get messy. And that's where having cool technology stops being enough.

Three founders who've actually done it will share what breaks, what scales, and what matters.

No theory. No fluff. Just real stories from the trenches.


Why Multimodal AI Matters Right Now

  • 95% of customer interactions will be AI-powered by end of 2025

  • Companies are spending billions trying to automate video, audio, and visual content creation

  • But most multimodal AI products fail in the messy space between demo and production

  • The founders who figure this out first will build the next generation of billion-dollar companies


Meet the Founders

Yinan (Steven) Na – Co-Founder & CEO, Creatify AI(#1 AI Video Ad Platform)
Turning text into video ads at scale. $15.5M Series A from top VCs. $9M ARR. Ex-Snap & Meta engineering leader.

Emmie Chang – Co-Founder @ Yuzu Labs | YC W14 | Serial Entrepreneur
YC-backed founder. Has built (and failed) enough AI products to know what actually matters.

Rissa Cao — Co-Founder & CEO @ Fish Audio & 39 AI | Ex-Co-Founder @ Mewtant
Built the most expressive voice AI platform on the market. 20K+ developers, $5M ARR, 6x cheaper than ElevenLabs.

What You'll Learn

The real cost of shipping multimodal AI (spoiler: it's not just compute)
🔧 Production war stories – what breaks when you hit scale
💰 How to build a business when models change every 3 months
🚀 Distribution secrets – getting users in a crowded market
🔮 What's next – where the biggest opportunities still are

Join us to learn the hard-won lessons of deploying multimodal AI and get a look at where the industry is heading.

Event Details

📅 Date: Dec 5th
🕐 Time:4:30-7:30pm
📍 Location: Stanford

Event Schedule:

  • ​​​​​​​​​​4:30 PM - 5:00 PM: Networking and Dinner

  • 5:00 PM -5:30 PM: Warm up and Presentation from Wan

  • ​​​​​​​​​​5:30 PM - 6:15 PM: Panel and Q&A

  • ​​​​​​​​​​6:15 PM onwards: Socializing, and Continued Conversations

Partner

Wan – An AI platform that turns text and images into animated short-form videos in seconds, lowering the barrier to creative work with AI.​​​​​​​​​​​

Meet your Host!

​​​​​​​​​​​FounderCoHo is a community where founders support each other by sharing knowledge and building connections. Starting a company is a brave endeavor. Many of us have faced numerous challenges and learned valuable lessons along the way. The founder's journey can feel lonely, as it's difficult to discuss its nuances with those who haven't experienced it firsthand. FounderCoho aims to be that supportive community where founders can share their stories and get fueled for their company-building journey.

​​​​​​​​​​Where to find us:

​​​​​​​​​​LinkedIn

​​​​​​​​​YouTube

​​​Substack

Location
EVGR, Building C
726 Serra St, Stanford, CA 94305, USA
Avatar for FounderCoHo
Presented by
FounderCoHo
Share founder story. Fuel founder journey. www.foundercoho.com
19 Going