Cover Image for Community Huddle: Faster Agents With SLMs

Presented by

We host technical, founder-focused events on building production AI systems. Topics include Small Language Models (SLMs), inference-time compute optimization, model routing, and cost-efficient deploym

Hosted By

2 Went

AI

Community Huddle: Faster Agents With SLMs

Neurometric AI Events

Google Meet

Past Event

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

A Live Technical Webinar

AI costs quietly spiral in production.

Between overpowered models, inefficient routing, and unnecessary tokens, many teams are paying 5–20x more than they need to.

Small Language Models (SLMs) offer a practical alternative.

In this live webinar, we’ll break down how to redesign your AI stack to dramatically reduce inference costs, without sacrificing performance or reliability.

You’ll learn:

Where large models are overkill
How to route intelligently between SLMs and frontier models
Using SLMs for classification, extraction, summarization, and guardrails
Designing cost-aware production architectures
Measuring quality vs cost tradeoffs with real eval pipelines
Real-world examples of teams cutting AI spend significantly

This session is built for developers, founders, and product teams shipping AI features in production.

If you’re running AI in production and want your margins back, this is for you.

Presented by

Neurometric AI Events

We host technical, founder-focused events on building production AI systems. Topics include Small Language Models (SLMs), inference-time compute optimization, model routing, and cost-efficient deploym

Hosted By

2 Went

AI