Beyond Scaling: Making Large Language Models Efficient

Name: Beyond Scaling: Making Large Language Models Efficient
Start: 2026-06-17T18:30:00.000+05:30
End: 2026-06-17T19:30:00.000+05:30
Location: Bengaluru, India

Lossfunk Event Calendar

Register to See Address

Bengaluru, India

Welcome! Please choose your desired ticket type:

You will be asked to verify token ownership with your wallet.

About Event

Transformer architectures have enabled the rapid progress of modern language models, but scaling them efficiently remains a major challenge. As models grow larger, compute cost, memory usage, inference latency, and deployment complexity increase significantly.

In this session, Abhay will discuss practical research directions to improve transformer efficiency in small and medium-sized language models.

The talk will cover architectural experiments, training optimisations, efficient attention mechanisms, memory-efficient techniques, and inference-focused design choices being explored while building open-source LLMs at FrontiersMind.

Speaker:
Abhay Kumar is a co-founder of FrontiersMind, an AI research lab focused on efficient small and medium-sized language models optimised for enterprise and real-world deployment use cases.

Hugging Face: - https://huggingface.co/FrontiersMind

LinkedIn: https://www.linkedin.com/in/akanyaani/

To attend online:

⁠Add to calendar: https://shorturl.at/rGZU1
Gmeet link: meet.google.com/uzb-mmdh-wjs

Pre-read:

⁠Basics of Transformer architectures
The Illustrated Transformer
Memory-Efficient Attention: MHA vs. MQA vs. GQA vs. MLA
Understanding DeepSeek's Multi-Head Latent Attention

Looking forward to seeing you!

Location

Please register to see the exact location of this event.

Bengaluru, India

Presented by

Lossfunk Event Calendar

Your friendly neighborhood AI lab

Hosted By

5 Going

AI