Edge Cases by AGI, Inc. #002 on Near-Memory Execution for LLMs with Lead Architect @NVIDIA

Name: Edge Cases by AGI, Inc. #002 on Near-Memory Execution for LLMs with Lead Architect @NVIDIA
Start: 2026-05-28T17:00:00.000-07:00
End: 2026-05-28T19:30:00.000-07:00
Location: San Francisco, CA

AGI, Inc.

Register to See Address

San Francisco, CA

4 Spots Remaining

Hurry up and register before the event fills up!

Approval Required

Your registration is subject to host approval.

Welcome! To join the event, please register below.

You will be asked to verify token ownership with your wallet.

About Event

A lot of the edge AI work we find interesting is either pre-print or too niche. With the Edge Cases we're creating a space for these topics.

Every other week at AGI HQ, one researcher takes the room through a paper or a work-in-progress. On-device inference, small models, efficient training, anything that lives on constrained hardware.

Case #002: Mochamad Asri (Lead Architect at NVIDIA, PhD
ECE from UT Austin) is presenting “System Architectures for near-memory execution”, how he thinks about bandwidth, latency, and end-to-end QoS for serving generative AI and LLMs.

Limited to AI researchers and students only.

One paper, one presenter, 45 minutes
Off-the-record Q&A
Drinks and conversation after

Approval-only. 15 seats. If you're in, we'll send the calendar invite with the address.

Case #003: Open call.

We're sourcing this session's paper from the community. Pitch your own work or a paper you want to lead a discussion on.

👉 Apply here

Location

Please register to see the exact location of this event.

San Francisco, CA

Presented by

AGI, Inc.

Hosted By

AI