Alumnx AI Build Day: TokenLens
βAlumnx AI Build Day: Token Observability
βποΈ 30th May | β° 11 AM β 12 PM π Hybrid β Hyderabad + Online
βEver wondered what your AI inference actually costs?
βWhen you upload a PDF into ChatGPT or Claude and ask a question β do you know how many tokens are being consumed? Do you know when it makes more sense to self-host an open source model like Gemma 4 versus calling a hosted API like Gemini Flash?
βMost people building AI applications today don't have answers to these questions. That's an expensive blind spot.
βWhat we are building
βParticipants from Alumnx AI Labs' Full Stack AI Engineers cohort and AI for Senior Leaders (SLP) program are building a Token Observability web application β live, in 48 hours.
βThe application will let you:
βUpload a PDF and ask questions against it
βSee exactly how many tokens are consumed and what it costs β in βΉ and $
βCompare full-context ingestion vs RAG-based retrieval
βUnderstand when a self-hosted model beats a hosted API β and when it doesn't
βWhat happens on 30th May
βWe demo the live, deployed application β built entirely by our cohort in 48 hours. You will see real token counts, real cost breakdowns, and a side-by-side comparison of Gemma 4 on an NVIDIA A10G GPU versus Gemini Flash API.
βWho should attend
βDevelopers building AI applications who want to understand and control their inference costs
βSenior leaders evaluating AI investments and ROI
βAI enthusiasts curious about how LLMs are priced and deployed
βAnyone who wants to see what 48 hours of focused AI building looks like
βWant to build with us?
βWe are open to contributions from AI enthusiasts who want to code alongside our team during the 48-hour build sprint. Limited spots available.
βOrganised by Alumnx AI Labs | Hyderabad Building hands-on AI practitioners β one cohort at a time.