Cover Image for AI Book Club: Build a Text-to-Image Generator (from Scratch)
Cover Image for AI Book Club: Build a Text-to-Image Generator (from Scratch)
Avatar for AI Builders and Learners
Hosted By

AI Book Club: Build a Text-to-Image Generator (from Scratch)

Virtual
Registration
Welcome! To join the event, please register below.
About Event

May's book is "Build a Text-to-Image Generator (from Scratch)"!

This is a casual-style event. Not a structured presentation on topics. Sometimes, the discussion even drifts away from the chapters, but feel free to grab the mic to help steer it back.

Feel free to join the discussion even if you have not read the book chapters! :)

Want to discuss the contents during the reading week? Join the Slack Flyte MLOps Slack group and search for the "ai-reading-club" channel. https://slack.flyte.org/

-------------------------------------------------
About the book:
Title: Build a Text-to-Image Generator (from Scratch)
Authors: Mark Liu
Published: December 2025

Manning ():https://www.manning.com/books/build-a-text-to-image-generator-from-scratch

O'rielly platform: https://learning.oreilly.com/library/view/build-a-text-to-image/9781633435421/

Chapters:
Part 1 Understanding attention and transformers
1 A tale of two models: Transformers and diffusions
2 Build a transformer
43% complete
3 Classify images with a vision transformer
4 Add captions to images
Part 2 Introduction to diffusion models
5 Generate images with diffusion models
6 Control what images to generate in diffusion models
7 Generate high-resolution images with diffusion models
Part 3 Text-to-image generation with diffusion models
8 CLIP: A model to measure the similarity between image and text
9 Text-to-image generation with latent diffusion
10 A deep dive into Stable Diffusion
Part 4 Text-to-image generation with transformers
11 VQGAN: Convert images into sequences of integers
12 A minimal implementation of DALL-E
Part 5 New developments and challenges
13 New developments and challenges in text-to-image generation

####

Book Description
Build a Text-to-Image Generator (from Scratch) takes you step-by-step through creating your own AI models that can generate images from text. You’ll explore two methods of image generation—vision transformers and diffusion models—and learn vital AI development techniques as you go.

Build a Text-to-Image Generator (from Scratch) teaches you how to:

  • Build and train models to generate high resolution images based on text descriptions

  • Edit an existing image based on text prompts

  • Build and train a model to add captions to images

  • Build and train a vision transformer to classify images

  • Fine-tune LLMs for downstream tasks such as classification, text or image generation

  • Better differentiate real images from deepfakes

Avatar for AI Builders and Learners
Hosted By