Introduction

Large Language Models have become a hot topic in the last few years. From simple autocomplete to agentic workflows, they have unlocked a wide range of applications that weren't possible just a few years ago.

It feels like every day there is a new hot model from OpenAI, Google, Meta, Anthropic, etc. that everyone is talking about. But for most software engineers that don't work at a frontier AI lab, the challenge is not how to train or host these massive models, but how to actually use them to build a functioning product.

This book, Large Language Models for Software Engineers, is a practical guide to do exactly that. We are not going to dive into the low-level details of neural network architecture. Instead, we assume that you already have access to an API for a hosted LLM and you are tasked to build something valuable with it.

We specifically cover:

the core text in, text out interface of LLMs
how tokenization works
how to generate the next token
how embeddings work
how to use RAG to improve quality
how to use structured outputs and tool calls to build agents
how to benchmark and evaluate LLMs

Our goal is to equip you with the fundementals that you need to know to design, implement and evaluate your own LLM-powered applications.

Keyboard shortcuts

Large Language Models for Software Engineers

Introduction