Inference Engineering
Inference is the most valuable category in AI, but inference engineering is still in its infancy.
Inference engineers work across the stack from CUDA to Kubernetes in pursuit of faster, less expensive, more reliable serving of generative AI models in production.
While the potential and impact of inference are becoming clear, the space is young. There are relatively few people working on inference, and newcomers can become experts quickly. There are opportunities to solve novel, interesting, and deeply technical problems at every level of the stack.
Inference Engineering is your guide to becoming an expert in inference. This book is based on the hundreds of thousands of words of documentation, blogs, and talks I've published on inference; interviews with dozens of experts from Baseten's engineering team; and countless conversations with customers and builders around the world.

Life-Changing Email
Are you tired of sending 200 job applications every semester and hearing nothing back?
Instead, use this guide to further your career by sending authentic emails to real people. And to prove it works, I've included a few stories from my college career where I applied the principles I'm teaching.
By the end of the guide, you'll have all the tools and confidence you need to send the kind of email that can move you a thousand miles in a hundred words.

Writing for Software Developers
In my first year of writing, I reached 100,000+ readers worldwide, met my heroes, and paid my rent with content. Writing for Software Developers will help you do the same.
By following the principles in this book, you will become a better writer — and structured practice of quality techniques will help you get there faster.
